Simple (mostly) variable parser

Question

Simple (mostly) variable parser

In one of my projects, I need to provide a very simple parser for finding and replacing variables (mainly for use in paths). Variables are used primarily during startup, and sometimes for accessing files (and not the main function of the program, just loading resources), so the parser should not be high-performance. However, I would prefer it to be thread safe.

The parser should be able to store a set of variables ( map<string, string> at the moment) and be able to replace tokens with the corresponding value in the lines. Variable values may contain other variables that will be resolved when using a variable (and not when adding it, since variables can be added over time).

The current grammar variable looks something like this:

 $basepath$/resources/file.txt /$drive$/$folder$/path/file

My current parser uses a pair of stringstream ("output" and "varname"), writes to the stream "output" until it finds the first stream $, "varname" to the second $, then looks up the variable (using the contents of varname.str() ) This is very simple and works great, even when navigating through variable values.

 String Parse(String input) { stringstream output, varname; bool dest = false; size_t total = input.length(); size_t pos = 0; while ( pos < total ) { char inchar = input[pos]; if ( inchar != '$' ) { if ( dest ) output << inchar; else varname << inchar; } else { // Is a varname start/end if ( !dest ) { varname.clear(); dest = true; } else { // Is an end Variable = mVariables.find(varname.str()); output << Parse(Variable.value()); dest = false; } } ++pos; } return output.str(); }

(error checking and removal)

However, this method does not give me the opportunity when I try to apply it to my desired grammar. I would like something similar to what Visual Studio uses for project variables:

 $(basepath)/resources/file.txt /$(drive)/$(folder)/path/file

I would also like to be able to:

 $(base$(path))/subdir/file

The recursion in the variable name launched me into the wall, and I'm not sure what is the best way to proceed.

I currently have two possible concepts:

Iterate over the input line until I find $, find (as the next character, then find a match) (counting the levels of inputs and outputs until the correct close pair is reached). Send this bit for analysis, then use the return value as the variable name. It looks like it will be messy and, nevertheless, will cause many copies.

The second concept is to use char * or possibly char * & , and move it forward until it reaches the final zero value. The parser function can use the pointer in recursive calls for itself when parsing variable names. I'm not sure how best to implement this technique, except that each call keeps track of the name it parses and adds the return value of any calls it makes.

The project should only be compiled in VS2010, so STL streams and strings, supported C ++ 0x bits and Microsoft-specific functions are fair game (a general solution is preferable if these changes change, but it is not necessary at this moment). However, using other libraries is not very good, especially not Boost.

Both of my ideas seem more complicated and messy than necessary, so I'm looking for a good clean way to handle this. Codex, ideas or documents discussing how best to do this are very welcome.

+4

c ++ string-parsing

ssube Apr 4 '11 at 2:30

source share

1 answer

Tony delroy · Accepted Answer · 2011-04-04T02:45:03+0000

A simple solution is to search for the first ")" in the string, and then jump back to see if there is an identifier preceding "$ (". If so, replace it and restart scanning. If you do not find "$ (" Identifier, then find the next ')' - if you are not done.

To explain: by searching ) you can be sure that you will find the full identifier for your substitution, which then has the opportunity to contribute to some other identifier used in the subsequent replacement.

Example

 Had a great time on $($(day)$(month)), did you? Dictionary: "day" -> "1", "month" -> "April", "1April" -> "April Fools Day" Had a great time on $($(day)$(month)), did you? ^ find this Had a great time on $($(day)$(month)), did you? ^^^^^^ back up to match this complete substitution Had a great time on $(1$(month)), did you? ^ substitution made, restart entire process... Had a great time on $(1$(month)), did you? ^ find this etc.

Simple (mostly) variable parser

More articles: