ANTLR: no viable alternative error

Question

ANTLR: no viable alternative error

I have a task to write a simple parser, so I wrote an ANTLR-like grammar and tried to parse a simple file, for example "foo: bar;", but I got the following output:

[@0,0:2='foo',<1>,1:0] [@1,3:3=':',<16>,1:3] [@2,4:6='bar',<1>,1:4] [@3,7:7=';',<18>,1:7] [@4,8:7='<EOF>',<-1>,1:8] line 1:0 no viable alternative at input 'foo' (rule foo : bar ;)

My grammar looks like

 grammar parsGen; gram : rule SEMICOLON (NEWLINE+ rule SEMICOLON)* ; rule : lRule | pRule ; lRule : LRULEID COLON lRule1 ; lRule1 : (((LRULEID | STRING | SET) | LBRACE lRule1 PIPE lRule1 RBRACE) modificator? SPACE+)+ ; pRule : PRULEID COLON pRule1 ; pRule1 : (((LRULEID | PRULEID) | LBRACE lRule1 PIPE lRule1 RBRACE) modificator? SPACE+)+ ; modificator : PLUS | ASTERISK | QUESTION ; ID : LRULEID | PRULEID ; LRULEID : UPPERLETTER (UPPERLETTER | LOWERLETTER | DIGIT)* ; PRULEID : LOWERLETTER (UPPERLETTER | LOWERLETTER | DIGIT)* ; STRING : ('\''.*?'\'') ; SET : '\''.*?'\'..\''.*?'\'' ; UPPERLETTER : [AZ] ; LOWERLETTER : [az] ; DIGIT : [0-9] ; NEWLINE : '\r\n'|'\n'|'\r' ; PLUS : '+' ; ASTERISK : '*' ; QUESTION : '?' ; LBRACE : '(' ; RBRACE : ')' ; SPACE : ' ' ; COLON : ':' ; PIPE : '|' ; SEMICOLON : ';' ;

So where can I make a mistake? I tried to find everywhere (google, SO, etc.) the error "there is no viable alternative", but it really did not help me.

+4

antlr parser-generator

Yaroslav skudarnov May 31 '13 at 12:11

source share

1 answer

Sam harwell · Accepted Answer · 2013-05-31T14:36:47+0000

ANTLR lexers fully assign unique types of tokens before the parser is ever used. When several types of tokens can coincide with a token, the first of them appears in the grammar - the one that is used. For your grammar, the token cannot be of type ID and type LRULEID at the same time. Since the input foo matches both of these lexer rules, the first one appears in the grammar, so your tokens are: ID , COLON , ID , SEMICOLON , <EOF> .

Since the ID token is not actually mentioned in the parser, I suggest one of the following changes. Any of these options will solve the problem that you described, so the choice depends entirely on how the final grammar looks.

Foreword

You need to change the space references from SPACE+ to SPACE* , or this rule will require at least one space between bar and ; .

Option 1

Delete rule rule lexer at all.

Option 2

Change the ID to the analyzer rule so that it does not try to assign an ID token type to all your identifiers.
```
 id : LRULEID | PRULEID; 
```

Update rule pRule1 with an ID .

 pRule1 : ((id | LBRACE lRule1 PIPE lRule1 RBRACE) modificator? SPACE+)+ ;

Unrelated side note

You can read the grammar more easily if you remove the outer + closure inside the lRule and pRule1 , and instead add them to the rule links themselves. Please note that I have modified the SPACE links as described in the preface.

 lRule : LRULEID COLON lRule1+ ; lRule1 : ((LRULEID | STRING | SET) | LBRACE lRule1 PIPE lRule1 RBRACE) modificator? SPACE* ; pRule : PRULEID COLON pRule1+ ; pRule1 : ((LRULEID | PRULEID) | LBRACE lRule1 PIPE lRule1 RBRACE) modificator? SPACE* ;

ANTLR: no viable alternative error

More articles: