How to control error handling and synchronization

2019-01-27 02:20发布

问题:

I'm using Antlr 4 with c# target. Here is a subset of my grammar:

/*
 * Parser Rules
 */
text : term+  EOF;
term : a1 a2 a3;
a1: ....
...
...

I want to accept valid data blocks as (term)s, when error exists I want to search for the next valid term and print out the whole text which caused the error for user to analyze manually.

How to synchronize input to the next valid term? and How to get the ignored text?

回答1:

You will need to create your own implementation of IAntlrErrorStrategy for this, and then set the Parser.ErrorHandler property to an instance of your error strategy. The documentation for the Java versions of the ANTLRErrorStrategy interface and default implementation DefaultErrorStrategy may provide useful information for implementing an error strategy, but I must warn you going in that creating a custom error strategy is an advanced feature with limited documentation. It's expected that the implementer is already an expert in ANTLR 4's implementation of the Adaptive LL(*) parsing algorithm (we're talking researcher-level understanding).



回答2:

For the first question (How to synchronize input to the next valid term?) I found some useful information that led me to acceptable solution.

Antlr generates next subcode for previous grammar:

public TextContext text() {
    TextContext _localctx = new TextContext(_ctx, State);
    EnterRule(_localctx, 0, RULE_text);
    int _la;
    try {
        EnterOuterAlt(_localctx, 1);
        State = 49;
        _errHandler.Sync(this);
        _la = _input.La(1);
        do {
            State = 48; term();
            State = 51;
            _errHandler.Sync(this);
            _la = _input.La(1);
        } while ( _la==KEYWORD );
        State = 53; Match(EOF);
    }
    catch (RecognitionException re) {
        _localctx.exception = re;
        _errHandler.ReportError(this, re);
        _errHandler.Recover(this, re);
    }
    finally {
        ExitRule();
    }
    return _localctx;
}

The call '_errHandler.Sync(this);' makes the parser advances through the input stream in an attempt to find next valid turn (as a result of "term+" component). To stop parser from sync in other subrules accept "term" rule", I Extended DefaultErrorStrategy Class as next:

public class MyErrorStrategy : Antlr4.Runtime.DefaultErrorStrategy
{
    public EventErrorStrategy() : base()
    { }

    public override void Sync(Antlr4.Runtime.Parser recognizer)
    {
        if(recognizer.Context is Dict.TextAnalyzer.DictionaryParser.TextContext)
            base.Sync(recognizer);
    }
}

then provided it to the parser:

parser.ErrorHandler = new MyErrorStrategy();