Re: Two-pass parser or AST with Bison?

help-bison

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Two-pass parser or AST with Bison?

From:	Evan Lavelle
Subject:	Re: Two-pass parser or AST with Bison?
Date:	Tue, 06 Jan 2009 09:57:34 +0000
User-agent:	Thunderbird 2.0.0.19 (Windows/20081209)

I had this problem some years ago, when I was first using Bison. I had asimple language, which I could handle in a single parser pass; this waseasy, and I could do everything in the Bison actions.

I then added some functionality, which was primarily forward-referencingof externals (mainly function calls, like you). I fixed this quickly byadding a second Bison pass; the parser re-scanned the original sourcecode (I didn't bother saving and re-scanning flex tokens, as you'resuggesting).

(2) (for two-pass:)
    Don't want to surround every single rule with an if like this:
    E = E '+' E {
        if(global.pass==1) {
            // do nothing in first pass
        } else {
            code_append(OP_ADD);
        }
    }

I had a lot of this, but I mainly put it in the C++ code called from theactions, where it was a trivial addition - if you're in the wrong pass,you just return. This keeps the grammar clean. Alternatively, you mightalso consider writing two grammars - in the first pass, you use asimplified grammar which ignores the contents of functions, and whichjust extracts function names and parameters.

If you're just forward-referencing function names, then two Bison parsesis *much*, *much* easier than creating and processing an AST. In mycase, the language got more complicated over time, and I needed a thirdpass. It was only then that I started using an AST - the parser passcreated the AST, and subsequent passes manipulated the AST (there wereeventually 6 passes).

Using an AST for the first time is hard work and there's a steeplearning curve (I used the C++ AST library from ANTLR to make thiseasier). You should also remember that writing an AST-based compiler isa completely different programming paradigm from a simple parsingexercise. The former is all about data and manipulating data structures;the latter is just actions and functions.

(1) (for the AST:)
Don't want to spell out every single AST node class explicitly, butrather, if possible, derive these from the grammar?

I don't think that would be desirable (even if it's possible). As aprogrammer, you'll be constantly modifying, adding, or removing nodeclasses to make life easier. For example, some passes might want a newnode class because they've simplified part of the tree and want toreplace it with an equivalent node, where that node has no equivalent inthe source code. Eventually, your AST will transform to something whichlooks nothing like your source code, so why should it uses node typeswhich are defined by the source?


Good luck -

Evan

[Prev in Thread]

Current Thread

[Next in Thread]

Two-pass parser or AST with Bison?, Matthias Kramm, 2009/01/06
- Re: Two-pass parser or AST with Bison?, Evan Lavelle <=
  - Re: Two-pass parser or AST with Bison?, Matthias Kramm, 2009/01/06
    - Re: Two-pass parser or AST with Bison?, Evan Lavelle, 2009/01/06
    - Re: Two-pass parser or AST with Bison?, Matthias Kramm, 2009/01/30
    - Re: Two-pass parser or AST with Bison?, Hans Aberg, 2009/01/30
- Re: Two-pass parser or AST with Bison?, Luca, 2009/01/06

Prev by Date: Two-pass parser or AST with Bison?
Next by Date: Re: Two-pass parser or AST with Bison?
Previous by thread: Two-pass parser or AST with Bison?
Next by thread: Re: Two-pass parser or AST with Bison?
Index(es):
- Date
- Thread