Bison - how to print a parse tree

2020-06-06 05:33发布

问题:

Hi I'm working on a small bison to learn how it works. The bison is supposed to parse a sentence. The sentence is made of expressions and expressions are made of words.

Following is my code:

%{
#include <stdio.h>
#include <string.h>


void yyerror(const char *str)
{
    fprintf(stderr,"error: %s\n",str);
}

int yywrap()
{
    return 1;
}

main()
{
    yyparse();
}

%}

%token ASSIGN RANGE OR AND WHITESPACE QUOTE LPAREN RPAREN NOT GREATER LESS

%union 
{
        int number;
        char *string;
}

%token <number> VALUE
%token <string> WORD

%type <string> term
%type <string> expression
%%

query:   /* empty */
    | query expression 
    {
        printf("WOrd:%s",$2);
    }
    ;

expression:
     term
    |expression term
    |expression AND term
        {
            printf("AND");
        }
    ;

term:
    WORD
    {
        $$=$1;
    }
    ;

So, when user enters a word, it supposed to print out the word. User should be able to type in: word,word word,word and word

I'm not sure how to pass a word using $$ and print it out from the "query expression" rule. How do I do this?

this is my flex:

%{
#include <stdio.h>
#include <string.h>
#include "y.tab.h"
%}
%%
[0-9]+                  yylval.number=atoi(yytext);return VALUE;
[a-zA-Z][a-zA-Z]*       yylval.string=strdup(yytext);return WORD;
":"                     return ASSIGN;
"and"|"&"|"&&"          return AND; 
".."                    return RANGE;
"-"                     return NOT;
"|"                     return OR;
"\""                    return QUOTE;
">"                     return GREATER;
"<"                     return LESS;
\n                      /* ignore end of line */;
\t                      /* ignore end of line */;

%%

THanks so much in advance. Sarah

回答1:

Usually, the point of writing a parser is so that you end up with a data structure that represents the input. You then transform the structure in some way, or, in your case, just print it out.

At each expression production, you want to construct a node in that structure that represents what you have recognized so far.

I'm a little rusty, but it would be something like this:

query:   /* empty */
     | query expression { printNode($2); /* printf()s are in here */ }
;

expression: term { $$ = makeTermNode($1); }
          | expression OR term { $$ = makeOrNode($1, $3); }
          | expression AND term  { $$ = makeAndNode($1, $3); }
;

The data structure to hold your nodes:

struct Node {
    int nodeType;          /* WORD or operator token like AND, OR */
    node* leftOperand;
    node* rightOperand;    /* will be null if the node is a term */
}

%union 
{
    int number;
    char *string;
    Node *node;
}

Update:

It's been a while since I coded in C, so I will have to resort to pseudocode. There is no code here to reclaim memory once we're done with it. Apologies for any other blunders.

struct Node *makeTermNode(int word) {
    Node *node = malloc(sizeof struct Node);
    node->nodeType = word;
    node->rightOperand = null;
    node->leftOperand = null;
    return node;
}

Notice that your WORD token just denotes that a string of letters of some sort was scanned; the specific sequence of letters is discarded. (If you want to know the sequence, have your lexer return a copy of yytext instead of the WORD token.)

struct Node *makeAndNode(struct Node* leftOperand, struct Node *rightOperand) {
    Node *node = malloc(sizeof struct Node);
    node->nodeType = AND;
    node->leftOperand = leftOperand;
    node->rightOperand = rightOperand;
    return node;
}

And likewise for makeOrNode(). Alternatively, you could write just makeNodeWithOperator(int operator, struct Node* leftOperand, struct Node *rightOperand) to handle the "and" and "or" cases.

I changed printAllNodes() to printNode(). It starts at the root of the expression tree structure we have built, recursively visiting the left side of each subexpression first, then the right. It goes something like this:

void printNode (struct Node* node) {
    switch (node->nodeType) {
    case WORD:
        printf("%i", node->nodeType);
        return;
    case AND:
    case OR:
        printf("(");
        printNode(node->leftOperand);
        printf("%i", node->nodeType);
        printfNode(node->rightOperand);
        printf(")");
        return;
    }
}


标签: bison