]> git.saurik.com Git - bison.git/blame - TODO
* src/gram.h, src/gram.c (nitems): Remove, it is an alias of
[bison.git] / TODO
CommitLineData
416bd7a9
MA
1-*- outline -*-
2
fa770c86
AD
3* Unit rules
4Maybe we could expand unit rules, i.e., transform
5
6 exp: arith | bool;
7 arith: exp '+' exp;
8 bool: exp '&' exp;
9
10into
11
12 exp: exp '+' exp | exp '&' exp;
13
14when there are no actions. This can significantly speed up some
15grammars.
16
17* Useless rules
18We have all the needed material to actually remove them. Do it.
19Or maybe not, but at least do not include them in the automaton.
20
21* read_pipe.c
22This is not portable to DOS for instance. Implement a more portable
23scheme. Sources of inspiration include GNU diff, and Free Recode.
24
52d1aeee
MA
25* NEWS
26Sort from 1.31 NEWS.
27
bcb05e75
MA
28* Prologue
29The %union is declared after the user C declarations. It can be
704a47c4 30a problem if YYSTYPE is declared after the user part. []
bcb05e75 31
704a47c4
AD
32Actually, the real problem seems that the %union ought to be output
33where it was defined. For instance, in gettext/intl/plural.y, we
34have:
35
36 %{
37 ...
38 #include "gettextP.h"
39 ...
40 %}
41
42 %union {
43 unsigned long int num;
44 enum operator op;
45 struct expression *exp;
46 }
47
48 %{
49 ...
50 static int yylex PARAMS ((YYSTYPE *lval, const char **pexp));
51 ...
52 %}
53
54Where the first part defines struct expression, the second uses it to
55define YYSTYPE, and the last uses YYSTYPE. Only this order is valid.
bcb05e75
MA
56
57* --graph
58Show reductions. []
59
704a47c4 60* Broken options ?
c3995d99 61** %no-lines [ok]
04a76783 62** %no-parser []
fbbf9b3b 63** %pure-parser []
04a76783
MA
64** %semantic-parser []
65** %token-table []
66** Options which could use parse_dquoted_param ().
67Maybe transfered in lex.c.
68*** %skeleton [ok]
69*** %output []
70*** %file-prefix []
71*** %name-prefix []
ec93a213 72
fbbf9b3b 73** Skeleton strategy. []
c3a8cbaa
MA
74Must we keep %no-parser?
75 %token-table?
fbbf9b3b 76*** New skeletons. []
416bd7a9 77
c111e171 78* src/print_graph.c
31b53af2 79Find the best graph parameters. []
63c2d5de
MA
80
81* doc/bison.texinfo
1a4648ff 82** Update
c3a8cbaa 83informations about ERROR_VERBOSE. []
1a4648ff 84** Add explainations about
c3a8cbaa
MA
85skeleton muscles. []
86%skeleton. []
eeeb962b 87
704a47c4 88* testsuite
c3a8cbaa
MA
89** tests/pure-parser.at []
90New tests.
0f8d586a
AD
91
92* Debugging parsers
93
94From Greg McGary:
95
96akim demaille <akim.demaille@epita.fr> writes:
97
98> With great pleasure! Nonetheless, things which are debatable
99> (or not, but just `big') should be discuss in `public': something
100> like help- or bug-bison@gnu.org is just fine. Jesse and I are there,
101> but there is also Jim and some other people.
102
103I have no idea whether it qualifies as big or controversial, so I'll
104just summarize for you. I proposed this change years ago and was
105surprised that it was met with utter indifference!
106
107This debug feature is for the programs/grammars one develops with
108bison, not for debugging bison itself. I find that the YYDEBUG
109output comes in a very inconvenient format for my purposes.
110When debugging gcc, for instance, what I want is to see a trace of
111the sequence of reductions and the line#s for the semantic actions
112so I can follow what's happening. Single-step in gdb doesn't cut it
113because to move from one semantic action to the next takes you through
114lots of internal machinery of the parser, which is uninteresting.
115
116The change I made was to the format of the debug output, so that it
117comes out in the format of C error messages, digestible by emacs
118compile mode, like so:
119
120grammar.y:1234: foo: bar(0x123456) baz(0x345678)
121
122where "foo: bar baz" is the reduction rule, whose semantic action
123appears on line 1234 of the bison grammar file grammar.y. The hex
124numbers on the rhs tokens are the parse-stack values associated with
125those tokens. Of course, yytype might be something totally
126incompatible with that representation, but for the most part, yytype
127values are single words (scalars or pointers). In the case of gcc,
128they're most often pointers to tree nodes. Come to think of it, the
129right thing to do is to make the printing of stack values be
130user-definable. It would also be useful to include the filename &
131line# of the file being parsed, but the main filename & line# should
132continue to be that of grammar.y
133
134Anyway, this feature has saved my life on numerous occasions. The way
135I customarily use it is to first run bison with the traces on, isolate
136the sequence of reductions that interests me, put those traces in a
137buffer and force it into compile-mode, then visit each of those lines
138in the grammar and set breakpoints with C-x SPACE. Then, I can run
139again under the control of gdb and stop at each semantic action.
140With the hex addresses of tree nodes, I can inspect the values
141associated with any rhs token.
142
143You like?
cd6a695e
AD
144
145* input synclines
146Some users create their foo.y files, and equip them with #line. Bison
147should recognize these, and preserve them.