]> git.saurik.com Git - bison.git/blame - TODO
* tests/torture.at (Big triangle): New.
[bison.git] / TODO
CommitLineData
416bd7a9
MA
1-*- outline -*-
2
fa770c86
AD
3* Unit rules
4Maybe we could expand unit rules, i.e., transform
5
6 exp: arith | bool;
7 arith: exp '+' exp;
8 bool: exp '&' exp;
9
10into
11
12 exp: exp '+' exp | exp '&' exp;
13
14when there are no actions. This can significantly speed up some
15grammars.
16
817e9f41
AD
17* Huge Grammars
18Currently, not only is Bison unable to handle huge grammars because of
19internal limitations, but the test `big triangle' also demonstrates
20that it can produce SEGVing executables! Push the limit beyond 124,
21and have a core dump. Be my guest: fix this!
fa770c86
AD
22
23* read_pipe.c
24This is not portable to DOS for instance. Implement a more portable
25scheme. Sources of inspiration include GNU diff, and Free Recode.
26
52d1aeee
MA
27* NEWS
28Sort from 1.31 NEWS.
29
bcb05e75
MA
30* Prologue
31The %union is declared after the user C declarations. It can be
704a47c4 32a problem if YYSTYPE is declared after the user part. []
bcb05e75 33
704a47c4
AD
34Actually, the real problem seems that the %union ought to be output
35where it was defined. For instance, in gettext/intl/plural.y, we
36have:
37
38 %{
39 ...
40 #include "gettextP.h"
41 ...
42 %}
43
44 %union {
45 unsigned long int num;
46 enum operator op;
47 struct expression *exp;
48 }
49
50 %{
51 ...
52 static int yylex PARAMS ((YYSTYPE *lval, const char **pexp));
53 ...
54 %}
55
56Where the first part defines struct expression, the second uses it to
57define YYSTYPE, and the last uses YYSTYPE. Only this order is valid.
bcb05e75
MA
58
59* --graph
60Show reductions. []
61
704a47c4 62* Broken options ?
c3995d99 63** %no-lines [ok]
04a76783 64** %no-parser []
fbbf9b3b 65** %pure-parser []
04a76783
MA
66** %semantic-parser []
67** %token-table []
68** Options which could use parse_dquoted_param ().
69Maybe transfered in lex.c.
70*** %skeleton [ok]
71*** %output []
72*** %file-prefix []
73*** %name-prefix []
ec93a213 74
fbbf9b3b 75** Skeleton strategy. []
c3a8cbaa
MA
76Must we keep %no-parser?
77 %token-table?
fbbf9b3b 78*** New skeletons. []
416bd7a9 79
c111e171 80* src/print_graph.c
31b53af2 81Find the best graph parameters. []
63c2d5de
MA
82
83* doc/bison.texinfo
1a4648ff 84** Update
c3a8cbaa 85informations about ERROR_VERBOSE. []
1a4648ff 86** Add explainations about
c3a8cbaa
MA
87skeleton muscles. []
88%skeleton. []
eeeb962b 89
704a47c4 90* testsuite
c3a8cbaa
MA
91** tests/pure-parser.at []
92New tests.
0f8d586a
AD
93
94* Debugging parsers
95
96From Greg McGary:
97
98akim demaille <akim.demaille@epita.fr> writes:
99
100> With great pleasure! Nonetheless, things which are debatable
101> (or not, but just `big') should be discuss in `public': something
102> like help- or bug-bison@gnu.org is just fine. Jesse and I are there,
103> but there is also Jim and some other people.
104
105I have no idea whether it qualifies as big or controversial, so I'll
106just summarize for you. I proposed this change years ago and was
107surprised that it was met with utter indifference!
108
109This debug feature is for the programs/grammars one develops with
110bison, not for debugging bison itself. I find that the YYDEBUG
111output comes in a very inconvenient format for my purposes.
112When debugging gcc, for instance, what I want is to see a trace of
113the sequence of reductions and the line#s for the semantic actions
114so I can follow what's happening. Single-step in gdb doesn't cut it
115because to move from one semantic action to the next takes you through
116lots of internal machinery of the parser, which is uninteresting.
117
118The change I made was to the format of the debug output, so that it
119comes out in the format of C error messages, digestible by emacs
120compile mode, like so:
121
122grammar.y:1234: foo: bar(0x123456) baz(0x345678)
123
124where "foo: bar baz" is the reduction rule, whose semantic action
125appears on line 1234 of the bison grammar file grammar.y. The hex
126numbers on the rhs tokens are the parse-stack values associated with
127those tokens. Of course, yytype might be something totally
128incompatible with that representation, but for the most part, yytype
129values are single words (scalars or pointers). In the case of gcc,
130they're most often pointers to tree nodes. Come to think of it, the
131right thing to do is to make the printing of stack values be
132user-definable. It would also be useful to include the filename &
133line# of the file being parsed, but the main filename & line# should
134continue to be that of grammar.y
135
136Anyway, this feature has saved my life on numerous occasions. The way
137I customarily use it is to first run bison with the traces on, isolate
138the sequence of reductions that interests me, put those traces in a
139buffer and force it into compile-mode, then visit each of those lines
140in the grammar and set breakpoints with C-x SPACE. Then, I can run
141again under the control of gdb and stop at each semantic action.
142With the hex addresses of tree nodes, I can inspect the values
143associated with any rhs token.
144
145You like?
cd6a695e
AD
146
147* input synclines
148Some users create their foo.y files, and equip them with #line. Bison
149should recognize these, and preserve them.