]> git.saurik.com Git - bison.git/blame_incremental - TODO
Update.
[bison.git] / TODO
... / ...
CommitLineData
1-*- outline -*-
2
3* URGENT: Documenting C++ output
4Write a first documentation for C++ output.
5
6* yyerror, yyprint interface
7It should be improved, in particular when using Bison features such as
8locations, and YYPARSE_PARAMS. For the time being, it is recommended
9to #define yyerror and yyprint to steal internal variables...
10
11* documentation
12Explain $axiom (and maybe change its name: BTYacc names it `goal',
13byacc `$accept', probably based on AT&T Yacc). Complete the glossary
14(item, axiom, ?).
15
16* report documentation
17Extend with error. The hard part will probably be finding the right
18rule so that a single state does not exhibit to many yet undocumented
19``features''. Maybe an empty action ought to be presented too. Shall
20we try to make a single grammar with all these features, or should we
21have several very small grammars?
22
23* documentation
24Some history of Bison and some bibliography would be most welcome.
25Are there any Texinfo standards for bibliography?
26
27* Several %unions
28I think this is a pleasant (but useless currently) feature, but in the
29future, I want a means to %include other bits of grammars, and _then_
30it will be important for the various bits to define their needs in
31%union.
32
33When implementing multiple-%union support, bare the following in mind:
34
35- when --yacc, this must be flagged as an error. Don't make it fatal
36 though.
37
38- The #line must now appear *inside* the definition of yystype.
39 Something like
40
41 {
42 #line 12 "foo.y"
43 int ival;
44 #line 23 "foo.y"
45 char *sval;
46 }
47
48* --report=conflict-path
49Provide better assistance for understanding the conflicts by providing
50a sample text exhibiting the (LALR) ambiguity.
51
52* Coding system independence
53Paul notes:
54
55 Currently Bison assumes 8-bit bytes (i.e. that UCHAR_MAX is
56 255). It also assumes that the 8-bit character encoding is
57 the same for the invocation of 'bison' as it is for the
58 invocation of 'cc', but this is not necessarily true when
59 people run bison on an ASCII host and then use cc on an EBCDIC
60 host. I don't think these topics are worth our time
61 addressing (unless we find a gung-ho volunteer for EBCDIC or
62 PDP-10 ports :-) but they should probably be documented
63 somewhere.
64
65* Output directory
66Akim:
67
68| I consider this to be a bug in bison:
69|
70| /tmp % mkdir src
71| /tmp % cp ~/src/bison/tests/calc.y src
72| /tmp % mkdir build && cd build
73| /tmp/build % bison ../src/calc.y
74| /tmp/build % cd ..
75| /tmp % ls -l build src
76| build:
77| total 0
78|
79| src:
80| total 32
81| -rw-r--r-- 1 akim lrde 27553 oct 2 16:31 calc.tab.c
82| -rw-r--r-- 1 akim lrde 3335 oct 2 16:31 calc.y
83|
84|
85| Would it be safe to change this behavior to something more reasonable?
86| Do you think some people depend upon this?
87
88Jim:
89
90Is it that behavior documented?
91If so, then it's probably not reasonable to change it.
92I've Cc'd the automake list, because some of automake's
93rules use bison through $(YACC) -- though I'll bet they
94all use it in yacc-compatible mode.
95
96Pavel:
97
98Hello, Jim and others!
99
100> Is it that behavior documented?
101> If so, then it's probably not reasonable to change it.
102> I've Cc'd the automake list, because some of automake's
103> rules use bison through $(YACC) -- though I'll bet they
104> all use it in yacc-compatible mode.
105
106Yes, Automake currently used bison in Automake-compatible mode, but it
107would be fair for Automake to switch to the native mode as long as the
108processed files are distributed and "missing" emulates bison.
109
110In any case, the makefiles should specify the output file explicitly
111instead of relying on weird defaults.
112
113> | src:
114> | total 32
115> | -rw-r--r-- 1 akim lrde 27553 oct 2 16:31 calc.tab.c
116> | -rw-r--r-- 1 akim lrde 3335 oct 2 16:31 calc.y
117
118This is not _that_ ugly as it seems - with Automake you want to put
119sources where they belong - to the source directory.
120
121> | This is not _that_ ugly as it seems - with Automake you want to put
122> | sources where they belong - to the source directory.
123>
124> The difference source/build you are referring to is based on Automake
125> concepts. They have no sense at all for tools such as bison or gcc
126> etc. They have input and output. I do not want them to try to grasp
127> source/build. I want them to behave uniformly: output *here*.
128
129I realize that.
130
131It's unfortunate that the native mode of Bison behaves in a less uniform
132way than the yacc mode. I agree with your point. Bison maintainters may
133want to fix it along with the documentation.
134
135
136* Unit rules
137Maybe we could expand unit rules, i.e., transform
138
139 exp: arith | bool;
140 arith: exp '+' exp;
141 bool: exp '&' exp;
142
143into
144
145 exp: exp '+' exp | exp '&' exp;
146
147when there are no actions. This can significantly speed up some
148grammars.
149
150* Stupid error messages
151An example shows it easily:
152
153src/bison/tests % ./testsuite -k calc,location,error-verbose -l
154GNU Bison 1.49a test suite test groups:
155
156 NUM: FILENAME:LINE TEST-GROUP-NAME
157 KEYWORDS
158
159 51: calc.at:440 Calculator --locations --yyerror-verbose
160 52: calc.at:442 Calculator --defines --locations --name-prefix=calc --verbose --yacc --yyerror-verbose
161 54: calc.at:445 Calculator --debug --defines --locations --name-prefix=calc --verbose --yacc --yyerror-verbose
162src/bison/tests % ./testsuite 51 -d
163## --------------------------- ##
164## GNU Bison 1.49a test suite. ##
165## --------------------------- ##
166 51: calc.at:440 ok
167## ---------------------------- ##
168## All 1 tests were successful. ##
169## ---------------------------- ##
170src/bison/tests % cd ./testsuite.dir/51
171tests/testsuite.dir/51 % echo "()" | ./calc
1721.2-1.3: parse error, unexpected ')', expecting error or "number" or '-' or '('
173
174* read_pipe.c
175This is not portable to DOS for instance. Implement a more portable
176scheme. Sources of inspiration include GNU diff, and Free Recode.
177
178* Memory leaks in the generator
179A round of memory leak clean ups would be most welcome. Dmalloc,
180Checker GCC, Electric Fence, or Valgrind: you chose your tool.
181
182* Memory leaks in the parser
183The same applies to the generated parsers. In particular, this is
184critical for user data: when aborting a parsing, when handling the
185error token etc., we often throw away yylval without giving a chance
186of cleaning it up to the user.
187
188* --graph
189Show reductions. []
190
191* Broken options ?
192** %no-lines [ok]
193** %no-parser []
194** %pure-parser []
195** %token-table []
196** Options which could use parse_dquoted_param ().
197Maybe transfered in lex.c.
198*** %skeleton [ok]
199*** %output []
200*** %file-prefix []
201*** %name-prefix []
202
203** Skeleton strategy. []
204Must we keep %no-parser?
205 %token-table?
206*** New skeletons. []
207
208* src/print_graph.c
209Find the best graph parameters. []
210
211* doc/bison.texinfo
212** Update
213informations about ERROR_VERBOSE. []
214** Add explainations about
215skeleton muscles. []
216%skeleton. []
217
218* testsuite
219** tests/pure-parser.at []
220New tests.
221
222* Debugging parsers
223
224From Greg McGary:
225
226akim demaille <akim.demaille@epita.fr> writes:
227
228> With great pleasure! Nonetheless, things which are debatable
229> (or not, but just `big') should be discuss in `public': something
230> like help- or bug-bison@gnu.org is just fine. Jesse and I are there,
231> but there is also Jim and some other people.
232
233I have no idea whether it qualifies as big or controversial, so I'll
234just summarize for you. I proposed this change years ago and was
235surprised that it was met with utter indifference!
236
237This debug feature is for the programs/grammars one develops with
238bison, not for debugging bison itself. I find that the YYDEBUG
239output comes in a very inconvenient format for my purposes.
240When debugging gcc, for instance, what I want is to see a trace of
241the sequence of reductions and the line#s for the semantic actions
242so I can follow what's happening. Single-step in gdb doesn't cut it
243because to move from one semantic action to the next takes you through
244lots of internal machinery of the parser, which is uninteresting.
245
246The change I made was to the format of the debug output, so that it
247comes out in the format of C error messages, digestible by emacs
248compile mode, like so:
249
250grammar.y:1234: foo: bar(0x123456) baz(0x345678)
251
252where "foo: bar baz" is the reduction rule, whose semantic action
253appears on line 1234 of the bison grammar file grammar.y. The hex
254numbers on the rhs tokens are the parse-stack values associated with
255those tokens. Of course, yytype might be something totally
256incompatible with that representation, but for the most part, yytype
257values are single words (scalars or pointers). In the case of gcc,
258they're most often pointers to tree nodes. Come to think of it, the
259right thing to do is to make the printing of stack values be
260user-definable. It would also be useful to include the filename &
261line# of the file being parsed, but the main filename & line# should
262continue to be that of grammar.y
263
264Anyway, this feature has saved my life on numerous occasions. The way
265I customarily use it is to first run bison with the traces on, isolate
266the sequence of reductions that interests me, put those traces in a
267buffer and force it into compile-mode, then visit each of those lines
268in the grammar and set breakpoints with C-x SPACE. Then, I can run
269again under the control of gdb and stop at each semantic action.
270With the hex addresses of tree nodes, I can inspect the values
271associated with any rhs token.
272
273You like?
274
275* input synclines
276Some users create their foo.y files, and equip them with #line. Bison
277should recognize these, and preserve them.
278
279* BTYacc
280See if we can integrate backtracking in Bison. Contact the BTYacc
281maintainers.
282
283* Automaton report
284Display more clearly the lookaheads for each item.
285
286* RR conflicts
287See if we can use precedence between rules to solve RR conflicts. See
288what POSIX says.
289
290* Precedence
291It is unfortunate that there is a total order for precedence. It
292makes it impossible to have modular precedence information. We should
293move to partial orders.
294
295This will be possible with a Bison parser for the grammar, as it will
296make it much easier to extend the grammar.
297
298* Parsing grammars
299Rewrite the reader in Flex/Bison. There will be delicate parts, in
300particular, expect the scanner to be hard to write. Many interesting
301features cannot be implemented without such a new reader.
302
303* $undefined
304From Hans:
305- If the Bison generated parser experiences an undefined number in the
306character range, that character is written out in diagnostic messages, an
307addition to the $undefined value.
308
309Suggest: Change the name $undefined to undefined; looks better in outputs.
310
311* Default Action
312From Hans:
313- For use with my C++ parser, I transported the "switch (yyn)" statement
314that Bison writes to the bison.simple skeleton file. This way, I can remove
315the current default rule $$ = $1 implementation, which causes a double
316assignment to $$ which may not be OK under C++, replacing it with a
317"default:" part within the switch statement.
318
319Note that the default rule $$ = $1, when typed, is perfectly OK under C,
320but in the C++ implementation I made, this rule is different from
321$<type_name>$ = $<type_name>1. I therefore think that one should implement
322a Bison option where every typed default rule is explicitly written out
323(same typed ruled can of course be grouped together).
324
325Note: Robert Anisko handles this. He knows how to do it.
326
327* Warnings
328It would be nice to have warning support. See how Autoconf handles
329them, it is fairly well described there. It would be very nice to
330implement this in such a way that other programs could use
331lib/warnings.[ch].
332
333Don't work on this without first announcing you do, as I already have
334thought about it, and know many of the components that can be used to
335implement it.
336
337* Pre and post actions.
338From: Florian Krohm <florian@edamail.fishkill.ibm.com>
339Subject: YYACT_EPILOGUE
340To: bug-bison@gnu.org
341X-Sent: 1 week, 4 days, 14 hours, 38 minutes, 11 seconds ago
342
343The other day I had the need for explicitly building the parse tree. I
344used %locations for that and defined YYLLOC_DEFAULT to call a function
345that returns the tree node for the production. Easy. But I also needed
346to assign the S-attribute to the tree node. That cannot be done in
347YYLLOC_DEFAULT, because it is invoked before the action is executed.
348The way I solved this was to define a macro YYACT_EPILOGUE that would
349be invoked after the action. For reasons of symmetry I also added
350YYACT_PROLOGUE. Although I had no use for that I can envision how it
351might come in handy for debugging purposes.
352All is needed is to add
353
354#if YYLSP_NEEDED
355 YYACT_EPILOGUE (yyval, (yyvsp - yylen), yylen, yyloc, (yylsp - yylen));
356#else
357 YYACT_EPILOGUE (yyval, (yyvsp - yylen), yylen);
358#endif
359
360at the proper place to bison.simple. Ditto for YYACT_PROLOGUE.
361
362I was wondering what you think about adding YYACT_PROLOGUE/EPILOGUE
363to bison. If you're interested, I'll work on a patch.
364
365-----
366
367Copyright (C) 2001, 2002 Free Software Foundation, Inc.
368
369This file is part of GNU Autoconf.
370
371GNU Autoconf is free software; you can redistribute it and/or modify
372it under the terms of the GNU General Public License as published by
373the Free Software Foundation; either version 2, or (at your option)
374any later version.
375
376GNU Autoconf is distributed in the hope that it will be useful,
377but WITHOUT ANY WARRANTY; without even the implied warranty of
378MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
379GNU General Public License for more details.
380
381You should have received a copy of the GNU General Public License
382along with autoconf; see the file COPYING. If not, write to
383the Free Software Foundation, Inc., 59 Temple Place - Suite 330,
384Boston, MA 02111-1307, USA.