]> git.saurik.com Git - bison.git/blame - TODO
(yylex): Do not pass signed char to isupper; it's not portable.
[bison.git] / TODO
CommitLineData
416bd7a9
MA
1-*- outline -*-
2
efea6231
AD
3* URGENT: Documenting C++ output
4Write a first documentation for C++ output.
5
bc933ef1 6
88bce5a2
AD
7* Documentation
8Before releasing, make sure the documentation refers to the current
9`output' format.
10
11
2ab9a04f
AD
12* Error messages
13Some are really funky. For instance
14
15 type clash (`%s' `%s') on default action
16
17is really weird. Revisit them all.
18
2ab9a04f
AD
19
20* read_pipe.c
21This is not portable to DOS for instance. Implement a more portable
22scheme. Sources of inspiration include GNU diff, and Free Recode.
23
bc933ef1 24
3ae2b51f
AD
25* value_components_used
26Was defined but not used: where was it coming from? It can't be to
27check if %union is used, since the user is free to $<foo>n on her
28union, doesn't she?
29
efea6231 30
2ab9a04f 31* Report
ec3bc396 32
2ab9a04f
AD
33** GLR
34How would Paul like to display the conflicted actions? In particular,
35what when two reductions are possible on a given lookahead, but one is
36part of $default. Should we make the two reductions explicit, or just
37keep $default? See the following point.
d7215705 38
2ab9a04f
AD
39** Disabled Reductions
40See `tests/conflicts.at (Defaulted Conflicted Reduction)', and decide
41what we want to do.
d7215705 42
2ab9a04f 43** Documentation
bc933ef1
AD
44Extend with error productions. The hard part will probably be finding
45the right rule so that a single state does not exhibit too many yet
46undocumented ``features''. Maybe an empty action ought to be
47presented too. Shall we try to make a single grammar with all these
48features, or should we have several very small grammars?
ec3bc396 49
2ab9a04f
AD
50** --report=conflict-path
51Provide better assistance for understanding the conflicts by providing
52a sample text exhibiting the (LALR) ambiguity. See the paper from
53DeRemer and Penello: they already provide the algorithm.
54
ec3bc396 55
2ab9a04f
AD
56* Extentions
57
58** yyerror, yysymprint interface
59It should be improved, in particular when using Bison features such as
60locations, and YYPARSE_PARAMS. For the time being, it is recommended
61to #define yyerror and yyprint to steal internal variables...
62
63** Several %unions
6cbfbcc5
AD
64I think this is a pleasant (but useless currently) feature, but in the
65future, I want a means to %include other bits of grammars, and _then_
66it will be important for the various bits to define their needs in
67%union.
76551463 68
5c0a0514
AD
69When implementing multiple-%union support, bare the following in mind:
70
71- when --yacc, this must be flagged as an error. Don't make it fatal
72 though.
73
74- The #line must now appear *inside* the definition of yystype.
75 Something like
76
77 {
78 #line 12 "foo.y"
79 int ival;
80 #line 23 "foo.y"
81 char *sval;
82 }
83
fa770c86
AD
84* Unit rules
85Maybe we could expand unit rules, i.e., transform
86
87 exp: arith | bool;
88 arith: exp '+' exp;
89 bool: exp '&' exp;
90
91into
92
93 exp: exp '+' exp | exp '&' exp;
94
95when there are no actions. This can significantly speed up some
d7215705
AD
96grammars. I can't find the papers. In particular the book `LR
97parsing: Theory and Practice' is impossible to find, but according to
98`Parsing Techniques: a Practical Guide', it includes information about
99this issue. Does anybody have it?
fa770c86 100
51dec47b 101
51dec47b 102
2ab9a04f 103* Documentation
51dec47b 104
2ab9a04f
AD
105** History/Bibliography
106Some history of Bison and some bibliography would be most welcome.
107Are there any Texinfo standards for bibliography?
108
109
110
111
112* Coding system independence
113Paul notes:
114
115 Currently Bison assumes 8-bit bytes (i.e. that UCHAR_MAX is
116 255). It also assumes that the 8-bit character encoding is
117 the same for the invocation of 'bison' as it is for the
118 invocation of 'cc', but this is not necessarily true when
119 people run bison on an ASCII host and then use cc on an EBCDIC
120 host. I don't think these topics are worth our time
121 addressing (unless we find a gung-ho volunteer for EBCDIC or
122 PDP-10 ports :-) but they should probably be documented
123 somewhere.
fa770c86 124
fa770c86 125
aef1ffd5 126
bcb05e75
MA
127* --graph
128Show reductions. []
129
704a47c4 130* Broken options ?
c3995d99 131** %no-lines [ok]
04a76783 132** %no-parser []
fbbf9b3b 133** %pure-parser []
04a76783
MA
134** %token-table []
135** Options which could use parse_dquoted_param ().
136Maybe transfered in lex.c.
137*** %skeleton [ok]
138*** %output []
139*** %file-prefix []
140*** %name-prefix []
ec93a213 141
fbbf9b3b 142** Skeleton strategy. []
c3a8cbaa
MA
143Must we keep %no-parser?
144 %token-table?
fbbf9b3b 145*** New skeletons. []
416bd7a9 146
c111e171 147* src/print_graph.c
31b53af2 148Find the best graph parameters. []
63c2d5de
MA
149
150* doc/bison.texinfo
1a4648ff 151** Update
c3a8cbaa 152informations about ERROR_VERBOSE. []
1a4648ff 153** Add explainations about
c3a8cbaa
MA
154skeleton muscles. []
155%skeleton. []
eeeb962b 156
704a47c4 157* testsuite
c3a8cbaa
MA
158** tests/pure-parser.at []
159New tests.
0f8d586a 160
cd6a695e
AD
161* input synclines
162Some users create their foo.y files, and equip them with #line. Bison
163should recognize these, and preserve them.
0e95c1dd
AD
164
165* BTYacc
166See if we can integrate backtracking in Bison. Contact the BTYacc
167maintainers.
168
2ab9a04f
AD
169** Keeping the conflicted actions
170First, analyze the differences between byacc and btyacc (I'm referring
171to the executables). Find where the conflicts are preserved.
172
173** Compare with the GLR tables
174See how isomorphic the way BTYacc and the way the GLR adjustements in
175Bison are compatible. *As much as possible* one should try to use the
176same implementation in the Bison executables. I insist: it should be
177very feasible to use the very same conflict tables.
178
179** Adjust the skeletons
180Import the skeletons for C and C++.
181
182** Improve the skeletons
183Have them support yysymprint, yydestruct and so forth.
184
0e95c1dd
AD
185
186* Precedence
2ab9a04f
AD
187
188** Partial order
0e95c1dd
AD
189It is unfortunate that there is a total order for precedence. It
190makes it impossible to have modular precedence information. We should
2ab9a04f 191move to partial orders (sounds like series/parallel orders to me).
0e95c1dd 192
3c9160d9
AD
193This will be possible with a Bison parser for the grammar, as it will
194make it much easier to extend the grammar.
195
2ab9a04f
AD
196** Correlation b/w precedence and associativity
197Also, I fail to understand why we have to assign the same
198associativity to operators with the same precedence. For instance,
199why can't I decide that the precedence of * and / is the same, but the
200latter is nonassoc?
201
202If there is really no profound motivation, we should find a new syntax
203to allow specifying this.
204
205** RR conflicts
206See if we can use precedence between rules to solve RR conflicts. See
207what POSIX says.
208
209
69991a58
AD
210* $undefined
211From Hans:
212- If the Bison generated parser experiences an undefined number in the
213character range, that character is written out in diagnostic messages, an
214addition to the $undefined value.
215
216Suggest: Change the name $undefined to undefined; looks better in outputs.
217
2ab9a04f 218
69991a58
AD
219* Default Action
220From Hans:
221- For use with my C++ parser, I transported the "switch (yyn)" statement
222that Bison writes to the bison.simple skeleton file. This way, I can remove
223the current default rule $$ = $1 implementation, which causes a double
224assignment to $$ which may not be OK under C++, replacing it with a
225"default:" part within the switch statement.
226
227Note that the default rule $$ = $1, when typed, is perfectly OK under C,
228but in the C++ implementation I made, this rule is different from
229$<type_name>$ = $<type_name>1. I therefore think that one should implement
230a Bison option where every typed default rule is explicitly written out
231(same typed ruled can of course be grouped together).
232
3c9160d9
AD
233Note: Robert Anisko handles this. He knows how to do it.
234
2ab9a04f 235
0164db68
AD
236* Warnings
237It would be nice to have warning support. See how Autoconf handles
238them, it is fairly well described there. It would be very nice to
239implement this in such a way that other programs could use
240lib/warnings.[ch].
241
9306c70c
AD
242Don't work on this without first announcing you do, as I already have
243thought about it, and know many of the components that can be used to
244implement it.
245
2ab9a04f 246
69991a58
AD
247* Pre and post actions.
248From: Florian Krohm <florian@edamail.fishkill.ibm.com>
249Subject: YYACT_EPILOGUE
250To: bug-bison@gnu.org
251X-Sent: 1 week, 4 days, 14 hours, 38 minutes, 11 seconds ago
252
253The other day I had the need for explicitly building the parse tree. I
254used %locations for that and defined YYLLOC_DEFAULT to call a function
255that returns the tree node for the production. Easy. But I also needed
256to assign the S-attribute to the tree node. That cannot be done in
257YYLLOC_DEFAULT, because it is invoked before the action is executed.
258The way I solved this was to define a macro YYACT_EPILOGUE that would
259be invoked after the action. For reasons of symmetry I also added
260YYACT_PROLOGUE. Although I had no use for that I can envision how it
261might come in handy for debugging purposes.
76551463 262All is needed is to add
69991a58
AD
263
264#if YYLSP_NEEDED
265 YYACT_EPILOGUE (yyval, (yyvsp - yylen), yylen, yyloc, (yylsp - yylen));
266#else
267 YYACT_EPILOGUE (yyval, (yyvsp - yylen), yylen);
268#endif
269
270at the proper place to bison.simple. Ditto for YYACT_PROLOGUE.
271
272I was wondering what you think about adding YYACT_PROLOGUE/EPILOGUE
273to bison. If you're interested, I'll work on a patch.
274
d7215705
AD
275* Move to Graphviz
276Well, VCG seems really dead. Move to Graphviz instead. Also, equip
277the parser with a means to create the (visual) parse tree.
278
f294a2c2
AD
279-----
280
281Copyright (C) 2001, 2002 Free Software Foundation, Inc.
282
976e6270 283This file is part of GNU Bison.
f294a2c2 284
976e6270 285GNU Bison is free software; you can redistribute it and/or modify
f294a2c2
AD
286it under the terms of the GNU General Public License as published by
287the Free Software Foundation; either version 2, or (at your option)
288any later version.
289
976e6270 290GNU Bison is distributed in the hope that it will be useful,
f294a2c2
AD
291but WITHOUT ANY WARRANTY; without even the implied warranty of
292MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
293GNU General Public License for more details.
294
295You should have received a copy of the GNU General Public License
976e6270 296along with Bison; see the file COPYING. If not, write to
f294a2c2
AD
297the Free Software Foundation, Inc., 59 Temple Place - Suite 330,
298Boston, MA 02111-1307, USA.