]> git.saurik.com Git - bison.git/blame_incremental - NEWS
NEWS: spell check.
[bison.git] / NEWS
... / ...
CommitLineData
1GNU Bison NEWS
2
3* Noteworthy changes in release ?.? (????-??-??) [?]
4
5** Future changes
6
7 The next major release of Bison will drop support for the following
8 deprecated features. Please report disagreements to bug-bison@gnu.org.
9
10*** K&C parsers
11
12 Support for generating parsers in K&R C will be removed. Parsers
13 generated for C support ISO C90, and are tested with ISO C99 and ISO C11
14 compilers.
15
16*** Features deprecated since Bison 1.875
17
18 The definitions of yystype and yyltype will be removed; use YYSTYPE and
19 YYLTYPE.
20
21 YYPARSE_PARAM and YYLEX_PARAM, deprecated in favor of %parse-param and
22 %lex-param, will no longer be supported.
23
24 Support for the preprocessor symbol YYERROR_VERBOSE will be removed, use
25 %error-verbose.
26
27*** The generated header will be included (yacc.c)
28
29 Instead of duplicating the content of the generated header (definition of
30 YYSTYPE, yyparse declaration etc.), the generated parser will include it,
31 as is already the case for GLR or C++ parsers. This change is deferred
32 because existing versions of ylwrap (e.g., Automake 1.12.1) do not support
33 it.
34
35** Headers
36
37*** Guards (yacc.c, glr.c, glr.cc)
38
39 The generated headers are now guarded, as is already the case for C++
40 parsers (lalr1.cc). For instance, with --defines=foo.h:
41
42 #ifndef YY_FOO_H
43 # define YY_FOO_H
44 ...
45 #endif /* !YY_FOO_H */
46
47*** New declarations (yacc.c, glr.c)
48
49 The generated header now declares yydebug and yyparse. Both honor
50 --name-prefix=bar_, and yield
51
52 int bar_parse (void);
53
54 rather than
55
56 #define yyparse bar_parse
57 int yyparse (void);
58
59 in order to facilitate the inclusion of several parser headers inside a
60 single compilation unit.
61
62*** Exported symbols in C++
63
64 The symbols YYTOKEN_TABLE and YYERROR_VERBOSE, which were defined in the
65 header, are removed, as they prevent the possibility of including several
66 generated headers from a single compilation unit.
67
68*** YYLSP_NEEDED
69
70 For the same reasons, the undocumented and unused macro YYLSP_NEEDED is no
71 longer defined.
72
73** New %define variable: api.prefix
74
75 Now that the generated headers are more complete and properly protected
76 against multiple inclusions, constant names, such as YYSTYPE are a
77 problem. While yyparse and others are properly renamed by %name-prefix,
78 YYSTYPE, YYDEBUG and others have never been affected by it. Because it
79 would introduce backward compatibility issues in projects not expecting
80 YYSTYPE to be renamed, instead of changing the behavior of %name-prefix,
81 it is deprecated in favor of a new %define variable: api.prefix.
82
83 The following examples compares both:
84
85 %name-prefix "bar_" | %define api.prefix "bar_"
86 %token <ival> FOO %token <ival> FOO
87 %union { int ival; } %union { int ival; }
88 %% %%
89 exp: 'a'; exp: 'a';
90
91 bison generates:
92
93 #ifndef BAR_FOO_H #ifndef BAR_FOO_H
94 # define BAR_FOO_H # define BAR_FOO_H
95
96 /* Enabling traces. */ /* Enabling traces. */
97 # ifndef YYDEBUG # ifndef YYDEBUG
98 # define YYDEBUG 0 # define YYDEBUG 0
99 # endif # endif
100 # if YYDEBUG # if YYDEBUG
101 extern int bar_debug; extern int bar_debug;
102 # endif # endif
103
104 /* Tokens. */ /* Tokens. */
105 # ifndef YYTOKENTYPE | # ifndef BAR_TOKENTYPE
106 # define YYTOKENTYPE | # define BAR_TOKENTYPE
107 enum yytokentype { | enum bar_tokentype {
108 FOO = 258 FOO = 258
109 }; };
110 # endif # endif
111
112 #if ! defined YYSTYPE \ | #if ! defined BAR_STYPE \
113 && ! defined YYSTYPE_IS_DECLARED | && ! defined BAR_STYPE_IS_DECLARED
114 typedef union YYSTYPE | typedef union BAR_STYPE
115 { {
116 int ival; int ival;
117 } YYSTYPE; | } BAR_STYPE;
118 # define YYSTYPE_IS_DECLARED 1 | # define BAR_STYPE_IS_DECLARED 1
119 #endif #endif
120
121 extern YYSTYPE bar_lval; | extern BAR_STYPE bar_lval;
122
123 int bar_parse (void); int bar_parse (void);
124
125 #endif /* !BAR_FOO_H */ #endif /* !BAR_FOO_H */
126
127* Noteworthy changes in release 2.5.1 (2012-06-05) [stable]
128
129** Future changes:
130
131 The next major release will drop support for generating parsers in K&R C.
132
133** yacc.c: YYBACKUP works as expected.
134
135** glr.c improvements:
136
137*** Location support is eliminated when not requested:
138
139 GLR parsers used to include location-related code even when locations were
140 not requested, and therefore not even usable.
141
142*** __attribute__ is preserved:
143
144 __attribute__ is no longer disabled when __STRICT_ANSI__ is defined (i.e.,
145 when -std is passed to GCC).
146
147** lalr1.java: several fixes:
148
149 The Java parser no longer throws ArrayIndexOutOfBoundsException if the
150 first token leads to a syntax error. Some minor clean ups.
151
152** Changes for C++:
153
154*** C++11 compatibility:
155
156 C and C++ parsers use "nullptr" instead of "0" when __cplusplus is 201103L
157 or higher.
158
159*** Header guards
160
161 The header files such as "parser.hh", "location.hh", etc. used a constant
162 name for preprocessor guards, for instance:
163
164 #ifndef BISON_LOCATION_HH
165 # define BISON_LOCATION_HH
166 ...
167 #endif // !BISON_LOCATION_HH
168
169 The inclusion guard is now computed from "PREFIX/FILE-NAME", where lower
170 case characters are converted to upper case, and series of
171 non-alphanumerical characters are converted to an underscore.
172
173 With "bison -o lang++/parser.cc", "location.hh" would now include:
174
175 #ifndef YY_LANG_LOCATION_HH
176 # define YY_LANG_LOCATION_HH
177 ...
178 #endif // !YY_LANG_LOCATION_HH
179
180*** C++ locations:
181
182 The position and location constructors (and their initialize methods)
183 accept new arguments for line and column. Several issues in the
184 documentation were fixed.
185
186** liby is no longer asking for "rpl_fprintf" on some platforms.
187
188** Changes in the manual:
189
190*** %printer is documented
191
192 The "%printer" directive, supported since at least Bison 1.50, is finally
193 documented. The "mfcalc" example is extended to demonstrate it.
194
195 For consistency with the C skeletons, the C++ parsers now also support
196 "yyoutput" (as an alias to "debug_stream ()").
197
198*** Several improvements have been made:
199
200 The layout for grammar excerpts was changed to a more compact scheme.
201 Named references are motivated. The description of the automaton
202 description file (*.output) is updated to the current format. Incorrect
203 index entries were fixed. Some other errors were fixed.
204
205** Building bison:
206
207*** Conflicting prototypes with recent/modified Flex.
208
209 Fixed build problems with the current, unreleased, version of Flex, and
210 some modified versions of 2.5.35, which have modified function prototypes.
211
212*** Warnings during the build procedure have been eliminated.
213
214*** Several portability problems in the test suite have been fixed:
215
216 This includes warnings with some compilers, unexpected behavior of tools
217 such as diff, warning messages from the test suite itself, etc.
218
219*** The install-pdf target works properly:
220
221 Running "make install-pdf" (or -dvi, -html, -info, and -ps) no longer
222 halts in the middle of its course.
223
224* Changes in version 2.5 (2011-05-14):
225
226** Grammar symbol names can now contain non-initial dashes:
227
228 Consistently with directives (such as %error-verbose) and with
229 %define variables (e.g. push-pull), grammar symbol names may contain
230 dashes in any position except the beginning. This is a GNU
231 extension over POSIX Yacc. Thus, use of this extension is reported
232 by -Wyacc and rejected in Yacc mode (--yacc).
233
234** Named references:
235
236 Historically, Yacc and Bison have supported positional references
237 ($n, $$) to allow access to symbol values from inside of semantic
238 actions code.
239
240 Starting from this version, Bison can also accept named references.
241 When no ambiguity is possible, original symbol names may be used
242 as named references:
243
244 if_stmt : "if" cond_expr "then" then_stmt ';'
245 { $if_stmt = mk_if_stmt($cond_expr, $then_stmt); }
246
247 In the more common case, explicit names may be declared:
248
249 stmt[res] : "if" expr[cond] "then" stmt[then] "else" stmt[else] ';'
250 { $res = mk_if_stmt($cond, $then, $else); }
251
252 Location information is also accessible using @name syntax. When
253 accessing symbol names containing dots or dashes, explicit bracketing
254 ($[sym.1]) must be used.
255
256 These features are experimental in this version. More user feedback
257 will help to stabilize them.
258
259** IELR(1) and canonical LR(1):
260
261 IELR(1) is a minimal LR(1) parser table generation algorithm. That
262 is, given any context-free grammar, IELR(1) generates parser tables
263 with the full language-recognition power of canonical LR(1) but with
264 nearly the same number of parser states as LALR(1). This reduction
265 in parser states is often an order of magnitude. More importantly,
266 because canonical LR(1)'s extra parser states may contain duplicate
267 conflicts in the case of non-LR(1) grammars, the number of conflicts
268 for IELR(1) is often an order of magnitude less as well. This can
269 significantly reduce the complexity of developing of a grammar.
270
271 Bison can now generate IELR(1) and canonical LR(1) parser tables in
272 place of its traditional LALR(1) parser tables, which remain the
273 default. You can specify the type of parser tables in the grammar
274 file with these directives:
275
276 %define lr.type lalr
277 %define lr.type ielr
278 %define lr.type canonical-lr
279
280 The default-reduction optimization in the parser tables can also be
281 adjusted using "%define lr.default-reductions". For details on both
282 of these features, see the new section "Tuning LR" in the Bison
283 manual.
284
285 These features are experimental. More user feedback will help to
286 stabilize them.
287
288** LAC (Lookahead Correction) for syntax error handling:
289
290 Canonical LR, IELR, and LALR can suffer from a couple of problems
291 upon encountering a syntax error. First, the parser might perform
292 additional parser stack reductions before discovering the syntax
293 error. Such reductions can perform user semantic actions that are
294 unexpected because they are based on an invalid token, and they
295 cause error recovery to begin in a different syntactic context than
296 the one in which the invalid token was encountered. Second, when
297 verbose error messages are enabled (with %error-verbose or the
298 obsolete "#define YYERROR_VERBOSE"), the expected token list in the
299 syntax error message can both contain invalid tokens and omit valid
300 tokens.
301
302 The culprits for the above problems are %nonassoc, default
303 reductions in inconsistent states, and parser state merging. Thus,
304 IELR and LALR suffer the most. Canonical LR can suffer only if
305 %nonassoc is used or if default reductions are enabled for
306 inconsistent states.
307
308 LAC is a new mechanism within the parsing algorithm that solves
309 these problems for canonical LR, IELR, and LALR without sacrificing
310 %nonassoc, default reductions, or state merging. When LAC is in
311 use, canonical LR and IELR behave almost exactly the same for both
312 syntactically acceptable and syntactically unacceptable input.
313 While LALR still does not support the full language-recognition
314 power of canonical LR and IELR, LAC at least enables LALR's syntax
315 error handling to correctly reflect LALR's language-recognition
316 power.
317
318 Currently, LAC is only supported for deterministic parsers in C.
319 You can enable LAC with the following directive:
320
321 %define parse.lac full
322
323 See the new section "LAC" in the Bison manual for additional
324 details including a few caveats.
325
326 LAC is an experimental feature. More user feedback will help to
327 stabilize it.
328
329** %define improvements:
330
331*** Can now be invoked via the command line:
332
333 Each of these command-line options
334
335 -D NAME[=VALUE]
336 --define=NAME[=VALUE]
337
338 -F NAME[=VALUE]
339 --force-define=NAME[=VALUE]
340
341 is equivalent to this grammar file declaration
342
343 %define NAME ["VALUE"]
344
345 except that the manner in which Bison processes multiple definitions
346 for the same NAME differs. Most importantly, -F and --force-define
347 quietly override %define, but -D and --define do not. For further
348 details, see the section "Bison Options" in the Bison manual.
349
350*** Variables renamed:
351
352 The following %define variables
353
354 api.push_pull
355 lr.keep_unreachable_states
356
357 have been renamed to
358
359 api.push-pull
360 lr.keep-unreachable-states
361
362 The old names are now deprecated but will be maintained indefinitely
363 for backward compatibility.
364
365*** Values no longer need to be quoted in the grammar file:
366
367 If a %define value is an identifier, it no longer needs to be placed
368 within quotations marks. For example,
369
370 %define api.push-pull "push"
371
372 can be rewritten as
373
374 %define api.push-pull push
375
376*** Unrecognized variables are now errors not warnings.
377
378*** Multiple invocations for any variable is now an error not a warning.
379
380** Unrecognized %code qualifiers are now errors not warnings.
381
382** Character literals not of length one:
383
384 Previously, Bison quietly converted all character literals to length
385 one. For example, without warning, Bison interpreted the operators in
386 the following grammar to be the same token:
387
388 exp: exp '++'
389 | exp '+' exp
390 ;
391
392 Bison now warns when a character literal is not of length one. In
393 some future release, Bison will start reporting an error instead.
394
395** Destructor calls fixed for lookaheads altered in semantic actions:
396
397 Previously for deterministic parsers in C, if a user semantic action
398 altered yychar, the parser in some cases used the old yychar value to
399 determine which destructor to call for the lookahead upon a syntax
400 error or upon parser return. This bug has been fixed.
401
402** C++ parsers use YYRHSLOC:
403
404 Similarly to the C parsers, the C++ parsers now define the YYRHSLOC
405 macro and use it in the default YYLLOC_DEFAULT. You are encouraged
406 to use it. If, for instance, your location structure has "first"
407 and "last" members, instead of
408
409 # define YYLLOC_DEFAULT(Current, Rhs, N) \
410 do \
411 if (N) \
412 { \
413 (Current).first = (Rhs)[1].location.first; \
414 (Current).last = (Rhs)[N].location.last; \
415 } \
416 else \
417 { \
418 (Current).first = (Current).last = (Rhs)[0].location.last; \
419 } \
420 while (false)
421
422 use:
423
424 # define YYLLOC_DEFAULT(Current, Rhs, N) \
425 do \
426 if (N) \
427 { \
428 (Current).first = YYRHSLOC (Rhs, 1).first; \
429 (Current).last = YYRHSLOC (Rhs, N).last; \
430 } \
431 else \
432 { \
433 (Current).first = (Current).last = YYRHSLOC (Rhs, 0).last; \
434 } \
435 while (false)
436
437** YYLLOC_DEFAULT in C++:
438
439 The default implementation of YYLLOC_DEFAULT used to be issued in
440 the header file. It is now output in the implementation file, after
441 the user %code sections so that its #ifndef guard does not try to
442 override the user's YYLLOC_DEFAULT if provided.
443
444** YYFAIL now produces warnings and Java parsers no longer implement it:
445
446 YYFAIL has existed for many years as an undocumented feature of
447 deterministic parsers in C generated by Bison. More recently, it was
448 a documented feature of Bison's experimental Java parsers. As
449 promised in Bison 2.4.2's NEWS entry, any appearance of YYFAIL in a
450 semantic action now produces a deprecation warning, and Java parsers
451 no longer implement YYFAIL at all. For further details, including a
452 discussion of how to suppress C preprocessor warnings about YYFAIL
453 being unused, see the Bison 2.4.2 NEWS entry.
454
455** Temporary hack for adding a semicolon to the user action:
456
457 Previously, Bison appended a semicolon to every user action for
458 reductions when the output language defaulted to C (specifically, when
459 neither %yacc, %language, %skeleton, or equivalent command-line
460 options were specified). This allowed actions such as
461
462 exp: exp "+" exp { $$ = $1 + $3 };
463
464 instead of
465
466 exp: exp "+" exp { $$ = $1 + $3; };
467
468 As a first step in removing this misfeature, Bison now issues a
469 warning when it appends a semicolon. Moreover, in cases where Bison
470 cannot easily determine whether a semicolon is needed (for example, an
471 action ending with a cpp directive or a braced compound initializer),
472 it no longer appends one. Thus, the C compiler might now complain
473 about a missing semicolon where it did not before. Future releases of
474 Bison will cease to append semicolons entirely.
475
476** Verbose syntax error message fixes:
477
478 When %error-verbose or the obsolete "#define YYERROR_VERBOSE" is
479 specified, syntax error messages produced by the generated parser
480 include the unexpected token as well as a list of expected tokens.
481 The effect of %nonassoc on these verbose messages has been corrected
482 in two ways, but a more complete fix requires LAC, described above:
483
484*** When %nonassoc is used, there can exist parser states that accept no
485 tokens, and so the parser does not always require a lookahead token
486 in order to detect a syntax error. Because no unexpected token or
487 expected tokens can then be reported, the verbose syntax error
488 message described above is suppressed, and the parser instead
489 reports the simpler message, "syntax error". Previously, this
490 suppression was sometimes erroneously triggered by %nonassoc when a
491 lookahead was actually required. Now verbose messages are
492 suppressed only when all previous lookaheads have already been
493 shifted or discarded.
494
495*** Previously, the list of expected tokens erroneously included tokens
496 that would actually induce a syntax error because conflicts for them
497 were resolved with %nonassoc in the current parser state. Such
498 tokens are now properly omitted from the list.
499
500*** Expected token lists are still often wrong due to state merging
501 (from LALR or IELR) and default reductions, which can both add
502 invalid tokens and subtract valid tokens. Canonical LR almost
503 completely fixes this problem by eliminating state merging and
504 default reductions. However, there is one minor problem left even
505 when using canonical LR and even after the fixes above. That is,
506 if the resolution of a conflict with %nonassoc appears in a later
507 parser state than the one at which some syntax error is
508 discovered, the conflicted token is still erroneously included in
509 the expected token list. Bison's new LAC implementation,
510 described above, eliminates this problem and the need for
511 canonical LR. However, LAC is still experimental and is disabled
512 by default.
513
514** Java skeleton fixes:
515
516*** A location handling bug has been fixed.
517
518*** The top element of each of the value stack and location stack is now
519 cleared when popped so that it can be garbage collected.
520
521*** Parser traces now print the top element of the stack.
522
523** -W/--warnings fixes:
524
525*** Bison now properly recognizes the "no-" versions of categories:
526
527 For example, given the following command line, Bison now enables all
528 warnings except warnings for incompatibilities with POSIX Yacc:
529
530 bison -Wall,no-yacc gram.y
531
532*** Bison now treats S/R and R/R conflicts like other warnings:
533
534 Previously, conflict reports were independent of Bison's normal
535 warning system. Now, Bison recognizes the warning categories
536 "conflicts-sr" and "conflicts-rr". This change has important
537 consequences for the -W and --warnings command-line options. For
538 example:
539
540 bison -Wno-conflicts-sr gram.y # S/R conflicts not reported
541 bison -Wno-conflicts-rr gram.y # R/R conflicts not reported
542 bison -Wnone gram.y # no conflicts are reported
543 bison -Werror gram.y # any conflict is an error
544
545 However, as before, if the %expect or %expect-rr directive is
546 specified, an unexpected number of conflicts is an error, and an
547 expected number of conflicts is not reported, so -W and --warning
548 then have no effect on the conflict report.
549
550*** The "none" category no longer disables a preceding "error":
551
552 For example, for the following command line, Bison now reports
553 errors instead of warnings for incompatibilities with POSIX Yacc:
554
555 bison -Werror,none,yacc gram.y
556
557*** The "none" category now disables all Bison warnings:
558
559 Previously, the "none" category disabled only Bison warnings for
560 which there existed a specific -W/--warning category. However,
561 given the following command line, Bison is now guaranteed to
562 suppress all warnings:
563
564 bison -Wnone gram.y
565
566** Precedence directives can now assign token number 0:
567
568 Since Bison 2.3b, which restored the ability of precedence
569 directives to assign token numbers, doing so for token number 0 has
570 produced an assertion failure. For example:
571
572 %left END 0
573
574 This bug has been fixed.
575
576* Changes in version 2.4.3 (2010-08-05):
577
578** Bison now obeys -Werror and --warnings=error for warnings about
579 grammar rules that are useless in the parser due to conflicts.
580
581** Problems with spawning M4 on at least FreeBSD 8 and FreeBSD 9 have
582 been fixed.
583
584** Failures in the test suite for GCC 4.5 have been fixed.
585
586** Failures in the test suite for some versions of Sun Studio C++ have
587 been fixed.
588
589** Contrary to Bison 2.4.2's NEWS entry, it has been decided that
590 warnings about undefined %prec identifiers will not be converted to
591 errors in Bison 2.5. They will remain warnings, which should be
592 sufficient for POSIX while avoiding backward compatibility issues.
593
594** Minor documentation fixes.
595
596* Changes in version 2.4.2 (2010-03-20):
597
598** Some portability problems that resulted in failures and livelocks
599 in the test suite on some versions of at least Solaris, AIX, HP-UX,
600 RHEL4, and Tru64 have been addressed. As a result, fatal Bison
601 errors should no longer cause M4 to report a broken pipe on the
602 affected platforms.
603
604** "%prec IDENTIFIER" requires IDENTIFIER to be defined separately.
605
606 POSIX specifies that an error be reported for any identifier that does
607 not appear on the LHS of a grammar rule and that is not defined by
608 %token, %left, %right, or %nonassoc. Bison 2.3b and later lost this
609 error report for the case when an identifier appears only after a
610 %prec directive. It is now restored. However, for backward
611 compatibility with recent Bison releases, it is only a warning for
612 now. In Bison 2.5 and later, it will return to being an error.
613 [Between the 2.4.2 and 2.4.3 releases, it was decided that this
614 warning will not be converted to an error in Bison 2.5.]
615
616** Detection of GNU M4 1.4.6 or newer during configure is improved.
617
618** Warnings from gcc's -Wundef option about undefined YYENABLE_NLS,
619 YYLTYPE_IS_TRIVIAL, and __STRICT_ANSI__ in C/C++ parsers are now
620 avoided.
621
622** %code is now a permanent feature.
623
624 A traditional Yacc prologue directive is written in the form:
625
626 %{CODE%}
627
628 To provide a more flexible alternative, Bison 2.3b introduced the
629 %code directive with the following forms for C/C++:
630
631 %code {CODE}
632 %code requires {CODE}
633 %code provides {CODE}
634 %code top {CODE}
635
636 These forms are now considered permanent features of Bison. See the
637 %code entries in the section "Bison Declaration Summary" in the Bison
638 manual for a summary of their functionality. See the section
639 "Prologue Alternatives" for a detailed discussion including the
640 advantages of %code over the traditional Yacc prologue directive.
641
642 Bison's Java feature as a whole including its current usage of %code
643 is still considered experimental.
644
645** YYFAIL is deprecated and will eventually be removed.
646
647 YYFAIL has existed for many years as an undocumented feature of
648 deterministic parsers in C generated by Bison. Previously, it was
649 documented for Bison's experimental Java parsers. YYFAIL is no longer
650 documented for Java parsers and is formally deprecated in both cases.
651 Users are strongly encouraged to migrate to YYERROR, which is
652 specified by POSIX.
653
654 Like YYERROR, you can invoke YYFAIL from a semantic action in order to
655 induce a syntax error. The most obvious difference from YYERROR is
656 that YYFAIL will automatically invoke yyerror to report the syntax
657 error so that you don't have to. However, there are several other
658 subtle differences between YYERROR and YYFAIL, and YYFAIL suffers from
659 inherent flaws when %error-verbose or "#define YYERROR_VERBOSE" is
660 used. For a more detailed discussion, see:
661
662 http://lists.gnu.org/archive/html/bison-patches/2009-12/msg00024.html
663
664 The upcoming Bison 2.5 will remove YYFAIL from Java parsers, but
665 deterministic parsers in C will continue to implement it. However,
666 because YYFAIL is already flawed, it seems futile to try to make new
667 Bison features compatible with it. Thus, during parser generation,
668 Bison 2.5 will produce a warning whenever it discovers YYFAIL in a
669 rule action. In a later release, YYFAIL will be disabled for
670 %error-verbose and "#define YYERROR_VERBOSE". Eventually, YYFAIL will
671 be removed altogether.
672
673 There exists at least one case where Bison 2.5's YYFAIL warning will
674 be a false positive. Some projects add phony uses of YYFAIL and other
675 Bison-defined macros for the sole purpose of suppressing C
676 preprocessor warnings (from GCC cpp's -Wunused-macros, for example).
677 To avoid Bison's future warning, such YYFAIL uses can be moved to the
678 epilogue (that is, after the second "%%") in the Bison input file. In
679 this release (2.4.2), Bison already generates its own code to suppress
680 C preprocessor warnings for YYFAIL, so projects can remove their own
681 phony uses of YYFAIL if compatibility with Bison releases prior to
682 2.4.2 is not necessary.
683
684** Internationalization.
685
686 Fix a regression introduced in Bison 2.4: Under some circumstances,
687 message translations were not installed although supported by the
688 host system.
689
690* Changes in version 2.4.1 (2008-12-11):
691
692** In the GLR defines file, unexpanded M4 macros in the yylval and yylloc
693 declarations have been fixed.
694
695** Temporary hack for adding a semicolon to the user action.
696
697 Bison used to prepend a trailing semicolon at the end of the user
698 action for reductions. This allowed actions such as
699
700 exp: exp "+" exp { $$ = $1 + $3 };
701
702 instead of
703
704 exp: exp "+" exp { $$ = $1 + $3; };
705
706 Some grammars still depend on this "feature". Bison 2.4.1 restores
707 the previous behavior in the case of C output (specifically, when
708 neither %language or %skeleton or equivalent command-line options
709 are used) to leave more time for grammars depending on the old
710 behavior to be adjusted. Future releases of Bison will disable this
711 feature.
712
713** A few minor improvements to the Bison manual.
714
715* Changes in version 2.4 (2008-11-02):
716
717** %language is an experimental feature.
718
719 We first introduced this feature in test release 2.3b as a cleaner
720 alternative to %skeleton. Since then, we have discussed the possibility of
721 modifying its effect on Bison's output file names. Thus, in this release,
722 we consider %language to be an experimental feature that will likely evolve
723 in future releases.
724
725** Forward compatibility with GNU M4 has been improved.
726
727** Several bugs in the C++ skeleton and the experimental Java skeleton have been
728 fixed.
729
730* Changes in version 2.3b (2008-05-27):
731
732** The quotes around NAME that used to be required in the following directive
733 are now deprecated:
734
735 %define NAME "VALUE"
736
737** The directive "%pure-parser" is now deprecated in favor of:
738
739 %define api.pure
740
741 which has the same effect except that Bison is more careful to warn about
742 unreasonable usage in the latter case.
743
744** Push Parsing
745
746 Bison can now generate an LALR(1) parser in C with a push interface. That
747 is, instead of invoking "yyparse", which pulls tokens from "yylex", you can
748 push one token at a time to the parser using "yypush_parse", which will
749 return to the caller after processing each token. By default, the push
750 interface is disabled. Either of the following directives will enable it:
751
752 %define api.push_pull "push" // Just push; does not require yylex.
753 %define api.push_pull "both" // Push and pull; requires yylex.
754
755 See the new section "A Push Parser" in the Bison manual for details.
756
757 The current push parsing interface is experimental and may evolve. More user
758 feedback will help to stabilize it.
759
760** The -g and --graph options now output graphs in Graphviz DOT format,
761 not VCG format. Like --graph, -g now also takes an optional FILE argument
762 and thus cannot be bundled with other short options.
763
764** Java
765
766 Bison can now generate an LALR(1) parser in Java. The skeleton is
767 "data/lalr1.java". Consider using the new %language directive instead of
768 %skeleton to select it.
769
770 See the new section "Java Parsers" in the Bison manual for details.
771
772 The current Java interface is experimental and may evolve. More user
773 feedback will help to stabilize it.
774
775** %language
776
777 This new directive specifies the programming language of the generated
778 parser, which can be C (the default), C++, or Java. Besides the skeleton
779 that Bison uses, the directive affects the names of the generated files if
780 the grammar file's name ends in ".y".
781
782** XML Automaton Report
783
784 Bison can now generate an XML report of the LALR(1) automaton using the new
785 "--xml" option. The current XML schema is experimental and may evolve. More
786 user feedback will help to stabilize it.
787
788** The grammar file may now specify the name of the parser header file using
789 %defines. For example:
790
791 %defines "parser.h"
792
793** When reporting useless rules, useless nonterminals, and unused terminals,
794 Bison now employs the terms "useless in grammar" instead of "useless",
795 "useless in parser" instead of "never reduced", and "unused in grammar"
796 instead of "unused".
797
798** Unreachable State Removal
799
800 Previously, Bison sometimes generated parser tables containing unreachable
801 states. A state can become unreachable during conflict resolution if Bison
802 disables a shift action leading to it from a predecessor state. Bison now:
803
804 1. Removes unreachable states.
805
806 2. Does not report any conflicts that appeared in unreachable states.
807 WARNING: As a result, you may need to update %expect and %expect-rr
808 directives in existing grammar files.
809
810 3. For any rule used only in such states, Bison now reports the rule as
811 "useless in parser due to conflicts".
812
813 This feature can be disabled with the following directive:
814
815 %define lr.keep_unreachable_states
816
817 See the %define entry in the "Bison Declaration Summary" in the Bison manual
818 for further discussion.
819
820** Lookahead Set Correction in the ".output" Report
821
822 When instructed to generate a ".output" file including lookahead sets
823 (using "--report=lookahead", for example), Bison now prints each reduction's
824 lookahead set only next to the associated state's one item that (1) is
825 associated with the same rule as the reduction and (2) has its dot at the end
826 of its RHS. Previously, Bison also erroneously printed the lookahead set
827 next to all of the state's other items associated with the same rule. This
828 bug affected only the ".output" file and not the generated parser source
829 code.
830
831** --report-file=FILE is a new option to override the default ".output" file
832 name.
833
834** The "=" that used to be required in the following directives is now
835 deprecated:
836
837 %file-prefix "parser"
838 %name-prefix "c_"
839 %output "parser.c"
840
841** An Alternative to "%{...%}" -- "%code QUALIFIER {CODE}"
842
843 Bison 2.3a provided a new set of directives as a more flexible alternative to
844 the traditional Yacc prologue blocks. Those have now been consolidated into
845 a single %code directive with an optional qualifier field, which identifies
846 the purpose of the code and thus the location(s) where Bison should generate
847 it:
848
849 1. "%code {CODE}" replaces "%after-header {CODE}"
850 2. "%code requires {CODE}" replaces "%start-header {CODE}"
851 3. "%code provides {CODE}" replaces "%end-header {CODE}"
852 4. "%code top {CODE}" replaces "%before-header {CODE}"
853
854 See the %code entries in section "Bison Declaration Summary" in the Bison
855 manual for a summary of the new functionality. See the new section "Prologue
856 Alternatives" for a detailed discussion including the advantages of %code
857 over the traditional Yacc prologues.
858
859 The prologue alternatives are experimental. More user feedback will help to
860 determine whether they should become permanent features.
861
862** Revised warning: unset or unused mid-rule values
863
864 Since Bison 2.2, Bison has warned about mid-rule values that are set but not
865 used within any of the actions of the parent rule. For example, Bison warns
866 about unused $2 in:
867
868 exp: '1' { $$ = 1; } '+' exp { $$ = $1 + $4; };
869
870 Now, Bison also warns about mid-rule values that are used but not set. For
871 example, Bison warns about unset $$ in the mid-rule action in:
872
873 exp: '1' { $1 = 1; } '+' exp { $$ = $2 + $4; };
874
875 However, Bison now disables both of these warnings by default since they
876 sometimes prove to be false alarms in existing grammars employing the Yacc
877 constructs $0 or $-N (where N is some positive integer).
878
879 To enable these warnings, specify the option "--warnings=midrule-values" or
880 "-W", which is a synonym for "--warnings=all".
881
882** Default %destructor or %printer with "<*>" or "<>"
883
884 Bison now recognizes two separate kinds of default %destructor's and
885 %printer's:
886
887 1. Place "<*>" in a %destructor/%printer symbol list to define a default
888 %destructor/%printer for all grammar symbols for which you have formally
889 declared semantic type tags.
890
891 2. Place "<>" in a %destructor/%printer symbol list to define a default
892 %destructor/%printer for all grammar symbols without declared semantic
893 type tags.
894
895 Bison no longer supports the "%symbol-default" notation from Bison 2.3a.
896 "<*>" and "<>" combined achieve the same effect with one exception: Bison no
897 longer applies any %destructor to a mid-rule value if that mid-rule value is
898 not actually ever referenced using either $$ or $n in a semantic action.
899
900 The default %destructor's and %printer's are experimental. More user
901 feedback will help to determine whether they should become permanent
902 features.
903
904 See the section "Freeing Discarded Symbols" in the Bison manual for further
905 details.
906
907** %left, %right, and %nonassoc can now declare token numbers. This is required
908 by POSIX. However, see the end of section "Operator Precedence" in the Bison
909 manual for a caveat concerning the treatment of literal strings.
910
911** The nonfunctional --no-parser, -n, and %no-parser options have been
912 completely removed from Bison.
913
914* Changes in version 2.3a, 2006-09-13:
915
916** Instead of %union, you can define and use your own union type
917 YYSTYPE if your grammar contains at least one <type> tag.
918 Your YYSTYPE need not be a macro; it can be a typedef.
919 This change is for compatibility with other Yacc implementations,
920 and is required by POSIX.
921
922** Locations columns and lines start at 1.
923 In accordance with the GNU Coding Standards and Emacs.
924
925** You may now declare per-type and default %destructor's and %printer's:
926
927 For example:
928
929 %union { char *string; }
930 %token <string> STRING1
931 %token <string> STRING2
932 %type <string> string1
933 %type <string> string2
934 %union { char character; }
935 %token <character> CHR
936 %type <character> chr
937 %destructor { free ($$); } %symbol-default
938 %destructor { free ($$); printf ("%d", @$.first_line); } STRING1 string1
939 %destructor { } <character>
940
941 guarantees that, when the parser discards any user-defined symbol that has a
942 semantic type tag other than "<character>", it passes its semantic value to
943 "free". However, when the parser discards a "STRING1" or a "string1", it
944 also prints its line number to "stdout". It performs only the second
945 "%destructor" in this case, so it invokes "free" only once.
946
947 [Although we failed to mention this here in the 2.3a release, the default
948 %destructor's and %printer's were experimental, and they were rewritten in
949 future versions.]
950
951** Except for LALR(1) parsers in C with POSIX Yacc emulation enabled (with "-y",
952 "--yacc", or "%yacc"), Bison no longer generates #define statements for
953 associating token numbers with token names. Removing the #define statements
954 helps to sanitize the global namespace during preprocessing, but POSIX Yacc
955 requires them. Bison still generates an enum for token names in all cases.
956
957** Handling of traditional Yacc prologue blocks is now more consistent but
958 potentially incompatible with previous releases of Bison.
959
960 As before, you declare prologue blocks in your grammar file with the
961 "%{ ... %}" syntax. To generate the pre-prologue, Bison concatenates all
962 prologue blocks that you've declared before the first %union. To generate
963 the post-prologue, Bison concatenates all prologue blocks that you've
964 declared after the first %union.
965
966 Previous releases of Bison inserted the pre-prologue into both the header
967 file and the code file in all cases except for LALR(1) parsers in C. In the
968 latter case, Bison inserted it only into the code file. For parsers in C++,
969 the point of insertion was before any token definitions (which associate
970 token numbers with names). For parsers in C, the point of insertion was
971 after the token definitions.
972
973 Now, Bison never inserts the pre-prologue into the header file. In the code
974 file, it always inserts it before the token definitions.
975
976** Bison now provides a more flexible alternative to the traditional Yacc
977 prologue blocks: %before-header, %start-header, %end-header, and
978 %after-header.
979
980 For example, the following declaration order in the grammar file reflects the
981 order in which Bison will output these code blocks. However, you are free to
982 declare these code blocks in your grammar file in whatever order is most
983 convenient for you:
984
985 %before-header {
986 /* Bison treats this block like a pre-prologue block: it inserts it into
987 * the code file before the contents of the header file. It does *not*
988 * insert it into the header file. This is a good place to put
989 * #include's that you want at the top of your code file. A common
990 * example is '#include "system.h"'. */
991 }
992 %start-header {
993 /* Bison inserts this block into both the header file and the code file.
994 * In both files, the point of insertion is before any Bison-generated
995 * token, semantic type, location type, and class definitions. This is a
996 * good place to define %union dependencies, for example. */
997 }
998 %union {
999 /* Unlike the traditional Yacc prologue blocks, the output order for the
1000 * new %*-header blocks is not affected by their declaration position
1001 * relative to any %union in the grammar file. */
1002 }
1003 %end-header {
1004 /* Bison inserts this block into both the header file and the code file.
1005 * In both files, the point of insertion is after the Bison-generated
1006 * definitions. This is a good place to declare or define public
1007 * functions or data structures that depend on the Bison-generated
1008 * definitions. */
1009 }
1010 %after-header {
1011 /* Bison treats this block like a post-prologue block: it inserts it into
1012 * the code file after the contents of the header file. It does *not*
1013 * insert it into the header file. This is a good place to declare or
1014 * define internal functions or data structures that depend on the
1015 * Bison-generated definitions. */
1016 }
1017
1018 If you have multiple occurrences of any one of the above declarations, Bison
1019 will concatenate the contents in declaration order.
1020
1021 [Although we failed to mention this here in the 2.3a release, the prologue
1022 alternatives were experimental, and they were rewritten in future versions.]
1023
1024** The option "--report=look-ahead" has been changed to "--report=lookahead".
1025 The old spelling still works, but is not documented and may be removed
1026 in a future release.
1027
1028* Changes in version 2.3, 2006-06-05:
1029
1030** GLR grammars should now use "YYRECOVERING ()" instead of "YYRECOVERING",
1031 for compatibility with LALR(1) grammars.
1032
1033** It is now documented that any definition of YYSTYPE or YYLTYPE should
1034 be to a type name that does not contain parentheses or brackets.
1035
1036* Changes in version 2.2, 2006-05-19:
1037
1038** The distribution terms for all Bison-generated parsers now permit
1039 using the parsers in nonfree programs. Previously, this permission
1040 was granted only for Bison-generated LALR(1) parsers in C.
1041
1042** %name-prefix changes the namespace name in C++ outputs.
1043
1044** The C++ parsers export their token_type.
1045
1046** Bison now allows multiple %union declarations, and concatenates
1047 their contents together.
1048
1049** New warning: unused values
1050 Right-hand side symbols whose values are not used are reported,
1051 if the symbols have destructors. For instance:
1052
1053 exp: exp "?" exp ":" exp { $1 ? $1 : $3; }
1054 | exp "+" exp
1055 ;
1056
1057 will trigger a warning about $$ and $5 in the first rule, and $3 in
1058 the second ($1 is copied to $$ by the default rule). This example
1059 most likely contains three errors, and could be rewritten as:
1060
1061 exp: exp "?" exp ":" exp
1062 { $$ = $1 ? $3 : $5; free ($1 ? $5 : $3); free ($1); }
1063 | exp "+" exp
1064 { $$ = $1 ? $1 : $3; if ($1) free ($3); }
1065 ;
1066
1067 However, if the original actions were really intended, memory leaks
1068 and all, the warnings can be suppressed by letting Bison believe the
1069 values are used, e.g.:
1070
1071 exp: exp "?" exp ":" exp { $1 ? $1 : $3; (void) ($$, $5); }
1072 | exp "+" exp { $$ = $1; (void) $3; }
1073 ;
1074
1075 If there are mid-rule actions, the warning is issued if no action
1076 uses it. The following triggers no warning: $1 and $3 are used.
1077
1078 exp: exp { push ($1); } '+' exp { push ($3); sum (); };
1079
1080 The warning is intended to help catching lost values and memory leaks.
1081 If a value is ignored, its associated memory typically is not reclaimed.
1082
1083** %destructor vs. YYABORT, YYACCEPT, and YYERROR.
1084 Destructors are now called when user code invokes YYABORT, YYACCEPT,
1085 and YYERROR, for all objects on the stack, other than objects
1086 corresponding to the right-hand side of the current rule.
1087
1088** %expect, %expect-rr
1089 Incorrect numbers of expected conflicts are now actual errors,
1090 instead of warnings.
1091
1092** GLR, YACC parsers.
1093 The %parse-params are available in the destructors (and the
1094 experimental printers) as per the documentation.
1095
1096** Bison now warns if it finds a stray "$" or "@" in an action.
1097
1098** %require "VERSION"
1099 This specifies that the grammar file depends on features implemented
1100 in Bison version VERSION or higher.
1101
1102** lalr1.cc: The token and value types are now class members.
1103 The tokens were defined as free form enums and cpp macros. YYSTYPE
1104 was defined as a free form union. They are now class members:
1105 tokens are enumerations of the "yy::parser::token" struct, and the
1106 semantic values have the "yy::parser::semantic_type" type.
1107
1108 If you do not want or can update to this scheme, the directive
1109 '%define "global_tokens_and_yystype" "1"' triggers the global
1110 definition of tokens and YYSTYPE. This change is suitable both
1111 for previous releases of Bison, and this one.
1112
1113 If you wish to update, then make sure older version of Bison will
1114 fail using '%require "2.2"'.
1115
1116** DJGPP support added.
1117\f
1118* Changes in version 2.1, 2005-09-16:
1119
1120** The C++ lalr1.cc skeleton supports %lex-param.
1121
1122** Bison-generated parsers now support the translation of diagnostics like
1123 "syntax error" into languages other than English. The default
1124 language is still English. For details, please see the new
1125 Internationalization section of the Bison manual. Software
1126 distributors should also see the new PACKAGING file. Thanks to
1127 Bruno Haible for this new feature.
1128
1129** Wording in the Bison-generated parsers has been changed slightly to
1130 simplify translation. In particular, the message "memory exhausted"
1131 has replaced "parser stack overflow", as the old message was not
1132 always accurate for modern Bison-generated parsers.
1133
1134** Destructors are now called when the parser aborts, for all symbols left
1135 behind on the stack. Also, the start symbol is now destroyed after a
1136 successful parse. In both cases, the behavior was formerly inconsistent.
1137
1138** When generating verbose diagnostics, Bison-generated parsers no longer
1139 quote the literal strings associated with tokens. For example, for
1140 a syntax error associated with '%token NUM "number"' they might
1141 print 'syntax error, unexpected number' instead of 'syntax error,
1142 unexpected "number"'.
1143\f
1144* Changes in version 2.0, 2004-12-25:
1145
1146** Possibly-incompatible changes
1147
1148 - Bison-generated parsers no longer default to using the alloca function
1149 (when available) to extend the parser stack, due to widespread
1150 problems in unchecked stack-overflow detection. You can "#define
1151 YYSTACK_USE_ALLOCA 1" to require the use of alloca, but please read
1152 the manual to determine safe values for YYMAXDEPTH in that case.
1153
1154 - Error token location.
1155 During error recovery, the location of the syntax error is updated
1156 to cover the whole sequence covered by the error token: it includes
1157 the shifted symbols thrown away during the first part of the error
1158 recovery, and the lookahead rejected during the second part.
1159
1160 - Semicolon changes:
1161 . Stray semicolons are no longer allowed at the start of a grammar.
1162 . Semicolons are now required after in-grammar declarations.
1163
1164 - Unescaped newlines are no longer allowed in character constants or
1165 string literals. They were never portable, and GCC 3.4.0 has
1166 dropped support for them. Better diagnostics are now generated if
1167 forget a closing quote.
1168
1169 - NUL bytes are no longer allowed in Bison string literals, unfortunately.
1170
1171** New features
1172
1173 - GLR grammars now support locations.
1174
1175 - New directive: %initial-action.
1176 This directive allows the user to run arbitrary code (including
1177 initializing @$) from yyparse before parsing starts.
1178
1179 - A new directive "%expect-rr N" specifies the expected number of
1180 reduce/reduce conflicts in GLR parsers.
1181
1182 - %token numbers can now be hexadecimal integers, e.g., "%token FOO 0x12d".
1183 This is a GNU extension.
1184
1185 - The option "--report=lookahead" was changed to "--report=look-ahead".
1186 [However, this was changed back after 2.3.]
1187
1188 - Experimental %destructor support has been added to lalr1.cc.
1189
1190 - New configure option --disable-yacc, to disable installation of the
1191 yacc command and -ly library introduced in 1.875 for POSIX conformance.
1192
1193** Bug fixes
1194
1195 - For now, %expect-count violations are now just warnings, not errors.
1196 This is for compatibility with Bison 1.75 and earlier (when there are
1197 reduce/reduce conflicts) and with Bison 1.30 and earlier (when there
1198 are too many or too few shift/reduce conflicts). However, in future
1199 versions of Bison we plan to improve the %expect machinery so that
1200 these violations will become errors again.
1201
1202 - Within Bison itself, numbers (e.g., goto numbers) are no longer
1203 arbitrarily limited to 16-bit counts.
1204
1205 - Semicolons are now allowed before "|" in grammar rules, as POSIX requires.
1206\f
1207* Changes in version 1.875, 2003-01-01:
1208
1209** The documentation license has been upgraded to version 1.2
1210 of the GNU Free Documentation License.
1211
1212** syntax error processing
1213
1214 - In Yacc-style parsers YYLLOC_DEFAULT is now used to compute error
1215 locations too. This fixes bugs in error-location computation.
1216
1217 - %destructor
1218 It is now possible to reclaim the memory associated to symbols
1219 discarded during error recovery. This feature is still experimental.
1220
1221 - %error-verbose
1222 This new directive is preferred over YYERROR_VERBOSE.
1223
1224 - #defining yyerror to steal internal variables is discouraged.
1225 It is not guaranteed to work forever.
1226
1227** POSIX conformance
1228
1229 - Semicolons are once again optional at the end of grammar rules.
1230 This reverts to the behavior of Bison 1.33 and earlier, and improves
1231 compatibility with Yacc.
1232
1233 - "parse error" -> "syntax error"
1234 Bison now uniformly uses the term "syntax error"; formerly, the code
1235 and manual sometimes used the term "parse error" instead. POSIX
1236 requires "syntax error" in diagnostics, and it was thought better to
1237 be consistent.
1238
1239 - The documentation now emphasizes that yylex and yyerror must be
1240 declared before use. C99 requires this.
1241
1242 - Bison now parses C99 lexical constructs like UCNs and
1243 backslash-newline within C escape sequences, as POSIX 1003.1-2001 requires.
1244
1245 - File names are properly escaped in C output. E.g., foo\bar.y is
1246 output as "foo\\bar.y".
1247
1248 - Yacc command and library now available
1249 The Bison distribution now installs a "yacc" command, as POSIX requires.
1250 Also, Bison now installs a small library liby.a containing
1251 implementations of Yacc-compatible yyerror and main functions.
1252 This library is normally not useful, but POSIX requires it.
1253
1254 - Type clashes now generate warnings, not errors.
1255
1256 - If the user does not define YYSTYPE as a macro, Bison now declares it
1257 using typedef instead of defining it as a macro.
1258 For consistency, YYLTYPE is also declared instead of defined.
1259
1260** Other compatibility issues
1261
1262 - %union directives can now have a tag before the "{", e.g., the
1263 directive "%union foo {...}" now generates the C code
1264 "typedef union foo { ... } YYSTYPE;"; this is for Yacc compatibility.
1265 The default union tag is "YYSTYPE", for compatibility with Solaris 9 Yacc.
1266 For consistency, YYLTYPE's struct tag is now "YYLTYPE" not "yyltype".
1267 This is for compatibility with both Yacc and Bison 1.35.
1268
1269 - ";" is output before the terminating "}" of an action, for
1270 compatibility with Bison 1.35.
1271
1272 - Bison now uses a Yacc-style format for conflict reports, e.g.,
1273 "conflicts: 2 shift/reduce, 1 reduce/reduce".
1274
1275 - "yystype" and "yyltype" are now obsolescent macros instead of being
1276 typedefs or tags; they are no longer documented and are planned to be
1277 withdrawn in a future release.
1278
1279** GLR parser notes
1280
1281 - GLR and inline
1282 Users of Bison have to decide how they handle the portability of the
1283 C keyword "inline".
1284
1285 - "parsing stack overflow..." -> "parser stack overflow"
1286 GLR parsers now report "parser stack overflow" as per the Bison manual.
1287
1288** %parse-param and %lex-param
1289 The macros YYPARSE_PARAM and YYLEX_PARAM provide a means to pass
1290 additional context to yyparse and yylex. They suffer from several
1291 shortcomings:
1292
1293 - a single argument only can be added,
1294 - their types are weak (void *),
1295 - this context is not passed to ancillary functions such as yyerror,
1296 - only yacc.c parsers support them.
1297
1298 The new %parse-param/%lex-param directives provide a more precise control.
1299 For instance:
1300
1301 %parse-param {int *nastiness}
1302 %lex-param {int *nastiness}
1303 %parse-param {int *randomness}
1304
1305 results in the following signatures:
1306
1307 int yylex (int *nastiness);
1308 int yyparse (int *nastiness, int *randomness);
1309
1310 or, if both %pure-parser and %locations are used:
1311
1312 int yylex (YYSTYPE *lvalp, YYLTYPE *llocp, int *nastiness);
1313 int yyparse (int *nastiness, int *randomness);
1314
1315** Bison now warns if it detects conflicting outputs to the same file,
1316 e.g., it generates a warning for "bison -d -o foo.h foo.y" since
1317 that command outputs both code and header to foo.h.
1318
1319** #line in output files
1320 - --no-line works properly.
1321
1322** Bison can no longer be built by a K&R C compiler; it requires C89 or
1323 later to be built. This change originally took place a few versions
1324 ago, but nobody noticed until we recently asked someone to try
1325 building Bison with a K&R C compiler.
1326\f
1327* Changes in version 1.75, 2002-10-14:
1328
1329** Bison should now work on 64-bit hosts.
1330
1331** Indonesian translation thanks to Tedi Heriyanto.
1332
1333** GLR parsers
1334 Fix spurious parse errors.
1335
1336** Pure parsers
1337 Some people redefine yyerror to steal yyparse' private variables.
1338 Reenable this trick until an official feature replaces it.
1339
1340** Type Clashes
1341 In agreement with POSIX and with other Yaccs, leaving a default
1342 action is valid when $$ is untyped, and $1 typed:
1343
1344 untyped: ... typed;
1345
1346 but the converse remains an error:
1347
1348 typed: ... untyped;
1349
1350** Values of mid-rule actions
1351 The following code:
1352
1353 foo: { ... } { $$ = $1; } ...
1354
1355 was incorrectly rejected: $1 is defined in the second mid-rule
1356 action, and is equal to the $$ of the first mid-rule action.
1357\f
1358* Changes in version 1.50, 2002-10-04:
1359
1360** GLR parsing
1361 The declaration
1362 %glr-parser
1363 causes Bison to produce a Generalized LR (GLR) parser, capable of handling
1364 almost any context-free grammar, ambiguous or not. The new declarations
1365 %dprec and %merge on grammar rules allow parse-time resolution of
1366 ambiguities. Contributed by Paul Hilfinger.
1367
1368 Unfortunately Bison 1.50 does not work properly on 64-bit hosts
1369 like the Alpha, so please stick to 32-bit hosts for now.
1370
1371** Output Directory
1372 When not in Yacc compatibility mode, when the output file was not
1373 specified, running "bison foo/bar.y" created "foo/bar.c". It
1374 now creates "bar.c".
1375
1376** Undefined token
1377 The undefined token was systematically mapped to 2 which prevented
1378 the use of 2 by the user. This is no longer the case.
1379
1380** Unknown token numbers
1381 If yylex returned an out of range value, yyparse could die. This is
1382 no longer the case.
1383
1384** Error token
1385 According to POSIX, the error token must be 256.
1386 Bison extends this requirement by making it a preference: *if* the
1387 user specified that one of her tokens is numbered 256, then error
1388 will be mapped onto another number.
1389
1390** Verbose error messages
1391 They no longer report "..., expecting error or..." for states where
1392 error recovery is possible.
1393
1394** End token
1395 Defaults to "$end" instead of "$".
1396
1397** Error recovery now conforms to documentation and to POSIX
1398 When a Bison-generated parser encounters a syntax error, it now pops
1399 the stack until it finds a state that allows shifting the error
1400 token. Formerly, it popped the stack until it found a state that
1401 allowed some non-error action other than a default reduction on the
1402 error token. The new behavior has long been the documented behavior,
1403 and has long been required by POSIX. For more details, please see
1404 Paul Eggert, "Reductions during Bison error handling" (2002-05-20)
1405 <http://lists.gnu.org/archive/html/bug-bison/2002-05/msg00038.html>.
1406
1407** Traces
1408 Popped tokens and nonterminals are now reported.
1409
1410** Larger grammars
1411 Larger grammars are now supported (larger token numbers, larger grammar
1412 size (= sum of the LHS and RHS lengths), larger LALR tables).
1413 Formerly, many of these numbers ran afoul of 16-bit limits;
1414 now these limits are 32 bits on most hosts.
1415
1416** Explicit initial rule
1417 Bison used to play hacks with the initial rule, which the user does
1418 not write. It is now explicit, and visible in the reports and
1419 graphs as rule 0.
1420
1421** Useless rules
1422 Before, Bison reported the useless rules, but, although not used,
1423 included them in the parsers. They are now actually removed.
1424
1425** Useless rules, useless nonterminals
1426 They are now reported, as a warning, with their locations.
1427
1428** Rules never reduced
1429 Rules that can never be reduced because of conflicts are now
1430 reported.
1431
1432** Incorrect "Token not used"
1433 On a grammar such as
1434
1435 %token useless useful
1436 %%
1437 exp: '0' %prec useful;
1438
1439 where a token was used to set the precedence of the last rule,
1440 bison reported both "useful" and "useless" as useless tokens.
1441
1442** Revert the C++ namespace changes introduced in 1.31
1443 as they caused too many portability hassles.
1444
1445** Default locations
1446 By an accident of design, the default computation of @$ was
1447 performed after another default computation was performed: @$ = @1.
1448 The latter is now removed: YYLLOC_DEFAULT is fully responsible of
1449 the computation of @$.
1450
1451** Token end-of-file
1452 The token end of file may be specified by the user, in which case,
1453 the user symbol is used in the reports, the graphs, and the verbose
1454 error messages instead of "$end", which remains being the default.
1455 For instance
1456 %token MYEOF 0
1457 or
1458 %token MYEOF 0 "end of file"
1459
1460** Semantic parser
1461 This old option, which has been broken for ages, is removed.
1462
1463** New translations
1464 Brazilian Portuguese, thanks to Alexandre Folle de Menezes.
1465 Croatian, thanks to Denis Lackovic.
1466
1467** Incorrect token definitions
1468 When given
1469 %token 'a' "A"
1470 bison used to output
1471 #define 'a' 65
1472
1473** Token definitions as enums
1474 Tokens are output both as the traditional #define's, and, provided
1475 the compiler supports ANSI C or is a C++ compiler, as enums.
1476 This lets debuggers display names instead of integers.
1477
1478** Reports
1479 In addition to --verbose, bison supports --report=THINGS, which
1480 produces additional information:
1481 - itemset
1482 complete the core item sets with their closure
1483 - lookahead [changed to "look-ahead" in 1.875e through 2.3, but changed back]
1484 explicitly associate lookahead tokens to items
1485 - solved
1486 describe shift/reduce conflicts solving.
1487 Bison used to systematically output this information on top of
1488 the report. Solved conflicts are now attached to their states.
1489
1490** Type clashes
1491 Previous versions don't complain when there is a type clash on
1492 the default action if the rule has a mid-rule action, such as in:
1493
1494 %type <foo> bar
1495 %%
1496 bar: '0' {} '0';
1497
1498 This is fixed.
1499
1500** GNU M4 is now required when using Bison.
1501\f
1502* Changes in version 1.35, 2002-03-25:
1503
1504** C Skeleton
1505 Some projects use Bison's C parser with C++ compilers, and define
1506 YYSTYPE as a class. The recent adjustment of C parsers for data
1507 alignment and 64 bit architectures made this impossible.
1508
1509 Because for the time being no real solution for C++ parser
1510 generation exists, kludges were implemented in the parser to
1511 maintain this use. In the future, when Bison has C++ parsers, this
1512 kludge will be disabled.
1513
1514 This kludge also addresses some C++ problems when the stack was
1515 extended.
1516\f
1517* Changes in version 1.34, 2002-03-12:
1518
1519** File name clashes are detected
1520 $ bison foo.y -d -o foo.x
1521 fatal error: header and parser would both be named "foo.x"
1522
1523** A missing ";" at the end of a rule triggers a warning
1524 In accordance with POSIX, and in agreement with other
1525 Yacc implementations, Bison will mandate this semicolon in the near
1526 future. This eases the implementation of a Bison parser of Bison
1527 grammars by making this grammar LALR(1) instead of LR(2). To
1528 facilitate the transition, this release introduces a warning.
1529
1530** Revert the C++ namespace changes introduced in 1.31, as they caused too
1531 many portability hassles.
1532
1533** DJGPP support added.
1534
1535** Fix test suite portability problems.
1536\f
1537* Changes in version 1.33, 2002-02-07:
1538
1539** Fix C++ issues
1540 Groff could not be compiled for the definition of size_t was lacking
1541 under some conditions.
1542
1543** Catch invalid @n
1544 As is done with $n.
1545\f
1546* Changes in version 1.32, 2002-01-23:
1547
1548** Fix Yacc output file names
1549
1550** Portability fixes
1551
1552** Italian, Dutch translations
1553\f
1554* Changes in version 1.31, 2002-01-14:
1555
1556** Many Bug Fixes
1557
1558** GNU Gettext and %expect
1559 GNU Gettext asserts 10 s/r conflicts, but there are 7. Now that
1560 Bison dies on incorrect %expectations, we fear there will be
1561 too many bug reports for Gettext, so _for the time being_, %expect
1562 does not trigger an error when the input file is named "plural.y".
1563
1564** Use of alloca in parsers
1565 If YYSTACK_USE_ALLOCA is defined to 0, then the parsers will use
1566 malloc exclusively. Since 1.29, but was not NEWS'ed.
1567
1568 alloca is used only when compiled with GCC, to avoid portability
1569 problems as on AIX.
1570
1571** yyparse now returns 2 if memory is exhausted; formerly it dumped core.
1572
1573** When the generated parser lacks debugging code, YYDEBUG is now 0
1574 (as POSIX requires) instead of being undefined.
1575
1576** User Actions
1577 Bison has always permitted actions such as { $$ = $1 }: it adds the
1578 ending semicolon. Now if in Yacc compatibility mode, the semicolon
1579 is no longer output: one has to write { $$ = $1; }.
1580
1581** Better C++ compliance
1582 The output parsers try to respect C++ namespaces.
1583 [This turned out to be a failed experiment, and it was reverted later.]
1584
1585** Reduced Grammars
1586 Fixed bugs when reporting useless nonterminals.
1587
1588** 64 bit hosts
1589 The parsers work properly on 64 bit hosts.
1590
1591** Error messages
1592 Some calls to strerror resulted in scrambled or missing error messages.
1593
1594** %expect
1595 When the number of shift/reduce conflicts is correct, don't issue
1596 any warning.
1597
1598** The verbose report includes the rule line numbers.
1599
1600** Rule line numbers are fixed in traces.
1601
1602** Swedish translation
1603
1604** Parse errors
1605 Verbose parse error messages from the parsers are better looking.
1606 Before: parse error: unexpected `'/'', expecting `"number"' or `'-'' or `'(''
1607 Now: parse error: unexpected '/', expecting "number" or '-' or '('
1608
1609** Fixed parser memory leaks.
1610 When the generated parser was using malloc to extend its stacks, the
1611 previous allocations were not freed.
1612
1613** Fixed verbose output file.
1614 Some newlines were missing.
1615 Some conflicts in state descriptions were missing.
1616
1617** Fixed conflict report.
1618 Option -v was needed to get the result.
1619
1620** %expect
1621 Was not used.
1622 Mismatches are errors, not warnings.
1623
1624** Fixed incorrect processing of some invalid input.
1625
1626** Fixed CPP guards: 9foo.h uses BISON_9FOO_H instead of 9FOO_H.
1627
1628** Fixed some typos in the documentation.
1629
1630** %token MY_EOF 0 is supported.
1631 Before, MY_EOF was silently renumbered as 257.
1632
1633** doc/refcard.tex is updated.
1634
1635** %output, %file-prefix, %name-prefix.
1636 New.
1637
1638** --output
1639 New, aliasing "--output-file".
1640\f
1641* Changes in version 1.30, 2001-10-26:
1642
1643** "--defines" and "--graph" have now an optional argument which is the
1644 output file name. "-d" and "-g" do not change; they do not take any
1645 argument.
1646
1647** "%source_extension" and "%header_extension" are removed, failed
1648 experiment.
1649
1650** Portability fixes.
1651\f
1652* Changes in version 1.29, 2001-09-07:
1653
1654** The output file does not define const, as this caused problems when used
1655 with common autoconfiguration schemes. If you still use ancient compilers
1656 that lack const, compile with the equivalent of the C compiler option
1657 "-Dconst=". Autoconf's AC_C_CONST macro provides one way to do this.
1658
1659** Added "-g" and "--graph".
1660
1661** The Bison manual is now distributed under the terms of the GNU FDL.
1662
1663** The input and the output files has automatically a similar extension.
1664
1665** Russian translation added.
1666
1667** NLS support updated; should hopefully be less troublesome.
1668
1669** Added the old Bison reference card.
1670
1671** Added "--locations" and "%locations".
1672
1673** Added "-S" and "--skeleton".
1674
1675** "%raw", "-r", "--raw" is disabled.
1676
1677** Special characters are escaped when output. This solves the problems
1678 of the #line lines with path names including backslashes.
1679
1680** New directives.
1681 "%yacc", "%fixed_output_files", "%defines", "%no_parser", "%verbose",
1682 "%debug", "%source_extension" and "%header_extension".
1683
1684** @$
1685 Automatic location tracking.
1686\f
1687* Changes in version 1.28, 1999-07-06:
1688
1689** Should compile better now with K&R compilers.
1690
1691** Added NLS.
1692
1693** Fixed a problem with escaping the double quote character.
1694
1695** There is now a FAQ.
1696\f
1697* Changes in version 1.27:
1698
1699** The make rule which prevented bison.simple from being created on
1700 some systems has been fixed.
1701\f
1702* Changes in version 1.26:
1703
1704** Bison now uses Automake.
1705
1706** New mailing lists: <bug-bison@gnu.org> and <help-bison@gnu.org>.
1707
1708** Token numbers now start at 257 as previously documented, not 258.
1709
1710** Bison honors the TMPDIR environment variable.
1711
1712** A couple of buffer overruns have been fixed.
1713
1714** Problems when closing files should now be reported.
1715
1716** Generated parsers should now work even on operating systems which do
1717 not provide alloca().
1718\f
1719* Changes in version 1.25, 1995-10-16:
1720
1721** Errors in the input grammar are not fatal; Bison keeps reading
1722the grammar file, and reports all the errors found in it.
1723
1724** Tokens can now be specified as multiple-character strings: for
1725example, you could use "<=" for a token which looks like <=, instead
1726of choosing a name like LESSEQ.
1727
1728** The %token_table declaration says to write a table of tokens (names
1729and numbers) into the parser file. The yylex function can use this
1730table to recognize multiple-character string tokens, or for other
1731purposes.
1732
1733** The %no_lines declaration says not to generate any #line preprocessor
1734directives in the parser file.
1735
1736** The %raw declaration says to use internal Bison token numbers, not
1737Yacc-compatible token numbers, when token names are defined as macros.
1738
1739** The --no-parser option produces the parser tables without including
1740the parser engine; a project can now use its own parser engine.
1741The actions go into a separate file called NAME.act, in the form of
1742a switch statement body.
1743\f
1744* Changes in version 1.23:
1745
1746The user can define YYPARSE_PARAM as the name of an argument to be
1747passed into yyparse. The argument should have type void *. It should
1748actually point to an object. Grammar actions can access the variable
1749by casting it to the proper pointer type.
1750
1751Line numbers in output file corrected.
1752\f
1753* Changes in version 1.22:
1754
1755--help option added.
1756\f
1757* Changes in version 1.20:
1758
1759Output file does not redefine const for C++.
1760
1761-----
1762
1763Copyright (C) 1995-2012 Free Software Foundation, Inc.
1764
1765This file is part of Bison, the GNU Parser Generator.
1766
1767This program is free software: you can redistribute it and/or modify
1768it under the terms of the GNU General Public License as published by
1769the Free Software Foundation, either version 3 of the License, or
1770(at your option) any later version.
1771
1772This program is distributed in the hope that it will be useful,
1773but WITHOUT ANY WARRANTY; without even the implied warranty of
1774MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
1775GNU General Public License for more details.
1776
1777You should have received a copy of the GNU General Public License
1778along with this program. If not, see <http://www.gnu.org/licenses/>.
1779
1780 LocalWords: yacc YYBACKUP glr GCC lalr ArrayIndexOutOfBoundsException nullptr
1781 LocalWords: cplusplus liby rpl fprintf mfcalc Wyacc stmt cond expr mk sym lr
1782 LocalWords: IELR ielr Lookahead YYERROR nonassoc LALR's api lookaheads yychar
1783 LocalWords: destructor lookahead YYRHSLOC YYLLOC Rhs ifndef YYFAIL cpp sr rr
1784 LocalWords: preprocessor initializer Wno Wnone Werror FreeBSD prec livelocks
1785 LocalWords: Solaris AIX UX RHEL Tru LHS gcc's Wundef YYENABLE NLS YYLTYPE VCG
1786 LocalWords: yyerror cpp's Wunused yylval yylloc prepend yyparse yylex yypush
1787 LocalWords: Graphviz xml nonterminals midrule destructor's YYSTYPE typedef ly
1788 LocalWords: CHR chr printf stdout namespace preprocessing enum pre include's
1789 LocalWords: YYRECOVERING nonfree destructors YYABORT YYACCEPT params enums de
1790 LocalWords: struct yystype DJGPP lex param Haible NUM alloca YYSTACK NUL goto
1791 LocalWords: YYMAXDEPTH Unescaped UCNs YYLTYPE's yyltype typedefs inline Yaccs
1792 LocalWords: Heriyanto Reenable dprec Hilfinger Eggert MYEOF Folle Menezes EOF
1793 LocalWords: Lackovic define's itemset Groff Gettext malloc NEWS'ed YYDEBUG YY
1794 LocalWords: namespaces strerror const autoconfiguration Dconst Autoconf's FDL
1795 LocalWords: Automake TMPDIR LESSEQ ylwrap endif yydebug YYTOKEN YYLSP ival hh
1796 LocalWords: extern YYTOKENTYPE TOKENTYPE yytokentype tokentype STYPE lval pdf
1797 LocalWords: lang yyoutput dvi html ps POSIX lvalp llocp
1798
1799Local Variables:
1800mode: outline
1801fill-column: 76
1802End: