]> git.saurik.com Git - bison.git/blob - NEWS
82044f25139c6b86d7c03b20f7dbeaf39c378103
[bison.git] / NEWS
1 Bison News
2 ----------
3
4 * Changes in version 2.5 (????-??-??):
5
6 ** IELR(1) and Canonical LR(1) Support
7
8 IELR(1) is a minimal LR(1) parser table generation algorithm. That
9 is, given any context-free grammar, IELR(1) generates parser tables
10 with the full language recognition power of canonical LR(1) but with
11 nearly the same number of parser states as LALR(1). This reduction in
12 parser states is often an order of magnitude. More importantly,
13 because canonical LR(1)'s extra parser states may contain duplicate
14 conflicts in the case of non-LR(1) grammars, the number of conflicts
15 for IELR(1) is often an order of magnitude less as well. This can
16 significantly reduce the complexity of developing of a grammar.
17
18 Bison can now generate IELR(1) and canonical LR(1) parser tables in
19 place of its traditional LALR(1) parser tables, which remain the
20 default. You can specify the type of parser tables in the grammar
21 file with these directives:
22
23 %define lr.type lalr
24 %define lr.type ielr
25 %define lr.type canonical-lr
26
27 The default reduction optimization in the parser tables can also be
28 adjusted using `%define lr.default-reductions'. See the documentation
29 for `%define lr.type' and `%define lr.default-reductions' in the
30 section `Bison Declaration Summary' in the Bison manual for the
31 details.
32
33 These features are experimental. More user feedback will help to
34 stabilize them.
35
36 ** %define improvements.
37
38 *** Multiple invocations for any variable is now an error not a warning.
39
40 *** Can now be invoked via the command line.
41
42 Each of these command-line options
43
44 -D NAME[=VALUE]
45 --define=NAME[=VALUE]
46
47 -F NAME[=VALUE]
48 --force-define=NAME[=VALUE]
49
50 is equivalent to this grammar file declaration
51
52 %define NAME ["VALUE"]
53
54 except that the manner in which Bison processes multiple definitions
55 for the same NAME differs. Most importantly, -F and --force-define
56 quietly override %define, but -D and --define do not. For further
57 details, see the section "Bison Options" in the Bison manual.
58
59 *** Variables renamed.
60
61 The following %define variables
62
63 api.push_pull
64 lr.keep_unreachable_states
65
66 have been renamed to
67
68 api.push-pull
69 lr.keep-unreachable-states
70
71 The old names are now deprecated but will be maintained indefinitely
72 for backward compatibility.
73
74 *** Values no longer need to be quoted in grammar file.
75
76 If a %define value is an identifier, it no longer needs to be placed
77 within quotations marks. For example,
78
79 %define api.push-pull "push"
80
81 can be rewritten as
82
83 %define api.push-pull push
84
85 ** Symbol names.
86
87 Consistently with directives (such as %error-verbose) and variables
88 (e.g. push-pull), symbol names may include dashes in any position,
89 similarly to periods and underscores. This is GNU extension over
90 POSIX Yacc whose use is reported by -Wyacc, and rejected in Yacc
91 mode (--yacc).
92
93 ** Temporary hack for adding a semicolon to the user action.
94
95 Previously, Bison appended a semicolon to every user action for
96 reductions when the output language defaulted to C (specifically, when
97 neither %yacc, %language, %skeleton, or equivalent command-line
98 options were specified). This allowed actions such as
99
100 exp: exp "+" exp { $$ = $1 + $3 };
101
102 instead of
103
104 exp: exp "+" exp { $$ = $1 + $3; };
105
106 As a first step in removing this misfeature, Bison now issues a
107 warning when it appends a semicolon. Moreover, in cases where Bison
108 cannot easily determine whether a semicolon is needed (for example, an
109 action ending with a cpp directive or a braced compound initializer),
110 it no longer appends one. Thus, the C compiler might now complain
111 about a missing semicolon where it did not before. Future releases of
112 Bison will cease to append semicolons entirely.
113
114 ** Character literals not of length one.
115
116 Previously, Bison quietly converted all character literals to length
117 one. For example, without warning, Bison interpreted the operators in
118 the following grammar to be the same token:
119
120 exp: exp '++'
121 | exp '+' exp
122 ;
123
124 Bison now warns when a character literal is not of length one. In
125 some future release, Bison will report an error instead.
126
127 ** Verbose error messages fixed for nonassociative tokens.
128
129 When %error-verbose is specified, syntax error messages produced by
130 the generated parser include the unexpected token as well as a list of
131 expected tokens. Previously, this list erroneously included tokens
132 that would actually induce a syntax error because conflicts for them
133 were resolved with %nonassoc. Such tokens are now properly omitted
134 from the list.
135
136 * Changes in version 2.4.2 (????-??-??):
137
138 ** Detection of GNU M4 1.4.6 or newer during configure is improved.
139
140 ** %code is now a permanent feature.
141
142 A traditional Yacc prologue directive is written in the form:
143
144 %{CODE%}
145
146 To provide a more flexible alternative, Bison 2.3b introduced the
147 %code directive with the following forms for C/C++:
148
149 %code {CODE}
150 %code requires {CODE}
151 %code provides {CODE}
152 %code top {CODE}
153
154 These forms are now considered permanent features of Bison. See the
155 %code entries in the section "Bison Declaration Summary" in the Bison
156 manual for a summary of their functionality. See the section
157 "Prologue Alternatives" for a detailed discussion including the
158 advantages of %code over the traditional Yacc prologue directive.
159
160 Bison's Java feature as a whole including its current usage of %code
161 is still considered experimental.
162
163 ** Internationalization.
164
165 Fix a regression introduced in Bison 2.4: Under some circumstances,
166 message translations were not installed although supported by the
167 host system.
168
169 * Changes in version 2.4.1 (2008-12-11):
170
171 ** In the GLR defines file, unexpanded M4 macros in the yylval and yylloc
172 declarations have been fixed.
173
174 ** Temporary hack for adding a semicolon to the user action.
175
176 Bison used to prepend a trailing semicolon at the end of the user
177 action for reductions. This allowed actions such as
178
179 exp: exp "+" exp { $$ = $1 + $3 };
180
181 instead of
182
183 exp: exp "+" exp { $$ = $1 + $3; };
184
185 Some grammars still depend on this `feature'. Bison 2.4.1 restores
186 the previous behavior in the case of C output (specifically, when
187 neither %language or %skeleton or equivalent command-line options
188 are used) to leave more time for grammars depending on the old
189 behavior to be adjusted. Future releases of Bison will disable this
190 feature.
191
192 ** A few minor improvements to the Bison manual.
193
194 * Changes in version 2.4 (2008-11-02):
195
196 ** %language is an experimental feature.
197
198 We first introduced this feature in test release 2.3b as a cleaner
199 alternative to %skeleton. Since then, we have discussed the possibility of
200 modifying its effect on Bison's output file names. Thus, in this release,
201 we consider %language to be an experimental feature that will likely evolve
202 in future releases.
203
204 ** Forward compatibility with GNU M4 has been improved.
205
206 ** Several bugs in the C++ skeleton and the experimental Java skeleton have been
207 fixed.
208
209 * Changes in version 2.3b (2008-05-27):
210
211 ** The quotes around NAME that used to be required in the following directive
212 are now deprecated:
213
214 %define NAME "VALUE"
215
216 ** The directive `%pure-parser' is now deprecated in favor of:
217
218 %define api.pure
219
220 which has the same effect except that Bison is more careful to warn about
221 unreasonable usage in the latter case.
222
223 ** Push Parsing
224
225 Bison can now generate an LALR(1) parser in C with a push interface. That
226 is, instead of invoking `yyparse', which pulls tokens from `yylex', you can
227 push one token at a time to the parser using `yypush_parse', which will
228 return to the caller after processing each token. By default, the push
229 interface is disabled. Either of the following directives will enable it:
230
231 %define api.push_pull "push" // Just push; does not require yylex.
232 %define api.push_pull "both" // Push and pull; requires yylex.
233
234 See the new section `A Push Parser' in the Bison manual for details.
235
236 The current push parsing interface is experimental and may evolve. More user
237 feedback will help to stabilize it.
238
239 ** The -g and --graph options now output graphs in Graphviz DOT format,
240 not VCG format. Like --graph, -g now also takes an optional FILE argument
241 and thus cannot be bundled with other short options.
242
243 ** Java
244
245 Bison can now generate an LALR(1) parser in Java. The skeleton is
246 `data/lalr1.java'. Consider using the new %language directive instead of
247 %skeleton to select it.
248
249 See the new section `Java Parsers' in the Bison manual for details.
250
251 The current Java interface is experimental and may evolve. More user
252 feedback will help to stabilize it.
253
254 ** %language
255
256 This new directive specifies the programming language of the generated
257 parser, which can be C (the default), C++, or Java. Besides the skeleton
258 that Bison uses, the directive affects the names of the generated files if
259 the grammar file's name ends in ".y".
260
261 ** XML Automaton Report
262
263 Bison can now generate an XML report of the LALR(1) automaton using the new
264 `--xml' option. The current XML schema is experimental and may evolve. More
265 user feedback will help to stabilize it.
266
267 ** The grammar file may now specify the name of the parser header file using
268 %defines. For example:
269
270 %defines "parser.h"
271
272 ** When reporting useless rules, useless nonterminals, and unused terminals,
273 Bison now employs the terms "useless in grammar" instead of "useless",
274 "useless in parser" instead of "never reduced", and "unused in grammar"
275 instead of "unused".
276
277 ** Unreachable State Removal
278
279 Previously, Bison sometimes generated parser tables containing unreachable
280 states. A state can become unreachable during conflict resolution if Bison
281 disables a shift action leading to it from a predecessor state. Bison now:
282
283 1. Removes unreachable states.
284
285 2. Does not report any conflicts that appeared in unreachable states.
286 WARNING: As a result, you may need to update %expect and %expect-rr
287 directives in existing grammar files.
288
289 3. For any rule used only in such states, Bison now reports the rule as
290 "useless in parser due to conflicts".
291
292 This feature can be disabled with the following directive:
293
294 %define lr.keep_unreachable_states
295
296 See the %define entry in the `Bison Declaration Summary' in the Bison manual
297 for further discussion.
298
299 ** Lookahead Set Correction in the `.output' Report
300
301 When instructed to generate a `.output' file including lookahead sets
302 (using `--report=lookahead', for example), Bison now prints each reduction's
303 lookahead set only next to the associated state's one item that (1) is
304 associated with the same rule as the reduction and (2) has its dot at the end
305 of its RHS. Previously, Bison also erroneously printed the lookahead set
306 next to all of the state's other items associated with the same rule. This
307 bug affected only the `.output' file and not the generated parser source
308 code.
309
310 ** --report-file=FILE is a new option to override the default `.output' file
311 name.
312
313 ** The `=' that used to be required in the following directives is now
314 deprecated:
315
316 %file-prefix "parser"
317 %name-prefix "c_"
318 %output "parser.c"
319
320 ** An Alternative to `%{...%}' -- `%code QUALIFIER {CODE}'
321
322 Bison 2.3a provided a new set of directives as a more flexible alternative to
323 the traditional Yacc prologue blocks. Those have now been consolidated into
324 a single %code directive with an optional qualifier field, which identifies
325 the purpose of the code and thus the location(s) where Bison should generate
326 it:
327
328 1. `%code {CODE}' replaces `%after-header {CODE}'
329 2. `%code requires {CODE}' replaces `%start-header {CODE}'
330 3. `%code provides {CODE}' replaces `%end-header {CODE}'
331 4. `%code top {CODE}' replaces `%before-header {CODE}'
332
333 See the %code entries in section `Bison Declaration Summary' in the Bison
334 manual for a summary of the new functionality. See the new section `Prologue
335 Alternatives' for a detailed discussion including the advantages of %code
336 over the traditional Yacc prologues.
337
338 The prologue alternatives are experimental. More user feedback will help to
339 determine whether they should become permanent features.
340
341 ** Revised warning: unset or unused mid-rule values
342
343 Since Bison 2.2, Bison has warned about mid-rule values that are set but not
344 used within any of the actions of the parent rule. For example, Bison warns
345 about unused $2 in:
346
347 exp: '1' { $$ = 1; } '+' exp { $$ = $1 + $4; };
348
349 Now, Bison also warns about mid-rule values that are used but not set. For
350 example, Bison warns about unset $$ in the mid-rule action in:
351
352 exp: '1' { $1 = 1; } '+' exp { $$ = $2 + $4; };
353
354 However, Bison now disables both of these warnings by default since they
355 sometimes prove to be false alarms in existing grammars employing the Yacc
356 constructs $0 or $-N (where N is some positive integer).
357
358 To enable these warnings, specify the option `--warnings=midrule-values' or
359 `-W', which is a synonym for `--warnings=all'.
360
361 ** Default %destructor or %printer with `<*>' or `<>'
362
363 Bison now recognizes two separate kinds of default %destructor's and
364 %printer's:
365
366 1. Place `<*>' in a %destructor/%printer symbol list to define a default
367 %destructor/%printer for all grammar symbols for which you have formally
368 declared semantic type tags.
369
370 2. Place `<>' in a %destructor/%printer symbol list to define a default
371 %destructor/%printer for all grammar symbols without declared semantic
372 type tags.
373
374 Bison no longer supports the `%symbol-default' notation from Bison 2.3a.
375 `<*>' and `<>' combined achieve the same effect with one exception: Bison no
376 longer applies any %destructor to a mid-rule value if that mid-rule value is
377 not actually ever referenced using either $$ or $n in a semantic action.
378
379 The default %destructor's and %printer's are experimental. More user
380 feedback will help to determine whether they should become permanent
381 features.
382
383 See the section `Freeing Discarded Symbols' in the Bison manual for further
384 details.
385
386 ** %left, %right, and %nonassoc can now declare token numbers. This is required
387 by POSIX. However, see the end of section `Operator Precedence' in the Bison
388 manual for a caveat concerning the treatment of literal strings.
389
390 ** The nonfunctional --no-parser, -n, and %no-parser options have been
391 completely removed from Bison.
392
393 * Changes in version 2.3a, 2006-09-13:
394
395 ** Instead of %union, you can define and use your own union type
396 YYSTYPE if your grammar contains at least one <type> tag.
397 Your YYSTYPE need not be a macro; it can be a typedef.
398 This change is for compatibility with other Yacc implementations,
399 and is required by POSIX.
400
401 ** Locations columns and lines start at 1.
402 In accordance with the GNU Coding Standards and Emacs.
403
404 ** You may now declare per-type and default %destructor's and %printer's:
405
406 For example:
407
408 %union { char *string; }
409 %token <string> STRING1
410 %token <string> STRING2
411 %type <string> string1
412 %type <string> string2
413 %union { char character; }
414 %token <character> CHR
415 %type <character> chr
416 %destructor { free ($$); } %symbol-default
417 %destructor { free ($$); printf ("%d", @$.first_line); } STRING1 string1
418 %destructor { } <character>
419
420 guarantees that, when the parser discards any user-defined symbol that has a
421 semantic type tag other than `<character>', it passes its semantic value to
422 `free'. However, when the parser discards a `STRING1' or a `string1', it
423 also prints its line number to `stdout'. It performs only the second
424 `%destructor' in this case, so it invokes `free' only once.
425
426 [Although we failed to mention this here in the 2.3a release, the default
427 %destructor's and %printer's were experimental, and they were rewritten in
428 future versions.]
429
430 ** Except for LALR(1) parsers in C with POSIX Yacc emulation enabled (with `-y',
431 `--yacc', or `%yacc'), Bison no longer generates #define statements for
432 associating token numbers with token names. Removing the #define statements
433 helps to sanitize the global namespace during preprocessing, but POSIX Yacc
434 requires them. Bison still generates an enum for token names in all cases.
435
436 ** Handling of traditional Yacc prologue blocks is now more consistent but
437 potentially incompatible with previous releases of Bison.
438
439 As before, you declare prologue blocks in your grammar file with the
440 `%{ ... %}' syntax. To generate the pre-prologue, Bison concatenates all
441 prologue blocks that you've declared before the first %union. To generate
442 the post-prologue, Bison concatenates all prologue blocks that you've
443 declared after the first %union.
444
445 Previous releases of Bison inserted the pre-prologue into both the header
446 file and the code file in all cases except for LALR(1) parsers in C. In the
447 latter case, Bison inserted it only into the code file. For parsers in C++,
448 the point of insertion was before any token definitions (which associate
449 token numbers with names). For parsers in C, the point of insertion was
450 after the token definitions.
451
452 Now, Bison never inserts the pre-prologue into the header file. In the code
453 file, it always inserts it before the token definitions.
454
455 ** Bison now provides a more flexible alternative to the traditional Yacc
456 prologue blocks: %before-header, %start-header, %end-header, and
457 %after-header.
458
459 For example, the following declaration order in the grammar file reflects the
460 order in which Bison will output these code blocks. However, you are free to
461 declare these code blocks in your grammar file in whatever order is most
462 convenient for you:
463
464 %before-header {
465 /* Bison treats this block like a pre-prologue block: it inserts it into
466 * the code file before the contents of the header file. It does *not*
467 * insert it into the header file. This is a good place to put
468 * #include's that you want at the top of your code file. A common
469 * example is `#include "system.h"'. */
470 }
471 %start-header {
472 /* Bison inserts this block into both the header file and the code file.
473 * In both files, the point of insertion is before any Bison-generated
474 * token, semantic type, location type, and class definitions. This is a
475 * good place to define %union dependencies, for example. */
476 }
477 %union {
478 /* Unlike the traditional Yacc prologue blocks, the output order for the
479 * new %*-header blocks is not affected by their declaration position
480 * relative to any %union in the grammar file. */
481 }
482 %end-header {
483 /* Bison inserts this block into both the header file and the code file.
484 * In both files, the point of insertion is after the Bison-generated
485 * definitions. This is a good place to declare or define public
486 * functions or data structures that depend on the Bison-generated
487 * definitions. */
488 }
489 %after-header {
490 /* Bison treats this block like a post-prologue block: it inserts it into
491 * the code file after the contents of the header file. It does *not*
492 * insert it into the header file. This is a good place to declare or
493 * define internal functions or data structures that depend on the
494 * Bison-generated definitions. */
495 }
496
497 If you have multiple occurrences of any one of the above declarations, Bison
498 will concatenate the contents in declaration order.
499
500 [Although we failed to mention this here in the 2.3a release, the prologue
501 alternatives were experimental, and they were rewritten in future versions.]
502
503 ** The option `--report=look-ahead' has been changed to `--report=lookahead'.
504 The old spelling still works, but is not documented and may be removed
505 in a future release.
506
507 * Changes in version 2.3, 2006-06-05:
508
509 ** GLR grammars should now use `YYRECOVERING ()' instead of `YYRECOVERING',
510 for compatibility with LALR(1) grammars.
511
512 ** It is now documented that any definition of YYSTYPE or YYLTYPE should
513 be to a type name that does not contain parentheses or brackets.
514
515 * Changes in version 2.2, 2006-05-19:
516
517 ** The distribution terms for all Bison-generated parsers now permit
518 using the parsers in nonfree programs. Previously, this permission
519 was granted only for Bison-generated LALR(1) parsers in C.
520
521 ** %name-prefix changes the namespace name in C++ outputs.
522
523 ** The C++ parsers export their token_type.
524
525 ** Bison now allows multiple %union declarations, and concatenates
526 their contents together.
527
528 ** New warning: unused values
529 Right-hand side symbols whose values are not used are reported,
530 if the symbols have destructors. For instance:
531
532 exp: exp "?" exp ":" exp { $1 ? $1 : $3; }
533 | exp "+" exp
534 ;
535
536 will trigger a warning about $$ and $5 in the first rule, and $3 in
537 the second ($1 is copied to $$ by the default rule). This example
538 most likely contains three errors, and could be rewritten as:
539
540 exp: exp "?" exp ":" exp
541 { $$ = $1 ? $3 : $5; free ($1 ? $5 : $3); free ($1); }
542 | exp "+" exp
543 { $$ = $1 ? $1 : $3; if ($1) free ($3); }
544 ;
545
546 However, if the original actions were really intended, memory leaks
547 and all, the warnings can be suppressed by letting Bison believe the
548 values are used, e.g.:
549
550 exp: exp "?" exp ":" exp { $1 ? $1 : $3; (void) ($$, $5); }
551 | exp "+" exp { $$ = $1; (void) $3; }
552 ;
553
554 If there are mid-rule actions, the warning is issued if no action
555 uses it. The following triggers no warning: $1 and $3 are used.
556
557 exp: exp { push ($1); } '+' exp { push ($3); sum (); };
558
559 The warning is intended to help catching lost values and memory leaks.
560 If a value is ignored, its associated memory typically is not reclaimed.
561
562 ** %destructor vs. YYABORT, YYACCEPT, and YYERROR.
563 Destructors are now called when user code invokes YYABORT, YYACCEPT,
564 and YYERROR, for all objects on the stack, other than objects
565 corresponding to the right-hand side of the current rule.
566
567 ** %expect, %expect-rr
568 Incorrect numbers of expected conflicts are now actual errors,
569 instead of warnings.
570
571 ** GLR, YACC parsers.
572 The %parse-params are available in the destructors (and the
573 experimental printers) as per the documentation.
574
575 ** Bison now warns if it finds a stray `$' or `@' in an action.
576
577 ** %require "VERSION"
578 This specifies that the grammar file depends on features implemented
579 in Bison version VERSION or higher.
580
581 ** lalr1.cc: The token and value types are now class members.
582 The tokens were defined as free form enums and cpp macros. YYSTYPE
583 was defined as a free form union. They are now class members:
584 tokens are enumerations of the `yy::parser::token' struct, and the
585 semantic values have the `yy::parser::semantic_type' type.
586
587 If you do not want or can update to this scheme, the directive
588 `%define "global_tokens_and_yystype" "1"' triggers the global
589 definition of tokens and YYSTYPE. This change is suitable both
590 for previous releases of Bison, and this one.
591
592 If you wish to update, then make sure older version of Bison will
593 fail using `%require "2.2"'.
594
595 ** DJGPP support added.
596 \f
597 * Changes in version 2.1, 2005-09-16:
598
599 ** The C++ lalr1.cc skeleton supports %lex-param.
600
601 ** Bison-generated parsers now support the translation of diagnostics like
602 "syntax error" into languages other than English. The default
603 language is still English. For details, please see the new
604 Internationalization section of the Bison manual. Software
605 distributors should also see the new PACKAGING file. Thanks to
606 Bruno Haible for this new feature.
607
608 ** Wording in the Bison-generated parsers has been changed slightly to
609 simplify translation. In particular, the message "memory exhausted"
610 has replaced "parser stack overflow", as the old message was not
611 always accurate for modern Bison-generated parsers.
612
613 ** Destructors are now called when the parser aborts, for all symbols left
614 behind on the stack. Also, the start symbol is now destroyed after a
615 successful parse. In both cases, the behavior was formerly inconsistent.
616
617 ** When generating verbose diagnostics, Bison-generated parsers no longer
618 quote the literal strings associated with tokens. For example, for
619 a syntax error associated with '%token NUM "number"' they might
620 print 'syntax error, unexpected number' instead of 'syntax error,
621 unexpected "number"'.
622 \f
623 * Changes in version 2.0, 2004-12-25:
624
625 ** Possibly-incompatible changes
626
627 - Bison-generated parsers no longer default to using the alloca function
628 (when available) to extend the parser stack, due to widespread
629 problems in unchecked stack-overflow detection. You can "#define
630 YYSTACK_USE_ALLOCA 1" to require the use of alloca, but please read
631 the manual to determine safe values for YYMAXDEPTH in that case.
632
633 - Error token location.
634 During error recovery, the location of the syntax error is updated
635 to cover the whole sequence covered by the error token: it includes
636 the shifted symbols thrown away during the first part of the error
637 recovery, and the lookahead rejected during the second part.
638
639 - Semicolon changes:
640 . Stray semicolons are no longer allowed at the start of a grammar.
641 . Semicolons are now required after in-grammar declarations.
642
643 - Unescaped newlines are no longer allowed in character constants or
644 string literals. They were never portable, and GCC 3.4.0 has
645 dropped support for them. Better diagnostics are now generated if
646 forget a closing quote.
647
648 - NUL bytes are no longer allowed in Bison string literals, unfortunately.
649
650 ** New features
651
652 - GLR grammars now support locations.
653
654 - New directive: %initial-action.
655 This directive allows the user to run arbitrary code (including
656 initializing @$) from yyparse before parsing starts.
657
658 - A new directive "%expect-rr N" specifies the expected number of
659 reduce/reduce conflicts in GLR parsers.
660
661 - %token numbers can now be hexadecimal integers, e.g., `%token FOO 0x12d'.
662 This is a GNU extension.
663
664 - The option `--report=lookahead' was changed to `--report=look-ahead'.
665 [However, this was changed back after 2.3.]
666
667 - Experimental %destructor support has been added to lalr1.cc.
668
669 - New configure option --disable-yacc, to disable installation of the
670 yacc command and -ly library introduced in 1.875 for POSIX conformance.
671
672 ** Bug fixes
673
674 - For now, %expect-count violations are now just warnings, not errors.
675 This is for compatibility with Bison 1.75 and earlier (when there are
676 reduce/reduce conflicts) and with Bison 1.30 and earlier (when there
677 are too many or too few shift/reduce conflicts). However, in future
678 versions of Bison we plan to improve the %expect machinery so that
679 these violations will become errors again.
680
681 - Within Bison itself, numbers (e.g., goto numbers) are no longer
682 arbitrarily limited to 16-bit counts.
683
684 - Semicolons are now allowed before "|" in grammar rules, as POSIX requires.
685 \f
686 * Changes in version 1.875, 2003-01-01:
687
688 ** The documentation license has been upgraded to version 1.2
689 of the GNU Free Documentation License.
690
691 ** syntax error processing
692
693 - In Yacc-style parsers YYLLOC_DEFAULT is now used to compute error
694 locations too. This fixes bugs in error-location computation.
695
696 - %destructor
697 It is now possible to reclaim the memory associated to symbols
698 discarded during error recovery. This feature is still experimental.
699
700 - %error-verbose
701 This new directive is preferred over YYERROR_VERBOSE.
702
703 - #defining yyerror to steal internal variables is discouraged.
704 It is not guaranteed to work forever.
705
706 ** POSIX conformance
707
708 - Semicolons are once again optional at the end of grammar rules.
709 This reverts to the behavior of Bison 1.33 and earlier, and improves
710 compatibility with Yacc.
711
712 - `parse error' -> `syntax error'
713 Bison now uniformly uses the term `syntax error'; formerly, the code
714 and manual sometimes used the term `parse error' instead. POSIX
715 requires `syntax error' in diagnostics, and it was thought better to
716 be consistent.
717
718 - The documentation now emphasizes that yylex and yyerror must be
719 declared before use. C99 requires this.
720
721 - Bison now parses C99 lexical constructs like UCNs and
722 backslash-newline within C escape sequences, as POSIX 1003.1-2001 requires.
723
724 - File names are properly escaped in C output. E.g., foo\bar.y is
725 output as "foo\\bar.y".
726
727 - Yacc command and library now available
728 The Bison distribution now installs a `yacc' command, as POSIX requires.
729 Also, Bison now installs a small library liby.a containing
730 implementations of Yacc-compatible yyerror and main functions.
731 This library is normally not useful, but POSIX requires it.
732
733 - Type clashes now generate warnings, not errors.
734
735 - If the user does not define YYSTYPE as a macro, Bison now declares it
736 using typedef instead of defining it as a macro.
737 For consistency, YYLTYPE is also declared instead of defined.
738
739 ** Other compatibility issues
740
741 - %union directives can now have a tag before the `{', e.g., the
742 directive `%union foo {...}' now generates the C code
743 `typedef union foo { ... } YYSTYPE;'; this is for Yacc compatibility.
744 The default union tag is `YYSTYPE', for compatibility with Solaris 9 Yacc.
745 For consistency, YYLTYPE's struct tag is now `YYLTYPE' not `yyltype'.
746 This is for compatibility with both Yacc and Bison 1.35.
747
748 - `;' is output before the terminating `}' of an action, for
749 compatibility with Bison 1.35.
750
751 - Bison now uses a Yacc-style format for conflict reports, e.g.,
752 `conflicts: 2 shift/reduce, 1 reduce/reduce'.
753
754 - `yystype' and `yyltype' are now obsolescent macros instead of being
755 typedefs or tags; they are no longer documented and are planned to be
756 withdrawn in a future release.
757
758 ** GLR parser notes
759
760 - GLR and inline
761 Users of Bison have to decide how they handle the portability of the
762 C keyword `inline'.
763
764 - `parsing stack overflow...' -> `parser stack overflow'
765 GLR parsers now report `parser stack overflow' as per the Bison manual.
766
767 ** Bison now warns if it detects conflicting outputs to the same file,
768 e.g., it generates a warning for `bison -d -o foo.h foo.y' since
769 that command outputs both code and header to foo.h.
770
771 ** #line in output files
772 - --no-line works properly.
773
774 ** Bison can no longer be built by a K&R C compiler; it requires C89 or
775 later to be built. This change originally took place a few versions
776 ago, but nobody noticed until we recently asked someone to try
777 building Bison with a K&R C compiler.
778 \f
779 * Changes in version 1.75, 2002-10-14:
780
781 ** Bison should now work on 64-bit hosts.
782
783 ** Indonesian translation thanks to Tedi Heriyanto.
784
785 ** GLR parsers
786 Fix spurious parse errors.
787
788 ** Pure parsers
789 Some people redefine yyerror to steal yyparse' private variables.
790 Reenable this trick until an official feature replaces it.
791
792 ** Type Clashes
793 In agreement with POSIX and with other Yaccs, leaving a default
794 action is valid when $$ is untyped, and $1 typed:
795
796 untyped: ... typed;
797
798 but the converse remains an error:
799
800 typed: ... untyped;
801
802 ** Values of mid-rule actions
803 The following code:
804
805 foo: { ... } { $$ = $1; } ...
806
807 was incorrectly rejected: $1 is defined in the second mid-rule
808 action, and is equal to the $$ of the first mid-rule action.
809 \f
810 * Changes in version 1.50, 2002-10-04:
811
812 ** GLR parsing
813 The declaration
814 %glr-parser
815 causes Bison to produce a Generalized LR (GLR) parser, capable of handling
816 almost any context-free grammar, ambiguous or not. The new declarations
817 %dprec and %merge on grammar rules allow parse-time resolution of
818 ambiguities. Contributed by Paul Hilfinger.
819
820 Unfortunately Bison 1.50 does not work properly on 64-bit hosts
821 like the Alpha, so please stick to 32-bit hosts for now.
822
823 ** Output Directory
824 When not in Yacc compatibility mode, when the output file was not
825 specified, running `bison foo/bar.y' created `foo/bar.c'. It
826 now creates `bar.c'.
827
828 ** Undefined token
829 The undefined token was systematically mapped to 2 which prevented
830 the use of 2 by the user. This is no longer the case.
831
832 ** Unknown token numbers
833 If yylex returned an out of range value, yyparse could die. This is
834 no longer the case.
835
836 ** Error token
837 According to POSIX, the error token must be 256.
838 Bison extends this requirement by making it a preference: *if* the
839 user specified that one of her tokens is numbered 256, then error
840 will be mapped onto another number.
841
842 ** Verbose error messages
843 They no longer report `..., expecting error or...' for states where
844 error recovery is possible.
845
846 ** End token
847 Defaults to `$end' instead of `$'.
848
849 ** Error recovery now conforms to documentation and to POSIX
850 When a Bison-generated parser encounters a syntax error, it now pops
851 the stack until it finds a state that allows shifting the error
852 token. Formerly, it popped the stack until it found a state that
853 allowed some non-error action other than a default reduction on the
854 error token. The new behavior has long been the documented behavior,
855 and has long been required by POSIX. For more details, please see
856 Paul Eggert, "Reductions during Bison error handling" (2002-05-20)
857 <http://lists.gnu.org/archive/html/bug-bison/2002-05/msg00038.html>.
858
859 ** Traces
860 Popped tokens and nonterminals are now reported.
861
862 ** Larger grammars
863 Larger grammars are now supported (larger token numbers, larger grammar
864 size (= sum of the LHS and RHS lengths), larger LALR tables).
865 Formerly, many of these numbers ran afoul of 16-bit limits;
866 now these limits are 32 bits on most hosts.
867
868 ** Explicit initial rule
869 Bison used to play hacks with the initial rule, which the user does
870 not write. It is now explicit, and visible in the reports and
871 graphs as rule 0.
872
873 ** Useless rules
874 Before, Bison reported the useless rules, but, although not used,
875 included them in the parsers. They are now actually removed.
876
877 ** Useless rules, useless nonterminals
878 They are now reported, as a warning, with their locations.
879
880 ** Rules never reduced
881 Rules that can never be reduced because of conflicts are now
882 reported.
883
884 ** Incorrect `Token not used'
885 On a grammar such as
886
887 %token useless useful
888 %%
889 exp: '0' %prec useful;
890
891 where a token was used to set the precedence of the last rule,
892 bison reported both `useful' and `useless' as useless tokens.
893
894 ** Revert the C++ namespace changes introduced in 1.31
895 as they caused too many portability hassles.
896
897 ** Default locations
898 By an accident of design, the default computation of @$ was
899 performed after another default computation was performed: @$ = @1.
900 The latter is now removed: YYLLOC_DEFAULT is fully responsible of
901 the computation of @$.
902
903 ** Token end-of-file
904 The token end of file may be specified by the user, in which case,
905 the user symbol is used in the reports, the graphs, and the verbose
906 error messages instead of `$end', which remains being the default.
907 For instance
908 %token MYEOF 0
909 or
910 %token MYEOF 0 "end of file"
911
912 ** Semantic parser
913 This old option, which has been broken for ages, is removed.
914
915 ** New translations
916 Brazilian Portuguese, thanks to Alexandre Folle de Menezes.
917 Croatian, thanks to Denis Lackovic.
918
919 ** Incorrect token definitions
920 When given `%token 'a' "A"', Bison used to output `#define 'a' 65'.
921
922 ** Token definitions as enums
923 Tokens are output both as the traditional #define's, and, provided
924 the compiler supports ANSI C or is a C++ compiler, as enums.
925 This lets debuggers display names instead of integers.
926
927 ** Reports
928 In addition to --verbose, bison supports --report=THINGS, which
929 produces additional information:
930 - itemset
931 complete the core item sets with their closure
932 - lookahead [changed to `look-ahead' in 1.875e through 2.3, but changed back]
933 explicitly associate lookahead tokens to items
934 - solved
935 describe shift/reduce conflicts solving.
936 Bison used to systematically output this information on top of
937 the report. Solved conflicts are now attached to their states.
938
939 ** Type clashes
940 Previous versions don't complain when there is a type clash on
941 the default action if the rule has a mid-rule action, such as in:
942
943 %type <foo> bar
944 %%
945 bar: '0' {} '0';
946
947 This is fixed.
948
949 ** GNU M4 is now required when using Bison.
950 \f
951 * Changes in version 1.35, 2002-03-25:
952
953 ** C Skeleton
954 Some projects use Bison's C parser with C++ compilers, and define
955 YYSTYPE as a class. The recent adjustment of C parsers for data
956 alignment and 64 bit architectures made this impossible.
957
958 Because for the time being no real solution for C++ parser
959 generation exists, kludges were implemented in the parser to
960 maintain this use. In the future, when Bison has C++ parsers, this
961 kludge will be disabled.
962
963 This kludge also addresses some C++ problems when the stack was
964 extended.
965 \f
966 * Changes in version 1.34, 2002-03-12:
967
968 ** File name clashes are detected
969 $ bison foo.y -d -o foo.x
970 fatal error: header and parser would both be named `foo.x'
971
972 ** A missing `;' at the end of a rule triggers a warning
973 In accordance with POSIX, and in agreement with other
974 Yacc implementations, Bison will mandate this semicolon in the near
975 future. This eases the implementation of a Bison parser of Bison
976 grammars by making this grammar LALR(1) instead of LR(2). To
977 facilitate the transition, this release introduces a warning.
978
979 ** Revert the C++ namespace changes introduced in 1.31, as they caused too
980 many portability hassles.
981
982 ** DJGPP support added.
983
984 ** Fix test suite portability problems.
985 \f
986 * Changes in version 1.33, 2002-02-07:
987
988 ** Fix C++ issues
989 Groff could not be compiled for the definition of size_t was lacking
990 under some conditions.
991
992 ** Catch invalid @n
993 As is done with $n.
994 \f
995 * Changes in version 1.32, 2002-01-23:
996
997 ** Fix Yacc output file names
998
999 ** Portability fixes
1000
1001 ** Italian, Dutch translations
1002 \f
1003 * Changes in version 1.31, 2002-01-14:
1004
1005 ** Many Bug Fixes
1006
1007 ** GNU Gettext and %expect
1008 GNU Gettext asserts 10 s/r conflicts, but there are 7. Now that
1009 Bison dies on incorrect %expectations, we fear there will be
1010 too many bug reports for Gettext, so _for the time being_, %expect
1011 does not trigger an error when the input file is named `plural.y'.
1012
1013 ** Use of alloca in parsers
1014 If YYSTACK_USE_ALLOCA is defined to 0, then the parsers will use
1015 malloc exclusively. Since 1.29, but was not NEWS'ed.
1016
1017 alloca is used only when compiled with GCC, to avoid portability
1018 problems as on AIX.
1019
1020 ** yyparse now returns 2 if memory is exhausted; formerly it dumped core.
1021
1022 ** When the generated parser lacks debugging code, YYDEBUG is now 0
1023 (as POSIX requires) instead of being undefined.
1024
1025 ** User Actions
1026 Bison has always permitted actions such as { $$ = $1 }: it adds the
1027 ending semicolon. Now if in Yacc compatibility mode, the semicolon
1028 is no longer output: one has to write { $$ = $1; }.
1029
1030 ** Better C++ compliance
1031 The output parsers try to respect C++ namespaces.
1032 [This turned out to be a failed experiment, and it was reverted later.]
1033
1034 ** Reduced Grammars
1035 Fixed bugs when reporting useless nonterminals.
1036
1037 ** 64 bit hosts
1038 The parsers work properly on 64 bit hosts.
1039
1040 ** Error messages
1041 Some calls to strerror resulted in scrambled or missing error messages.
1042
1043 ** %expect
1044 When the number of shift/reduce conflicts is correct, don't issue
1045 any warning.
1046
1047 ** The verbose report includes the rule line numbers.
1048
1049 ** Rule line numbers are fixed in traces.
1050
1051 ** Swedish translation
1052
1053 ** Parse errors
1054 Verbose parse error messages from the parsers are better looking.
1055 Before: parse error: unexpected `'/'', expecting `"number"' or `'-'' or `'(''
1056 Now: parse error: unexpected '/', expecting "number" or '-' or '('
1057
1058 ** Fixed parser memory leaks.
1059 When the generated parser was using malloc to extend its stacks, the
1060 previous allocations were not freed.
1061
1062 ** Fixed verbose output file.
1063 Some newlines were missing.
1064 Some conflicts in state descriptions were missing.
1065
1066 ** Fixed conflict report.
1067 Option -v was needed to get the result.
1068
1069 ** %expect
1070 Was not used.
1071 Mismatches are errors, not warnings.
1072
1073 ** Fixed incorrect processing of some invalid input.
1074
1075 ** Fixed CPP guards: 9foo.h uses BISON_9FOO_H instead of 9FOO_H.
1076
1077 ** Fixed some typos in the documentation.
1078
1079 ** %token MY_EOF 0 is supported.
1080 Before, MY_EOF was silently renumbered as 257.
1081
1082 ** doc/refcard.tex is updated.
1083
1084 ** %output, %file-prefix, %name-prefix.
1085 New.
1086
1087 ** --output
1088 New, aliasing `--output-file'.
1089 \f
1090 * Changes in version 1.30, 2001-10-26:
1091
1092 ** `--defines' and `--graph' have now an optional argument which is the
1093 output file name. `-d' and `-g' do not change; they do not take any
1094 argument.
1095
1096 ** `%source_extension' and `%header_extension' are removed, failed
1097 experiment.
1098
1099 ** Portability fixes.
1100 \f
1101 * Changes in version 1.29, 2001-09-07:
1102
1103 ** The output file does not define const, as this caused problems when used
1104 with common autoconfiguration schemes. If you still use ancient compilers
1105 that lack const, compile with the equivalent of the C compiler option
1106 `-Dconst='. autoconf's AC_C_CONST macro provides one way to do this.
1107
1108 ** Added `-g' and `--graph'.
1109
1110 ** The Bison manual is now distributed under the terms of the GNU FDL.
1111
1112 ** The input and the output files has automatically a similar extension.
1113
1114 ** Russian translation added.
1115
1116 ** NLS support updated; should hopefully be less troublesome.
1117
1118 ** Added the old Bison reference card.
1119
1120 ** Added `--locations' and `%locations'.
1121
1122 ** Added `-S' and `--skeleton'.
1123
1124 ** `%raw', `-r', `--raw' is disabled.
1125
1126 ** Special characters are escaped when output. This solves the problems
1127 of the #line lines with path names including backslashes.
1128
1129 ** New directives.
1130 `%yacc', `%fixed_output_files', `%defines', `%no_parser', `%verbose',
1131 `%debug', `%source_extension' and `%header_extension'.
1132
1133 ** @$
1134 Automatic location tracking.
1135 \f
1136 * Changes in version 1.28, 1999-07-06:
1137
1138 ** Should compile better now with K&R compilers.
1139
1140 ** Added NLS.
1141
1142 ** Fixed a problem with escaping the double quote character.
1143
1144 ** There is now a FAQ.
1145 \f
1146 * Changes in version 1.27:
1147
1148 ** The make rule which prevented bison.simple from being created on
1149 some systems has been fixed.
1150 \f
1151 * Changes in version 1.26:
1152
1153 ** Bison now uses automake.
1154
1155 ** New mailing lists: <bug-bison@gnu.org> and <help-bison@gnu.org>.
1156
1157 ** Token numbers now start at 257 as previously documented, not 258.
1158
1159 ** Bison honors the TMPDIR environment variable.
1160
1161 ** A couple of buffer overruns have been fixed.
1162
1163 ** Problems when closing files should now be reported.
1164
1165 ** Generated parsers should now work even on operating systems which do
1166 not provide alloca().
1167 \f
1168 * Changes in version 1.25, 1995-10-16:
1169
1170 ** Errors in the input grammar are not fatal; Bison keeps reading
1171 the grammar file, and reports all the errors found in it.
1172
1173 ** Tokens can now be specified as multiple-character strings: for
1174 example, you could use "<=" for a token which looks like <=, instead
1175 of chosing a name like LESSEQ.
1176
1177 ** The %token_table declaration says to write a table of tokens (names
1178 and numbers) into the parser file. The yylex function can use this
1179 table to recognize multiple-character string tokens, or for other
1180 purposes.
1181
1182 ** The %no_lines declaration says not to generate any #line preprocessor
1183 directives in the parser file.
1184
1185 ** The %raw declaration says to use internal Bison token numbers, not
1186 Yacc-compatible token numbers, when token names are defined as macros.
1187
1188 ** The --no-parser option produces the parser tables without including
1189 the parser engine; a project can now use its own parser engine.
1190 The actions go into a separate file called NAME.act, in the form of
1191 a switch statement body.
1192 \f
1193 * Changes in version 1.23:
1194
1195 The user can define YYPARSE_PARAM as the name of an argument to be
1196 passed into yyparse. The argument should have type void *. It should
1197 actually point to an object. Grammar actions can access the variable
1198 by casting it to the proper pointer type.
1199
1200 Line numbers in output file corrected.
1201 \f
1202 * Changes in version 1.22:
1203
1204 --help option added.
1205 \f
1206 * Changes in version 1.20:
1207
1208 Output file does not redefine const for C++.
1209
1210 Local Variables:
1211 mode: outline
1212 End:
1213
1214 -----
1215
1216 Copyright (C) 1995, 1996, 1997, 1998, 1999, 2000, 2001, 2002, 2003,
1217 2004, 2005, 2006, 2007, 2008, 2009 Free Software Foundation, Inc.
1218
1219 This file is part of Bison, the GNU Parser Generator.
1220
1221 This program is free software: you can redistribute it and/or modify
1222 it under the terms of the GNU General Public License as published by
1223 the Free Software Foundation, either version 3 of the License, or
1224 (at your option) any later version.
1225
1226 This program is distributed in the hope that it will be useful,
1227 but WITHOUT ANY WARRANTY; without even the implied warranty of
1228 MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
1229 GNU General Public License for more details.
1230
1231 You should have received a copy of the GNU General Public License
1232 along with this program. If not, see <http://www.gnu.org/licenses/>.