]> git.saurik.com Git - bison.git/blob - NEWS
%define: accept unquoted values.
[bison.git] / NEWS
1 Bison News
2 ----------
3
4 * Changes in version 2.5 (????-??-??):
5
6 ** IELR(1) and Canonical LR(1) Support
7
8 IELR(1) is a minimal LR(1) parser table generation algorithm. That
9 is, given any context-free grammar, IELR(1) generates parser tables
10 with the full language recognition power of canonical LR(1) but with
11 nearly the same number of parser states as LALR(1). This reduction in
12 parser states is often an order of magnitude. More importantly,
13 because canonical LR(1)'s extra parser states may contain duplicate
14 conflicts in the case of non-LR(1) grammars, the number of conflicts
15 for IELR(1) is often an order of magnitude less as well. This can
16 significantly reduce the complexity of developing of a grammar.
17
18 Bison can now generate IELR(1) and canonical LR(1) parser tables in
19 place of its traditional LALR(1) parser tables, which remain the
20 default. You can specify the type of parser tables in the grammar
21 file with these directives:
22
23 %define lr.type lalr
24 %define lr.type ielr
25 %define lr.type canonical-lr
26
27 The default reduction optimization in the parser tables can also be
28 adjusted using `%define lr.default-reductions'. See the documentation
29 for `%define lr.type' and `%define lr.default-reductions' in the
30 section `Bison Declaration Summary' in the Bison manual for the
31 details.
32
33 These features are experimental. More user feedback will help to
34 stabilize them.
35
36 ** %define improvements.
37
38 *** Multiple invocations for any variable is now an error not a warning.
39
40 *** Can now be invoked via the command line.
41
42 Each of these command-line options
43
44 -D NAME[=VALUE]
45 --define=NAME[=VALUE]
46
47 -F NAME[=VALUE]
48 --force-define=NAME[=VALUE]
49
50 is equivalent to this grammar file declaration
51
52 %define NAME ["VALUE"]
53
54 except that the manner in which Bison processes multiple definitions
55 for the same NAME differs. Most importantly, -F and --force-define
56 quietly override %define, but -D and --define do not. For further
57 details, see the section "Bison Options" in the Bison manual.
58
59 *** Variables renamed.
60
61 The following %define variables
62
63 api.push_pull
64 lr.keep_unreachable_states
65
66 have been renamed to
67
68 api.push-pull
69 lr.keep-unreachable-states
70
71 The old names are now deprecated but will be maintained indefinitely
72 for backward compatibility.
73
74 *** Values no longer need to be quoted in grammar file.
75
76 If a %define value is an identifier, it no longer needs to be placed
77 within quotations marks. For example,
78
79 %define api.push-pull "push"
80
81 can be rewritten as
82
83 %define api.push-pull push
84
85 ** Symbol names.
86
87 Consistently with directives (such as %error-verbose) and variables
88 (e.g. push-pull), symbol names may include dashes in any position,
89 similarly to periods and underscores. This is GNU extension over
90 POSIX Yacc whose use is reported by -Wyacc, and rejected in Yacc
91 mode (--yacc).
92
93 ** Temporary hack for adding a semicolon to the user action.
94
95 Previously, Bison appended a semicolon to every user action for
96 reductions when the output language defaulted to C (specifically, when
97 neither %yacc, %language, %skeleton, or equivalent command-line
98 options were specified). This allowed actions such as
99
100 exp: exp "+" exp { $$ = $1 + $3 };
101
102 instead of
103
104 exp: exp "+" exp { $$ = $1 + $3; };
105
106 As a first step in removing this misfeature, Bison now issues a
107 warning when it appends a semicolon. Moreover, in cases where Bison
108 cannot easily determine whether a semicolon is needed (for example, an
109 action ending with a cpp directive or a braced compound initializer),
110 it no longer appends one. Thus, the C compiler might now complain
111 about a missing semicolon where it did not before. Future releases of
112 Bison will cease to append semicolons entirely.
113
114 ** Character literals not of length one.
115
116 Previously, Bison quietly converted all character literals to length
117 one. For example, without warning, Bison interpreted the operators in
118 the following grammar to be the same token:
119
120 exp: exp '++'
121 | exp '+' exp
122 ;
123
124 Bison now warns when a character literal is not of length one. In
125 some future release, Bison will report an error instead.
126
127 ** Verbose error messages fixed for nonassociative tokens.
128
129 When %error-verbose is specified, syntax error messages produced by
130 the generated parser include the unexpected token as well as a list of
131 expected tokens. Previously, this list erroneously included tokens
132 that would actually induce a syntax error because conflicts for them
133 were resolved with %nonassoc. Such tokens are now properly omitted
134 from the list.
135
136 * Changes in version 2.4.2 (????-??-??):
137
138 ** Detection of GNU M4 1.4.6 or newer during configure is improved.
139
140 ** %code is now a permanent feature.
141
142 A traditional Yacc prologue directive is written in the form:
143
144 %{CODE%}
145
146 To provide a more flexible alternative, Bison 2.3b introduced the
147 %code directive with the following forms for C/C++:
148
149 %code {CODE}
150 %code requires {CODE}
151 %code provides {CODE}
152 %code top {CODE}
153
154 These forms are now considered permanent features of Bison. See the
155 %code entries in the section "Bison Declaration Summary" in the Bison
156 manual for a summary of their functionality. See the section
157 "Prologue Alternatives" for a detailed discussion including the
158 advantages of %code over the traditional Yacc prologue directive.
159
160 Bison's Java feature as a whole including its current usage of %code
161 is still considered experimental.
162
163 * Changes in version 2.4.1 (2008-12-11):
164
165 ** In the GLR defines file, unexpanded M4 macros in the yylval and yylloc
166 declarations have been fixed.
167
168 ** Temporary hack for adding a semicolon to the user action.
169
170 Bison used to prepend a trailing semicolon at the end of the user
171 action for reductions. This allowed actions such as
172
173 exp: exp "+" exp { $$ = $1 + $3 };
174
175 instead of
176
177 exp: exp "+" exp { $$ = $1 + $3; };
178
179 Some grammars still depend on this `feature'. Bison 2.4.1 restores
180 the previous behavior in the case of C output (specifically, when
181 neither %language or %skeleton or equivalent command-line options
182 are used) to leave more time for grammars depending on the old
183 behavior to be adjusted. Future releases of Bison will disable this
184 feature.
185
186 ** A few minor improvements to the Bison manual.
187
188 * Changes in version 2.4 (2008-11-02):
189
190 ** %language is an experimental feature.
191
192 We first introduced this feature in test release 2.3b as a cleaner
193 alternative to %skeleton. Since then, we have discussed the possibility of
194 modifying its effect on Bison's output file names. Thus, in this release,
195 we consider %language to be an experimental feature that will likely evolve
196 in future releases.
197
198 ** Forward compatibility with GNU M4 has been improved.
199
200 ** Several bugs in the C++ skeleton and the experimental Java skeleton have been
201 fixed.
202
203 * Changes in version 2.3b (2008-05-27):
204
205 ** The quotes around NAME that used to be required in the following directive
206 are now deprecated:
207
208 %define NAME "VALUE"
209
210 ** The directive `%pure-parser' is now deprecated in favor of:
211
212 %define api.pure
213
214 which has the same effect except that Bison is more careful to warn about
215 unreasonable usage in the latter case.
216
217 ** Push Parsing
218
219 Bison can now generate an LALR(1) parser in C with a push interface. That
220 is, instead of invoking `yyparse', which pulls tokens from `yylex', you can
221 push one token at a time to the parser using `yypush_parse', which will
222 return to the caller after processing each token. By default, the push
223 interface is disabled. Either of the following directives will enable it:
224
225 %define api.push_pull "push" // Just push; does not require yylex.
226 %define api.push_pull "both" // Push and pull; requires yylex.
227
228 See the new section `A Push Parser' in the Bison manual for details.
229
230 The current push parsing interface is experimental and may evolve. More user
231 feedback will help to stabilize it.
232
233 ** The -g and --graph options now output graphs in Graphviz DOT format,
234 not VCG format. Like --graph, -g now also takes an optional FILE argument
235 and thus cannot be bundled with other short options.
236
237 ** Java
238
239 Bison can now generate an LALR(1) parser in Java. The skeleton is
240 `data/lalr1.java'. Consider using the new %language directive instead of
241 %skeleton to select it.
242
243 See the new section `Java Parsers' in the Bison manual for details.
244
245 The current Java interface is experimental and may evolve. More user
246 feedback will help to stabilize it.
247
248 ** %language
249
250 This new directive specifies the programming language of the generated
251 parser, which can be C (the default), C++, or Java. Besides the skeleton
252 that Bison uses, the directive affects the names of the generated files if
253 the grammar file's name ends in ".y".
254
255 ** XML Automaton Report
256
257 Bison can now generate an XML report of the LALR(1) automaton using the new
258 `--xml' option. The current XML schema is experimental and may evolve. More
259 user feedback will help to stabilize it.
260
261 ** The grammar file may now specify the name of the parser header file using
262 %defines. For example:
263
264 %defines "parser.h"
265
266 ** When reporting useless rules, useless nonterminals, and unused terminals,
267 Bison now employs the terms "useless in grammar" instead of "useless",
268 "useless in parser" instead of "never reduced", and "unused in grammar"
269 instead of "unused".
270
271 ** Unreachable State Removal
272
273 Previously, Bison sometimes generated parser tables containing unreachable
274 states. A state can become unreachable during conflict resolution if Bison
275 disables a shift action leading to it from a predecessor state. Bison now:
276
277 1. Removes unreachable states.
278
279 2. Does not report any conflicts that appeared in unreachable states.
280 WARNING: As a result, you may need to update %expect and %expect-rr
281 directives in existing grammar files.
282
283 3. For any rule used only in such states, Bison now reports the rule as
284 "useless in parser due to conflicts".
285
286 This feature can be disabled with the following directive:
287
288 %define lr.keep_unreachable_states
289
290 See the %define entry in the `Bison Declaration Summary' in the Bison manual
291 for further discussion.
292
293 ** Lookahead Set Correction in the `.output' Report
294
295 When instructed to generate a `.output' file including lookahead sets
296 (using `--report=lookahead', for example), Bison now prints each reduction's
297 lookahead set only next to the associated state's one item that (1) is
298 associated with the same rule as the reduction and (2) has its dot at the end
299 of its RHS. Previously, Bison also erroneously printed the lookahead set
300 next to all of the state's other items associated with the same rule. This
301 bug affected only the `.output' file and not the generated parser source
302 code.
303
304 ** --report-file=FILE is a new option to override the default `.output' file
305 name.
306
307 ** The `=' that used to be required in the following directives is now
308 deprecated:
309
310 %file-prefix "parser"
311 %name-prefix "c_"
312 %output "parser.c"
313
314 ** An Alternative to `%{...%}' -- `%code QUALIFIER {CODE}'
315
316 Bison 2.3a provided a new set of directives as a more flexible alternative to
317 the traditional Yacc prologue blocks. Those have now been consolidated into
318 a single %code directive with an optional qualifier field, which identifies
319 the purpose of the code and thus the location(s) where Bison should generate
320 it:
321
322 1. `%code {CODE}' replaces `%after-header {CODE}'
323 2. `%code requires {CODE}' replaces `%start-header {CODE}'
324 3. `%code provides {CODE}' replaces `%end-header {CODE}'
325 4. `%code top {CODE}' replaces `%before-header {CODE}'
326
327 See the %code entries in section `Bison Declaration Summary' in the Bison
328 manual for a summary of the new functionality. See the new section `Prologue
329 Alternatives' for a detailed discussion including the advantages of %code
330 over the traditional Yacc prologues.
331
332 The prologue alternatives are experimental. More user feedback will help to
333 determine whether they should become permanent features.
334
335 ** Revised warning: unset or unused mid-rule values
336
337 Since Bison 2.2, Bison has warned about mid-rule values that are set but not
338 used within any of the actions of the parent rule. For example, Bison warns
339 about unused $2 in:
340
341 exp: '1' { $$ = 1; } '+' exp { $$ = $1 + $4; };
342
343 Now, Bison also warns about mid-rule values that are used but not set. For
344 example, Bison warns about unset $$ in the mid-rule action in:
345
346 exp: '1' { $1 = 1; } '+' exp { $$ = $2 + $4; };
347
348 However, Bison now disables both of these warnings by default since they
349 sometimes prove to be false alarms in existing grammars employing the Yacc
350 constructs $0 or $-N (where N is some positive integer).
351
352 To enable these warnings, specify the option `--warnings=midrule-values' or
353 `-W', which is a synonym for `--warnings=all'.
354
355 ** Default %destructor or %printer with `<*>' or `<>'
356
357 Bison now recognizes two separate kinds of default %destructor's and
358 %printer's:
359
360 1. Place `<*>' in a %destructor/%printer symbol list to define a default
361 %destructor/%printer for all grammar symbols for which you have formally
362 declared semantic type tags.
363
364 2. Place `<>' in a %destructor/%printer symbol list to define a default
365 %destructor/%printer for all grammar symbols without declared semantic
366 type tags.
367
368 Bison no longer supports the `%symbol-default' notation from Bison 2.3a.
369 `<*>' and `<>' combined achieve the same effect with one exception: Bison no
370 longer applies any %destructor to a mid-rule value if that mid-rule value is
371 not actually ever referenced using either $$ or $n in a semantic action.
372
373 The default %destructor's and %printer's are experimental. More user
374 feedback will help to determine whether they should become permanent
375 features.
376
377 See the section `Freeing Discarded Symbols' in the Bison manual for further
378 details.
379
380 ** %left, %right, and %nonassoc can now declare token numbers. This is required
381 by POSIX. However, see the end of section `Operator Precedence' in the Bison
382 manual for a caveat concerning the treatment of literal strings.
383
384 ** The nonfunctional --no-parser, -n, and %no-parser options have been
385 completely removed from Bison.
386
387 * Changes in version 2.3a, 2006-09-13:
388
389 ** Instead of %union, you can define and use your own union type
390 YYSTYPE if your grammar contains at least one <type> tag.
391 Your YYSTYPE need not be a macro; it can be a typedef.
392 This change is for compatibility with other Yacc implementations,
393 and is required by POSIX.
394
395 ** Locations columns and lines start at 1.
396 In accordance with the GNU Coding Standards and Emacs.
397
398 ** You may now declare per-type and default %destructor's and %printer's:
399
400 For example:
401
402 %union { char *string; }
403 %token <string> STRING1
404 %token <string> STRING2
405 %type <string> string1
406 %type <string> string2
407 %union { char character; }
408 %token <character> CHR
409 %type <character> chr
410 %destructor { free ($$); } %symbol-default
411 %destructor { free ($$); printf ("%d", @$.first_line); } STRING1 string1
412 %destructor { } <character>
413
414 guarantees that, when the parser discards any user-defined symbol that has a
415 semantic type tag other than `<character>', it passes its semantic value to
416 `free'. However, when the parser discards a `STRING1' or a `string1', it
417 also prints its line number to `stdout'. It performs only the second
418 `%destructor' in this case, so it invokes `free' only once.
419
420 [Although we failed to mention this here in the 2.3a release, the default
421 %destructor's and %printer's were experimental, and they were rewritten in
422 future versions.]
423
424 ** Except for LALR(1) parsers in C with POSIX Yacc emulation enabled (with `-y',
425 `--yacc', or `%yacc'), Bison no longer generates #define statements for
426 associating token numbers with token names. Removing the #define statements
427 helps to sanitize the global namespace during preprocessing, but POSIX Yacc
428 requires them. Bison still generates an enum for token names in all cases.
429
430 ** Handling of traditional Yacc prologue blocks is now more consistent but
431 potentially incompatible with previous releases of Bison.
432
433 As before, you declare prologue blocks in your grammar file with the
434 `%{ ... %}' syntax. To generate the pre-prologue, Bison concatenates all
435 prologue blocks that you've declared before the first %union. To generate
436 the post-prologue, Bison concatenates all prologue blocks that you've
437 declared after the first %union.
438
439 Previous releases of Bison inserted the pre-prologue into both the header
440 file and the code file in all cases except for LALR(1) parsers in C. In the
441 latter case, Bison inserted it only into the code file. For parsers in C++,
442 the point of insertion was before any token definitions (which associate
443 token numbers with names). For parsers in C, the point of insertion was
444 after the token definitions.
445
446 Now, Bison never inserts the pre-prologue into the header file. In the code
447 file, it always inserts it before the token definitions.
448
449 ** Bison now provides a more flexible alternative to the traditional Yacc
450 prologue blocks: %before-header, %start-header, %end-header, and
451 %after-header.
452
453 For example, the following declaration order in the grammar file reflects the
454 order in which Bison will output these code blocks. However, you are free to
455 declare these code blocks in your grammar file in whatever order is most
456 convenient for you:
457
458 %before-header {
459 /* Bison treats this block like a pre-prologue block: it inserts it into
460 * the code file before the contents of the header file. It does *not*
461 * insert it into the header file. This is a good place to put
462 * #include's that you want at the top of your code file. A common
463 * example is `#include "system.h"'. */
464 }
465 %start-header {
466 /* Bison inserts this block into both the header file and the code file.
467 * In both files, the point of insertion is before any Bison-generated
468 * token, semantic type, location type, and class definitions. This is a
469 * good place to define %union dependencies, for example. */
470 }
471 %union {
472 /* Unlike the traditional Yacc prologue blocks, the output order for the
473 * new %*-header blocks is not affected by their declaration position
474 * relative to any %union in the grammar file. */
475 }
476 %end-header {
477 /* Bison inserts this block into both the header file and the code file.
478 * In both files, the point of insertion is after the Bison-generated
479 * definitions. This is a good place to declare or define public
480 * functions or data structures that depend on the Bison-generated
481 * definitions. */
482 }
483 %after-header {
484 /* Bison treats this block like a post-prologue block: it inserts it into
485 * the code file after the contents of the header file. It does *not*
486 * insert it into the header file. This is a good place to declare or
487 * define internal functions or data structures that depend on the
488 * Bison-generated definitions. */
489 }
490
491 If you have multiple occurrences of any one of the above declarations, Bison
492 will concatenate the contents in declaration order.
493
494 [Although we failed to mention this here in the 2.3a release, the prologue
495 alternatives were experimental, and they were rewritten in future versions.]
496
497 ** The option `--report=look-ahead' has been changed to `--report=lookahead'.
498 The old spelling still works, but is not documented and may be removed
499 in a future release.
500
501 * Changes in version 2.3, 2006-06-05:
502
503 ** GLR grammars should now use `YYRECOVERING ()' instead of `YYRECOVERING',
504 for compatibility with LALR(1) grammars.
505
506 ** It is now documented that any definition of YYSTYPE or YYLTYPE should
507 be to a type name that does not contain parentheses or brackets.
508
509 * Changes in version 2.2, 2006-05-19:
510
511 ** The distribution terms for all Bison-generated parsers now permit
512 using the parsers in nonfree programs. Previously, this permission
513 was granted only for Bison-generated LALR(1) parsers in C.
514
515 ** %name-prefix changes the namespace name in C++ outputs.
516
517 ** The C++ parsers export their token_type.
518
519 ** Bison now allows multiple %union declarations, and concatenates
520 their contents together.
521
522 ** New warning: unused values
523 Right-hand side symbols whose values are not used are reported,
524 if the symbols have destructors. For instance:
525
526 exp: exp "?" exp ":" exp { $1 ? $1 : $3; }
527 | exp "+" exp
528 ;
529
530 will trigger a warning about $$ and $5 in the first rule, and $3 in
531 the second ($1 is copied to $$ by the default rule). This example
532 most likely contains three errors, and could be rewritten as:
533
534 exp: exp "?" exp ":" exp
535 { $$ = $1 ? $3 : $5; free ($1 ? $5 : $3); free ($1); }
536 | exp "+" exp
537 { $$ = $1 ? $1 : $3; if ($1) free ($3); }
538 ;
539
540 However, if the original actions were really intended, memory leaks
541 and all, the warnings can be suppressed by letting Bison believe the
542 values are used, e.g.:
543
544 exp: exp "?" exp ":" exp { $1 ? $1 : $3; (void) ($$, $5); }
545 | exp "+" exp { $$ = $1; (void) $3; }
546 ;
547
548 If there are mid-rule actions, the warning is issued if no action
549 uses it. The following triggers no warning: $1 and $3 are used.
550
551 exp: exp { push ($1); } '+' exp { push ($3); sum (); };
552
553 The warning is intended to help catching lost values and memory leaks.
554 If a value is ignored, its associated memory typically is not reclaimed.
555
556 ** %destructor vs. YYABORT, YYACCEPT, and YYERROR.
557 Destructors are now called when user code invokes YYABORT, YYACCEPT,
558 and YYERROR, for all objects on the stack, other than objects
559 corresponding to the right-hand side of the current rule.
560
561 ** %expect, %expect-rr
562 Incorrect numbers of expected conflicts are now actual errors,
563 instead of warnings.
564
565 ** GLR, YACC parsers.
566 The %parse-params are available in the destructors (and the
567 experimental printers) as per the documentation.
568
569 ** Bison now warns if it finds a stray `$' or `@' in an action.
570
571 ** %require "VERSION"
572 This specifies that the grammar file depends on features implemented
573 in Bison version VERSION or higher.
574
575 ** lalr1.cc: The token and value types are now class members.
576 The tokens were defined as free form enums and cpp macros. YYSTYPE
577 was defined as a free form union. They are now class members:
578 tokens are enumerations of the `yy::parser::token' struct, and the
579 semantic values have the `yy::parser::semantic_type' type.
580
581 If you do not want or can update to this scheme, the directive
582 `%define "global_tokens_and_yystype" "1"' triggers the global
583 definition of tokens and YYSTYPE. This change is suitable both
584 for previous releases of Bison, and this one.
585
586 If you wish to update, then make sure older version of Bison will
587 fail using `%require "2.2"'.
588
589 ** DJGPP support added.
590 \f
591 * Changes in version 2.1, 2005-09-16:
592
593 ** The C++ lalr1.cc skeleton supports %lex-param.
594
595 ** Bison-generated parsers now support the translation of diagnostics like
596 "syntax error" into languages other than English. The default
597 language is still English. For details, please see the new
598 Internationalization section of the Bison manual. Software
599 distributors should also see the new PACKAGING file. Thanks to
600 Bruno Haible for this new feature.
601
602 ** Wording in the Bison-generated parsers has been changed slightly to
603 simplify translation. In particular, the message "memory exhausted"
604 has replaced "parser stack overflow", as the old message was not
605 always accurate for modern Bison-generated parsers.
606
607 ** Destructors are now called when the parser aborts, for all symbols left
608 behind on the stack. Also, the start symbol is now destroyed after a
609 successful parse. In both cases, the behavior was formerly inconsistent.
610
611 ** When generating verbose diagnostics, Bison-generated parsers no longer
612 quote the literal strings associated with tokens. For example, for
613 a syntax error associated with '%token NUM "number"' they might
614 print 'syntax error, unexpected number' instead of 'syntax error,
615 unexpected "number"'.
616 \f
617 * Changes in version 2.0, 2004-12-25:
618
619 ** Possibly-incompatible changes
620
621 - Bison-generated parsers no longer default to using the alloca function
622 (when available) to extend the parser stack, due to widespread
623 problems in unchecked stack-overflow detection. You can "#define
624 YYSTACK_USE_ALLOCA 1" to require the use of alloca, but please read
625 the manual to determine safe values for YYMAXDEPTH in that case.
626
627 - Error token location.
628 During error recovery, the location of the syntax error is updated
629 to cover the whole sequence covered by the error token: it includes
630 the shifted symbols thrown away during the first part of the error
631 recovery, and the lookahead rejected during the second part.
632
633 - Semicolon changes:
634 . Stray semicolons are no longer allowed at the start of a grammar.
635 . Semicolons are now required after in-grammar declarations.
636
637 - Unescaped newlines are no longer allowed in character constants or
638 string literals. They were never portable, and GCC 3.4.0 has
639 dropped support for them. Better diagnostics are now generated if
640 forget a closing quote.
641
642 - NUL bytes are no longer allowed in Bison string literals, unfortunately.
643
644 ** New features
645
646 - GLR grammars now support locations.
647
648 - New directive: %initial-action.
649 This directive allows the user to run arbitrary code (including
650 initializing @$) from yyparse before parsing starts.
651
652 - A new directive "%expect-rr N" specifies the expected number of
653 reduce/reduce conflicts in GLR parsers.
654
655 - %token numbers can now be hexadecimal integers, e.g., `%token FOO 0x12d'.
656 This is a GNU extension.
657
658 - The option `--report=lookahead' was changed to `--report=look-ahead'.
659 [However, this was changed back after 2.3.]
660
661 - Experimental %destructor support has been added to lalr1.cc.
662
663 - New configure option --disable-yacc, to disable installation of the
664 yacc command and -ly library introduced in 1.875 for POSIX conformance.
665
666 ** Bug fixes
667
668 - For now, %expect-count violations are now just warnings, not errors.
669 This is for compatibility with Bison 1.75 and earlier (when there are
670 reduce/reduce conflicts) and with Bison 1.30 and earlier (when there
671 are too many or too few shift/reduce conflicts). However, in future
672 versions of Bison we plan to improve the %expect machinery so that
673 these violations will become errors again.
674
675 - Within Bison itself, numbers (e.g., goto numbers) are no longer
676 arbitrarily limited to 16-bit counts.
677
678 - Semicolons are now allowed before "|" in grammar rules, as POSIX requires.
679 \f
680 * Changes in version 1.875, 2003-01-01:
681
682 ** The documentation license has been upgraded to version 1.2
683 of the GNU Free Documentation License.
684
685 ** syntax error processing
686
687 - In Yacc-style parsers YYLLOC_DEFAULT is now used to compute error
688 locations too. This fixes bugs in error-location computation.
689
690 - %destructor
691 It is now possible to reclaim the memory associated to symbols
692 discarded during error recovery. This feature is still experimental.
693
694 - %error-verbose
695 This new directive is preferred over YYERROR_VERBOSE.
696
697 - #defining yyerror to steal internal variables is discouraged.
698 It is not guaranteed to work forever.
699
700 ** POSIX conformance
701
702 - Semicolons are once again optional at the end of grammar rules.
703 This reverts to the behavior of Bison 1.33 and earlier, and improves
704 compatibility with Yacc.
705
706 - `parse error' -> `syntax error'
707 Bison now uniformly uses the term `syntax error'; formerly, the code
708 and manual sometimes used the term `parse error' instead. POSIX
709 requires `syntax error' in diagnostics, and it was thought better to
710 be consistent.
711
712 - The documentation now emphasizes that yylex and yyerror must be
713 declared before use. C99 requires this.
714
715 - Bison now parses C99 lexical constructs like UCNs and
716 backslash-newline within C escape sequences, as POSIX 1003.1-2001 requires.
717
718 - File names are properly escaped in C output. E.g., foo\bar.y is
719 output as "foo\\bar.y".
720
721 - Yacc command and library now available
722 The Bison distribution now installs a `yacc' command, as POSIX requires.
723 Also, Bison now installs a small library liby.a containing
724 implementations of Yacc-compatible yyerror and main functions.
725 This library is normally not useful, but POSIX requires it.
726
727 - Type clashes now generate warnings, not errors.
728
729 - If the user does not define YYSTYPE as a macro, Bison now declares it
730 using typedef instead of defining it as a macro.
731 For consistency, YYLTYPE is also declared instead of defined.
732
733 ** Other compatibility issues
734
735 - %union directives can now have a tag before the `{', e.g., the
736 directive `%union foo {...}' now generates the C code
737 `typedef union foo { ... } YYSTYPE;'; this is for Yacc compatibility.
738 The default union tag is `YYSTYPE', for compatibility with Solaris 9 Yacc.
739 For consistency, YYLTYPE's struct tag is now `YYLTYPE' not `yyltype'.
740 This is for compatibility with both Yacc and Bison 1.35.
741
742 - `;' is output before the terminating `}' of an action, for
743 compatibility with Bison 1.35.
744
745 - Bison now uses a Yacc-style format for conflict reports, e.g.,
746 `conflicts: 2 shift/reduce, 1 reduce/reduce'.
747
748 - `yystype' and `yyltype' are now obsolescent macros instead of being
749 typedefs or tags; they are no longer documented and are planned to be
750 withdrawn in a future release.
751
752 ** GLR parser notes
753
754 - GLR and inline
755 Users of Bison have to decide how they handle the portability of the
756 C keyword `inline'.
757
758 - `parsing stack overflow...' -> `parser stack overflow'
759 GLR parsers now report `parser stack overflow' as per the Bison manual.
760
761 ** Bison now warns if it detects conflicting outputs to the same file,
762 e.g., it generates a warning for `bison -d -o foo.h foo.y' since
763 that command outputs both code and header to foo.h.
764
765 ** #line in output files
766 - --no-line works properly.
767
768 ** Bison can no longer be built by a K&R C compiler; it requires C89 or
769 later to be built. This change originally took place a few versions
770 ago, but nobody noticed until we recently asked someone to try
771 building Bison with a K&R C compiler.
772 \f
773 * Changes in version 1.75, 2002-10-14:
774
775 ** Bison should now work on 64-bit hosts.
776
777 ** Indonesian translation thanks to Tedi Heriyanto.
778
779 ** GLR parsers
780 Fix spurious parse errors.
781
782 ** Pure parsers
783 Some people redefine yyerror to steal yyparse' private variables.
784 Reenable this trick until an official feature replaces it.
785
786 ** Type Clashes
787 In agreement with POSIX and with other Yaccs, leaving a default
788 action is valid when $$ is untyped, and $1 typed:
789
790 untyped: ... typed;
791
792 but the converse remains an error:
793
794 typed: ... untyped;
795
796 ** Values of mid-rule actions
797 The following code:
798
799 foo: { ... } { $$ = $1; } ...
800
801 was incorrectly rejected: $1 is defined in the second mid-rule
802 action, and is equal to the $$ of the first mid-rule action.
803 \f
804 * Changes in version 1.50, 2002-10-04:
805
806 ** GLR parsing
807 The declaration
808 %glr-parser
809 causes Bison to produce a Generalized LR (GLR) parser, capable of handling
810 almost any context-free grammar, ambiguous or not. The new declarations
811 %dprec and %merge on grammar rules allow parse-time resolution of
812 ambiguities. Contributed by Paul Hilfinger.
813
814 Unfortunately Bison 1.50 does not work properly on 64-bit hosts
815 like the Alpha, so please stick to 32-bit hosts for now.
816
817 ** Output Directory
818 When not in Yacc compatibility mode, when the output file was not
819 specified, running `bison foo/bar.y' created `foo/bar.c'. It
820 now creates `bar.c'.
821
822 ** Undefined token
823 The undefined token was systematically mapped to 2 which prevented
824 the use of 2 by the user. This is no longer the case.
825
826 ** Unknown token numbers
827 If yylex returned an out of range value, yyparse could die. This is
828 no longer the case.
829
830 ** Error token
831 According to POSIX, the error token must be 256.
832 Bison extends this requirement by making it a preference: *if* the
833 user specified that one of her tokens is numbered 256, then error
834 will be mapped onto another number.
835
836 ** Verbose error messages
837 They no longer report `..., expecting error or...' for states where
838 error recovery is possible.
839
840 ** End token
841 Defaults to `$end' instead of `$'.
842
843 ** Error recovery now conforms to documentation and to POSIX
844 When a Bison-generated parser encounters a syntax error, it now pops
845 the stack until it finds a state that allows shifting the error
846 token. Formerly, it popped the stack until it found a state that
847 allowed some non-error action other than a default reduction on the
848 error token. The new behavior has long been the documented behavior,
849 and has long been required by POSIX. For more details, please see
850 Paul Eggert, "Reductions during Bison error handling" (2002-05-20)
851 <http://lists.gnu.org/archive/html/bug-bison/2002-05/msg00038.html>.
852
853 ** Traces
854 Popped tokens and nonterminals are now reported.
855
856 ** Larger grammars
857 Larger grammars are now supported (larger token numbers, larger grammar
858 size (= sum of the LHS and RHS lengths), larger LALR tables).
859 Formerly, many of these numbers ran afoul of 16-bit limits;
860 now these limits are 32 bits on most hosts.
861
862 ** Explicit initial rule
863 Bison used to play hacks with the initial rule, which the user does
864 not write. It is now explicit, and visible in the reports and
865 graphs as rule 0.
866
867 ** Useless rules
868 Before, Bison reported the useless rules, but, although not used,
869 included them in the parsers. They are now actually removed.
870
871 ** Useless rules, useless nonterminals
872 They are now reported, as a warning, with their locations.
873
874 ** Rules never reduced
875 Rules that can never be reduced because of conflicts are now
876 reported.
877
878 ** Incorrect `Token not used'
879 On a grammar such as
880
881 %token useless useful
882 %%
883 exp: '0' %prec useful;
884
885 where a token was used to set the precedence of the last rule,
886 bison reported both `useful' and `useless' as useless tokens.
887
888 ** Revert the C++ namespace changes introduced in 1.31
889 as they caused too many portability hassles.
890
891 ** Default locations
892 By an accident of design, the default computation of @$ was
893 performed after another default computation was performed: @$ = @1.
894 The latter is now removed: YYLLOC_DEFAULT is fully responsible of
895 the computation of @$.
896
897 ** Token end-of-file
898 The token end of file may be specified by the user, in which case,
899 the user symbol is used in the reports, the graphs, and the verbose
900 error messages instead of `$end', which remains being the default.
901 For instance
902 %token MYEOF 0
903 or
904 %token MYEOF 0 "end of file"
905
906 ** Semantic parser
907 This old option, which has been broken for ages, is removed.
908
909 ** New translations
910 Brazilian Portuguese, thanks to Alexandre Folle de Menezes.
911 Croatian, thanks to Denis Lackovic.
912
913 ** Incorrect token definitions
914 When given `%token 'a' "A"', Bison used to output `#define 'a' 65'.
915
916 ** Token definitions as enums
917 Tokens are output both as the traditional #define's, and, provided
918 the compiler supports ANSI C or is a C++ compiler, as enums.
919 This lets debuggers display names instead of integers.
920
921 ** Reports
922 In addition to --verbose, bison supports --report=THINGS, which
923 produces additional information:
924 - itemset
925 complete the core item sets with their closure
926 - lookahead [changed to `look-ahead' in 1.875e through 2.3, but changed back]
927 explicitly associate lookahead tokens to items
928 - solved
929 describe shift/reduce conflicts solving.
930 Bison used to systematically output this information on top of
931 the report. Solved conflicts are now attached to their states.
932
933 ** Type clashes
934 Previous versions don't complain when there is a type clash on
935 the default action if the rule has a mid-rule action, such as in:
936
937 %type <foo> bar
938 %%
939 bar: '0' {} '0';
940
941 This is fixed.
942
943 ** GNU M4 is now required when using Bison.
944 \f
945 * Changes in version 1.35, 2002-03-25:
946
947 ** C Skeleton
948 Some projects use Bison's C parser with C++ compilers, and define
949 YYSTYPE as a class. The recent adjustment of C parsers for data
950 alignment and 64 bit architectures made this impossible.
951
952 Because for the time being no real solution for C++ parser
953 generation exists, kludges were implemented in the parser to
954 maintain this use. In the future, when Bison has C++ parsers, this
955 kludge will be disabled.
956
957 This kludge also addresses some C++ problems when the stack was
958 extended.
959 \f
960 * Changes in version 1.34, 2002-03-12:
961
962 ** File name clashes are detected
963 $ bison foo.y -d -o foo.x
964 fatal error: header and parser would both be named `foo.x'
965
966 ** A missing `;' at the end of a rule triggers a warning
967 In accordance with POSIX, and in agreement with other
968 Yacc implementations, Bison will mandate this semicolon in the near
969 future. This eases the implementation of a Bison parser of Bison
970 grammars by making this grammar LALR(1) instead of LR(2). To
971 facilitate the transition, this release introduces a warning.
972
973 ** Revert the C++ namespace changes introduced in 1.31, as they caused too
974 many portability hassles.
975
976 ** DJGPP support added.
977
978 ** Fix test suite portability problems.
979 \f
980 * Changes in version 1.33, 2002-02-07:
981
982 ** Fix C++ issues
983 Groff could not be compiled for the definition of size_t was lacking
984 under some conditions.
985
986 ** Catch invalid @n
987 As is done with $n.
988 \f
989 * Changes in version 1.32, 2002-01-23:
990
991 ** Fix Yacc output file names
992
993 ** Portability fixes
994
995 ** Italian, Dutch translations
996 \f
997 * Changes in version 1.31, 2002-01-14:
998
999 ** Many Bug Fixes
1000
1001 ** GNU Gettext and %expect
1002 GNU Gettext asserts 10 s/r conflicts, but there are 7. Now that
1003 Bison dies on incorrect %expectations, we fear there will be
1004 too many bug reports for Gettext, so _for the time being_, %expect
1005 does not trigger an error when the input file is named `plural.y'.
1006
1007 ** Use of alloca in parsers
1008 If YYSTACK_USE_ALLOCA is defined to 0, then the parsers will use
1009 malloc exclusively. Since 1.29, but was not NEWS'ed.
1010
1011 alloca is used only when compiled with GCC, to avoid portability
1012 problems as on AIX.
1013
1014 ** yyparse now returns 2 if memory is exhausted; formerly it dumped core.
1015
1016 ** When the generated parser lacks debugging code, YYDEBUG is now 0
1017 (as POSIX requires) instead of being undefined.
1018
1019 ** User Actions
1020 Bison has always permitted actions such as { $$ = $1 }: it adds the
1021 ending semicolon. Now if in Yacc compatibility mode, the semicolon
1022 is no longer output: one has to write { $$ = $1; }.
1023
1024 ** Better C++ compliance
1025 The output parsers try to respect C++ namespaces.
1026 [This turned out to be a failed experiment, and it was reverted later.]
1027
1028 ** Reduced Grammars
1029 Fixed bugs when reporting useless nonterminals.
1030
1031 ** 64 bit hosts
1032 The parsers work properly on 64 bit hosts.
1033
1034 ** Error messages
1035 Some calls to strerror resulted in scrambled or missing error messages.
1036
1037 ** %expect
1038 When the number of shift/reduce conflicts is correct, don't issue
1039 any warning.
1040
1041 ** The verbose report includes the rule line numbers.
1042
1043 ** Rule line numbers are fixed in traces.
1044
1045 ** Swedish translation
1046
1047 ** Parse errors
1048 Verbose parse error messages from the parsers are better looking.
1049 Before: parse error: unexpected `'/'', expecting `"number"' or `'-'' or `'(''
1050 Now: parse error: unexpected '/', expecting "number" or '-' or '('
1051
1052 ** Fixed parser memory leaks.
1053 When the generated parser was using malloc to extend its stacks, the
1054 previous allocations were not freed.
1055
1056 ** Fixed verbose output file.
1057 Some newlines were missing.
1058 Some conflicts in state descriptions were missing.
1059
1060 ** Fixed conflict report.
1061 Option -v was needed to get the result.
1062
1063 ** %expect
1064 Was not used.
1065 Mismatches are errors, not warnings.
1066
1067 ** Fixed incorrect processing of some invalid input.
1068
1069 ** Fixed CPP guards: 9foo.h uses BISON_9FOO_H instead of 9FOO_H.
1070
1071 ** Fixed some typos in the documentation.
1072
1073 ** %token MY_EOF 0 is supported.
1074 Before, MY_EOF was silently renumbered as 257.
1075
1076 ** doc/refcard.tex is updated.
1077
1078 ** %output, %file-prefix, %name-prefix.
1079 New.
1080
1081 ** --output
1082 New, aliasing `--output-file'.
1083 \f
1084 * Changes in version 1.30, 2001-10-26:
1085
1086 ** `--defines' and `--graph' have now an optional argument which is the
1087 output file name. `-d' and `-g' do not change; they do not take any
1088 argument.
1089
1090 ** `%source_extension' and `%header_extension' are removed, failed
1091 experiment.
1092
1093 ** Portability fixes.
1094 \f
1095 * Changes in version 1.29, 2001-09-07:
1096
1097 ** The output file does not define const, as this caused problems when used
1098 with common autoconfiguration schemes. If you still use ancient compilers
1099 that lack const, compile with the equivalent of the C compiler option
1100 `-Dconst='. autoconf's AC_C_CONST macro provides one way to do this.
1101
1102 ** Added `-g' and `--graph'.
1103
1104 ** The Bison manual is now distributed under the terms of the GNU FDL.
1105
1106 ** The input and the output files has automatically a similar extension.
1107
1108 ** Russian translation added.
1109
1110 ** NLS support updated; should hopefully be less troublesome.
1111
1112 ** Added the old Bison reference card.
1113
1114 ** Added `--locations' and `%locations'.
1115
1116 ** Added `-S' and `--skeleton'.
1117
1118 ** `%raw', `-r', `--raw' is disabled.
1119
1120 ** Special characters are escaped when output. This solves the problems
1121 of the #line lines with path names including backslashes.
1122
1123 ** New directives.
1124 `%yacc', `%fixed_output_files', `%defines', `%no_parser', `%verbose',
1125 `%debug', `%source_extension' and `%header_extension'.
1126
1127 ** @$
1128 Automatic location tracking.
1129 \f
1130 * Changes in version 1.28, 1999-07-06:
1131
1132 ** Should compile better now with K&R compilers.
1133
1134 ** Added NLS.
1135
1136 ** Fixed a problem with escaping the double quote character.
1137
1138 ** There is now a FAQ.
1139 \f
1140 * Changes in version 1.27:
1141
1142 ** The make rule which prevented bison.simple from being created on
1143 some systems has been fixed.
1144 \f
1145 * Changes in version 1.26:
1146
1147 ** Bison now uses automake.
1148
1149 ** New mailing lists: <bug-bison@gnu.org> and <help-bison@gnu.org>.
1150
1151 ** Token numbers now start at 257 as previously documented, not 258.
1152
1153 ** Bison honors the TMPDIR environment variable.
1154
1155 ** A couple of buffer overruns have been fixed.
1156
1157 ** Problems when closing files should now be reported.
1158
1159 ** Generated parsers should now work even on operating systems which do
1160 not provide alloca().
1161 \f
1162 * Changes in version 1.25, 1995-10-16:
1163
1164 ** Errors in the input grammar are not fatal; Bison keeps reading
1165 the grammar file, and reports all the errors found in it.
1166
1167 ** Tokens can now be specified as multiple-character strings: for
1168 example, you could use "<=" for a token which looks like <=, instead
1169 of chosing a name like LESSEQ.
1170
1171 ** The %token_table declaration says to write a table of tokens (names
1172 and numbers) into the parser file. The yylex function can use this
1173 table to recognize multiple-character string tokens, or for other
1174 purposes.
1175
1176 ** The %no_lines declaration says not to generate any #line preprocessor
1177 directives in the parser file.
1178
1179 ** The %raw declaration says to use internal Bison token numbers, not
1180 Yacc-compatible token numbers, when token names are defined as macros.
1181
1182 ** The --no-parser option produces the parser tables without including
1183 the parser engine; a project can now use its own parser engine.
1184 The actions go into a separate file called NAME.act, in the form of
1185 a switch statement body.
1186 \f
1187 * Changes in version 1.23:
1188
1189 The user can define YYPARSE_PARAM as the name of an argument to be
1190 passed into yyparse. The argument should have type void *. It should
1191 actually point to an object. Grammar actions can access the variable
1192 by casting it to the proper pointer type.
1193
1194 Line numbers in output file corrected.
1195 \f
1196 * Changes in version 1.22:
1197
1198 --help option added.
1199 \f
1200 * Changes in version 1.20:
1201
1202 Output file does not redefine const for C++.
1203
1204 Local Variables:
1205 mode: outline
1206 End:
1207
1208 -----
1209
1210 Copyright (C) 1995, 1996, 1997, 1998, 1999, 2000, 2001, 2002, 2003,
1211 2004, 2005, 2006, 2007, 2008, 2009 Free Software Foundation, Inc.
1212
1213 This file is part of Bison, the GNU Parser Generator.
1214
1215 This program is free software: you can redistribute it and/or modify
1216 it under the terms of the GNU General Public License as published by
1217 the Free Software Foundation, either version 3 of the License, or
1218 (at your option) any later version.
1219
1220 This program is distributed in the hope that it will be useful,
1221 but WITHOUT ANY WARRANTY; without even the implied warranty of
1222 MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
1223 GNU General Public License for more details.
1224
1225 You should have received a copy of the GNU General Public License
1226 along with this program. If not, see <http://www.gnu.org/licenses/>.