Undo 2005-12-01 tentative license wording change.

[bison.git] / doc / bison.texinfo
diff --git a/doc/bison.texinfo b/doc/bison.texinfo

index 01dccb418bcea8ca898ef758154f4c9234e39512..74e5381265e4471f92d90735720a2aecc8567d82 100644 (file)
--- a/doc/bison.texinfo
+++ b/doc/bison.texinfo
@@ -461,7 +461,7 @@ more information on this.
  @cindex @acronym{GLR} parsing
  @cindex generalized @acronym{LR} (@acronym{GLR}) parsing
  @cindex ambiguous grammars
-@cindex non-deterministic parsing
+@cindex nondeterministic parsing
  
  Parsers for @acronym{LALR}(1) grammars are @dfn{deterministic}, meaning
  roughly that the next grammar rule to apply at any point in the input is
@@ -469,7 +469,7 @@ uniquely determined by the preceding input and a fixed, finite portion
  (called a @dfn{look-ahead}) of the remaining input.  A context-free
  grammar can be @dfn{ambiguous}, meaning that there are multiple ways to
  apply the grammar rules to get the same inputs.  Even unambiguous
-grammars can be @dfn{non-deterministic}, meaning that no fixed
+grammars can be @dfn{nondeterministic}, meaning that no fixed
  look-ahead always suffices to determine the next grammar rule to apply.
  With the proper declarations, Bison is also able to parse these more
  general context-free grammars, using a technique known as @acronym{GLR}
@@ -2126,7 +2126,7 @@ as @code{sin}, @code{cos}, etc.
  
  It is easy to add new operators to the infix calculator as long as they are
  only single-character literals.  The lexical analyzer @code{yylex} passes
-back all nonnumber characters as tokens, so new grammar rules suffice for
+back all nonnumeric characters as tokens, so new grammar rules suffice for
  adding a new operator.  But we want something more flexible: built-in
  functions whose syntax has this form:
  
@@ -2411,7 +2411,7 @@ getsym (char const *sym_name)
  
  The function @code{yylex} must now recognize variables, numeric values, and
  the single-character arithmetic operators.  Strings of alphanumeric
-characters with a leading non-digit are recognized as either variables or
+characters with a leading letter are recognized as either variables or
  functions depending on what the symbol table says about them.
  
  The string is passed to @code{getsym} for look up in the symbol table.  If
@@ -2782,7 +2782,7 @@ into a separate header file @file{@var{name}.tab.h} which you can include
  in the other source files that need it.  @xref{Invocation, ,Invoking Bison}.
  
  If you want to write a grammar that is portable to any Standard C
-host, you must use only non-null character tokens taken from the basic
+host, you must use only nonnull character tokens taken from the basic
  execution character set of Standard C@.  This set consists of the ten
  digits, the 52 lower- and upper-case English letters, and the
  characters in the following C-language string:
@@ -3800,7 +3800,7 @@ For instance, if your locations use a file name, you may use
  %parse-param @{ char const *file_name @};
  %initial-action
  @{
-  @@$.begin.filename = @@$.end.filename = file_name;
+  @@$.initialize (file_name);
  @};
  @end example
  
@@ -3810,20 +3810,20 @@ For instance, if your locations use a file name, you may use
  @cindex freeing discarded symbols
  @findex %destructor
  
-Some symbols can be discarded by the parser.  During error recovery
-(@pxref{Error Recovery}), symbols already pushed on the stack and tokens
-coming from the rest of the file are discarded until the parser falls on
-its feet.  If the parser runs out of memory, all the symbols on the
-stack must be discarded.  Even if the parser succeeds, it must discard
-the start symbol.
+During error recovery (@pxref{Error Recovery}), symbols already pushed
+on the stack and tokens coming from the rest of the file are discarded
+until the parser falls on its feet.  If the parser runs out of memory,
+or if it returns via @code{YYABORT} or @code{YYACCEPT}, all the
+symbols on the stack must be discarded.  Even if the parser succeeds, it
+must discard the start symbol.
  
  When discarded symbols convey heap based information, this memory is
  lost.  While this behavior can be tolerable for batch parsers, such as
  in traditional compilers, it is unacceptable for programs like shells or
  protocol implementations that may parse and execute indefinitely.
  
-The @code{%destructor} directive defines code that
-is called when a symbol is discarded.
+The @code{%destructor} directive defines code that is called when a
+symbol is automatically discarded.
  
  @deffn {Directive} %destructor @{ @var{code} @} @var{symbols}
  @findex %destructor
@@ -3832,10 +3832,6 @@ Within @var{code}, @code{$$} designates the semantic value associated
  with the discarded symbol.  The additional parser parameters are also
  available (@pxref{Parser Function, , The Parser Function
  @code{yyparse}}).
-
-@strong{Warning:} as of Bison 2.1, this feature is still
-experimental, as there has not been enough user feedback.  In particular,
-the syntax might still change.
  @end deffn
  
  For instance:
@@ -3854,24 +3850,6 @@ For instance:
  guarantees that when a @code{STRING} or a @code{string} is discarded,
  its associated memory will be freed.
  
-Note that in the future, Bison might also consider that right hand side
-members that are not mentioned in the action can be destroyed.  For
-instance, in:
-
-@smallexample
-comment: "/*" STRING "*/";
-@end smallexample
-
-@noindent
-the parser is entitled to destroy the semantic value of the
-@code{string}.  Of course, this will not apply to the default action;
-compare:
-
-@smallexample
-typeless: string;  // $$ = $1 does not apply; $1 is destroyed.
-typefull: string;  // $$ = $1 applies, $1 is not destroyed.
-@end smallexample
-
  @sp 1
  
  @cindex discarded symbols
@@ -3883,13 +3861,20 @@ stacked symbols popped during the first phase of error recovery,
  @item
  incoming terminals during the second phase of error recovery,
  @item
-the current look-ahead and the entire stack when the parser aborts
-(either via an explicit call to @code{YYABORT}, or as a consequence of
-a failed error recovery or of memory exhaustion), and
+the current look-ahead and the entire stack (except the current
+right-hand side symbols) when the parser returns immediately, and
  @item
  the start symbol, when the parser succeeds.
  @end itemize
  
+The parser can @dfn{return immediately} because of an explicit call to
+@code{YYABORT} or @code{YYACCEPT}, or failed error recovery, or memory
+exhaustion.
+
+Right-hand size symbols of a rule that explicitly triggers a syntax
+error via @code{YYERROR} are not discarded automatically.  As a rule
+of thumb, destructors are invoked only when user actions cannot manage
+the memory.
  
  @node Expect Decl
  @subsection Suppressing Conflict Warnings
@@ -3913,19 +3898,18 @@ The declaration looks like this:
  %expect @var{n}
  @end example
  
-Here @var{n} is a decimal integer.  The declaration says there should be
-no warning if there are @var{n} shift/reduce conflicts and no
-reduce/reduce conflicts.  The usual warning is
-given if there are either more or fewer conflicts, or if there are any
-reduce/reduce conflicts.
+Here @var{n} is a decimal integer.  The declaration says there should
+be @var{n} shift/reduce conflicts and no reduce/reduce conflicts.
+Bison reports an error if the number of shift/reduce conflicts differs
+from @var{n}, or if there are any reduce/reduce conflicts.
  
-For normal @acronym{LALR}(1) parsers, reduce/reduce conflicts are more serious,
-and should be eliminated entirely.  Bison will always report
-reduce/reduce conflicts for these parsers.  With @acronym{GLR} parsers, however,
-both shift/reduce and reduce/reduce are routine (otherwise, there
-would be no need to use @acronym{GLR} parsing).  Therefore, it is also possible
-to specify an expected number of reduce/reduce conflicts in @acronym{GLR}
-parsers, using the declaration:
+For normal @acronym{LALR}(1) parsers, reduce/reduce conflicts are more
+serious, and should be eliminated entirely.  Bison will always report
+reduce/reduce conflicts for these parsers.  With @acronym{GLR}
+parsers, however, both kinds of conflicts are routine; otherwise,
+there would be no need to use @acronym{GLR} parsing.  Therefore, it is
+also possible to specify an expected number of reduce/reduce conflicts
+in @acronym{GLR} parsers, using the declaration:
  
  @example
  %expect-rr @var{n}
@@ -3946,12 +3930,12 @@ go back to the beginning.
  
  @item
  Add an @code{%expect} declaration, copying the number @var{n} from the
-number which Bison printed.
+number which Bison printed.  With @acronym{GLR} parsers, add an
+@code{%expect-rr} declaration as well.
  @end itemize
  
-Now Bison will stop annoying you if you do not change the number of
-conflicts, but it will warn you again if changes in the grammar result
-in more or fewer conflicts.
+Now Bison will warn you if you introduce an unexpected conflict, but
+will keep silent otherwise.
  
  @node Start Decl
  @subsection The Start-Symbol
@@ -3977,8 +3961,8 @@ may override this restriction with the @code{%start} declaration as follows:
  A @dfn{reentrant} program is one which does not alter in the course of
  execution; in other words, it consists entirely of @dfn{pure} (read-only)
  code.  Reentrancy is important whenever asynchronous execution is possible;
-for example, a non-reentrant program may not be safe to call from a signal
-handler.  In systems with multiple threads of control, a non-reentrant
+for example, a nonreentrant program may not be safe to call from a signal
+handler.  In systems with multiple threads of control, a nonreentrant
  program must be called only within interlocks.
  
  Normally, Bison generates a parser which is not reentrant.  This is
@@ -4468,7 +4452,7 @@ The @code{yytname} table is generated only if you use the
  @subsection Semantic Values of Tokens
  
  @vindex yylval
-In an ordinary (non-reentrant) parser, the semantic value of the token must
+In an ordinary (nonreentrant) parser, the semantic value of the token must
  be stored into the global variable @code{yylval}.  When you are using
  just one data type for semantic values, @code{yylval} has that type.
  Thus, if the type is @code{int} (the default), you might write this in
@@ -5628,7 +5612,7 @@ pp.@: 615--649 @uref{http://doi.acm.org/10.1145/69622.357187}.
  @cindex @acronym{GLR} parsing
  @cindex generalized @acronym{LR} (@acronym{GLR}) parsing
  @cindex ambiguous grammars
-@cindex non-deterministic parsing
+@cindex nondeterministic parsing
  
  Bison produces @emph{deterministic} parsers that choose uniquely
  when to reduce and which reduction to apply
@@ -5693,10 +5677,10 @@ quadratic worst-case time, and any general (possibly ambiguous)
  context-free grammar in cubic worst-case time.  However, Bison currently
  uses a simpler data structure that requires time proportional to the
  length of the input times the maximum number of stacks required for any
-prefix of the input.  Thus, really ambiguous or non-deterministic
+prefix of the input.  Thus, really ambiguous or nondeterministic
  grammars can require exponential time and space to process.  Such badly
  behaving examples, however, are not generally of practical interest.
-Usually, non-determinism in a grammar is local---the parser is ``in
+Usually, nondeterminism in a grammar is local---the parser is ``in
  doubt'' only for a few tokens at a time.  Therefore, the current data
  structure should generally be adequate.  On @acronym{LALR}(1) portions of a
  grammar, in particular, it is only slightly slower than with the default
@@ -7055,7 +7039,7 @@ The output files @file{@var{output}.hh} and @file{@var{output}.cc}
  declare and define the parser class in the namespace @code{yy}.  The
  class name defaults to @code{parser}, but may be changed using
  @samp{%define "parser_class_name" "@var{name}"}.  The interface of
-this class is detailled below.  It can be extended using the
+this class is detailed below.  It can be extended using the
  @code{%parse-param} feature: its semantics is slightly changed since
  it describes an additional member of the parser class, and an
  additional argument for its constructor.
@@ -7083,7 +7067,7 @@ Get or set the stream used for tracing the parsing.  It defaults to
  @deftypemethod {parser} {debug_level_type} debug_level ()
  @deftypemethodx {parser} {void} set_debug_level (debug_level @var{l})
  Get or set the tracing level.  Currently its value is either 0, no trace,
-or non-zero, full tracing.
+or nonzero, full tracing.
  @end deftypemethod
  
  @deftypemethod {parser} {void} error (const location_type& @var{l}, const std::string& @var{m})
@@ -7134,7 +7118,7 @@ actually easier to interface with.
  @subsection Calc++ --- C++ Calculator
  
  Of course the grammar is dedicated to arithmetics, a single
-expression, possibily preceded by variable assignments.  An
+expression, possibly preceded by variable assignments.  An
  environment containing possibly predefined variables such as
  @code{one} and @code{two}, is exchanged with the parser.  An example
  of valid input follows.
@@ -7429,7 +7413,7 @@ The grammar itself is straightforward.
  unit: assignments exp  @{ driver.result = $2; @};
  
  assignments: assignments assignment @{@}
-           | /* Nothing. */         @{@};
+           | /* Nothing.  */        @{@};
  
  assignment: "identifier" ":=" exp @{ driver.variables[*$1] = $3; @};
  
@@ -7498,7 +7482,7 @@ blank [ \t]
  @end example
  
  @noindent
-The following paragraph suffices to track locations acurately.  Each
+The following paragraph suffices to track locations accurately.  Each
  time @code{yylex} is invoked, the begin position is moved onto the end
  position.  Then when a pattern is matched, the end position is
  advanced of its width.  In case it matched ends of lines, the end
@@ -7523,7 +7507,7 @@ preceding tokens.  Comments would be treated equally.
  The rules are simple, just note the use of the driver to report errors.
  It is convenient to use a typedef to shorten
  @code{yy::calcxx_parser::token::identifier} into
-@code{token::identifier} for isntance.
+@code{token::identifier} for instance.
  
  @comment file: calc++-scanner.ll
  @example
@@ -7967,7 +7951,7 @@ parser file.  @xref{Decl Summary}.
  @end deffn
  
  @deffn {Directive} %nonassoc
-Bison declaration to assign non-associativity to token(s).
+Bison declaration to assign nonassociativity to token(s).
  @xref{Precedence Decl, ,Operator Precedence}.
  @end deffn