api.token.prefix: use code values

[bison.git] / doc / bison.texi
diff --git a/doc/bison.texi b/doc/bison.texi

index 9708ad49fbcd697ae0672a351445537ff8b08780..58a14f8bc2fc3a9ee35c440b91ea666f135cc110 100644 (file)
--- a/doc/bison.texi
+++ b/doc/bison.texi
@@ -33,7 +33,7 @@
  This manual (@value{UPDATED}) is for GNU Bison (version
  @value{VERSION}), the GNU parser generator.
  
  This manual (@value{UPDATED}) is for GNU Bison (version
  @value{VERSION}), the GNU parser generator.
  
-Copyright @copyright{} 1988-1993, 1995, 1998-2012 Free Software
+Copyright @copyright{} 1988-1993, 1995, 1998-2013 Free Software
  Foundation, Inc.
  
  @quotation
  Foundation, Inc.
  
  @quotation
@@ -186,7 +186,6 @@ Bison Grammar Files
  * Grammar Outline::    Overall layout of the grammar file.
  * Symbols::            Terminal and nonterminal symbols.
  * Rules::              How to write grammar rules.
  * Grammar Outline::    Overall layout of the grammar file.
  * Symbols::            Terminal and nonterminal symbols.
  * Rules::              How to write grammar rules.
-* Recursion::          Writing recursive rules.
  * Semantics::          Semantic values and actions.
  * Tracking Locations:: Locations and actions.
  * Named References::   Using named references in actions.
  * Semantics::          Semantic values and actions.
  * Tracking Locations:: Locations and actions.
  * Named References::   Using named references in actions.
@@ -201,10 +200,20 @@ Outline of a Bison Grammar
  * Grammar Rules::         Syntax and usage of the grammar rules section.
  * Epilogue::              Syntax and usage of the epilogue.
  
  * Grammar Rules::         Syntax and usage of the grammar rules section.
  * Epilogue::              Syntax and usage of the epilogue.
  
+Grammar Rules
+
+* Rules Syntax::   Syntax of the rules.
+* Empty Rules::    Symbols that can match the empty string.
+* Recursion::      Writing recursive rules.
+
+
  Defining Language Semantics
  
  * Value Type::        Specifying one data type for all semantic values.
  * Multiple Types::    Specifying several alternative data types.
  Defining Language Semantics
  
  * Value Type::        Specifying one data type for all semantic values.
  * Multiple Types::    Specifying several alternative data types.
+* Type Generation::   Generating the semantic value type.
+* Union Decl::        Declaring the set of all semantic value types.
+* Structured Value Type::  Providing a structured semantic value type.
  * Actions::           An action is the semantic definition of a grammar rule.
  * Action Types::      Specifying data types for actions to operate on.
  * Mid-Rule Actions::  Most actions go at the end of a rule.
  * Actions::           An action is the semantic definition of a grammar rule.
  * Action Types::      Specifying data types for actions to operate on.
  * Mid-Rule Actions::  Most actions go at the end of a rule.
@@ -228,7 +237,6 @@ Bison Declarations
  * Require Decl::      Requiring a Bison version.
  * Token Decl::        Declaring terminal symbols.
  * Precedence Decl::   Declaring terminals with precedence and associativity.
  * Require Decl::      Requiring a Bison version.
  * Token Decl::        Declaring terminal symbols.
  * Precedence Decl::   Declaring terminals with precedence and associativity.
-* Union Decl::        Declaring the set of all semantic value types.
  * Type Decl::         Declaring the choice of type for a nonterminal symbol.
  * Initial Action Decl::  Code run before parsing starts.
  * Destructor Decl::   Declaring how symbols are freed.
  * Type Decl::         Declaring the choice of type for a nonterminal symbol.
  * Initial Action Decl::  Code run before parsing starts.
  * Destructor Decl::   Declaring how symbols are freed.
@@ -1007,7 +1015,7 @@ Let's consider an example, vastly simplified from a C++ grammar.
  %%
  
  prog:
  %%
  
  prog:
-  /* Nothing.  */
+  %empty
  | prog stmt   @{ printf ("\n"); @}
  ;
  
  | prog stmt   @{ printf ("\n"); @}
  ;
  
@@ -1538,14 +1546,16 @@ calculator.  As in C, comments are placed between @samp{/*@dots{}*/}.
  @example
  /* Reverse polish notation calculator.  */
  
  @example
  /* Reverse polish notation calculator.  */
  
+@group
  %@{
  %@{
-  #define YYSTYPE double
    #include <stdio.h>
    #include <math.h>
    int yylex (void);
    void yyerror (char const *);
  %@}
    #include <stdio.h>
    #include <math.h>
    int yylex (void);
    void yyerror (char const *);
  %@}
+@end group
  
  
+%define api.value.type @{double@}
  %token NUM
  
  %% /* Grammar rules and actions follow.  */
  %token NUM
  
  %% /* Grammar rules and actions follow.  */
@@ -1554,14 +1564,6 @@ calculator.  As in C, comments are placed between @samp{/*@dots{}*/}.
  The declarations section (@pxref{Prologue, , The prologue}) contains two
  preprocessor directives and two forward declarations.
  
  The declarations section (@pxref{Prologue, , The prologue}) contains two
  preprocessor directives and two forward declarations.
  
-The @code{#define} directive defines the macro @code{YYSTYPE}, thus
-specifying the C data type for semantic values of both tokens and
-groupings (@pxref{Value Type, ,Data Types of Semantic Values}).  The
-Bison parser will use whatever type @code{YYSTYPE} is defined as; if you
-don't define it, @code{int} is the default.  Because we specify
-@code{double}, each token and each expression has an associated value,
-which is a floating point number.
-
  The @code{#include} directive is used to declare the exponentiation
  function @code{pow}.
  
  The @code{#include} directive is used to declare the exponentiation
  function @code{pow}.
  
@@ -1571,14 +1573,24 @@ before they are used.  These functions will be defined in the
  epilogue, but the parser calls them so they must be declared in the
  prologue.
  
  epilogue, but the parser calls them so they must be declared in the
  prologue.
  
-The second section, Bison declarations, provides information to Bison
-about the token types (@pxref{Bison Declarations, ,The Bison
-Declarations Section}).  Each terminal symbol that is not a
-single-character literal must be declared here.  (Single-character
-literals normally don't need to be declared.)  In this example, all the
-arithmetic operators are designated by single-character literals, so the
-only terminal symbol that needs to be declared is @code{NUM}, the token
-type for numeric constants.
+The second section, Bison declarations, provides information to Bison about
+the tokens and their types (@pxref{Bison Declarations, ,The Bison
+Declarations Section}).
+
+The @code{%define} directive defines the variable @code{api.value.type},
+thus specifying the C data type for semantic values of both tokens and
+groupings (@pxref{Value Type, ,Data Types of Semantic Values}).  The Bison
+parser will use whatever type @code{api.value.type} is defined as; if you
+don't define it, @code{int} is the default.  Because we specify
+@samp{@{double@}}, each token and each expression has an associated value,
+which is a floating point number.  C code can use @code{YYSTYPE} to refer to
+the value @code{api.value.type}.
+
+Each terminal symbol that is not a single-character literal must be
+declared.  (Single-character literals normally don't need to be declared.)
+In this example, all the arithmetic operators are designated by
+single-character literals, so the only terminal symbol that needs to be
+declared is @code{NUM}, the token type for numeric constants.
  
  @node Rpcalc Rules
  @subsection Grammar Rules for @code{rpcalc}
  
  @node Rpcalc Rules
  @subsection Grammar Rules for @code{rpcalc}
@@ -1589,7 +1601,7 @@ Here are the grammar rules for the reverse polish notation calculator.
  @example
  @group
  input:
  @example
  @group
  input:
-  /* empty */
+  %empty
  | input line
  ;
  @end group
  | input line
  ;
  @end group
@@ -1646,7 +1658,7 @@ Consider the definition of @code{input}:
  
  @example
  input:
  
  @example
  input:
-  /* empty */
+  %empty
  | input line
  ;
  @end example
  | input line
  ;
  @end example
@@ -1661,8 +1673,9 @@ The first alternative is empty because there are no symbols between the
  colon and the first @samp{|}; this means that @code{input} can match an
  empty string of input (no tokens).  We write the rules this way because it
  is legitimate to type @kbd{Ctrl-d} right after you start the calculator.
  colon and the first @samp{|}; this means that @code{input} can match an
  empty string of input (no tokens).  We write the rules this way because it
  is legitimate to type @kbd{Ctrl-d} right after you start the calculator.
-It's conventional to put an empty alternative first and write the comment
-@samp{/* empty */} in it.
+It's conventional to put an empty alternative first and to use the
+(optional) @code{%empty} directive, or to write the comment @samp{/* empty
+*/} in it (@pxref{Empty Rules}).
  
  The second alternate rule (@code{input line}) handles all nontrivial input.
  It means, ``After reading any number of lines, read one more line if
  
  The second alternate rule (@code{input line}) handles all nontrivial input.
  It means, ``After reading any number of lines, read one more line if
@@ -1791,9 +1804,9 @@ therefore, @code{NUM} becomes a macro for @code{yylex} to use.
  
  The semantic value of the token (if it has one) is stored into the
  global variable @code{yylval}, which is where the Bison parser will look
  
  The semantic value of the token (if it has one) is stored into the
  global variable @code{yylval}, which is where the Bison parser will look
-for it.  (The C data type of @code{yylval} is @code{YYSTYPE}, which was
-defined at the beginning of the grammar; @pxref{Rpcalc Declarations,
-,Declarations for @code{rpcalc}}.)
+for it.  (The C data type of @code{yylval} is @code{YYSTYPE}, whose value
+was defined at the beginning of the grammar via @samp{%define api.value.type
+@{double@}}; @pxref{Rpcalc Declarations,,Declarations for @code{rpcalc}}.)
  
  A token type code of zero is returned if the end-of-input is encountered.
  (Bison recognizes any nonpositive value as indicating end-of-input.)
  
  A token type code of zero is returned if the end-of-input is encountered.
  (Bison recognizes any nonpositive value as indicating end-of-input.)
@@ -1982,7 +1995,6 @@ parentheses nested to arbitrary depth.  Here is the Bison code for
  
  @group
  %@{
  
  @group
  %@{
-  #define YYSTYPE double
    #include <math.h>
    #include <stdio.h>
    int yylex (void);
    #include <math.h>
    #include <stdio.h>
    int yylex (void);
@@ -1992,6 +2004,7 @@ parentheses nested to arbitrary depth.  Here is the Bison code for
  
  @group
  /* Bison declarations.  */
  
  @group
  /* Bison declarations.  */
+%define api.value.type @{double@}
  %token NUM
  %left '-' '+'
  %left '*' '/'
  %token NUM
  %left '-' '+'
  %left '*' '/'
@@ -2002,7 +2015,7 @@ parentheses nested to arbitrary depth.  Here is the Bison code for
  %% /* The grammar follows.  */
  @group
  input:
  %% /* The grammar follows.  */
  @group
  input:
-  /* empty */
+  %empty
  | input line
  ;
  @end group
  | input line
  ;
  @end group
@@ -2141,13 +2154,13 @@ the same as the declarations for the infix notation calculator.
  /* Location tracking calculator.  */
  
  %@{
  /* Location tracking calculator.  */
  
  %@{
-  #define YYSTYPE int
    #include <math.h>
    int yylex (void);
    void yyerror (char const *);
  %@}
  
  /* Bison declarations.  */
    #include <math.h>
    int yylex (void);
    void yyerror (char const *);
  %@}
  
  /* Bison declarations.  */
+%define api.value.type int
  %token NUM
  
  %left '-' '+'
  %token NUM
  
  %left '-' '+'
@@ -2182,7 +2195,7 @@ wrong expressions or subexpressions.
  @example
  @group
  input:
  @example
  @group
  input:
-  /* empty */
+  %empty
  | input line
  ;
  @end group
  | input line
  ;
  @end group
@@ -2394,24 +2407,19 @@ Here are the C and Bison declarations for the multi-function calculator.
  %@{
    #include <stdio.h>  /* For printf, etc. */
    #include <math.h>   /* For pow, used in the grammar.  */
  %@{
    #include <stdio.h>  /* For printf, etc. */
    #include <math.h>   /* For pow, used in the grammar.  */
-  #include "calc.h"   /* Contains definition of `symrec'.  */
+  #include "calc.h"   /* Contains definition of 'symrec'.  */
    int yylex (void);
    void yyerror (char const *);
  %@}
  @end group
  
    int yylex (void);
    void yyerror (char const *);
  %@}
  @end group
  
-@group
-%union @{
-  double    val;   /* For returning numbers.  */
-  symrec  *tptr;   /* For returning symbol-table pointers.  */
-@}
-@end group
-%token <val>  NUM        /* Simple double precision number.  */
-%token <tptr> VAR FNCT   /* Variable and function.  */
-%type  <val>  exp
+%define api.value.type union /* Generate YYSTYPE from these types:  */
+%token <double>  NUM         /* Simple double precision number.  */
+%token <symrec*> VAR FNCT    /* Symbol table pointer: variable and function.  */
+%type  <double>  exp
  
  @group
  
  @group
-%right '='
+%precedence '='
  %left '-' '+'
  %left '*' '/'
  %precedence NEG /* negation--unary minus */
  %left '-' '+'
  %left '*' '/'
  %precedence NEG /* negation--unary minus */
@@ -2423,23 +2431,23 @@ The above grammar introduces only two new features of the Bison language.
  These features allow semantic values to have various data types
  (@pxref{Multiple Types, ,More Than One Value Type}).
  
  These features allow semantic values to have various data types
  (@pxref{Multiple Types, ,More Than One Value Type}).
  
-The @code{%union} declaration specifies the entire list of possible types;
-this is instead of defining @code{YYSTYPE}.  The allowable types are now
-double-floats (for @code{exp} and @code{NUM}) and pointers to entries in
-the symbol table.  @xref{Union Decl, ,The Collection of Value Types}.
-
-Since values can now have various types, it is necessary to associate a
-type with each grammar symbol whose semantic value is used.  These symbols
-are @code{NUM}, @code{VAR}, @code{FNCT}, and @code{exp}.  Their
-declarations are augmented with information about their data type (placed
-between angle brackets).
-
-The Bison construct @code{%type} is used for declaring nonterminal
-symbols, just as @code{%token} is used for declaring token types.  We
-have not used @code{%type} before because nonterminal symbols are
-normally declared implicitly by the rules that define them.  But
-@code{exp} must be declared explicitly so we can specify its value type.
-@xref{Type Decl, ,Nonterminal Symbols}.
+The special @code{union} value assigned to the @code{%define} variable
+@code{api.value.type} specifies that the symbols are defined with their data
+types.  Bison will generate an appropriate definition of @code{YYSTYPE} to
+store these values.
+
+Since values can now have various types, it is necessary to associate a type
+with each grammar symbol whose semantic value is used.  These symbols are
+@code{NUM}, @code{VAR}, @code{FNCT}, and @code{exp}.  Their declarations are
+augmented with their data type (placed between angle brackets).  For
+instance, values of @code{NUM} are stored in @code{double}.
+
+The Bison construct @code{%type} is used for declaring nonterminal symbols,
+just as @code{%token} is used for declaring token types.  Previously we did
+not use @code{%type} before because nonterminal symbols are normally
+declared implicitly by the rules that define them.  But @code{exp} must be
+declared explicitly so we can specify its value type.  @xref{Type Decl,
+,Nonterminal Symbols}.
  
  @node Mfcalc Rules
  @subsection Grammar Rules for @code{mfcalc}
  
  @node Mfcalc Rules
  @subsection Grammar Rules for @code{mfcalc}
@@ -2453,7 +2461,7 @@ those which mention @code{VAR} or @code{FNCT}, are new.
  %% /* The grammar follows.  */
  @group
  input:
  %% /* The grammar follows.  */
  @group
  input:
-  /* empty */
+  %empty
  | input line
  ;
  @end group
  | input line
  ;
  @end group
@@ -2523,7 +2531,7 @@ struct symrec
  @group
  typedef struct symrec symrec;
  
  @group
  typedef struct symrec symrec;
  
-/* The symbol table: a chain of `struct symrec'.  */
+/* The symbol table: a chain of 'struct symrec'.  */
  extern symrec *sym_table;
  
  symrec *putsym (char const *, int);
  extern symrec *sym_table;
  
  symrec *putsym (char const *, int);
@@ -2558,7 +2566,7 @@ struct init const arith_fncts[] =
  @end group
  
  @group
  @end group
  
  @group
-/* The symbol table: a chain of `struct symrec'.  */
+/* The symbol table: a chain of 'struct symrec'.  */
  symrec *sym_table;
  @end group
  
  symrec *sym_table;
  @end group
  
@@ -2663,11 +2671,18 @@ yylex (void)
    if (c == '.' || isdigit (c))
      @{
        ungetc (c, stdin);
    if (c == '.' || isdigit (c))
      @{
        ungetc (c, stdin);
-      scanf ("%lf", &yylval.val);
+      scanf ("%lf", &yylval.NUM);
        return NUM;
      @}
  @end group
        return NUM;
      @}
  @end group
+@end example
+
+@noindent
+Bison generated a definition of @code{YYSTYPE} with a member named
+@code{NUM} to store value of @code{NUM} symbols.
  
  
+@comment file: mfcalc.y: 3
+@example
  @group
    /* Char starts an identifier => read the name.       */
    if (isalpha (c))
  @group
    /* Char starts an identifier => read the name.       */
    if (isalpha (c))
@@ -2709,7 +2724,7 @@ yylex (void)
        s = getsym (symbuf);
        if (s == 0)
          s = putsym (symbuf, VAR);
        s = getsym (symbuf);
        if (s == 0)
          s = putsym (symbuf, VAR);
-      yylval.tptr = s;
+      *((symrec**) &yylval) = s;
        return s->type;
      @}
  
        return s->type;
      @}
  
@@ -2787,7 +2802,6 @@ The Bison grammar file conventionally has a name ending in @samp{.y}.
  * Grammar Outline::    Overall layout of the grammar file.
  * Symbols::            Terminal and nonterminal symbols.
  * Rules::              How to write grammar rules.
  * Grammar Outline::    Overall layout of the grammar file.
  * Symbols::            Terminal and nonterminal symbols.
  * Rules::              How to write grammar rules.
-* Recursion::          Writing recursive rules.
  * Semantics::          Semantic values and actions.
  * Tracking Locations:: Locations and actions.
  * Named References::   Using named references in actions.
  * Semantics::          Semantic values and actions.
  * Tracking Locations:: Locations and actions.
  * Named References::   Using named references in actions.
@@ -2857,21 +2871,27 @@ can be done with two @var{Prologue} blocks, one before and one after the
  @code{%union} declaration.
  
  @example
  @code{%union} declaration.
  
  @example
+@group
  %@{
    #define _GNU_SOURCE
    #include <stdio.h>
    #include "ptypes.h"
  %@}
  %@{
    #define _GNU_SOURCE
    #include <stdio.h>
    #include "ptypes.h"
  %@}
+@end group
  
  
+@group
  %union @{
    long int n;
    tree t;  /* @r{@code{tree} is defined in @file{ptypes.h}.} */
  @}
  %union @{
    long int n;
    tree t;  /* @r{@code{tree} is defined in @file{ptypes.h}.} */
  @}
+@end group
  
  
+@group
  %@{
    static void print_token_value (FILE *, int, YYSTYPE);
    #define YYPRINT(F, N, L) print_token_value (F, N, L)
  %@}
  %@{
    static void print_token_value (FILE *, int, YYSTYPE);
    #define YYPRINT(F, N, L) print_token_value (F, N, L)
  %@}
+@end group
  
  @dots{}
  @end example
  
  @dots{}
  @end example
@@ -2903,21 +2923,27 @@ location, or it can be one of @code{requires}, @code{provides},
  Look again at the example of the previous section:
  
  @example
  Look again at the example of the previous section:
  
  @example
+@group
  %@{
    #define _GNU_SOURCE
    #include <stdio.h>
    #include "ptypes.h"
  %@}
  %@{
    #define _GNU_SOURCE
    #include <stdio.h>
    #include "ptypes.h"
  %@}
+@end group
  
  
+@group
  %union @{
    long int n;
    tree t;  /* @r{@code{tree} is defined in @file{ptypes.h}.} */
  @}
  %union @{
    long int n;
    tree t;  /* @r{@code{tree} is defined in @file{ptypes.h}.} */
  @}
+@end group
  
  
+@group
  %@{
    static void print_token_value (FILE *, int, YYSTYPE);
    #define YYPRINT(F, N, L) print_token_value (F, N, L)
  %@}
  %@{
    static void print_token_value (FILE *, int, YYSTYPE);
    #define YYPRINT(F, N, L) print_token_value (F, N, L)
  %@}
+@end group
  
  @dots{}
  @end example
  
  @dots{}
  @end example
@@ -2955,7 +2981,7 @@ Let's go ahead and add the new @code{YYLTYPE} definition and the
    #include <stdio.h>
  
    /* WARNING: The following code really belongs
    #include <stdio.h>
  
    /* WARNING: The following code really belongs
-   * in a `%code requires'; see below.  */
+   * in a '%code requires'; see below.  */
  
    #include "ptypes.h"
    #define YYLTYPE YYLTYPE
  
    #include "ptypes.h"
    #define YYLTYPE YYLTYPE
@@ -2969,16 +2995,20 @@ Let's go ahead and add the new @code{YYLTYPE} definition and the
    @} YYLTYPE;
  @}
  
    @} YYLTYPE;
  @}
  
+@group
  %union @{
    long int n;
    tree t;  /* @r{@code{tree} is defined in @file{ptypes.h}.} */
  @}
  %union @{
    long int n;
    tree t;  /* @r{@code{tree} is defined in @file{ptypes.h}.} */
  @}
+@end group
  
  
+@group
  %code @{
    static void print_token_value (FILE *, int, YYSTYPE);
    #define YYPRINT(F, N, L) print_token_value (F, N, L)
    static void trace_token (enum yytokentype token, YYLTYPE loc);
  @}
  %code @{
    static void print_token_value (FILE *, int, YYSTYPE);
    #define YYPRINT(F, N, L) print_token_value (F, N, L)
    static void trace_token (enum yytokentype token, YYLTYPE loc);
  @}
+@end group
  
  @dots{}
  @end example
  
  @dots{}
  @end example
@@ -3389,7 +3419,18 @@ value of the error token is 256, unless you explicitly assigned 256 to
  one of your tokens with a @code{%token} declaration.
  
  @node Rules
  one of your tokens with a @code{%token} declaration.
  
  @node Rules
-@section Syntax of Grammar Rules
+@section Grammar Rules
+
+A Bison grammar is a list of rules.
+
+@menu
+* Rules Syntax::   Syntax of the rules.
+* Empty Rules::    Symbols that can match the empty string.
+* Recursion::      Writing recursive rules.
+@end menu
+
+@node Rules Syntax
+@subsection Syntax of Grammar Rules
  @cindex rule syntax
  @cindex grammar rule syntax
  @cindex syntax of grammar rules
  @cindex rule syntax
  @cindex grammar rule syntax
  @cindex syntax of grammar rules
@@ -3463,33 +3504,57 @@ be joined with the vertical-bar character @samp{|} as follows:
  @noindent
  They are still considered distinct rules even when joined in this way.
  
  @noindent
  They are still considered distinct rules even when joined in this way.
  
-If @var{components} in a rule is empty, it means that @var{result} can
-match the empty string.  For example, here is how to define a
-comma-separated sequence of zero or more @code{exp} groupings:
+@node Empty Rules
+@subsection Empty Rules
+@cindex empty rule
+@cindex rule, empty
+@findex %empty
+
+A rule is said to be @dfn{empty} if its right-hand side (@var{components})
+is empty.  It means that @var{result} can match the empty string.  For
+example, here is how to define an optional semicolon:
+
+@example
+semicolon.opt: | ";";
+@end example
+
+@noindent
+It is easy not to see an empty rule, especially when @code{|} is used.  The
+@code{%empty} directive allows to make explicit that a rule is empty on
+purpose:
  
  @example
  @group
  
  @example
  @group
-expseq:
-  /* empty */
-| expseq1
+semicolon.opt:
+  %empty
+| ";"
  ;
  @end group
  ;
  @end group
+@end example
+
+Flagging a non-empty rule with @code{%empty} is an error.  If run with
+@option{-Wempty-rule}, @command{bison} will report empty rules without
+@code{%empty}.  Using @code{%empty} enables this warning, unless
+@option{-Wno-empty-rule} was specified.
+
+The @code{%empty} directive is a Bison extension, it does not work with
+Yacc.  To remain compatible with POSIX Yacc, it is customary to write a
+comment @samp{/* empty */} in each rule with no components:
  
  
+@example
  @group
  @group
-expseq1:
-  exp
-| expseq1 ',' exp
+semicolon.opt:
+  /* empty */
+| ";"
  ;
  @end group
  @end example
  
  ;
  @end group
  @end example
  
-@noindent
-It is customary to write a comment @samp{/* empty */} in each rule
-with no components.
  
  @node Recursion
  
  @node Recursion
-@section Recursive Rules
+@subsection Recursive Rules
  @cindex recursive rule
  @cindex recursive rule
+@cindex rule, recursive
  
  A rule is called @dfn{recursive} when its @var{result} nonterminal
  appears also on its right hand side.  Nearly all Bison grammars need to
  
  A rule is called @dfn{recursive} when its @var{result} nonterminal
  appears also on its right hand side.  Nearly all Bison grammars need to
@@ -3577,6 +3642,9 @@ the numbers associated with @var{x} and @var{y}.
  @menu
  * Value Type::        Specifying one data type for all semantic values.
  * Multiple Types::    Specifying several alternative data types.
  @menu
  * Value Type::        Specifying one data type for all semantic values.
  * Multiple Types::    Specifying several alternative data types.
+* Type Generation::   Generating the semantic value type.
+* Union Decl::        Declaring the set of all semantic value types.
+* Structured Value Type::  Providing a structured semantic value type.
  * Actions::           An action is the semantic definition of a grammar rule.
  * Action Types::      Specifying data types for actions to operate on.
  * Mid-Rule Actions::  Most actions go at the end of a rule.
  * Actions::           An action is the semantic definition of a grammar rule.
  * Action Types::      Specifying data types for actions to operate on.
  * Mid-Rule Actions::  Most actions go at the end of a rule.
@@ -3598,17 +3666,38 @@ Notation Calculator}).
  
  Bison normally uses the type @code{int} for semantic values if your
  program uses the same data type for all language constructs.  To
  
  Bison normally uses the type @code{int} for semantic values if your
  program uses the same data type for all language constructs.  To
-specify some other type, define @code{YYSTYPE} as a macro, like this:
+specify some other type, define the @code{%define} variable
+@code{api.value.type} like this:
+
+@example
+%define api.value.type @{double@}
+@end example
+
+@noindent
+or
+
+@example
+%define api.value.type @{struct semantic_type@}
+@end example
+
+The value of @code{api.value.type} should be a type name that does not
+contain parentheses or square brackets.
+
+Alternatively, instead of relying of Bison's @code{%define} support, you may
+rely on the C/C++ preprocessor and define @code{YYSTYPE} as a macro, like
+this:
  
  @example
  #define YYSTYPE double
  @end example
  
  @noindent
  
  @example
  #define YYSTYPE double
  @end example
  
  @noindent
-@code{YYSTYPE}'s replacement list should be a type name
-that does not contain parentheses or square brackets.
  This macro definition must go in the prologue of the grammar file
  This macro definition must go in the prologue of the grammar file
-(@pxref{Grammar Outline, ,Outline of a Bison Grammar}).
+(@pxref{Grammar Outline, ,Outline of a Bison Grammar}).  If compatibility
+with POSIX Yacc matters to you, use this.  Note however that Bison cannot
+know @code{YYSTYPE}'s value, not even whether it is defined, so there are
+services it cannot provide.  Besides this works only for languages that have
+a preprocessor.
  
  @node Multiple Types
  @subsection More Than One Value Type
  
  @node Multiple Types
  @subsection More Than One Value Type
@@ -3624,11 +3713,25 @@ requires you to do two things:
  
  @itemize @bullet
  @item
  
  @itemize @bullet
  @item
-Specify the entire collection of possible data types, either by using the
-@code{%union} Bison declaration (@pxref{Union Decl, ,The Collection of
-Value Types}), or by using a @code{typedef} or a @code{#define} to
-define @code{YYSTYPE} to be a union type whose member names are
-the type tags.
+Specify the entire collection of possible data types.  There are several
+options:
+@itemize @bullet
+@item
+let Bison compute the union type from the tags you assign to symbols;
+
+@item
+use the @code{%union} Bison declaration (@pxref{Union Decl, ,The Union
+Declaration});
+
+@item
+define the @code{%define} variable @code{api.value.type} to be a union type
+whose members are the type tags (@pxref{Structured Value Type,, Providing a
+Structured Semantic Value Type});
+
+@item
+use a @code{typedef} or a @code{#define} to define @code{YYSTYPE} to be a
+union type whose member names are the type tags.
+@end itemize
  
  @item
  Choose one of those types for each symbol (terminal or nonterminal) for
  
  @item
  Choose one of those types for each symbol (terminal or nonterminal) for
@@ -3638,6 +3741,164 @@ and for groupings with the @code{%type} Bison declaration (@pxref{Type
  Decl, ,Nonterminal Symbols}).
  @end itemize
  
  Decl, ,Nonterminal Symbols}).
  @end itemize
  
+@node Type Generation
+@subsection Generating the Semantic Value Type
+@cindex declaring value types
+@cindex value types, declaring
+@findex %define api.value.type union
+
+The special value @code{union} of the @code{%define} variable
+@code{api.value.type} instructs Bison that the tags used with the
+@code{%token} and @code{%type} directives are genuine types, not names of
+members of @code{YYSTYPE}.
+
+For example:
+
+@example
+%define api.value.type union
+%token <int> INT "integer"
+%token <int> 'n'
+%type <int> expr
+%token <char const *> ID "identifier"
+@end example
+
+@noindent
+generates an appropriate value of @code{YYSTYPE} to support each symbol
+type.  The name of the member of @code{YYSTYPE} for tokens than have a
+declared identifier @var{id} (such as @code{INT} and @code{ID} above, but
+not @code{'n'}) is @code{@var{id}}.  The other symbols have unspecified
+names on which you should not depend; instead, relying on C casts to access
+the semantic value with the appropriate type:
+
+@example
+/* For an "integer".  */
+yylval.INT = 42;
+return INT;
+
+/* For an 'n', also declared as int.  */
+*((int*)&yylval) = 42;
+return 'n';
+
+/* For an "identifier".  */
+yylval.ID = "42";
+return ID;
+@end example
+
+If the @code{%define} variable @code{api.token.prefix} is defined
+(@pxref{%define Summary,,api.token.prefix}), then it is also used to prefix
+the union member names.  For instance, with @samp{%define api.token.prefix
+@{TOK_@}}:
+
+@example
+/* For an "integer".  */
+yylval.TOK_INT = 42;
+return TOK_INT;
+@end example
+
+This Bison extension cannot work if @code{%yacc} (or
+@option{-y}/@option{--yacc}) is enabled, as POSIX mandates that Yacc
+generate tokens as macros (e.g., @samp{#define INT 258}, or @samp{#define
+TOK_INT 258}).
+
+This feature is new, and user feedback would be most welcome.
+
+A similar feature is provided for C++ that in addition overcomes C++
+limitations (that forbid non-trivial objects to be part of a @code{union}):
+@samp{%define api.value.type variant}, see @ref{C++ Variants}.
+
+@node Union Decl
+@subsection The Union Declaration
+@cindex declaring value types
+@cindex value types, declaring
+@findex %union
+
+The @code{%union} declaration specifies the entire collection of possible
+data types for semantic values.  The keyword @code{%union} is followed by
+braced code containing the same thing that goes inside a @code{union} in C@.
+
+For example:
+
+@example
+@group
+%union @{
+  double val;
+  symrec *tptr;
+@}
+@end group
+@end example
+
+@noindent
+This says that the two alternative types are @code{double} and @code{symrec
+*}.  They are given names @code{val} and @code{tptr}; these names are used
+in the @code{%token} and @code{%type} declarations to pick one of the types
+for a terminal or nonterminal symbol (@pxref{Type Decl, ,Nonterminal Symbols}).
+
+As an extension to POSIX, a tag is allowed after the @code{%union}.  For
+example:
+
+@example
+@group
+%union value @{
+  double val;
+  symrec *tptr;
+@}
+@end group
+@end example
+
+@noindent
+specifies the union tag @code{value}, so the corresponding C type is
+@code{union value}.  If you do not specify a tag, it defaults to
+@code{YYSTYPE}.
+
+As another extension to POSIX, you may specify multiple @code{%union}
+declarations; their contents are concatenated.  However, only the first
+@code{%union} declaration can specify a tag.
+
+Note that, unlike making a @code{union} declaration in C, you need not write
+a semicolon after the closing brace.
+
+@node Structured Value Type
+@subsection Providing a Structured Semantic Value Type
+@cindex declaring value types
+@cindex value types, declaring
+@findex %union
+
+Instead of @code{%union}, you can define and use your own union type
+@code{YYSTYPE} if your grammar contains at least one @samp{<@var{type}>}
+tag.  For example, you can put the following into a header file
+@file{parser.h}:
+
+@example
+@group
+union YYSTYPE @{
+  double val;
+  symrec *tptr;
+@};
+@end group
+@end example
+
+@noindent
+and then your grammar can use the following instead of @code{%union}:
+
+@example
+@group
+%@{
+#include "parser.h"
+%@}
+%define api.value.type "union YYSTYPE"
+%type <val> expr
+%token <tptr> ID
+@end group
+@end example
+
+Actually, you may also provide a @code{struct} rather that a @code{union},
+which may be handy if you want to track information for every symbol (such
+as preceding comments).
+
+The type you provide may even be structured and include pointers, in which
+case the type tags you provide may be composite, with @samp{.} and @samp{->}
+operators.
+
  @node Actions
  @subsection Actions
  @cindex action
  @node Actions
  @subsection Actions
  @cindex action
@@ -3739,7 +4000,7 @@ foo:
  
  @group
  bar:
  
  @group
  bar:
-  /* empty */    @{ previous_expr = $0; @}
+  %empty    @{ previous_expr = $0; @}
  ;
  @end group
  @end example
  ;
  @end group
  @end example
@@ -3916,16 +4177,20 @@ declare a destructor for that symbol:
  @group
  %type <context> let
  %destructor @{ pop_context ($$); @} let
  @group
  %type <context> let
  %destructor @{ pop_context ($$); @} let
+@end group
  
  %%
  
  
  %%
  
+@group
  stmt:
    let stmt
      @{
        $$ = $2;
        pop_context ($let);
      @};
  stmt:
    let stmt
      @{
        $$ = $2;
        pop_context ($let);
      @};
+@end group
  
  
+@group
  let:
    "let" '(' var ')'
      @{
  let:
    "let" '(' var ')'
      @{
@@ -3960,9 +4225,9 @@ exp: @{ a(); @} "b" @{ c(); @} @{ d(); @} "e" @{ f(); @};
  is translated into:
  
  @example
  is translated into:
  
  @example
-$@@1: /* empty */ @{ a(); @};
-$@@2: /* empty */ @{ c(); @};
-$@@3: /* empty */ @{ d(); @};
+$@@1: %empty @{ a(); @};
+$@@2: %empty @{ c(); @};
+$@@3: %empty @{ d(); @};
  exp: $@@1 "b" $@@2 $@@3 "e" @{ f(); @};
  @end example
  
  exp: $@@1 "b" $@@2 $@@3 "e" @{ f(); @};
  @end example
  
@@ -3981,9 +4246,9 @@ exp: @{ a(); @} "b" @{ $$ = c(); @} @{ d(); @} "e" @{ f = $1; @};
  is translated into
  
  @example
  is translated into
  
  @example
-@@1: /* empty */ @{ a(); @};
-@@2: /* empty */ @{ $$ = c(); @};
-$@@3: /* empty */ @{ d(); @};
+@@1: %empty @{ a(); @};
+@@2: %empty @{ $$ = c(); @};
+$@@3: %empty @{ d(); @};
  exp: @@1 "b" @@2 $@@3 "e" @{ f = $1; @}
  @end example
  
  exp: @@1 "b" @@2 $@@3 "e" @{ f = $1; @}
  @end example
  
@@ -4092,7 +4357,7 @@ serves as a subroutine:
  @example
  @group
  subroutine:
  @example
  @group
  subroutine:
-  /* empty */  @{ prepare_for_local_variables (); @}
+  %empty  @{ prepare_for_local_variables (); @}
  ;
  @end group
  
  ;
  @end group
  
@@ -4199,8 +4464,7 @@ exp:
        else
          @{
            $$ = 1;
        else
          @{
            $$ = 1;
-          fprintf (stderr,
-                   "Division by zero, l%d,c%d-l%d,c%d",
+          fprintf (stderr, "%d.%d-%d.%d: division by zero",
                     @@3.first_line, @@3.first_column,
                     @@3.last_line, @@3.last_column);
          @}
                     @@3.first_line, @@3.first_column,
                     @@3.last_line, @@3.last_column);
          @}
@@ -4227,8 +4491,7 @@ exp:
        else
          @{
            $$ = 1;
        else
          @{
            $$ = 1;
-          fprintf (stderr,
-                   "Division by zero, l%d,c%d-l%d,c%d",
+          fprintf (stderr, "%d.%d-%d.%d: division by zero",
                     @@3.first_line, @@3.first_column,
                     @@3.last_line, @@3.last_column);
          @}
                     @@3.first_line, @@3.first_column,
                     @@3.last_line, @@3.last_column);
          @}
@@ -4436,7 +4699,6 @@ and Context-Free Grammars}).
  * Require Decl::      Requiring a Bison version.
  * Token Decl::        Declaring terminal symbols.
  * Precedence Decl::   Declaring terminals with precedence and associativity.
  * Require Decl::      Requiring a Bison version.
  * Token Decl::        Declaring terminal symbols.
  * Precedence Decl::   Declaring terminals with precedence and associativity.
-* Union Decl::        Declaring the set of all semantic value types.
  * Type Decl::         Declaring the choice of type for a nonterminal symbol.
  * Initial Action Decl::  Code run before parsing starts.
  * Destructor Decl::   Declaring how symbols are freed.
  * Type Decl::         Declaring the choice of type for a nonterminal symbol.
  * Initial Action Decl::  Code run before parsing starts.
  * Destructor Decl::   Declaring how symbols are freed.
@@ -4619,87 +4881,6 @@ For example:
  %left  OR 134 "<=" 135 // Declares 134 for OR and 135 for "<=".
  @end example
  
  %left  OR 134 "<=" 135 // Declares 134 for OR and 135 for "<=".
  @end example
  
-@node Union Decl
-@subsection The Collection of Value Types
-@cindex declaring value types
-@cindex value types, declaring
-@findex %union
-
-The @code{%union} declaration specifies the entire collection of
-possible data types for semantic values.  The keyword @code{%union} is
-followed by braced code containing the same thing that goes inside a
-@code{union} in C@.
-
-For example:
-
-@example
-@group
-%union @{
-  double val;
-  symrec *tptr;
-@}
-@end group
-@end example
-
-@noindent
-This says that the two alternative types are @code{double} and @code{symrec
-*}.  They are given names @code{val} and @code{tptr}; these names are used
-in the @code{%token} and @code{%type} declarations to pick one of the types
-for a terminal or nonterminal symbol (@pxref{Type Decl, ,Nonterminal Symbols}).
-
-As an extension to POSIX, a tag is allowed after the
-@code{union}.  For example:
-
-@example
-@group
-%union value @{
-  double val;
-  symrec *tptr;
-@}
-@end group
-@end example
-
-@noindent
-specifies the union tag @code{value}, so the corresponding C type is
-@code{union value}.  If you do not specify a tag, it defaults to
-@code{YYSTYPE}.
-
-As another extension to POSIX, you may specify multiple
-@code{%union} declarations; their contents are concatenated.  However,
-only the first @code{%union} declaration can specify a tag.
-
-Note that, unlike making a @code{union} declaration in C, you need not write
-a semicolon after the closing brace.
-
-Instead of @code{%union}, you can define and use your own union type
-@code{YYSTYPE} if your grammar contains at least one
-@samp{<@var{type}>} tag.  For example, you can put the following into
-a header file @file{parser.h}:
-
-@example
-@group
-union YYSTYPE @{
-  double val;
-  symrec *tptr;
-@};
-typedef union YYSTYPE YYSTYPE;
-@end group
-@end example
-
-@noindent
-and then your grammar can use the following
-instead of @code{%union}:
-
-@example
-@group
-%@{
-#include "parser.h"
-%@}
-%type <val> expr
-%token <tptr> ID
-@end group
-@end example
-
  @node Type Decl
  @subsection Nonterminal Symbols
  @cindex declaring value types, nonterminals
  @node Type Decl
  @subsection Nonterminal Symbols
  @cindex declaring value types, nonterminals
@@ -4718,7 +4899,7 @@ used.  This is done with a @code{%type} declaration, like this:
  @noindent
  Here @var{nonterminal} is the name of a nonterminal symbol, and
  @var{type} is the name given in the @code{%union} to the alternative
  @noindent
  Here @var{nonterminal} is the name of a nonterminal symbol, and
  @var{type} is the name given in the @code{%union} to the alternative
-that you want (@pxref{Union Decl, ,The Collection of Value Types}).  You
+that you want (@pxref{Union Decl, ,The Union Declaration}).  You
  can give any number of nonterminal symbols in the same @code{%type}
  declaration, if they have the same value type.  Use spaces to separate
  the symbol names.
  can give any number of nonterminal symbols in the same @code{%type}
  declaration, if they have the same value type.  Use spaces to separate
  the symbol names.
@@ -4814,10 +4995,8 @@ For example:
  
  @example
  %union @{ char *string; @}
  
  @example
  %union @{ char *string; @}
-%token <string> STRING1
-%token <string> STRING2
-%type  <string> string1
-%type  <string> string2
+%token <string> STRING1 STRING2
+%type  <string> string1 string2
  %union @{ char character; @}
  %token <character> CHR
  %type  <character> chr
  %union @{ char character; @}
  %token <character> CHR
  %type  <character> chr
@@ -4943,10 +5122,8 @@ For example:
  
  @example
  %union @{ char *string; @}
  
  @example
  %union @{ char *string; @}
-%token <string> STRING1
-%token <string> STRING2
-%type  <string> string1
-%type  <string> string2
+%token <string> STRING1 STRING2
+%type  <string> string1 string2
  %union @{ char character; @}
  %token <character> CHR
  %type  <character> chr
  %union @{ char character; @}
  %token <character> CHR
  %type  <character> chr
@@ -5205,7 +5382,7 @@ Here is a summary of the declarations used to define a grammar:
  
  @deffn {Directive} %union
  Declare the collection of data types that semantic values may have
  
  @deffn {Directive} %union
  Declare the collection of data types that semantic values may have
-(@pxref{Union Decl, ,The Collection of Value Types}).
+(@pxref{Union Decl, ,The Union Declaration}).
  @end deffn
  
  @deffn {Directive} %token
  @end deffn
  
  @deffn {Directive} %token
@@ -5333,7 +5510,7 @@ For instance with @samp{%define api.prefix "calc"} and @samp{%defines
  @end deffn
  
  @deffn {Directive} %defines @var{defines-file}
  @end deffn
  
  @deffn {Directive} %defines @var{defines-file}
-Same as above, but save in the file @var{defines-file}.
+Same as above, but save in the file @file{@var{defines-file}}.
  @end deffn
  
  @deffn {Directive} %destructor
  @end deffn
  
  @deffn {Directive} %destructor
@@ -5396,7 +5573,7 @@ own right.
  @end deffn
  
  @deffn {Directive} %output "@var{file}"
  @end deffn
  
  @deffn {Directive} %output "@var{file}"
-Specify @var{file} for the parser implementation file.
+Generate the parser implementation in @file{@var{file}}.
  @end deffn
  
  @deffn {Directive} %pure-parser
  @end deffn
  
  @deffn {Directive} %pure-parser
@@ -5523,7 +5700,8 @@ Summary,,%skeleton}).
  Unaccepted @var{variable}s produce an error.
  Some of the accepted @var{variable}s are described below.
  
  Unaccepted @var{variable}s produce an error.
  Some of the accepted @var{variable}s are described below.
  
-@deffn Directive {%define api.namespace} "@var{namespace}"
+@c ================================================== api.namespace
+@deffn Directive {%define api.namespace} @{@var{namespace}@}
  @itemize
  @item Languages(s): C++
  
  @itemize
  @item Languages(s): C++
  
@@ -5531,7 +5709,7 @@ Some of the accepted @var{variable}s are described below.
  For example, if you specify:
  
  @example
  For example, if you specify:
  
  @example
-%define api.namespace "foo::bar"
+%define api.namespace @{foo::bar@}
  @end example
  
  Bison uses @code{foo::bar} verbatim in references such as:
  @end example
  
  Bison uses @code{foo::bar} verbatim in references such as:
@@ -5564,7 +5742,7 @@ api.namespace} so that @code{%name-prefix} @emph{only} affects the
  lexical analyzer function.  For example, if you specify:
  
  @example
  lexical analyzer function.  For example, if you specify:
  
  @example
-%define api.namespace "foo"
+%define api.namespace @{foo@}
  %name-prefix "bar::"
  @end example
  
  %name-prefix "bar::"
  @end example
  
@@ -5706,7 +5884,7 @@ introduced in Bison 2.8
  
  
  @c ================================================== api.token.prefix
  
  
  @c ================================================== api.token.prefix
-@deffn Directive {%define api.token.prefix} @var{prefix}
+@deffn Directive {%define api.token.prefix} @{@var{prefix}@}
  
  @itemize
  @item Languages(s): all
  
  @itemize
  @item Languages(s): all
@@ -5717,7 +5895,7 @@ target language.  For instance
  
  @example
  %token FILE for ERROR
  
  @example
  %token FILE for ERROR
-%define api.token.prefix "TOK_"
+%define api.token.prefix @{TOK_@}
  %%
  start: FILE for ERROR;
  @end example
  %%
  start: FILE for ERROR;
  @end example
@@ -5728,8 +5906,13 @@ and @code{TOK_ERROR} in the generated source files.  In particular, the
  scanner must use these prefixed token names, while the grammar itself
  may still use the short names (as in the sample rule given above).  The
  generated informational files (@file{*.output}, @file{*.xml},
  scanner must use these prefixed token names, while the grammar itself
  may still use the short names (as in the sample rule given above).  The
  generated informational files (@file{*.output}, @file{*.xml},
-@file{*.dot}) are not modified by this prefix.  See @ref{Calc++ Parser}
-and @ref{Calc++ Scanner}, for a complete example.
+@file{*.dot}) are not modified by this prefix.
+
+Bison also prefixes the generated member names of the semantic value union.
+@xref{Type Generation,, Generating the Semantic Value Type}, for more
+details.
+
+See @ref{Calc++ Parser} and @ref{Calc++ Scanner}, for a complete example.
  
  @item Accepted Values:
  Any string.  Should be a valid identifier prefix in the target language,
  
  @item Accepted Values:
  Any string.  Should be a valid identifier prefix in the target language,
@@ -5739,7 +5922,7 @@ letters, underscores, and ---not at the beginning--- digits).
  @item Default Value:
  empty
  @item History:
  @item Default Value:
  empty
  @item History:
-introduced in Bison 2.8
+introduced in Bison 3.0
  @end itemize
  @end deffn
  @c api.token.prefix
  @end itemize
  @end deffn
  @c api.token.prefix
@@ -5749,14 +5932,89 @@ introduced in Bison 2.8
  @deffn Directive {%define api.value.type} @var{type}
  @itemize @bullet
  @item Language(s):
  @deffn Directive {%define api.value.type} @var{type}
  @itemize @bullet
  @item Language(s):
-C++
+all
  
  @item Purpose:
  
  @item Purpose:
-Request variant-based semantic values.
+The type for semantic values.
+
+@item Accepted Values:
+@table @asis
+@item @code{""}
+This grammar has no semantic value at all.  This is not properly supported
+yet.
+@item @code{%union} (C, C++)
+The type is defined thanks to the @code{%union} directive.  You don't have
+to define @code{api.value.type} in that case, using @code{%union} suffices.
+@xref{Union Decl, ,The Union Declaration}.
+For instance:
+@example
+%define api.value.type "%union"
+%union
+@{
+  int ival;
+  char *sval;
+@}
+%token <ival> INT "integer"
+%token <sval> STR "string"
+@end example
+
+@item @code{union} (C, C++)
+The symbols are defined with type names, from which Bison will generate a
+@code{union}.  For instance:
+@example
+%define api.value.type "union"
+%token <int> INT "integer"
+%token <char *> STR "string"
+@end example
+This feature needs user feedback to stabilize.  Note that most C++ objects
+cannot be stored in a @code{union}.
+
+@item @code{variant} (C++)
+This is similar to @code{union}, but special storage techniques are used to
+allow any kind of C++ object to be used. For instance:
+@example
+%define api.value.type "variant"
+%token <int> INT "integer"
+%token <std::string> STR "string"
+@end example
+This feature needs user feedback to stabilize.
  @xref{C++ Variants}.
  
  @xref{C++ Variants}.
  
+@item any other identifier
+Use this name as semantic value.
+@example
+%code requires
+@{
+  struct my_value
+  @{
+    enum
+    @{
+      is_int, is_str
+    @} kind;
+    union
+    @{
+      int ival;
+      char *sval;
+    @} u;
+  @};
+@}
+%define api.value.type "struct my_value"
+%token <u.ival> INT "integer"
+%token <u.sval> STR "string"
+@end example
+@end table
+
  @item Default Value:
  @item Default Value:
-FIXME:
+@itemize @minus
+@item
+@code{%union} if @code{%union} is used, otherwise @dots{}
+@item
+@code{int} if type tags are used (i.e., @samp{%token <@var{type}>@dots{}} or
+@samp{%token <@var{type}>@dots{}} is used), otherwise @dots{}
+@item
+@code{""}
+@end itemize
+
  @item History:
  introduced in Bison 2.8.  Was introduced for Java only in 2.3b as
  @code{stype}.
  @item History:
  introduced in Bison 2.8.  Was introduced for Java only in 2.3b as
  @code{stype}.
@@ -5831,7 +6089,7 @@ More user feedback will help to stabilize it.)
  @end deffn
  
  @c ================================================== namespace
  @end deffn
  
  @c ================================================== namespace
-@deffn Directive %define namespace @var{namespace}
+@deffn Directive %define namespace @{@var{namespace}@}
  Obsoleted by @code{api.namespace}
  @c namespace
  @end deffn
  Obsoleted by @code{api.namespace}
  @c namespace
  @end deffn
@@ -5969,10 +6227,11 @@ qualifiers produce an error.  Some of the accepted qualifiers are:
  @item Language(s): C, C++
  
  @item Purpose: This is the best place to write dependency code required for
  @item Language(s): C, C++
  
  @item Purpose: This is the best place to write dependency code required for
-@code{YYSTYPE} and @code{YYLTYPE}.
-In other words, it's the best place to define types referenced in @code{%union}
-directives, and it's the best place to override Bison's default @code{YYSTYPE}
-and @code{YYLTYPE} definitions.
+@code{YYSTYPE} and @code{YYLTYPE}.  In other words, it's the best place to
+define types referenced in @code{%union} directives.  If you use
+@code{#define} to override Bison's default @code{YYSTYPE} and @code{YYLTYPE}
+definitions, then it is also the best place.  However you should rather
+@code{%define} @code{api.value.type} and @code{api.location.type}.
  
  @item Location(s): The parser header file and the parser implementation file
  before the Bison-generated @code{YYSTYPE} and @code{YYLTYPE}
  
  @item Location(s): The parser header file and the parser implementation file
  before the Bison-generated @code{YYSTYPE} and @code{YYLTYPE}
@@ -6372,7 +6631,7 @@ yylex (void)
      return 0;
    @dots{}
    if (c == '+' || c == '-')
      return 0;
    @dots{}
    if (c == '+' || c == '-')
-    return c;      /* Assume token type for `+' is '+'.  */
+    return c;      /* Assume token type for '+' is '+'.  */
    @dots{}
    return INT;      /* Return the type of the token.  */
    @dots{}
    @dots{}
    return INT;      /* Return the type of the token.  */
    @dots{}
@@ -6444,7 +6703,7 @@ Thus, if the type is @code{int} (the default), you might write this in
  
  When you are using multiple data types, @code{yylval}'s type is a union
  made from the @code{%union} declaration (@pxref{Union Decl, ,The
  
  When you are using multiple data types, @code{yylval}'s type is a union
  made from the @code{%union} declaration (@pxref{Union Decl, ,The
-Collection of Value Types}).  So when you store a token's value, you
+Union Declaration}).  So when you store a token's value, you
  must use the proper member of the union.  If the @code{%union}
  declaration looks like this:
  
  must use the proper member of the union.  If the @code{%union}
  declaration looks like this:
  
@@ -7469,7 +7728,7 @@ of zero or more @code{word} groupings.
  @example
  @group
  sequence:
  @example
  @group
  sequence:
-  /* empty */    @{ printf ("empty sequence\n"); @}
+  %empty         @{ printf ("empty sequence\n"); @}
  | maybeword
  | sequence word  @{ printf ("added word %s\n", $2); @}
  ;
  | maybeword
  | sequence word  @{ printf ("added word %s\n", $2); @}
  ;
@@ -7477,8 +7736,8 @@ sequence:
  
  @group
  maybeword:
  
  @group
  maybeword:
-  /* empty */   @{ printf ("empty maybeword\n"); @}
-| word          @{ printf ("single word %s\n", $1); @}
+  %empty    @{ printf ("empty maybeword\n"); @}
+| word      @{ printf ("single word %s\n", $1); @}
  ;
  @end group
  @end example
  ;
  @end group
  @end example
@@ -7509,7 +7768,7 @@ proper way to define @code{sequence}:
  @example
  @group
  sequence:
  @example
  @group
  sequence:
-  /* empty */    @{ printf ("empty sequence\n"); @}
+  %empty         @{ printf ("empty sequence\n"); @}
  | sequence word  @{ printf ("added word %s\n", $2); @}
  ;
  @end group
  | sequence word  @{ printf ("added word %s\n", $2); @}
  ;
  @end group
@@ -7520,7 +7779,7 @@ Here is another common error that yields a reduce/reduce conflict:
  @example
  @group
  sequence:
  @example
  @group
  sequence:
-  /* empty */
+  %empty
  | sequence words
  | sequence redirects
  ;
  | sequence words
  | sequence redirects
  ;
@@ -7528,14 +7787,14 @@ sequence:
  
  @group
  words:
  
  @group
  words:
-  /* empty */
+  %empty
  | words word
  ;
  @end group
  
  @group
  redirects:
  | words word
  ;
  @end group
  
  @group
  redirects:
-  /* empty */
+  %empty
  | redirects redirect
  ;
  @end group
  | redirects redirect
  ;
  @end group
@@ -7558,7 +7817,7 @@ of sequence:
  
  @example
  sequence:
  
  @example
  sequence:
-  /* empty */
+  %empty
  | sequence word
  | sequence redirect
  ;
  | sequence word
  | sequence redirect
  ;
@@ -7570,7 +7829,7 @@ from being empty:
  @example
  @group
  sequence:
  @example
  @group
  sequence:
-  /* empty */
+  %empty
  | sequence words
  | sequence redirects
  ;
  | sequence words
  | sequence redirects
  ;
@@ -7614,7 +7873,7 @@ rule:
  %%
  @group
  sequence:
  %%
  @group
  sequence:
-  /* empty */
+  %empty
  | sequence word      %prec "sequence"
  | sequence redirect  %prec "sequence"
  ;
  | sequence word      %prec "sequence"
  | sequence redirect  %prec "sequence"
  ;
@@ -7636,7 +7895,7 @@ rule with the same precedence, but make them right-associative:
  %%
  @group
  sequence:
  %%
  @group
  sequence:
-  /* empty */
+  %empty
  | sequence word      %prec "word"
  | sequence redirect  %prec "redirect"
  ;
  | sequence word      %prec "word"
  | sequence redirect  %prec "redirect"
  ;
@@ -8333,7 +8592,7 @@ For example:
  
  @example
  stmts:
  
  @example
  stmts:
-  /* empty string */
+  %empty
  | stmts '\n'
  | stmts exp '\n'
  | stmts error '\n'
  | stmts '\n'
  | stmts exp '\n'
  | stmts error '\n'
@@ -9387,7 +9646,7 @@ prologue:
  /* Formatting semantic values.  */
  %printer @{ fprintf (yyoutput, "%s", $$->name); @} VAR;
  %printer @{ fprintf (yyoutput, "%s()", $$->name); @} FNCT;
  /* Formatting semantic values.  */
  %printer @{ fprintf (yyoutput, "%s", $$->name); @} VAR;
  %printer @{ fprintf (yyoutput, "%s()", $$->name); @} FNCT;
-%printer @{ fprintf (yyoutput, "%g", $$); @} <val>;
+%printer @{ fprintf (yyoutput, "%g", $$); @} <double>;
  @end example
  
  The @code{%define} directive instructs Bison to generate run-time trace
  @end example
  
  The @code{%define} directive instructs Bison to generate run-time trace
@@ -9400,8 +9659,8 @@ ill-named) @code{%verbose} directive.
  The set of @code{%printer} directives demonstrates how to format the
  semantic value in the traces.  Note that the specification can be done
  either on the symbol type (e.g., @code{VAR} or @code{FNCT}), or on the type
  The set of @code{%printer} directives demonstrates how to format the
  semantic value in the traces.  Note that the specification can be done
  either on the symbol type (e.g., @code{VAR} or @code{FNCT}), or on the type
-tag: since @code{<val>} is the type for both @code{NUM} and @code{exp}, this
-printer will be used for them.
+tag: since @code{<double>} is the type for both @code{NUM} and @code{exp},
+this printer will be used for them.
  
  Here is a sample of the information provided by run-time traces.  The traces
  are sent onto standard error.
  
  Here is a sample of the information provided by run-time traces.  The traces
  are sent onto standard error.
@@ -9451,7 +9710,7 @@ Entering state 24
  
  @noindent
  The previous reduction demonstrates the @code{%printer} directive for
  
  @noindent
  The previous reduction demonstrates the @code{%printer} directive for
-@code{<val>}: both the token @code{NUM} and the resulting nonterminal
+@code{<double>}: both the token @code{NUM} and the resulting nonterminal
  @code{exp} have @samp{1} as value.
  
  @example
  @code{exp} have @samp{1} as value.
  
  @example
@@ -9635,7 +9894,7 @@ Here is a list of options that can be used with Bison, alphabetized by
  short option.  It is followed by a cross key alphabetized by long
  option.
  
  short option.  It is followed by a cross key alphabetized by long
  option.
  
-@c Please, keep this ordered as in `bison --help'.
+@c Please, keep this ordered as in 'bison --help'.
  @noindent
  Operations modes:
  @table @option
  @noindent
  Operations modes:
  @table @option
@@ -9716,6 +9975,72 @@ no effect on the conflict report.
  Deprecated constructs whose support will be removed in future versions of
  Bison.
  
  Deprecated constructs whose support will be removed in future versions of
  Bison.
  
+@item empty-rule
+Empty rules without @code{%empty}.  @xref{Empty Rules}.  Disabled by
+default, but enabled by uses of @code{%empty}, unless
+@option{-Wno-empty-rule} was specified.
+
+@item precedence
+Useless precedence and associativity directives.  Disabled by default.
+
+Consider for instance the following grammar:
+
+@example
+@group
+%nonassoc "="
+%left "+"
+%left "*"
+%precedence "("
+@end group
+%%
+@group
+stmt:
+  exp
+| "var" "=" exp
+;
+@end group
+
+@group
+exp:
+  exp "+" exp
+| exp "*" "num"
+| "(" exp ")"
+| "num"
+;
+@end group
+@end example
+
+Bison reports:
+
+@c cannot leave the location and the [-Wprecedence] for lack of
+@c width in PDF.
+@example
+@group
+warning: useless precedence and associativity for "="
+ %nonassoc "="
+           ^^^
+@end group
+@group
+warning: useless associativity for "*", use %precedence
+ %left "*"
+       ^^^
+@end group
+@group
+warning: useless precedence for "("
+ %precedence "("
+             ^^^
+@end group
+@end example
+
+One would get the exact same parser with the following directives instead:
+
+@example
+@group
+%left "+"
+%precedence "*"
+@end group
+@end example
+
  @item other
  All warnings not categorized above.  These warnings are enabled by default.
  
  @item other
  All warnings not categorized above.  These warnings are enabled by default.
  
@@ -9724,9 +10049,11 @@ releases of Bison may move warnings from this category to new, more specific
  categories.
  
  @item all
  categories.
  
  @item all
-All the warnings.
+All the warnings except @code{yacc}.
+
  @item none
  Turn off all the warnings.
  @item none
  Turn off all the warnings.
+
  @item error
  See @option{-Werror}, below.
  @end table
  @item error
  See @option{-Werror}, below.
  @end table
@@ -9774,7 +10101,7 @@ an example, using the following file @file{in.y}:
  exp: exp '+' exp @{ $exp = $1 + $2; @};
  @end example
  
  exp: exp '+' exp @{ $exp = $1 + $2; @};
  @end example
  
-When invoked with @option{-fcaret}, Bison will report:
+When invoked with @option{-fcaret} (or nothing), Bison will report:
  
  @example
  @group
  
  @example
  @group
@@ -9804,6 +10131,20 @@ in.y:3.32-33: error: $2 of 'exp' has no declared type
  @end group
  @end example
  
  @end group
  @end example
  
+Whereas, when invoked with @option{-fno-caret}, Bison will only report:
+
+@example
+@group
+in.y:3.20-23: error: ambiguous reference: ‘$exp’
+in.y:3.1-3:       refers to: $exp at $$
+in.y:3.6-8:       refers to: $exp at $1
+in.y:3.14-16:     refers to: $exp at $3
+in.y:3.32-33: error: $2 of ‘exp’ has no declared type
+@end group
+@end example
+
+This option is activated by default.
+
  @end table
  @end table
  
  @end table
  @end table
  
@@ -10098,7 +10439,7 @@ approach is provided, based on variants (@pxref{C++ Variants}).
  @subsubsection C++ Unions
  
  The @code{%union} directive works as for C, see @ref{Union Decl, ,The
  @subsubsection C++ Unions
  
  The @code{%union} directive works as for C, see @ref{Union Decl, ,The
-Collection of Value Types}.  In particular it produces a genuine
+Union Declaration}.  In particular it produces a genuine
  @code{union}, which have a few specific features in C++.
  @itemize @minus
  @item
  @code{union}, which have a few specific features in C++.
  @itemize @minus
  @item
@@ -10193,14 +10534,6 @@ therefore, since, as far as we know, @code{double} is the most demanding
  type on all platforms, alignments are enforced for @code{double} whatever
  types are actually used.  This may waste space in some cases.
  
  type on all platforms, alignments are enforced for @code{double} whatever
  types are actually used.  This may waste space in some cases.
  
-@item
-Our implementation is not conforming with strict aliasing rules.  Alias
-analysis is a technique used in optimizing compilers to detect when two
-pointers are disjoint (they cannot ``meet'').  Our implementation breaks
-some of the rules that G++ 4.4 uses in its alias analysis, so @emph{strict
-alias analysis must be disabled}.  Use the option
-@option{-fno-strict-aliasing} to compile the generated parser.
-
  @item
  There might be portability issues we are not aware of.
  @end itemize
  @item
  There might be portability issues we are not aware of.
  @end itemize
@@ -10258,16 +10591,18 @@ filename_type "@var{type}"}.
  The line, starting at 1.
  @end deftypeivar
  
  The line, starting at 1.
  @end deftypeivar
  
-@deftypemethod {position} {uint} lines (int @var{height} = 1)
-Advance by @var{height} lines, resetting the column number.
+@deftypemethod {position} {void} lines (int @var{height} = 1)
+If @var{height} is not null, advance by @var{height} lines, resetting the
+column number.  The resulting line number cannot be less than 1.
  @end deftypemethod
  
  @deftypeivar {position} {uint} column
  The column, starting at 1.
  @end deftypeivar
  
  @end deftypemethod
  
  @deftypeivar {position} {uint} column
  The column, starting at 1.
  @end deftypeivar
  
-@deftypemethod {position} {uint} columns (int @var{width} = 1)
-Advance by @var{width} columns, without changing the line number.
+@deftypemethod {position} {void} columns (int @var{width} = 1)
+Advance by @var{width} columns, without changing the line number. The
+resulting column number cannot be less than 1.
  @end deftypemethod
  
  @deftypemethod {position} {position&} operator+= (int @var{width})
  @end deftypemethod
  
  @deftypemethod {position} {position&} operator+= (int @var{width})
@@ -10309,14 +10644,16 @@ Reset the location to an empty range at the given values.
  The first, inclusive, position of the range, and the first beyond.
  @end deftypeivar
  
  The first, inclusive, position of the range, and the first beyond.
  @end deftypeivar
  
-@deftypemethod {location} {uint} columns (int @var{width} = 1)
-@deftypemethodx {location} {uint} lines (int @var{height} = 1)
-Advance the @code{end} position.
+@deftypemethod {location} {void} columns (int @var{width} = 1)
+@deftypemethodx {location} {void} lines (int @var{height} = 1)
+Forwarded to the @code{end} position.
  @end deftypemethod
  
  @deftypemethod {location} {location} operator+ (const location& @var{end})
  @deftypemethodx {location} {location} operator+ (int @var{width})
  @deftypemethodx {location} {location} operator+= (int @var{width})
  @end deftypemethod
  
  @deftypemethod {location} {location} operator+ (const location& @var{end})
  @deftypemethodx {location} {location} operator+ (int @var{width})
  @deftypemethodx {location} {location} operator+= (int @var{width})
+@deftypemethodx {location} {location} operator- (int @var{width})
+@deftypemethodx {location} {location} operator-= (int @var{width})
  Various forms of syntactic sugar.
  @end deftypemethod
  
  Various forms of syntactic sugar.
  @end deftypemethod
  
@@ -10580,7 +10917,7 @@ also pass the @var{location}.
  For instance, given the following declarations:
  
  @example
  For instance, given the following declarations:
  
  @example
-%define api.token.prefix "TOK_"
+%define api.token.prefix @{TOK_@}
  %token <std::string> IDENTIFIER;
  %token <int> INTEGER;
  %token COLON;
  %token <std::string> IDENTIFIER;
  %token <int> INTEGER;
  %token COLON;
@@ -10906,7 +11243,7 @@ tokens with @code{TOK_} (@pxref{%define Summary,,api.token.prefix}).
  
  @comment file: calc++-parser.yy
  @example
  
  @comment file: calc++-parser.yy
  @example
-%define api.token.prefix "TOK_"
+%define api.token.prefix @{TOK_@}
  %token
    END  0  "end of file"
    ASSIGN  ":="
  %token
    END  0  "end of file"
    ASSIGN  ":="
@@ -10953,7 +11290,7 @@ Location Tracking Calculator: @code{ltcalc}}).
  unit: assignments exp  @{ driver.result = $2; @};
  
  assignments:
  unit: assignments exp  @{ driver.result = $2; @};
  
  assignments:
-  /* Nothing.  */        @{@}
+  %empty                 @{@}
  | assignments assignment @{@};
  
  assignment:
  | assignments assignment @{@};
  
  assignment:
@@ -11022,7 +11359,7 @@ Finally, we enable scanner tracing.
  
  @comment file: calc++-scanner.ll
  @example
  
  @comment file: calc++-scanner.ll
  @example
-%option noyywrap nounput batch debug
+%option noyywrap nounput batch debug noinput
  @end example
  
  @noindent
  @end example
  
  @noindent
@@ -12128,7 +12465,7 @@ operating system's name and version and your compiler's name and
  version.  If you have trouble compiling, you should also include a
  transcript of the build session, starting with the invocation of
  `configure'.  Depending on the nature of the bug, you may be asked to
  version.  If you have trouble compiling, you should also include a
  transcript of the build session, starting with the invocation of
  `configure'.  Depending on the nature of the bug, you may be asked to
-send additional files as well (such as `config.h' or `config.cache').
+send additional files as well (such as @file{config.h} or @file{config.cache}).
  
  Patches are most welcome, but not required.  That is, do not hesitate to
  send a bug report just because you cannot provide a fix.
  
  Patches are most welcome, but not required.  That is, do not hesitate to
  send a bug report just because you cannot provide a fix.
@@ -12343,6 +12680,11 @@ time to resolve reduce/reduce conflicts.  @xref{GLR Parsers, ,Writing
  GLR Parsers}.
  @end deffn
  
  GLR Parsers}.
  @end deffn
  
+@deffn {Directive} %empty
+Bison declaration to declare make explicit that a rule has an empty
+right-hand side.  @xref{Empty Rules}.
+@end deffn
+
  @deffn {Symbol} $end
  The predefined token marking the end of the token stream.  It cannot be
  used in the grammar.
  @deffn {Symbol} $end
  The predefined token marking the end of the token stream.  It cannot be
  used in the grammar.
@@ -12513,7 +12855,7 @@ The predefined token onto which all undefined values returned by
  
  @deffn {Directive} %union
  Bison declaration to specify several possible data types for semantic
  
  @deffn {Directive} %union
  Bison declaration to specify several possible data types for semantic
-values.  @xref{Union Decl, ,The Collection of Value Types}.
+values.  @xref{Union Decl, ,The Union Declaration}.
  @end deffn
  
  @deffn {Macro} YYABORT
  @end deffn
  
  @deffn {Macro} YYABORT
@@ -12718,6 +13060,7 @@ require some expertise in low-level implementation details.
  @end deffn
  
  @deffn {Type} YYSTYPE
  @end deffn
  
  @deffn {Type} YYSTYPE
+Deprecated in favor of the @code{%define} variable @code{api.value.type}.
  Data type of semantic values; @code{int} by default.
  @xref{Value Type, ,Data Types of Semantic Values}.
  @end deffn
  Data type of semantic values; @code{int} by default.
  @xref{Value Type, ,Data Types of Semantic Values}.
  @end deffn