]>
git.saurik.com Git - apple/icu.git/blob - icuSources/i18n/unicode/choicfmt.h
2 ********************************************************************************
3 * Copyright (C) 1997-2008, International Business Machines
4 * Corporation and others. All Rights Reserved.
5 ********************************************************************************
9 * Modification History:
11 * Date Name Description
12 * 02/19/97 aliu Converted from java.
13 * 03/20/97 helena Finished first cut of implementation and got rid
14 * of nextDouble/previousDouble and replaced with
16 * 4/10/97 aliu Clean up. Modified to work on AIX.
17 * 8/6/97 nos Removed overloaded constructor, member var 'buffer'.
18 * 07/22/98 stephen Removed operator!= (implemented in Format)
19 ********************************************************************************
25 #include "unicode/utypes.h"
29 * \brief C++ API: Choice Format.
32 #if !UCONFIG_NO_FORMATTING
34 #include "unicode/unistr.h"
35 #include "unicode/numfmt.h"
36 #include "unicode/fieldpos.h"
37 #include "unicode/format.h"
44 * ChoiceFormat converts between ranges of numeric values
45 * and string names for those ranges. A <code>ChoiceFormat</code> splits
46 * the real number line <code>-Inf</code> to <code>+Inf</code> into two
47 * or more contiguous ranges. Each range is mapped to a
48 * string. <code>ChoiceFormat</code> is generally used in a
49 * <code>MessageFormat</code> for displaying grammatically correct
50 * plurals such as "There are 2 files."</p>
52 * <p>There are two methods of defining a <code>ChoiceFormat</code>; both
53 * are equivalent. The first is by using a string pattern. This is the
54 * preferred method in most cases. The second method is through direct
55 * specification of the arrays that make up the
56 * <code>ChoiceFormat</code>.</p>
58 * <p><strong>Patterns</strong></p>
60 * <p>In most cases, the preferred way to define a
61 * <code>ChoiceFormat</code> is with a pattern. Here is an example of a
62 * <code>ChoiceFormat</code> pattern:</p>
64 * \htmlonly<pre> 0≤are no files|1≤is one file|1<are many files</pre>\endhtmlonly
66 * <p>or equivalently,</p>
68 * \htmlonly<pre> 0#are no files|1#is one file|1<are many files</pre>\endhtmlonly
70 * <p>The pattern consists of a number or <em>range specifiers</em>
71 * separated by vertical bars '|' (U+007C). There is no
72 * vertical bar after the last range. Each range specifier is of the
75 * \htmlonly<blockquote><em>Number Separator String</em></blockquote>\endhtmlonly
77 * <p><em>Number</em> is a floating point number that can be parsed by a
78 * default <code>NumberFormat</code> for the US locale. It gives the
79 * lower limit of this range. The lower limit is either inclusive or
80 * exclusive, depending on the <em>separator</em>. The upper limit is
81 * given by the lower limit of the next range. The Unicode infinity
82 * sign \htmlonly∞ \endhtmlonly (U+221E) is recognized for positive infinity. It may be preceded by
83 * '-' (U+002D) to indicate negative infinity.</p>
85 * <p><em>String</em> is the format string for this range, with special
86 * characters enclosed in single quotes (<code>'The #
87 * sign'</code>). Single quotes themselves are indicated by two single
88 * quotes in a row (<code>'o''clock'</code>).</p>
90 * <p><em>Separator</em> is one of the following single characters:
93 * <li>\htmlonly'≤' \endhtmlonly (U+2264) or '#' (U+0023)
94 * indicates that the lower limit given by <em>Number</em> is
95 * inclusive. (The two characters are equivalent to ChoiceFormat.)
96 * This means that the limit value <em>Number</em> belongs to this
97 * range. Another way of saying this is that the corresponding
98 * closure is <code>FALSE</code>.</li>
100 * <li>'<' (U+003C) indicates that the lower limit given by
101 * <em>Number</em> is exclusive. This means that the value
102 * <em>Number</em> belongs to the prior range.</li> Another way of
103 * saying this is that the corresponding closure is
107 * <p>See below for more information about closures.</p>
109 * <p><strong>Arrays</strong></p>
111 * <p>A <code>ChoiceFormat</code> defining <code>n</code> intervals
112 * (<code>n</code> >= 2) is specified by three arrays of
113 * <code>n</code> items:
116 * <li><code>double limits[]</code> gives the start of each
117 * interval. This must be a non-decreasing list of values, none of
118 * which may be <code>NaN</code>.</li>
119 * <li><code>UBool closures[]</code> determines whether each limit
120 * value is contained in the interval below it or in the interval
121 * above it. If <code>closures[i]</code> is <code>FALSE</code>, then
122 * <code>limits[i]</code> is a member of interval
123 * <code>i</code>. Otherwise it is a member of interval
124 * <code>i+1</code>. If no closures array is specified, this is
125 * equivalent to having all closures be <code>FALSE</code>. Closures
126 * allow one to specify half-open, open, or closed intervals.</li>
127 * <li><code>UnicodeString formats[]</code> gives the string label
128 * associated with each interval.</li>
131 * <p><strong>Formatting and Parsing</strong></p>
133 * <p>During formatting, a number is converted to a
134 * string. <code>ChoiceFormat</code> accomplishes this by mapping the
135 * number to an interval using the following rule. Given a number
136 * <code>X</code> and and index value <code>j</code> in the range
137 * <code>0..n-1</code>, where <code>n</code> is the number of ranges:</p>
139 * \htmlonly<blockquote>\endhtmlonly<code>X</code> matches <code>j</code> if and only if
140 * <code>limit[j] <= X < limit[j+1]</code>
141 * \htmlonly</blockquote>\endhtmlonly
143 * <p>(This assumes that all closures are <code>FALSE</code>. If some
144 * closures are <code>TRUE</code> then the relations must be changed to
145 * <code><=</code> or <code><</code> as appropriate.) If there is
146 * no match, then either the first or last index is used, depending on
147 * whether the number is too low or too high. Once a number is mapped to
148 * an interval <code>j</code>, the string <code>formats[j]</code> is
151 * <p>During parsing, a string is converted to a
152 * number. <code>ChoiceFormat</code> finds the element
153 * <code>formats[j]</code> equal to the string, and returns
154 * <code>limits[j]</code> as the parsed value.</p>
156 * <p><strong>Notes</strong></p>
158 * <p>The first limit value does not define a range boundary. For
159 * example, in the pattern \htmlonly"<code>1.0#a|2.0#b</code>"\endhtmlonly, the
160 * intervals are [-Inf, 2.0) and [2.0, +Inf]. It appears that the first
161 * interval should be [1.0, 2.0). However, since all values that are too
162 * small are mapped to range zero, the first interval is effectively
163 * [-Inf, 2.0). However, the first limit value <em>is</em> used during
164 * formatting. In this example, <code>parse("a")</code> returns
167 * <p>There are no gaps between intervals and the entire number line is
168 * covered. A <code>ChoiceFormat</code> maps <em>all</em> possible
169 * double values to a finite set of intervals.</p>
171 * <p>The non-number <code>NaN</code> is mapped to interval zero during
174 * <p><strong>Examples</strong></p>
176 * <p>Here is an example of two arrays that map the number
177 * <code>1..7</code> to the English day of the week abbreviations
178 * <code>Sun..Sat</code>. No closures array is given; this is the same as
179 * specifying all closures to be <code>FALSE</code>.</p>
181 * <pre> {1,2,3,4,5,6,7},
182 * {"Sun","Mon","Tue","Wed","Thur","Fri","Sat"}</pre>
184 * <p>Here is an example that maps the ranges [-Inf, 1), [1, 1], and (1,
185 * +Inf] to three strings. That is, the number line is split into three
186 * ranges: x < 1.0, x = 1.0, and x > 1.0.</p>
189 * {FALSE, FALSE, TRUE},
190 * {"no files", "one file", "many files"}</pre>
192 * <p>Here is a simple example that shows formatting and parsing: </p>
195 * #include <unicode/choicfmt.h>
196 * #include <unicode/unistr.h>
197 * #include <iostream.h>
199 * int main(int argc, char *argv[]) {
200 * double limits[] = {1,2,3,4,5,6,7};
201 * UnicodeString monthNames[] = {
202 * "Sun","Mon","Tue","Wed","Thu","Fri","Sat"};
203 * ChoiceFormat fmt(limits, monthNames, 7);
206 * for (double x = 1.0; x <= 8.0; x += 1.0) {
207 * fmt.format(x, str);
208 * str.extract(0, str.length(), buf, 256, "");
210 * cout << x << " -> "
218 * <p>Here is a more complex example using a <code>ChoiceFormat</code>
219 * constructed from a pattern together with a
220 * <code>MessageFormat</code>.</p>
223 * #include <unicode/choicfmt.h>
224 * #include <unicode/msgfmt.h>
225 * #include <unicode/unistr.h>
226 * #include <iostream.h>
228 * int main(int argc, char *argv[]) {
229 * UErrorCode status = U_ZERO_ERROR;
230 * double filelimits[] = {0,1,2};
231 * UnicodeString filepart[] =
232 * {"are no files","is one file","are {0} files"};
233 * ChoiceFormat* fileform = new ChoiceFormat(filelimits, filepart, 3 );
234 * Format* testFormats[] =
235 * {fileform, NULL, NumberFormat::createInstance(status)};
236 * MessageFormat pattform("There {0} on {1}", status );
237 * pattform.adoptFormats(testFormats, 3);
238 * Formattable testArgs[] = {0L, "Disk A"};
239 * FieldPosition fp(0);
242 * for (int32_t i = 0; i < 4; ++i) {
243 * Formattable fInt(i);
244 * testArgs[0] = fInt;
245 * pattform.format(testArgs, 2, str, fp, status );
246 * str.extract(0, str.length(), buf, "");
248 * cout << "Output for i=" << i << " : " << buf << endl;
255 * <p><em>User subclasses are not supported.</em> While clients may write
256 * subclasses, such code will not necessarily work and will not be
257 * guaranteed to work stably from release to release.
259 class U_I18N_API ChoiceFormat
: public NumberFormat
{
262 * Construct a new ChoiceFormat with the limits and the corresponding formats
263 * based on the pattern.
265 * @param pattern Pattern used to construct object.
266 * @param status Output param to receive success code. If the
267 * pattern cannot be parsed, set to failure code.
270 ChoiceFormat(const UnicodeString
& pattern
,
275 * Construct a new ChoiceFormat with the given limits and formats. Copy
276 * the limits and formats instead of adopting them.
278 * @param limits Array of limit values.
279 * @param formats Array of formats.
280 * @param count Size of 'limits' and 'formats' arrays.
284 ChoiceFormat(const double* limits
,
285 const UnicodeString
* formats
,
289 * Construct a new ChoiceFormat with the given limits and formats.
290 * Copy the limits and formats (instead of adopting them). By
291 * default, each limit in the array specifies the inclusive lower
292 * bound of its range, and the exclusive upper bound of the previous
293 * range. However, if the isLimitOpen element corresponding to a
294 * limit is TRUE, then the limit is the exclusive lower bound of its
295 * range, and the inclusive upper bound of the previous range.
296 * @param limits Array of limit values
297 * @param closures Array of booleans specifying whether each
298 * element of 'limits' is open or closed. If FALSE, then the
299 * corresponding limit is a member of the range above it. If TRUE,
300 * then the limit belongs to the range below it.
301 * @param formats Array of formats
302 * @param count Size of 'limits', 'closures', and 'formats' arrays
305 ChoiceFormat(const double* limits
,
306 const UBool
* closures
,
307 const UnicodeString
* formats
,
313 * @param that ChoiceFormat object to be copied from
316 ChoiceFormat(const ChoiceFormat
& that
);
319 * Assignment operator.
321 * @param that ChoiceFormat object to be copied
324 const ChoiceFormat
& operator=(const ChoiceFormat
& that
);
330 virtual ~ChoiceFormat();
333 * Clone this Format object polymorphically. The caller owns the
334 * result and should delete it when done.
336 * @return a copy of this object
339 virtual Format
* clone(void) const;
342 * Return true if the given Format objects are semantically equal.
343 * Objects of different subclasses are considered unequal.
345 * @param other ChoiceFormat object to be compared
346 * @return true if other is the same as this.
349 virtual UBool
operator==(const Format
& other
) const;
353 * @param pattern The pattern to be applied.
354 * @param status Output param set to success/failure code on
355 * exit. If the pattern is invalid, this will be
356 * set to a failure result.
359 virtual void applyPattern(const UnicodeString
& pattern
,
364 * @param pattern The pattern to be applied.
365 * @param parseError Struct to recieve information on position
366 * of error if an error is encountered
367 * @param status Output param set to success/failure code on
368 * exit. If the pattern is invalid, this will be
369 * set to a failure result.
372 virtual void applyPattern(const UnicodeString
& pattern
,
373 UParseError
& parseError
,
378 * @param pattern Output param which will recieve the pattern
379 * Previous contents are deleted.
380 * @return A reference to 'pattern'
383 virtual UnicodeString
& toPattern(UnicodeString
&pattern
) const;
386 * Set the choices to be used in formatting.
388 * @param limitsToCopy Contains the top value that you want
389 * parsed with that format,and should be in
390 * ascending sorted order. When formatting X,
391 * the choice will be the i, where limit[i]
392 * <= X < limit[i+1].
393 * @param formatsToCopy The format strings you want to use for each limit.
394 * @param count The size of the above arrays.
397 virtual void setChoices(const double* limitsToCopy
,
398 const UnicodeString
* formatsToCopy
,
402 * Set the choices to be used in formatting. See class description
403 * for documenatation of the limits, closures, and formats arrays.
404 * @param limits Array of limits
405 * @param closures Array of limit booleans
406 * @param formats Array of format string
407 * @param count The size of the above arrays
410 virtual void setChoices(const double* limits
,
411 const UBool
* closures
,
412 const UnicodeString
* formats
,
416 * Get the limits passed in the constructor.
418 * @param count The size of the limits arrays
419 * @return the limits.
422 virtual const double* getLimits(int32_t& count
) const;
425 * Get the limit booleans passed in the constructor. The caller
426 * must not delete the result.
428 * @param count The size of the arrays
429 * @return the closures
432 virtual const UBool
* getClosures(int32_t& count
) const;
435 * Get the formats passed in the constructor.
437 * @param count The size of the arrays
438 * @return the formats.
441 virtual const UnicodeString
* getFormats(int32_t& count
) const;
444 * Format a double or long number using this object's choices.
446 * @param number The value to be formatted.
447 * @param appendTo Output parameter to receive result.
448 * Result is appended to existing contents.
449 * @param pos On input: an alignment field, if desired.
450 * On output: the offsets of the alignment field.
451 * @return Reference to 'appendTo' parameter.
454 virtual UnicodeString
& format(double number
,
455 UnicodeString
& appendTo
,
456 FieldPosition
& pos
) const;
458 * Format a int_32t number using this object's choices.
460 * @param number The value to be formatted.
461 * @param appendTo Output parameter to receive result.
462 * Result is appended to existing contents.
463 * @param pos On input: an alignment field, if desired.
464 * On output: the offsets of the alignment field.
465 * @return Reference to 'appendTo' parameter.
468 virtual UnicodeString
& format(int32_t number
,
469 UnicodeString
& appendTo
,
470 FieldPosition
& pos
) const;
473 * Format an int64_t number using this object's choices.
475 * @param number The value to be formatted.
476 * @param appendTo Output parameter to receive result.
477 * Result is appended to existing contents.
478 * @param pos On input: an alignment field, if desired.
479 * On output: the offsets of the alignment field.
480 * @return Reference to 'appendTo' parameter.
483 virtual UnicodeString
& format(int64_t number
,
484 UnicodeString
& appendTo
,
485 FieldPosition
& pos
) const;
488 * Format an array of objects using this object's choices.
490 * @param objs The array of objects to be formatted.
491 * @param cnt The size of objs.
492 * @param appendTo Output parameter to receive result.
493 * Result is appended to existing contents.
494 * @param pos On input: an alignment field, if desired.
495 * On output: the offsets of the alignment field.
496 * @param success Output param set to success/failure code on
498 * @return Reference to 'appendTo' parameter.
501 virtual UnicodeString
& format(const Formattable
* objs
,
503 UnicodeString
& appendTo
,
505 UErrorCode
& success
) const;
507 * Format an object using this object's choices.
510 * @param obj The object to be formatted.
511 * @param appendTo Output parameter to receive result.
512 * Result is appended to existing contents.
513 * @param pos On input: an alignment field, if desired.
514 * On output: the offsets of the alignment field.
515 * @param status Output param set to success/failure code on
517 * @return Reference to 'appendTo' parameter.
520 virtual UnicodeString
& format(const Formattable
& obj
,
521 UnicodeString
& appendTo
,
523 UErrorCode
& status
) const;
526 * Redeclared NumberFormat method.
528 * @param obj The object to be formatted.
529 * @param appendTo Output parameter to receive result.
530 * Result is appended to existing contents.
531 * @param status Output param set to success/failure code on
533 * @return Reference to 'appendTo' parameter.
536 UnicodeString
& format(const Formattable
& obj
,
537 UnicodeString
& appendTo
,
538 UErrorCode
& status
) const;
541 * Redeclared NumberFormat method.
542 * Format a double number. These methods call the NumberFormat
543 * pure virtual format() methods with the default FieldPosition.
545 * @param number The value to be formatted.
546 * @param appendTo Output parameter to receive result.
547 * Result is appended to existing contents.
548 * @return Reference to 'appendTo' parameter.
551 UnicodeString
& format( double number
,
552 UnicodeString
& appendTo
) const;
555 * Redeclared NumberFormat method.
556 * Format a long number. These methods call the NumberFormat
557 * pure virtual format() methods with the default FieldPosition.
559 * @param number The value to be formatted.
560 * @param appendTo Output parameter to receive result.
561 * Result is appended to existing contents.
562 * @return Reference to 'appendTo' parameter.
565 UnicodeString
& format( int32_t number
,
566 UnicodeString
& appendTo
) const;
569 * Return a long if possible (e.g. within range LONG_MAX,
570 * LONG_MAX], and with no decimals), otherwise a double. If
571 * IntegerOnly is set, will stop at a decimal point (or equivalent;
572 * e.g. for rational numbers "1 2/3", will stop after the 1).
574 * If no object can be parsed, parsePosition is unchanged, and NULL is
577 * @param text The text to be parsed.
578 * @param result Formattable to be set to the parse result.
579 * If parse fails, return contents are undefined.
580 * @param parsePosition The position to start parsing at on input.
581 * On output, moved to after the last successfully
582 * parse character. On parse failure, does not change.
583 * @see NumberFormat::isParseIntegerOnly
586 virtual void parse(const UnicodeString
& text
,
588 ParsePosition
& parsePosition
) const;
591 * Return a long if possible (e.g. within range LONG_MAX,
592 * LONG_MAX], and with no decimals), otherwise a double. If
593 * IntegerOnly is set, will stop at a decimal point (or equivalent;
594 * e.g. for rational numbers "1 2/3", will stop after the 1).
596 * If no object can be parsed, parsePosition is unchanged, and NULL is
599 * @param text The text to be parsed.
600 * @param result Formattable to be set to the parse result.
601 * If parse fails, return contents are undefined.
602 * @param status Output param with the formatted string.
603 * @see NumberFormat::isParseIntegerOnly
606 virtual void parse(const UnicodeString
& text
,
608 UErrorCode
& status
) const;
613 * Returns a unique class ID POLYMORPHICALLY. Pure virtual override.
614 * This method is to implement a simple version of RTTI, since not all
615 * C++ compilers support genuine RTTI. Polymorphic operator==() and
616 * clone() methods call this method.
618 * @return The class ID for this object. All objects of a
619 * given class have the same class ID. Objects of
620 * other classes have different class IDs.
623 virtual UClassID
getDynamicClassID(void) const;
626 * Return the class ID for this class. This is useful only for
627 * comparing to a return value from getDynamicClassID(). For example:
629 * . Base* polymorphic_pointer = createPolymorphicObject();
630 * . if (polymorphic_pointer->getDynamicClassID() ==
631 * . Derived::getStaticClassID()) ...
633 * @return The class ID for all objects of this class.
636 static UClassID U_EXPORT2
getStaticClassID(void);
639 // static cache management (thread-safe)
640 // static NumberFormat* getNumberFormat(UErrorCode &status); // call this function to 'check out' a numberformat from the cache.
641 // static void releaseNumberFormat(NumberFormat *adopt); // call this function to 'return' the number format to the cache.
644 * Converts a string to a double value using a default NumberFormat object
645 * which is static (shared by all ChoiceFormat instances).
646 * @param string the string to be converted with.
647 * @return the converted double number.
649 static double stod(const UnicodeString
& string
);
652 * Converts a double value to a string using a default NumberFormat object
653 * which is static (shared by all ChoiceFormat instances).
654 * @param value the double number to be converted with.
655 * @param string the result string.
656 * @return the converted string.
658 static UnicodeString
& dtos(double value
, UnicodeString
& string
);
660 ChoiceFormat(); // default constructor not implemented
663 * Construct a new ChoiceFormat with the limits and the corresponding formats
664 * based on the pattern.
666 * @param newPattern Pattern used to construct object.
667 * @param parseError Struct to recieve information on position
668 * of error if an error is encountered.
669 * @param status Output param to receive success code. If the
670 * pattern cannot be parsed, set to failure code.
673 ChoiceFormat(const UnicodeString
& newPattern
,
674 UParseError
& parseError
,
677 friend class MessageFormat
;
679 * Each ChoiceFormat divides the range -Inf..+Inf into fCount
680 * intervals. The intervals are:
682 * 0: fChoiceLimits[0]..fChoiceLimits[1]
683 * 1: fChoiceLimits[1]..fChoiceLimits[2]
685 * fCount-2: fChoiceLimits[fCount-2]..fChoiceLimits[fCount-1]
686 * fCount-1: fChoiceLimits[fCount-1]..+Inf
688 * Interval 0 is special; during formatting (mapping numbers to
689 * strings), it also contains all numbers less than
690 * fChoiceLimits[0], as well as NaN values.
692 * Interval i maps to and from string fChoiceFormats[i]. When
693 * parsing (mapping strings to numbers), then intervals map to
694 * their lower limit, that is, interval i maps to fChoiceLimit[i].
696 * The intervals may be closed, half open, or open. This affects
697 * formatting but does not affect parsing. Interval i is affected
698 * by fClosures[i] and fClosures[i+1]. If fClosures[i]
699 * is FALSE, then the value fChoiceLimits[i] is in interval i.
700 * That is, intervals i and i are:
702 * i-1: ... x < fChoiceLimits[i]
703 * i: fChoiceLimits[i] <= x ...
705 * If fClosures[i] is TRUE, then the value fChoiceLimits[i] is
706 * in interval i-1. That is, intervals i-1 and i are:
708 * i-1: ... x <= fChoiceLimits[i]
709 * i: fChoiceLimits[i] < x ...
711 * Because of the nature of interval 0, fClosures[0] has no
715 double* fChoiceLimits
;
717 UnicodeString
* fChoiceFormats
;
721 inline UnicodeString
&
722 ChoiceFormat::format(const Formattable
& obj
,
723 UnicodeString
& appendTo
,
724 UErrorCode
& status
) const {
725 // Don't use Format:: - use immediate base class only,
726 // in case immediate base modifies behavior later.
727 return NumberFormat::format(obj
, appendTo
, status
);
730 inline UnicodeString
&
731 ChoiceFormat::format(double number
,
732 UnicodeString
& appendTo
) const {
733 return NumberFormat::format(number
, appendTo
);
736 inline UnicodeString
&
737 ChoiceFormat::format(int32_t number
,
738 UnicodeString
& appendTo
) const {
739 return NumberFormat::format(number
, appendTo
);
743 #endif /* #if !UCONFIG_NO_FORMATTING */