]> git.saurik.com Git - wxWidgets.git/blame - docs/doxygen/overviews/html.h
compilation fix for pch-less build
[wxWidgets.git] / docs / doxygen / overviews / html.h
CommitLineData
15b6757b 1/////////////////////////////////////////////////////////////////////////////
c33e257b 2// Name: html.h
15b6757b
FM
3// Purpose: topic overview
4// Author: wxWidgets team
5// RCS-ID: $Id$
6// Licence: wxWindows license
7/////////////////////////////////////////////////////////////////////////////
8
9/*!
36c9828f 10
c33e257b 11 @page overview_html wxHTML overview
36c9828f 12
15b6757b
FM
13 The wxHTML library provides classes for parsing and displaying HTML.
14 It is not intended to be a high-end HTML browser. If you are looking for
c33e257b
FM
15 something like that try http://www.mozilla.org.
16
36c9828f 17 wxHTML can be used as a generic rich text viewer - for example to display
15b6757b 18 a nice About Box (like those of GNOME apps) or to display the result of
c33e257b
FM
19 database searching. There is a #wxFileSystem class which allows you to use
20 your own virtual file systems.
21
15b6757b
FM
22 wxHtmlWindow supports tag handlers. This means that you can easily
23 extend wxHtml library with new, unsupported tags. Not only that,
24 you can even use your own application-specific tags!
c33e257b 25
15b6757b 26 See @c src/html/m_*.cpp files for details.
c33e257b
FM
27
28 There is a generic wxHtmlParser class, independent of wxHtmlWindow.
29
30 @li @ref overview_html_quickstart
31 @li @ref overview_html_printing
32 @li @ref overview_html_helpformats
33 @li @ref overview_html_filters
34 @li @ref overview_html_cells
35 @li @ref overview_html_handlers
36 @li @ref overview_html_supptags
37
38
39 <hr>
40
41
42 @section overview_html_quickstart wxHTML quick start
43
44 @subsection overview_html_quickstart_disphtml Displaying HTML
45
46 First of all, you must include @c wx/wxhtml.h.
47
48 Class #wxHtmlWindow (derived from wxScrolledWindow) is used to display HTML documents.
49
50 It has two important methods: wxHtmlWindow::LoadPage and wxHtmlWindow::SetPage.
15b6757b
FM
51 LoadPage loads and displays HTML file while SetPage displays directly the
52 passed @b string. See the example:
36c9828f 53
15b6757b 54 @code
c33e257b 55 mywin - LoadPage("test.htm");
15b6757b
FM
56 mywin - SetPage("htmlbody"
57 "h1Error/h1"
58 "Some error occurred :-H)"
59 "/body/hmtl");
60 @endcode
36c9828f 61
c33e257b
FM
62 @subsection overview_html_quickstart_disphelp Displaying Help
63
15b6757b 64 See #wxHtmlHelpController.
c33e257b
FM
65
66 @subsection overview_html_quickstart_settingup Setting up wxHtmlWindow
67
15b6757b
FM
68 Because wxHtmlWindow is derived from wxScrolledWindow and not from
69 wxFrame, it doesn't have visible frame. But the user usually wants to see
36c9828f 70 the title of HTML page displayed somewhere and the frame's titlebar is
15b6757b 71 the ideal place for it.
c33e257b 72
36c9828f 73 wxHtmlWindow provides 2 methods in order to handle this:
c33e257b 74 wxHtmlWindow::SetRelatedFrame and wxHtmlWindow::SetRelatedStatusBar.
15b6757b 75 See the example:
36c9828f 76
15b6757b 77 @code
c33e257b 78 html = new wxHtmlWindow(this);
15b6757b
FM
79 html - SetRelatedFrame(this, "HTML : %%s");
80 html - SetRelatedStatusBar(0);
81 @endcode
36c9828f 82
15b6757b
FM
83 The first command associates the HTML object with its parent frame
84 (this points to wxFrame object there) and sets the format of the title.
85 Page title "Hello, world!" will be displayed as "HTML : Hello, world!"
86 in this example.
c33e257b 87
15b6757b
FM
88 The second command sets which frame's status bar should be used to display
89 browser's messages (such as "Loading..." or "Done" or hypertext links).
36c9828f 90
c33e257b 91 @subsection overview_html_quickstart_custom Customizing wxHtmlWindow
36c9828f 92
c33e257b
FM
93 You can customize wxHtmlWindow by setting font size, font face and
94 borders (space between border of window and displayed HTML). Related functions:
36c9828f 95
c33e257b
FM
96 @li wxHtmlWindow::SetFonts
97 @li wxHtmlWindow::SetBorders
98 @li wxHtmlWindow::ReadCustomization
99 @li wxHtmlWindow::WriteCustomization
36c9828f 100
15b6757b
FM
101 The last two functions are used to store user customization info wxConfig stuff
102 (for example in the registry under Windows, or in a dotfile under Unix).
36c9828f 103
c33e257b
FM
104
105
106 @section overview_html_printing HTML Printing
36c9828f
FM
107
108 The wxHTML library provides printing facilities with several levels of complexity.
c33e257b
FM
109 The easiest way to print an HTML document is to use @ref htmleasyprinting_overview.
110
111 It lets you print HTML documents with only one command and you don't have to worry
112 about deriving from the wxPrintout class at all. It is only a simple wrapper around the
15b6757b 113 #wxHtmlPrintout, normal wxWidgets printout class.
c33e257b 114
15b6757b 115 And finally there is the low level class #wxHtmlDCRenderer which you can use to
c33e257b 116 render HTML into a rectangular area on any DC.
36c9828f 117
c33e257b
FM
118 It supports rendering into multiple rectangles with the same
119 width. (The most common use of this is placing one rectangle on each page or
120 printing into two columns.)
121
122
123 @section overview_html_helpformats Help Files Format
36c9828f 124
15b6757b 125 wxHTML library uses a reduced version of MS HTML Workshop format.
c33e257b
FM
126 Tex2RTF can produce these files when generating HTML, if you set
127 @b htmlWorkshopFiles to @true in your tex2rtf.ini file.
15b6757b 128 (See #wxHtmlHelpController for help controller description.)
c33e257b
FM
129
130 A @b book consists of three files: the header file, the contents file
131 and the index file.
132
133 You can make a regular zip archive of these files, plus the HTML and any
134 image files, for wxHTML (or helpview) to read; and the @c .zip file can
135 optionally be renamed to @c .htb.
136
137 @subsection overview_html_helpformats_hhp Header file (.hhp)
138
139 The header file must contain these lines (and may contain additional lines
140 which are ignored):
36c9828f 141
15b6757b
FM
142 @code
143 Contents file=filename.hhc
144 Index file=filename.hhk
145 Title=title of your book
146 Default topic=default page to be displayed.htm
147 @endcode
36c9828f
FM
148
149 All filenames (including the Default topic) are relative to the
c33e257b 150 location of the @c .hhp file.
36c9828f 151
c33e257b
FM
152 @note For localization, in addition the @c .hhp file may contain the line
153 @code
154 Charset=rfc_charset
155 @endcode
156 which specifies what charset (e.g. "iso8859_1") was used in contents
157 and index files. Please note that this line is incompatible with
158 MS HTML Help Workshop and it would either silently remove it or complain
159 with some error. See also @ref overview_nonenglish.
36c9828f 160
c33e257b
FM
161 @subsection overview_html_helpformats_hhc Contents file (.hhc)
162
163 Contents file has HTML syntax and it can be parsed by regular HTML parser.
164 It contains exactly one list (@c &lt;ul&gt;....@c &lt;/ul&gt; statement):
36c9828f 165
15b6757b 166 @code
c33e257b
FM
167 <ul>
168
169 <li><object type="text/sitemap">
170 <param name="Name" value="@topic name@">
171 <param name="ID" value=@numeric_id@>
172 <param name="Local" value="@filename.htm@">
173 </object>
174 <li><object type="text/sitemap">
175 <param name="Name" value="@topic name@">
176 <param name="ID" value=@numeric_id@>
177 <param name="Local" value="@filename.htm@">
178 </object>
36c9828f 179 ...
c33e257b 180 </ul>
15b6757b 181 @endcode
36c9828f 182
c33e257b
FM
183 You can modify value attributes of param tags.
184 The <em>topic name</em> is name of chapter/topic as is displayed in
185 contents, <em>filename.htm</em> is the HTML page name (relative to the @c .hhp file)
186 and <em>numeric_id</em> is optional - it is used only when you use wxHtmlHelpController::Display(int).
36c9828f 187
c33e257b 188 Items in the list may be nested - one @c &lt;li&gt; statement may contain a @c &lt;ul&gt; sub-statement:
36c9828f 189
c33e257b
FM
190 @code
191 <ul>
192
193 <li><object type="text/sitemap">
194 <param name="Name" value="Top node">
195 <param name="Local" value="top.htm">
196 </object>
197 <ul>
198 <li><object type="text/sitemap">
199 <param name="Name" value="subnode in topnode">
200 <param name="Local" value="subnode1.htm">
201 </object>
202 ...
203 </ul>
204
205 <li><object type="text/sitemap">
206 <param name="Name" value="Another Top">
207 <param name="Local" value="top2.htm">
208 </object>
209 ...
210
211 </ul>
15b6757b 212 @endcode
36c9828f 213
c33e257b
FM
214 @subsection overview_html_helpformats_hhk Index file (.hhk)
215
216 Index files have same format as contents file except that ID params are ignored
217 and sublists are @b not allowed.
218
36c9828f 219
c33e257b 220 @section overview_html_filters Input Filters
36c9828f 221
15b6757b 222 The wxHTML library provides a mechanism for reading and displaying
36c9828f 223 files of many different file formats.
c33e257b
FM
224
225 wxHtmlWindow::LoadPage can load not only HTML files but any known file.
226 To make a file type known to wxHtmlWindow you must create a #wxHtmlFilter filter and
15b6757b 227 register it using wxHtmlWindow::AddFilter.
36c9828f 228
36c9828f 229
c33e257b
FM
230 @section overview_html_cells Cells and Containers
231
232 This article describes mechanism used by #wxHtmlWinParser and
15b6757b 233 #wxHtmlWindow to parse and display HTML documents.
c33e257b
FM
234
235 @subsection overview_html_cells_cells Cells
236
15b6757b
FM
237 You can divide any text (or HTML) into small fragments. Let's call these
238 fragments @b cells. Cell is for example one word, horizontal line, image
239 or any other part of document. Each cell has width and height (except special
240 "magic" cells with zero dimensions - e.g. colour changers or font changers).
241 See #wxHtmlCell.
c33e257b
FM
242
243 @subsection overview_html_cells_containers Containers
244
15b6757b 245 Container is kind of cell that may contain sub-cells. Its size depends
36c9828f 246 on number and sizes of its sub-cells (and also depends on width of window).
c33e257b
FM
247 See #wxHtmlContainerCell, wxHtmlCell::Layout.
248 This image shows the cells and containers: @image html contbox.bmp
249
250 @subsection overview_html_cells_conttaghandler Using Containers in Tag Handler
251
252 #wxHtmlWinParser provides a user-friendly way of managing containers.
253 It is based on the idea of opening and closing containers.
254
255 Use #OpenContainer to open new a container @e within an already opened container.
256 This new container is a @e sub-container of the old one. (If you want to create a
257 new container with the same depth level you can call @c CloseContainer(); OpenContainer();.)
258
259 Use #CloseContainer to close the container. This doesn't create a new container
260 with same depth level but it returns "control" to the parent container.
261 See explanation: @image html cont.bmp
36c9828f 262
15b6757b
FM
263 There clearly must be same number of calls to OpenContainer as to
264 CloseContainer.
c33e257b
FM
265
266 @subsubsection overview_html_cells_conttaghandler_example Example
267
15b6757b
FM
268 This code creates a new paragraph (container at same depth level)
269 with "Hello, world!":
36c9828f 270
15b6757b 271 @code
c33e257b
FM
272 m_WParser -> CloseContainer();
273 c = m_WParser -> OpenContainer();
36c9828f 274
c33e257b
FM
275 m_WParser -> AddText("Hello, ");
276 m_WParser -> AddText("world!");
36c9828f 277
c33e257b
FM
278 m_WParser -> CloseContainer();
279 m_WParser -> OpenContainer();
15b6757b 280 @endcode
36c9828f 281
c33e257b 282 and here is image of the situation: @image html hello.bmp
36c9828f 283
15b6757b
FM
284 You can see that there was an opened container before the code was executed.
285 We closed it, created our own container, then closed our container and opened
c33e257b
FM
286 new container.
287
288 The result was that we had @e same depth level after executing.
289 This is general rule that should be followed by tag handlers:
15b6757b
FM
290 leave depth level of containers unmodified (in other words, number of
291 OpenContainer and CloseContainer calls should be same within #HandleTag's body).
c33e257b
FM
292
293 Notice that it would be usually better to use wxHtmlContainerCell::InsertCell instead
15b6757b 294 of adding text to the parser directly.
36c9828f 295
c33e257b
FM
296
297 @section overview_html_handlers Tag Handlers
36c9828f 298
15b6757b
FM
299 The wxHTML library provides architecture of pluggable @e tag handlers.
300 Tag handler is class that understands particular HTML tag (or tags) and is
301 able to interpret it.
c33e257b
FM
302
303 #wxHtmlWinParser has a static table of @b modules.
15b6757b
FM
304 Each module contains one or more tag handlers. Each time a new wxHtmlWinParser
305 object is constructed all modules are scanned and handlers are added
306 to wxHtmlParser's list of available handlers (note: wxHtmlParser's list
307 is non-static).
36c9828f 308
c33e257b
FM
309 @subsection overview_html_handlers_howworks How it works
310
311 Common tag handler's #HandleTag method works in four steps:
36c9828f 312
c33e257b
FM
313 @li Save state of parent parser into local variables
314 @li Change parser state according to tag's params
315 @li Parse text between the tag and paired ending tag (if present)
316 @li Restore original parser state
36c9828f 317
c33e257b
FM
318 See #wxHtmlWinParser for methods for modifying parser's state.
319 In general you can do things like opening/closing containers, changing colors, fonts etc.
36c9828f 320
c33e257b
FM
321 @subsection overview_html_handlers_custom Providing own tag handlers
322
323 You should create a new .cpp file and place the following lines into it:
36c9828f 324
15b6757b 325 @code
c33e257b
FM
326 #include <mod_templ.h>
327 #include <forcelink.h>
15b6757b
FM
328 FORCE_LINK_ME(yourmodulefilenamewithoutcpp)
329 @endcode
36c9828f 330
15b6757b 331 Then you must define handlers and one module.
36c9828f 332
c33e257b 333 @subsection overview_html_handlers_tag Tag handlers
36c9828f 334
c33e257b 335 The handler is derived from #wxHtmlWinTagHandler (or directly from #wxHtmlTagHandler).
36c9828f 336
c33e257b
FM
337 You can use set of macros to define the handler (see src/html/m_*.cpp files
338 for details). Handler definition must start with @b TAG_HANDLER_BEGIN macro
339 and end with @b TAG_HANDLER_END macro.
36c9828f 340
c33e257b
FM
341 I strongly recommend to have a look at @e include/wxhtml/mod_templ.h file.
342 Otherwise you won't understand the structure of macros.
36c9828f 343
c33e257b
FM
344 See macros reference:
345 @li @b TAG_HANDLER_BEGIN(@e name, @e tags):
346 Starts handler definition. @e name is handler identifier (in fact
347 part of class name), @e tags is string containing list of tags
348 supported by this handler (in uppercase). This macro derives new class from
349 wxHtmlWinTagHandler and implements it is #GetSupportedTags method.
350 Example: TAG_HANDLER_BEGIN(FONTS, "B,I,U,T")
351
352 @li @b TAG_HANDLER_VARS:
353 This macro starts block of variables definitions. (Variables are identical
354 to class attributes.) Example:
355
356 @code
357 TAG_HANDLER_BEGIN(VARS_ONLY, "CRAZYTAG")
358 TAG_HANDLER_VARS
359 int my_int_var;
360 wxString something_else;
361 TAG_HANDLER_END(VARS_ONLY)
362 @endcode
363
364 This macro is used only in rare cases.
365
366 @li @b TAG_HANDLER_CONSTR(@e name):
367 This macro supplies object constructor. @e name is same name as the one
368 from TAG_HANDLER_BEGIN macro. Body of constructor follow after
369 this macro (you must use { and } ). Example:
370
371 @code
372 TAG_HANDLER_BEGIN(VARS2, "CRAZYTAG")
373 TAG_HANDLER_VARS
374 int my_int_var;
375 TAG_HANDLER_CONSTR(vars2)
376 { // !!!!!!
377 my_int_var = 666;
378 } // !!!!!!
379 TAG_HANDLER_END(VARS2)
380 @endcode
381
382 Never used in wxHTML :-)
383
384 @li @b TAG_HANDLER_PROC(@e varib):
385 This is very important macro. It defines #HandleTag
386 method. @e varib is name of parameter passed to the method, usually
387 @e tag. Body of method follows after this macro.
388 Note than you must use { and } !
389 Example:
390
391 @code
392 TAG_HANDLER_BEGIN(TITLE, "TITLE")
393 TAG_HANDLER_PROC(tag)
394 {
395 printf("TITLE found...\n");
396 }
397 TAG_HANDLER_END(TITLE)
398 @endcode
399
400 @li @b TAG_HANDLER_END(@e name):
401 Ends definition of tag handler @e name.
402
403 @subsection overview_html_handlers_modules Tags Modules
36c9828f 404
36c9828f 405 You can use set of 3 macros TAGS_MODULE_BEGIN, TAGS_MODULE_ADD and
15b6757b
FM
406 TAGS_MODULE_END to inherit new module from
407 #wxHtmlTagsModule and to create instance of it.
36c9828f 408
c33e257b 409 See macros reference:
36c9828f 410
c33e257b
FM
411 @li @b TAGS_MODULE_BEGIN(@e modname):
412 Begins module definition. @e modname is part of class name and must be unique.
413 @li @b TAGS_MODULE_ADD(@e name):
414 Adds the handler to this module. @e name is the identifier from TAG_HANDLER_BEGIN.
415 @li @b TAGS_MODULE_END(@e modname):
416 Ends the definition of module.
417 Example:
418
419 @code
420 TAGS_MODULE_BEGIN(Examples)
421 TAGS_MODULE_ADD(VARS_ONLY)
422 TAGS_MODULE_ADD(VARS2)
423 TAGS_MODULE_ADD(TITLE)
424 TAGS_MODULE_END(Examples)
425 @endcode
426
427
428 @section overview_html_supptags Tags supported by wxHTML
429
430 wxHTML is not full implementation of HTML standard. Instead, it supports most
431 common tags so that it is possible to display @e simple HTML documents with it.
432 (For example it works fine with pages created in Netscape Composer or generated by tex2rtf).
36c9828f 433
15b6757b 434 Following tables list all tags known to wxHTML, together with supported parameters.
c33e257b 435
15b6757b 436 A tag has general form of @c tagname param_1 param_2 ... param_n where param_i is
c33e257b
FM
437 either @c paramname="paramvalue" or @c paramname=paramvalue - these two are equivalent.
438 Unless stated otherwise, wxHTML is case-insensitive.
439
440 @subsection overview_html_supptags_commonvalues Table of common parameter values
441
15b6757b 442 We will use these substitutions in tags descriptions:
36c9828f 443
15b6757b
FM
444 @code
445 [alignment] CENTER
446 LEFT
447 RIGHT
448 JUSTIFY
36c9828f 449
15b6757b
FM
450 [v_alignment] TOP
451 BOTTOM
452 CENTER
36c9828f 453
15b6757b 454 [color] HTML 4.0-compliant colour specification
36c9828f 455
15b6757b
FM
456 [fontsize] -2
457 -1
458 +0
459 +1
460 +2
461 +3
462 +4
463 1
464 2
465 3
466 4
467 5
468 6
469 7
36c9828f 470
15b6757b 471 [pixels] integer value that represents dimension in pixels
36c9828f
FM
472
473 [percent] i%
15b6757b 474 where i is integer
36c9828f
FM
475
476 [url] an URL
477
15b6757b 478 [string] text string
36c9828f 479
15b6757b
FM
480 [coords] c(1),c(2),c(3),...,c(n)
481 where c(i) is integer
482 @endcode
36c9828f
FM
483
484
c33e257b 485 @subsection overview_html_supptags_list List of supported tags
36c9828f 486
15b6757b
FM
487 @code
488 A NAME=[string]
489 HREF=[url]
490 TARGET=[target window spec]
491 ADDRESS
492 AREA SHAPE=POLY
493 SHAPE=CIRCLE
494 SHAPE=RECT
495 COORDS=[coords]
496 HREF=[url]
497 B
498 BIG
499 BLOCKQUOTE
500 BODY TEXT=[color]
501 LINK=[color]
502 BGCOLOR=[color]
503 BR ALIGN=[alignment]
504 CENTER
505 CITE
506 CODE
507 DD
508 DIV ALIGN=[alignment]
509 DL
510 DT
511 EM
512 FONT COLOR=[color]
513 SIZE=[fontsize]
514 FACE=[comma-separated list of facenames]
515 HR ALIGN=[alignment]
516 SIZE=[pixels]
517 WIDTH=[percent|pixels]
518 NOSHADE
519 H1
520 H2
521 H3
522 H4
523 H5
524 H6
525 I
526 IMG SRC=[url]
527 WIDTH=[pixels]
528 HEIGHT=[pixels]
529 ALIGN=TEXTTOP
530 ALIGN=CENTER
531 ALIGN=ABSCENTER
532 ALIGN=BOTTOM
533 USEMAP=[url]
534 KBD
535 LI
536 MAP NAME=[string]
537 META HTTP-EQUIV="Content-Type"
538 CONTENT=[string]
539 OL
540 P ALIGN=[alignment]
541 PRE
542 SAMP
543 SMALL
544 STRIKE
545 STRONG
546 SUB
547 SUP
548 TABLE ALIGN=[alignment]
549 WIDTH=[percent|pixels]
550 BORDER=[pixels]
551 VALIGN=[v_alignment]
552 BGCOLOR=[color]
553 CELLSPACING=[pixels]
554 CELLPADDING=[pixels]
555 TD ALIGN=[alignment]
556 VALIGN=[v_alignment]
557 BGCOLOR=[color]
558 WIDTH=[percent|pixels]
559 COLSPAN=[pixels]
560 ROWSPAN=[pixels]
561 NOWRAP
562 TH ALIGN=[alignment]
563 VALIGN=[v_alignment]
564 BGCOLOR=[color]
565 WIDTH=[percent|pixels]
566 COLSPAN=[pixels]
567 ROWSPAN=[pixels]
568 TITLE
569 TR ALIGN=[alignment]
570 VALIGN=[v_alignment]
571 BGCOLOR=[color]
572 TT
573 U
574 UL
575 @endcode
36c9828f 576
c33e257b 577*/
36c9828f 578