1 .\" Copyright (c) 1989, 1991, 1993
2 .\" The Regents of the University of California. All rights reserved.
4 .\" Redistribution and use in source and binary forms, with or without
5 .\" modification, are permitted provided that the following conditions
7 .\" 1. Redistributions of source code must retain the above copyright
8 .\" notice, this list of conditions and the following disclaimer.
9 .\" 2. Redistributions in binary form must reproduce the above copyright
10 .\" notice, this list of conditions and the following disclaimer in the
11 .\" documentation and/or other materials provided with the distribution.
12 .\" 3. All advertising materials mentioning features or use of this software
13 .\" must display the following acknowledgement:
14 .\" This product includes software developed by the University of
15 .\" California, Berkeley and its contributors.
16 .\" 4. Neither the name of the University nor the names of its contributors
17 .\" may be used to endorse or promote products derived from this software
18 .\" without specific prior written permission.
20 .\" THIS SOFTWARE IS PROVIDED BY THE REGENTS AND CONTRIBUTORS ``AS IS'' AND
21 .\" ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE
22 .\" IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR PURPOSE
23 .\" ARE DISCLAIMED. IN NO EVENT SHALL THE REGENTS OR CONTRIBUTORS BE LIABLE
24 .\" FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL, EXEMPLARY, OR CONSEQUENTIAL
25 .\" DAMAGES (INCLUDING, BUT NOT LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS
26 .\" OR SERVICES; LOSS OF USE, DATA, OR PROFITS; OR BUSINESS INTERRUPTION)
27 .\" HOWEVER CAUSED AND ON ANY THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT
28 .\" LIABILITY, OR TORT (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY
29 .\" OUT OF THE USE OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF
32 .\" From: @(#)vis.3 8.1 (Berkeley) 6/9/93
33 .\" $FreeBSD: src/lib/libc/gen/vis.3,v 1.26 2004/07/17 12:27:25 tjr Exp $
40 .Nd visually encode characters
46 .Fn vis "char *dst" "int c" "int flag" "int nextc"
48 .Fn strvis "char *dst" "const char *src" "int flag"
50 .Fn strvisx "char *dst" "const char *src" "size_t len" "int flag"
57 a string which represents the character
61 needs no encoding, it is copied in unaltered.
63 null terminated, and a pointer to the end of the string is
65 The maximum length of any encoding is four
66 characters (not including the trailing
69 encoding a set of characters into a buffer, the size of the buffer should
70 be four times the number of characters encoded, plus one for the trailing
74 argument is used for altering the default range of
75 characters considered for encoding and for altering the visual
77 The additional character,
79 is only used when selecting the
81 encoding format (explained below).
89 a visual representation of
94 function encodes characters from
101 function encodes exactly
106 is useful for encoding a block of data that may contain
114 must be four times the number
115 of characters encoded from
120 forms return the number of characters in dst (not including
124 The encoding is a unique, invertible representation composed entirely of
125 graphic characters; it can be decoded back into the original form using
132 There are two parameters that can be controlled: the range of
133 characters that are encoded, and the type
134 of representation used.
135 By default, all non-graphic characters
136 except space, tab, and newline are encoded.
141 .Bl -tag -width VIS_WHITEX
143 Also encode magic characters
165 Only encode "unsafe" characters.
167 characters which may cause common terminals to perform
168 unexpected functions.
169 Currently this form allows space,
170 tab, newline, backspace, bell, and return - in addition
171 to all graphic characters - unencoded.
174 There are four forms of encoding.
175 Most forms use the backslash character
177 to introduce a special
178 sequence; two backslashes are used to represent a real backslash.
179 These are the visual formats:
180 .Bl -tag -width VIS_HTTPSTYLE
184 to represent meta characters (characters with the 8th
185 bit set), and use caret
187 to represent control characters see
188 .Pf ( Xr iscntrl 3 ) .
189 The following formats are used:
190 .Bl -tag -width xxxxx
192 Represents the control character
205 with the 8th bit set.
211 Represents control character
213 with the 8th bit set.
227 Represents Meta-space.
231 Use C-style backslash sequences to represent standard non-printable
233 The following sequences are used to represent the indicated characters:
234 .Bd -unfilled -offset indent
235 .Li \ea Tn - BEL No (007)
236 .Li \eb Tn - BS No (010)
237 .Li \ef Tn - NP No (014)
238 .Li \en Tn - NL No (012)
239 .Li \er Tn - CR No (015)
240 .Li \et Tn - HT No (011)
241 .Li \ev Tn - VT No (013)
242 .Li \e0 Tn - NUL No (000)
245 When using this format, the
247 argument is looked at to determine
250 character can be encoded as
256 is an octal digit, the latter representation is used to
259 Use URI encoding as described in RFC 1808.
264 represents a hexadecimal digit.
266 Use a three digit octal sequence.
271 represents an octal digit.
274 There is one additional flag,
277 doubling of backslashes and the backslash before the default
278 format (that is, control characters are represented by
283 With this flag set, the encoding is
284 ambiguous and non-invertible.
290 .%T Relative Uniform Resource Locators
294 These functions first appeared in
299 family of functions do not recognize multibyte characters, and thus
300 may consider them to be non-printable when they are in fact printable