1 <?xml version=
"1.0" encoding=
"UTF-8"?>
2 <!DOCTYPE book PUBLIC
"-//OASIS//DTD DocBook XML V4.5//EN"
3 "http://www.oasis-open.org/docbook/xml/4.5/docbookx.dtd" [
4 <!ENTITY % aptent SYSTEM
"apt.ent"> %aptent;
5 <!ENTITY % aptverbatiment SYSTEM
"apt-verbatim.ent"> %aptverbatiment;
6 <!ENTITY % aptvendor SYSTEM
"apt-vendor.ent"> %aptvendor;
11 <title>dpkg technical manual
</title>
17 <personname>Tom Lees
</personname><email>tom@lpsg.demon.co.uk
</email>
21 <releaseinfo>Version &apt-product-version;
</releaseinfo>
25 This document describes the minimum necessary workings for the APT dselect
26 replacement. It gives an overall specification of what its external interface
27 must look like for compatibility, and also gives details of some internal
32 <copyright><year>1997</year><holder>Tom Lees
</holder></copyright>
35 <title>License Notice
</title>
37 APT and this document are free software; you can redistribute them and/or
38 modify them under the terms of the GNU General Public License as published by
39 the Free Software Foundation; either version
2 of the License, or (at your
40 option) any later version.
43 For more details, on Debian systems, see the file
44 /usr/share/common-licenses/GPL for the full license.
50 <chapter id=
"ch1"><title>Quick summary of dpkg's external interface
</title>
52 <section id=
"control"><title>Control files
</title>
54 The basic dpkg package control file supports the following major features:-
59 5 types of dependencies:-
64 Pre-Depends, which must be satisfied before a package may be unpacked
69 Depends, which must be satisfied before a package may be configured
74 Recommends, to specify a package which if not installed may severely limit the
75 usefulness of the package
80 Suggests, to specify a package which may increase the productivity of the
86 Conflicts, to specify a package which must NOT be installed in order for the
87 package to be configured
92 Breaks, to specify a package which is broken by the package and which should
93 therefore not be configured while broken
98 Each of these dependencies can specify a version and a depedency on that
99 version, for example "
<=
0.5-
1", "==
2.7.2-
1", etc. The comparators
105 "
<<" - less than
110 "
<=" - less than or equal to
115 "
>>" - greater than
120 "
>=" - greater than or equal to
132 The concept of "virtual packages", which many other packages may provide,
133 using the Provides mechanism. An example of this is the "httpd" virtual
134 package, which all web servers should provide. Virtual package names may be
135 used in dependency headers. However, current policy is that virtual packages
136 do not support version numbers, so dependencies on virtual packages with
137 versions will always fail.
142 Several other control fields, such as Package, Version, Description, Section,
143 Priority, etc., which are mainly for classification purposes. The package
144 name must consist entirely of lowercase characters, plus the characters '+',
145 '-', and '.'. Fields can extend across multiple lines - on the second and
146 subsequent lines, there is a space at the beginning instead of a field name
147 and a ':'. Empty lines must consist of the text " .", which will be ignored,
148 as will the initial space for other continuation lines. This feature is
149 usually only used in the Description field.
155 <section id=
"s1.2"><title>The dpkg status area
</title>
157 The "dpkg status area" is the term used to refer to the directory where dpkg
158 keeps its various status files (GNU would have you call it the dpkg shared
159 state directory). This is always, on Debian systems, /var/lib/dpkg. However,
160 the default directory name should not be hard-coded, but #define'd, so that
161 alteration is possible (it is available via configure in dpkg
1.4.0.9 and
162 above). Of course, in a library, code should be allowed to override the
163 default directory, but the default should be part of the library (so that
164 the user may change the dpkg admin dir simply by replacing the library).
167 Dpkg keeps a variety of files in its status area. These are discussed later
168 on in this document, but a quick summary of the files is here:-
173 available - this file contains a concatenation of control information from all
174 the packages which dpkg knows about. This is updated using the dpkg commands
175 "--update-avail
<file
>", "--merge-avail
<file
>", and
181 status - this file contains information on the following things for every
187 Whether it is installed, not installed, unpacked, removed, failed
188 configuration, or half-installed (deconfigured in favour of another package).
193 Whether it is selected as install, hold, remove, or purge.
198 If it is "ok" (no installation problems), or "not-ok".
203 It usually also contains the section and priority (so that dselect may classify
204 packages not in available)
209 For packages which did not initially appear in the "available" file when they
210 were installed, the other control information for them.
215 The exact format for the "Status:" field is:
218 Status: Want Flag Status
221 Where
<replaceable>Want
</replaceable> may be one of
222 <emphasis>unknown
</emphasis>,
<emphasis>install
</emphasis>,
223 <emphasis>hold
</emphasis>,
<emphasis>deinstall
</emphasis>,
224 <emphasis>purge
</emphasis>.
<replaceable>Flag
</replaceable> may
225 be one of
<emphasis>ok
</emphasis>,
<emphasis>reinstreq
</emphasis>.
226 <replaceable>Status
</replaceable> may
227 be one of
<emphasis>not-installed
</emphasis>,
<emphasis>config-files
</emphasis>,
228 <emphasis>half-installed
</emphasis>,
<emphasis>unpacked
</emphasis>,
229 <emphasis>half-configured
</emphasis> and
<emphasis>installed
</emphasis>.
230 The states are as follows:-
234 <term>not-installed
</term>
237 No files are installed from the package, it has no config files left, it
238 uninstalled cleanly if it ever was installed.
243 <term>unpacked
</term>
246 The basic files have been unpacked (and are listed in
247 /var/lib/dpkg/info/[package].list. There are config files present, but the
248 postinst script has _NOT_ been run.
253 <term>half-configured
</term>
256 The package was installed and unpacked, but the postinst script failed in some
262 <term>installed
</term>
265 All files for the package are installed, and the configuration was also
271 <term>half-installed
</term>
274 An attempt was made to remove the packagem but there was a failure in the
280 <term>config-files
</term>
283 The package was "removed", not "purged". The config files are left, but
290 The two last items are only left in dpkg for compatibility - they are
291 understood by it, but never written out in this form.
294 Please see the dpkg source code,
<literal>lib/parshelp.c
</literal>,
295 <emphasis>statusinfos
</emphasis>,
<emphasis>eflaginfos
</emphasis> and
296 <emphasis>wantinfos
</emphasis> for more details.
301 info - this directory contains files from the control archive of every
302 package currently installed. They are installed with a prefix of
303 "
<packagename
>.". In addition to this, it also contains a file
304 called
<package
>.list for every package, which contains a list
305 of files. Note also that the control file is not copied into here; it
306 is instead found as part of status or available.
311 methods - this directory is reserved for "method"-specific files - each
312 "method" has a subdirectory underneath this directory (or at least,
313 it can have). In addition, there is another subdirectory "mnt", where
314 misc. filesystems (floppies, CD-ROMs, etc.) are mounted.
319 alternatives - directory used by the "update-alternatives" program. It
320 contains one file for each "alternatives" interface, which contains
321 information about all the needed symlinked files for each alternative.
326 diversions - file used by the "dpkg-divert" program. Each diversion takes
327 three lines. The first is the package name (or ":" for user diversion), the
328 second the original filename, and the third the diverted filename.
333 updates - directory used internally by dpkg. This is discussed later, in the
334 section
<xref linkend=
"updates"/>.
339 parts - temporary directory used by dpkg-split
345 <section id=
"s1.3"><title>The dpkg library files
</title>
347 These files are installed under /usr/lib/dpkg (usually), but
348 /usr/local/lib/dpkg is also a possibility (as Debian policy dictates). Under
349 this directory, there is a "methods" subdirectory. The methods subdirectory in
350 turn contains any number of subdirectories for each general method processor
351 (note that one set of method scripts can, and is, used for more than one of
352 the methods listed under dselect).
355 The following files may be found in each of these subdirectories:-
360 names - One line per method, two-digit priority to appear on menu at
361 beginning, followed by a space, the name, and then another space and
362 the short description.
367 desc.
<name
> - Contains the long description displayed by dselect
368 when the cursor is put over the
<name
> method.
373 setup - Script or program which sets up the initial values to be used
374 by this method. Called with first argument as the status area directory
375 (/var/lib/dpkg), second argument as the name of the method (as in the
376 directory name), and the third argument as the option (as in the names file).
381 install - Script/program called when the "install" option of dselect is run
382 with this method. Same arguments as for setup.
387 update - Script/program called when the "update" option of dselect is
388 run. Same arguments as for setup/install.
394 <section id=
"s1.4"><title>The "dpkg" command-line utility
</title>
396 <section id=
"s1.4.1"><title>"Documented" command-line interfaces
</title>
398 As yet unwritten. You can refer to the other manuals for now. See
399 <citerefentry><refentrytitle>dpkg
</refentrytitle><manvolnum>8</manvolnum></citerefentry>.
403 <section id=
"s1.4.2"><title>Environment variables which dpkg responds to
</title>
407 DPKG_NO_TSTP - if set to a non-null value, this variable causes dpkg to run a
408 child shell process instead of sending itself a SIGTSTP, when the user selects
409 to background the dpkg process when it asks about conffiles.
414 SHELL - used to determine which shell to run in the case when DPKG_NO_TSTP
420 CC - used as the C compiler to call to determine the target architecture. The
426 PATH - dpkg checks that it can find at least the following files in the path
427 when it wants to run package installation scripts, and gives an error if it
428 cannot find all of them:-
456 <section id=
"s1.4.3"><title>Assertions
</title>
458 The dpkg utility itself is required for quite a number of packages, even if
459 they have been installed with a tool totally separate from dpkg. The reason
460 for this is that some packages, in their pre-installation scripts, check that
461 your version of dpkg supports certain features. This was broken from the
462 start, and it should have actually been a control file header "Dpkg-requires",
463 or similar. What happens is that the configuration scripts will abort or
464 continue according to the exit code of a call to dpkg, which will stop them
465 from being wrongly configured.
468 These special command-line options, which simply return as true or false are
469 all prefixed with "--assert-". Here is a list of them (without the prefix):-
474 support-predepends - Returns success or failure according to whether a version
475 of dpkg which supports predepends properly (
1.1.0 or above) is installed,
476 according to the database.
481 working-epoch - Return success or failure according to whether a version of
482 dpkg which supports epochs in version properly (
1.4.0.7 or above) is installed,
483 according to the database.
488 Both these options check the status database to see what version of the
489 "dpkg" package is installed, and check it against a known working version.
493 <section id=
"s1.4.4"><title>--predep-package
</title>
495 This strange option is described as follows in the source code:
498 /* Print a single package which:
499 * (a) is the target of one or more relevant predependencies.
500 * (b) has itself no unsatisfied pre-dependencies.
501 * If such a package is present output is the Packages file entry,
502 * which can be massaged as appropriate.
504 *
0 = a package printed, OK
505 *
1 = no suitable package available
510 On further inspection of the source code, it appears that what is does is
516 Looks at the packages in the database which are selected as "install",
522 It then looks at the Pre-Depends information for each of these packages
523 from the available file. When it find a package for which any of the
524 pre-dependencies are not satisfied, it breaks from the loop through the
530 It then looks through the unsatisfied pre-dependencies, and looks for
531 packages which would satisfy this pre-dependency, stopping on the first
532 it finds. If it finds none, it bombs out with an error.
537 It then continues this for every dependency of the initial package.
542 Eventually, it writes out the record of all the packages to satisfy the
543 pre-dependencies. This is used by the disk method to make sure that its
544 dependency ordering is correct. What happens is that all pre-depending
545 packages are first installed, then it runs dpkg -iGROEB on the directory,
546 which installs in the order package files are found. Since pre-dependencies
547 mean that a package may not even be unpacked unless they are satisfied, it
548 is necessary to do this (usually, since all the package files are unpacked
549 in one phase, the configured in another, this is not needed).
557 <chapter id=
"ch2"><title>dpkg-deb and .deb file internals
</title>
559 This chapter describes the internals to the "dpkg-deb" tool, which is used by
560 "dpkg" as a back-end. dpkg-deb has its own tar extraction functions, which is
561 the source of many problems, as it does not support long filenames, using
565 <section id=
"s2.1"><title>The .deb archive format
</title>
567 The main principal of the new-format Debian archive (I won't describe the old
568 format - for that have a look at deb-old
.5), is that the archive really is an
569 archive - as used by "ar" and friends. However, dpkg-deb uses this format
570 internally, rather than calling "ar". Inside this archive, there are usually
571 the following members:-
591 The debian-binary member consists simply of the string "
2.0", indicating
592 the format version. control.tar.gz contains the control files (and scripts),
593 and the data.tar.gz contains the actual files to populate the filesystem
594 with. Both tarfiles extract straight into the current directory. Information
595 on the tar formats can be found in the GNU tar info page. Since dpkg-deb
596 calls "tar -cf" to build packages, the Debian packages use the GNU extensions.
600 <section id=
"s2.2"><title>The dpkg-deb command-line
</title>
602 dpkg-deb documents itself thoroughly with its '--help' command-line
603 option. However, I am including a reference to these for
604 completeness. dpkg-deb supports the following options:-
609 --build (-b)
<dir
> - builds a .deb archive, takes a directory which
610 contains all the files as an argument. Note that the directory
611 <dir
>/DEBIAN will be packed separately into the control archive.
616 --contents (-c)
<debfile
> - Lists the contents of the "data.tar.gz"
622 --control (-e)
<debfile
> - Extracts the control archive into a directory
623 called DEBIAN. Alternatively, with another argument, it will extract it into a
629 --info (-I)
<debfile
> - Prints the contents of the "control" file in the
630 control archive to stdout. Alternatively, giving it other arguments will cause
631 it to print the contents of those files instead.
636 --field (-f)
<debfile
> <field
> ... - Prints any number of fields
637 from the "control" file. Giving it extra arguments limits the fields it prints
638 to only those specified. With no command-line arguments other than a filename,
639 it is equivalent to -I and just the .deb filename.
644 --extract (-x)
<debfile
> <dir
> - Extracts the data archive of a
645 debian package under the directory
<dir
>.
650 --vextract (-X)
<debfile
> <dir
> - Same as --extract, except it
651 is equivalent of giving tar the '-v' option - it prints the filenames as it
657 --fsys-tarfile
<debfile
> - This option outputs a gunzip'd version of
658 data.tar.gz to stdout.
663 --new - sets the archive format to be used to the new Debian format
668 --old - sets the archive format to be used to the old Debian format
673 --debug - Tells dpkg-deb to produce debugging output
678 --nocheck - Tells dpkg-deb not to check the sanity of the control file
683 --help (-h) - Gives a help message
688 --version - Shows the version number
693 --licence/--license (UK/US spellings) - Shows a brief outline of the GPL
698 <section id=
"s2.2.1"><title>Internal checks used by dpkg-deb when building packages
</title>
700 Here is a list of the internal checks used by dpkg-deb when building
701 packages. It is in the order they are done.
706 First, the output Debian archive argument, if it is given, is checked using
707 stat. If it is a directory, an internal flag is set. This check is only made
708 if the archive name is specified explicitly on the command-line. If the
709 argument was not given, the default is the directory name, with ".deb"
715 Next, the control file is checked, unless the --nocheck flag was specified on
716 the command-line. dpkg-deb will bomb out if the second argument to --build was
717 a directory, and --nocheck was specified. Note that dpkg-deb will not be able
718 to determine the name of the package in this case. In the control file, the
719 following things are checked:-
724 The package name is checked to see if it contains any invalid characters (see
725 <xref linkend=
"control"/> for this).
730 The priority field is checked to see if it uses standard values, and
731 user-defined values are warned against. However, note that this check is now
732 redundant, since the control file no longer contains the priority - the
733 changes file now does this.
738 The control file fields are then checked against the standard list of fields
739 which appear in control files, and any "user-defined" fields are reported as
745 dpkg-deb then checks that the control file contains a valid version number.
752 After this, in the case where a directory was specified to build the .deb file
753 in, the filename is created as "directory/pkg_ver.deb" or
754 "directory/pkg_ver_arch.deb", depending on whether the control file contains
755 an architecture field.
760 Next, dpkg-deb checks for the
<dir
>/DEBIAN directory. It complains if it
761 doesn't exist, or if it has permissions
< 0755, or
> 0775.
766 It then checks that all the files in this subdir are either symlinks or plain
767 files, and have permissions between
0555 and
0775.
772 The conffiles file is then checked to see if the filenames are too
773 long. Warnings are produced for each that is. After this, it checks
774 that the package provides initial copies of each of these conffiles,
775 and that they are all plain files.
785 <chapter id=
"ch3"><title>dpkg internals
</title>
787 This chapter describes the internals of dpkg itself. Although the low-level
788 formats are quite simple, what dpkg does in certain cases often does not make
792 <section id=
"updates"><title>Updates
</title>
794 This describes the /var/lib/dpkg/updates directory. The function of this
795 directory is somewhat strange, and seems only to be used internally. A
796 function called cleanupdates is called whenever the database is scanned. This
797 function in turn uses
798 <citerefentry><refentrytitle>scandir
</refentrytitle><manvolnum>3</manvolnum></citerefentry>,
799 to sort the files in this directory. Files who names do not consist entirely
800 of digits are discarded. dpkg also causes a fatal error if any of the
801 filenames are different lengths.
804 After having scanned the directory, dpkg in turn parses each file the same way
805 it parses the status file (they are sorted by the scandir to be in numerical
806 order). After having done this, it then writes the status information back to
807 the "status" file, and removes all the "updates" files.
810 These files are created internally by dpkg's "checkpoint" function, and are
811 cleaned up when dpkg exits cleanly.
814 Juding by the use of the updates directory I would call it a Journal. Inorder
815 to efficiently ensure the complete integrity of the status file dpkg will
816 "checkpoint" or journal all of it's activities in the updates directory. By
817 merging the contents of the updates directory (in order!!) against the original
818 status file it can get the precise current state of the system, even in the
819 event of a system failure while dpkg is running.
822 The other option would be to sync-rewrite the status file after each operation,
823 which would kill performance.
826 It is very important that any program that uses the status file abort if the
827 updates directory is not empty! The user should be informed to run dpkg
828 manually (what options though??) to correct the situation.
832 <section id=
"s3.2"><title>What happens when dpkg reads the database
</title>
834 First, the status file is read. This gives dpkg an initial idea of the
835 packages that are there. Next, the updates files are read in, overriding the
836 status file, and if necessary, the status file is re-written, and updates files
837 are removed. Finally, the available file is read. The available file is read
838 with flags which preclude dpkg from updating any status information from it,
839 though - installed version, etc., and is also told to record that the packages
840 it reads this time are available, not installed.
843 More information on updates is given above.
847 <section id=
"s3.3"><title>How dpkg compares version numbers
</title>
849 Version numbers consist of three parts: the epoch, the upstream version, and
850 the Debian revision. Dpkg compares these parts in that order. If the epochs
851 are different, it returns immediately, and so on.
854 However, the important part is how it compares the versions which are
855 essentially stored as just strings. These are compared in two distinct
856 parts: those consisting of numerical characters (which are evaluated, and
857 then compared), and those consisting of other characters. When comparing
858 non-numerical parts, they are compared as the character values (ASCII),
859 but non-alphabetical characters are considered "greater than" alphabetical
860 ones. Also note that longer strings (after excluding differences where
861 numerical values are equal) are considered "greater than" shorter ones.
864 Here are a few examples of how these rules apply:-