hgbook

annotate en/mq.tex @ 1:04e469de601e

Early content for chapter on MQ.
author Bryan O'Sullivan <bos@serpentine.com>
date Fri Jun 23 12:15:38 2006 -0700 (2006-06-23)
parents
children 379a802c0210
rev   line source
bos@1 1 \chapter{Managing change with Mercurial Queues}
bos@1 2 \label{chap:mq}
bos@1 3
bos@1 4 \section{The patch management problem}
bos@1 5 \label{sec:mq:patch-mgmt}
bos@1 6
bos@1 7 Here is a common scenario: you need to install a software package from
bos@1 8 source, but you find a bug that you must fix in the source before you
bos@1 9 can start using the package. You make your changes, forget about the
bos@1 10 package for a while, and a few months later you need to upgrade to a
bos@1 11 newer version of the package. If the newer version of the package
bos@1 12 still has the bug, you must extract your fix from the older source
bos@1 13 tree and apply it against the newer version. This is a tedious task,
bos@1 14 and it's easy to make mistakes.
bos@1 15
bos@1 16 This is a simple case of the ``patch management'' problem. You have
bos@1 17 an ``upstream'' source tree that you can't change; you need to make
bos@1 18 some local changes on top of the upstream tree; and you'd like to be
bos@1 19 able to keep those changes separate, so that you can apply them to
bos@1 20 newer versions of the upstream source.
bos@1 21
bos@1 22 The patch management problem arises in many situations. Probably the
bos@1 23 most visible is that a user of an open source software project will
bos@1 24 contribute a bugfix or new feature to the project's maintainers in the
bos@1 25 form of a patch.
bos@1 26
bos@1 27 Distributors of operating systems that include open source software
bos@1 28 often need to make changes to the packages they distribute so that
bos@1 29 they will build properly in their environments.
bos@1 30
bos@1 31 When you have few changes to maintain, it is easy to manage a single
bos@1 32 patch using the standard \texttt{diff} and \texttt{patch} programs.
bos@1 33 Once the number of changes grows, it starts to makes sense to maintain
bos@1 34 patches as discrete ``chunks of work,'' so that for example a single
bos@1 35 patch will contain only one bug fix (the patch might modify several
bos@1 36 files, but it's doing ``only one thing''), and you may have a number
bos@1 37 of such patches for different bugs you need fixed and local changes
bos@1 38 you require. In this situation, if you submit a bugfix patch to the
bos@1 39 upstream maintainers of a package and they include your fix in a
bos@1 40 subsequent release, you can simply drop that single patch when you're
bos@1 41 updating to the newer release.
bos@1 42
bos@1 43 Maintaining a single patch against an upstream tree is a little
bos@1 44 tedious and error-prone, but not difficult. However, the complexity
bos@1 45 of the problem grows rapidly as the number of patches you have to
bos@1 46 maintain increases. With more than a tiny number of patches in hand,
bos@1 47 understanding which ones you have applied and maintaining them moves
bos@1 48 from messy to overwhelming.
bos@1 49
bos@1 50 Fortunately, Mercurial includes a powerful extension, Mercurial Queues
bos@1 51 (or simply ``MQ''), that massively simplifies the patch management
bos@1 52 problem.
bos@1 53
bos@1 54 \section{The prehistory of Mercurial Queues}
bos@1 55 \label{sec:mq:history}
bos@1 56
bos@1 57 During the late 1990s, several Linux kernel developers started to
bos@1 58 maintain ``patch series'' that modified the behaviour of the Linux
bos@1 59 kernel. Some of these series were focused on stability, some on
bos@1 60 feature coverage, and others were more speculative.
bos@1 61
bos@1 62 The sizes of these patch series grew rapidly. In 2002, Andrew Morton
bos@1 63 published some shell scripts he had been using to automate the task of
bos@1 64 managing his patch queues. Andrew was successfully using these
bos@1 65 scripts to manage hundreds (sometimes thousands) of patches on top of
bos@1 66 the Linux kernel.
bos@1 67
bos@1 68 \subsection{A patchwork quilt}
bos@1 69 \label{sec:mq:quilt}
bos@1 70
bos@1 71
bos@1 72 In early 2003, Andreas Gruenbacher and Martin Quinson borrowed the
bos@1 73 approach of Andrew's scripts and published a tool called
bos@1 74 \href{http://savannah.nongnu.org/projects/quilt}{``patchwork quilt''},
bos@1 75 or simply ``quilt''. Because quilt substantially automated patch
bos@1 76 management, it rapidly gained a large following among open source
bos@1 77 software developers.
bos@1 78
bos@1 79 Quilt manages a \emph{stack of patches} on top of a directory tree.
bos@1 80 To begin, you tell quilt to manage a directory tree; it stores away
bos@1 81 the names and contents of all files in the tree. To fix a bug, you
bos@1 82 create a new patch (using a single command), edit the files you need
bos@1 83 to fix, then ``refresh'' the patch.
bos@1 84
bos@1 85 The refresh step causes quilt to scan the directory tree; it updates
bos@1 86 the patch with all of the changes you have made. You can create
bos@1 87 another patch on top of the first, which will track the changes
bos@1 88 required to modify the tree from ``tree with one patch applied'' to
bos@1 89 ``tree with two patches applied''.
bos@1 90
bos@1 91 You can \emph{change} which patches are applied to the tree. If you
bos@1 92 ``pop'' a patch, the changes made by that patch will vanish from the
bos@1 93 directory tree. Quilt remembers which patches you have popped,
bos@1 94 though, so you can ``push'' a popped patch again, and the directory
bos@1 95 tree will be restored to contain the modifications in the patch. Most
bos@1 96 importantly, you can run the ``refresh'' command at any time, and the
bos@1 97 topmost applied patch will be updated. This means that you can, at
bos@1 98 any time, change both which patches are applied and what
bos@1 99 modifications those patches make.
bos@1 100
bos@1 101 Quilt knows nothing about revision control tools, so it works equally
bos@1 102 well on top of an unpacked tarball or a Suversion repository.
bos@1 103
bos@1 104 \subsection{From patchwork quilt to Mercurial Queues}
bos@1 105 \label{sec:mq:quilt-mq}
bos@1 106
bos@1 107 In mid-2005, Chris Mason took the features of quilt and wrote an
bos@1 108 extension that he called Mercurial Queues, which added quilt-like
bos@1 109 behaviour to Mercurial.
bos@1 110
bos@1 111 The key difference between quilt and MQ is that quilt knows nothing
bos@1 112 about revision control systems, while MQ is \emph{integrated} into
bos@1 113 Mercurial. Each patch that you push is represented as a Mercurial
bos@1 114 changeset. Pop a patch, and the changeset goes away.
bos@1 115
bos@1 116 This integration makes understanding patches and debugging their
bos@1 117 effects \emph{enormously} easier. Since every applied patch has an
bos@1 118 associated changeset, you can use \hgcmdargs{log}{\emph{filename}} to
bos@1 119 see which changesets and patches affected a file. You can use the
bos@1 120 \hgext{bisect} extension to binary-search through all changesets and
bos@1 121 applied patches to see where a bug got introduced or fixed. You can
bos@1 122 use the \hgcmd{annotate} command to see which changeset or patch
bos@1 123 modified a particular line of a source file. And so on.
bos@1 124
bos@1 125 Because quilt does not care about revision control tools, it is still
bos@1 126 a tremendously useful piece of software to know about for situations
bos@1 127 where you cannot use Mercurial and MQ.
bos@1 128 \section{Section!}
bos@1 129 \label{sec:sec}
bos@1 130
bos@1 131 Section!
bos@1 132
bos@1 133 %%% Local Variables:
bos@1 134 %%% mode: latex
bos@1 135 %%% TeX-master: "00book"
bos@1 136 %%% End: