hgbook: en/hook.tex annotate

hgbook

annotate en/hook.tex @ 54:e94202d88199

Tix fypos.

author	Bryan O'Sullivan <bos@serpentine.com>
date	Wed Aug 02 13:08:56 2006 -0700 (2006-08-02)
parents	497aa3c9d4ce
children	8806b2875f10

rev	line source
bos@34	1 \chapter{Handling repository events with hooks}
bos@34	2 \label{chap:hook}
bos@34	3
bos@34	4 Mercurial offers a powerful mechanism to let you perform automated
bos@34	5 actions in response to events that occur in a repository. In some
bos@34	6 cases, you can even control Mercurial's response to those events.
bos@34	7
bos@34	8 The name Mercurial uses for one of these actions is a \emph{hook}.
bos@34	9 Hooks are called ``triggers'' in some revision control systems, but
bos@34	10 the two names refer to the same idea.
bos@34	11
bos@38	12 \section{An overview of hooks in Mercurial}
bos@38	13
bos@41	14 Here is a brief list of the hooks that Mercurial supports. We will
bos@41	15 revisit each of these hooks in more detail later, in
bos@41	16 section~\ref{sec:hook:ref}.
bos@41	17
bos@38	18 \begin{itemize}
bos@38	19 \item[\small\hook{changegroup}] This is run after a group of
bos@41	20 changesets has been brought into the repository from elsewhere.
bos@38	21 \item[\small\hook{commit}] This is run after a new changeset has been
bos@41	22 created in the local repository.
bos@38	23 \item[\small\hook{incoming}] This is run once for each new changeset
bos@38	24 that is brought into the repository from elsewhere. Notice the
bos@38	25 difference from \hook{changegroup}, which is run once per
bos@41	26 \emph{group} of changesets brought in.
bos@38	27 \item[\small\hook{outgoing}] This is run after a group of changesets
bos@41	28 has been transmitted from this repository.
bos@38	29 \item[\small\hook{prechangegroup}] This is run before starting to
bos@41	30 bring a group of changesets into the repository.
bos@41	31 \item[\small\hook{precommit}] Controlling. This is run before starting
bos@41	32 a commit.
bos@41	33 \item[\small\hook{preoutgoing}] Controlling. This is run before
bos@41	34 starting to transmit a group of changesets from this repository.
bos@41	35 \item[\small\hook{pretag}] Controlling. This is run before creating a tag.
bos@41	36 \item[\small\hook{pretxnchangegroup}] Controlling. This is run after a
bos@41	37 group of changesets has been brought into the local repository from
bos@41	38 another, but before the transaction completes that will make the
bos@41	39 changes permanent in the repository.
bos@41	40 \item[\small\hook{pretxncommit}] Controlling. This is run after a new
bos@41	41 changeset has been created in the local repository, but before the
bos@41	42 transaction completes that will make it permanent.
bos@41	43 \item[\small\hook{preupdate}] Controlling. This is run before starting
bos@41	44 an update or merge of the working directory.
bos@38	45 \item[\small\hook{tag}] This is run after a tag is created.
bos@38	46 \item[\small\hook{update}] This is run after an update or merge of the
bos@38	47 working directory has finished.
bos@38	48 \end{itemize}
bos@41	49 Each of the hooks whose description begins with the word
bos@41	50 ``Controlling'' has the ability to determine whether an activity can
bos@41	51 proceed. If the hook succeeds, the activity may proceed; if it fails,
bos@41	52 the activity is either not permitted or undone, depending on the hook.
bos@38	53
bos@38	54 \section{Hooks and security}
bos@38	55
bos@38	56 \subsection{Hooks are run with your privileges}
bos@38	57
bos@38	58 When you run a Mercurial command in a repository, and the command
bos@41	59 causes a hook to run, that hook runs on \emph{your} system, under
bos@41	60 \emph{your} user account, with \emph{your} privilege level. Since
bos@41	61 hooks are arbitrary pieces of executable code, you should treat them
bos@41	62 with an appropriate level of suspicion. Do not install a hook unless
bos@41	63 you are confident that you know who created it and what it does.
bos@38	64
bos@38	65 In some cases, you may be exposed to hooks that you did not install
bos@38	66 yourself. If you work with Mercurial on an unfamiliar system,
bos@38	67 Mercurial will run hooks defined in that system's global \hgrc\ file.
bos@38	68
bos@38	69 If you are working with a repository owned by another user, Mercurial
bos@41	70 can run hooks defined in that user's repository, but it will still run
bos@41	71 them as ``you''. For example, if you \hgcmd{pull} from that
bos@41	72 repository, and its \sfilename{.hg/hgrc} defines a local
bos@41	73 \hook{outgoing} hook, that hook will run under your user account, even
bos@41	74 though you don't own that repository.
bos@38	75
bos@38	76 \begin{note}
bos@38	77 This only applies if you are pulling from a repository on a local or
bos@38	78 network filesystem. If you're pulling over http or ssh, any
bos@41	79 \hook{outgoing} hook will run under whatever account is executing
bos@41	80 the server process, on the server.
bos@38	81 \end{note}
bos@38	82
bos@38	83 XXX To see what hooks are defined in a repository, use the
bos@38	84 \hgcmdargs{config}{hooks} command. If you are working in one
bos@38	85 repository, but talking to another that you do not own (e.g.~using
bos@38	86 \hgcmd{pull} or \hgcmd{incoming}), remember that it is the other
bos@38	87 repository's hooks you should be checking, not your own.
bos@38	88
bos@38	89 \subsection{Hooks do not propagate}
bos@38	90
bos@38	91 In Mercurial, hooks are not revision controlled, and do not propagate
bos@38	92 when you clone, or pull from, a repository. The reason for this is
bos@38	93 simple: a hook is a completely arbitrary piece of executable code. It
bos@38	94 runs under your user identity, with your privilege level, on your
bos@38	95 machine.
bos@38	96
bos@38	97 It would be extremely reckless for any distributed revision control
bos@38	98 system to implement revision-controlled hooks, as this would offer an
bos@38	99 easily exploitable way to subvert the accounts of users of the
bos@38	100 revision control system.
bos@38	101
bos@38	102 Since Mercurial does not propagate hooks, if you are collaborating
bos@38	103 with other people on a common project, you should not assume that they
bos@38	104 are using the same Mercurial hooks as you are, or that theirs are
bos@38	105 correctly configured. You should document the hooks you expect people
bos@38	106 to use.
bos@38	107
bos@38	108 In a corporate intranet, this is somewhat easier to control, as you
bos@38	109 can for example provide a ``standard'' installation of Mercurial on an
bos@38	110 NFS filesystem, and use a site-wide \hgrc\ file to define hooks that
bos@38	111 all users will see. However, this too has its limits; see below.
bos@38	112
bos@38	113 \subsection{Hooks can be overridden}
bos@38	114
bos@38	115 Mercurial allows you to override a hook definition by redefining the
bos@38	116 hook. You can disable it by setting its value to the empty string, or
bos@38	117 change its behaviour as you wish.
bos@38	118
bos@38	119 If you deploy a system-~or site-wide \hgrc\ file that defines some
bos@38	120 hooks, you should thus understand that your users can disable or
bos@38	121 override those hooks.
bos@38	122
bos@38	123 \subsection{Ensuring that critical hooks are run}
bos@38	124
bos@38	125 Sometimes you may want to enforce a policy that you do not want others
bos@38	126 to be able to work around. For example, you may have a requirement
bos@38	127 that every changeset must pass a rigorous set of tests. Defining this
bos@38	128 requirement via a hook in a site-wide \hgrc\ won't work for remote
bos@38	129 users on laptops, and of course local users can subvert it at will by
bos@38	130 overriding the hook.
bos@38	131
bos@38	132 Instead, you can set up your policies for use of Mercurial so that
bos@38	133 people are expected to propagate changes through a well-known
bos@38	134 ``canonical'' server that you have locked down and configured
bos@38	135 appropriately.
bos@38	136
bos@38	137 One way to do this is via a combination of social engineering and
bos@38	138 technology. Set up a restricted-access account; users can push
bos@38	139 changes over the network to repositories managed by this account, but
bos@38	140 they cannot log into the account and run normal shell commands. In
bos@38	141 this scenario, a user can commit a changeset that contains any old
bos@38	142 garbage they want.
bos@38	143
bos@38	144 When someone pushes a changeset to the server that everyone pulls
bos@38	145 from, the server will test the changeset before it accepts it as
bos@38	146 permanent, and reject it if it fails to pass the test suite. If
bos@38	147 people only pull changes from this filtering server, it will serve to
bos@38	148 ensure that all changes that people pull have been automatically
bos@38	149 vetted.
bos@38	150
bos@41	151 \section{Using hooks with shared access to a repository}
bos@41	152
bos@41	153 If you want to use hooks to so some automated work in a repository
bos@54	154 that a number of people have shared access to, you need to be careful
bos@41	155 in how you do this.
bos@41	156
bos@41	157 Mercurial only locks a repository when it is writing to the
bos@41	158 repository, and only the parts of Mercurial that write to the
bos@41	159 repository pay attention to locks. Write locks are necessary to
bos@41	160 prevent multiple simultaneous writers from scribbling on each other's
bos@41	161 work, corrupting the repository.
bos@41	162
bos@41	163 Because Mercurial is careful with the order in which it reads and
bos@41	164 writes data, it does not need to acquire a lock when it wants to read
bos@41	165 data from the repository. The parts of Mercurial that read from the
bos@41	166 repository never pay attention to locks. This lockless reading scheme
bos@41	167 greatly increases performance and concurrency.
bos@41	168
bos@41	169 With great performance comes a trade-off, though, one which has the
bos@41	170 potential to cause you trouble unless you're aware of it. To describe
bos@41	171 this requires a little detail about how Mercurial adds changesets to a
bos@41	172 repository and reads those changes.
bos@41	173
bos@41	174 When Mercurial \emph{writes} metadata, it writes it straight into the
bos@41	175 destination file. It writes file data first, then manifest data
bos@41	176 (which contains pointers to the new file data), then changelog data
bos@41	177 (which contains pointers to the new manifest data). Before the first
bos@41	178 write to each file, it stores a record of where the end of the file
bos@41	179 was in its transaction log. If the transaction must be rolled back,
bos@54	180 Mercurial simply truncates each file back to the size it was before the
bos@41	181 transaction began.
bos@41	182
bos@41	183 When Mercurial \emph{reads} metadata, it reads the changelog first,
bos@41	184 then everything else. Since a reader will only access parts of the
bos@41	185 manifest or file metadata that it can see in the changelog, it can
bos@41	186 never see partially written data.
bos@41	187
bos@41	188 Some controlling hooks (\hook{pretxncommit} and
bos@41	189 \hook{pretxnchangegroup}) run when a transaction is almost complete.
bos@41	190 All of the metadata has been written, but Mercurial can still roll the
bos@41	191 transaction back and cause the newly-written data to disappear.
bos@41	192
bos@41	193 If one of these hooks runs for long, it opens a window in which a
bos@41	194 reader can see the metadata for changesets that are, strictly
bos@41	195 speaking, not yet permanent. The longer the hook runs, the bigger the
bos@41	196 window.
bos@41	197
bos@41	198 A good use for the \hook{pretxnchangegroup} hook would be to
bos@41	199 automatically build and test incoming changes before they are accepted
bos@41	200 into the repository, so that you can guarantee that nobody can push
bos@41	201 changes to this repository that ``break the build''. But if a client
bos@41	202 can pull changes while they're being tested, the usefulness of the
bos@41	203 test is zero; someone can pull untested changes.
bos@41	204
bos@41	205 The safest answer to this challenge is to set up such a ``gatekeeper''
bos@41	206 repository as \emph{unidirectional}. It can take changes pushed in
bos@41	207 from the outside, but nobody can pull changes from it. Use the
bos@41	208 \hook{preoutgoing} hook to lock it down. Configure a
bos@41	209 \hook{changegroup} hook so that if a build or test succeeds, the hook
bos@41	210 will push the new changes out to another repository that people
bos@41	211 \emph{can} pull from.
bos@41	212
bos@34	213 \section{A short tutorial on using hooks}
bos@34	214 \label{sec:hook:simple}
bos@34	215
bos@34	216 It is easy to write a Mercurial hook. Let's start with a hook that
bos@34	217 runs when you finish a \hgcmd{commit}, and simply prints the hash of
bos@34	218 the changeset you just created. The hook is called \hook{commit}.
bos@34	219
bos@34	220 \begin{figure}[ht]
bos@34	221 \interaction{hook.simple.init}
bos@34	222 \caption{A simple hook that runs when a changeset is committed}
bos@34	223 \label{ex:hook:init}
bos@34	224 \end{figure}
bos@34	225
bos@34	226 All hooks follow the pattern in example~\ref{ex:hook:init}. You add
bos@34	227 an entry to the \rcsection{hooks} section of your \hgrc\. On the left
bos@34	228 is the name of the event to trigger on; on the right is the action to
bos@34	229 take. As you can see, you can run an arbitrary shell command in a
bos@34	230 hook. Mercurial passes extra information to the hook using
bos@34	231 environment variables (look for \envar{HG\_NODE} in the example).
bos@34	232
bos@34	233 \subsection{Performing multiple actions per event}
bos@34	234
bos@34	235 Quite often, you will want to define more than one hook for a
bos@34	236 particular kind of event, as shown in example~\ref{ex:hook:ext}.
bos@34	237 Mercurial lets you do this by adding an \emph{extension} to the end of
bos@34	238 a hook's name. You extend a hook's name by giving the name of the
bos@34	239 hook, followed by a full stop (the ``\texttt{.}'' character), followed
bos@34	240 by some more text of your choosing. For example, Mercurial will run
bos@34	241 both \texttt{commit.foo} and \texttt{commit.bar} when the
bos@34	242 \texttt{commit} event occurs.
bos@34	243
bos@34	244 \begin{figure}[ht]
bos@34	245 \interaction{hook.simple.ext}
bos@34	246 \caption{Defining a second \hook{commit} hook}
bos@34	247 \label{ex:hook:ext}
bos@34	248 \end{figure}
bos@34	249
bos@34	250 To give a well-defined order of execution when there are multiple
bos@34	251 hooks defined for an event, Mercurial sorts hooks by extension, and
bos@34	252 executes the hook commands in this sorted order. In the above
bos@34	253 example, it will execute \texttt{commit.bar} before
bos@34	254 \texttt{commit.foo}, and \texttt{commit} before both.
bos@34	255
bos@34	256 It is a good idea to use a somewhat descriptive extension when you
bos@34	257 define a new hook. This will help you to remember what the hook was
bos@34	258 for. If the hook fails, you'll get an error message that contains the
bos@34	259 hook name and extension, so using a descriptive extension could give
bos@34	260 you an immediate hint as to why the hook failed (see
bos@34	261 section~\ref{sec:hook:perm} for an example).
bos@34	262
bos@34	263 \subsection{Controlling whether an activity can proceed}
bos@34	264 \label{sec:hook:perm}
bos@34	265
bos@34	266 In our earlier examples, we used the \hook{commit} hook, which is
bos@34	267 run after a commit has completed. This is one of several Mercurial
bos@34	268 hooks that run after an activity finishes. Such hooks have no way of
bos@34	269 influencing the activity itself.
bos@34	270
bos@34	271 Mercurial defines a number of events that occur before an activity
bos@34	272 starts; or after it starts, but before it finishes. Hooks that
bos@34	273 trigger on these events have the added ability to choose whether the
bos@34	274 activity can continue, or will abort.
bos@34	275
bos@34	276 The \hook{pretxncommit} hook runs after a commit has all but
bos@34	277 completed. In other words, the metadata representing the changeset
bos@34	278 has been written out to disk, but the transaction has not yet been
bos@34	279 allowed to complete. The \hook{pretxncommit} hook has the ability to
bos@34	280 decide whether the transaction can complete, or must be rolled back.
bos@34	281
bos@34	282 If the \hook{pretxncommit} hook exits with a status code of zero, the
bos@34	283 transaction is allowed to complete; the commit finishes; and the
bos@34	284 \hook{commit} hook is run. If the \hook{pretxncommit} hook exits with
bos@34	285 a non-zero status code, the transaction is rolled back; the metadata
bos@34	286 representing the changeset is erased; and the \hook{commit} hook is
bos@34	287 not run.
bos@34	288
bos@34	289 \begin{figure}[ht]
bos@34	290 \interaction{hook.simple.pretxncommit}
bos@34	291 \caption{Using the \hook{pretxncommit} hook to control commits}
bos@34	292 \label{ex:hook:pretxncommit}
bos@34	293 \end{figure}
bos@34	294
bos@34	295 The hook in example~\ref{ex:hook:pretxncommit} checks that a commit
bos@34	296 comment contains a bug ID. If it does, the commit can complete. If
bos@34	297 not, the commit is rolled back.
bos@34	298
bos@37	299 \section{Writing your own hooks}
bos@37	300
bos@37	301 When you are writing a hook, you might find it useful to run Mercurial
bos@37	302 either with the \hggopt{-v} option, or the \rcitem{ui}{verbose} config
bos@37	303 item set to ``true''. When you do so, Mercurial will print a message
bos@37	304 before it calls each hook.
bos@37	305
bos@37	306 \subsection{Choosing how your hook should run}
bos@37	307 \label{sec:hook:lang}
bos@34	308
bos@34	309 You can write a hook either as a normal program---typically a shell
bos@37	310 script---or as a Python function that is executed within the Mercurial
bos@34	311 process.
bos@34	312
bos@34	313 Writing a hook as an external program has the advantage that it
bos@34	314 requires no knowledge of Mercurial's internals. You can call normal
bos@34	315 Mercurial commands to get any added information you need. The
bos@34	316 trade-off is that external hooks are slower than in-process hooks.
bos@34	317
bos@34	318 An in-process Python hook has complete access to the Mercurial API,
bos@34	319 and does not ``shell out'' to another process, so it is inherently
bos@34	320 faster than an external hook. It is also easier to obtain much of the
bos@34	321 information that a hook requires by using the Mercurial API than by
bos@34	322 running Mercurial commands.
bos@34	323
bos@34	324 If you are comfortable with Python, or require high performance,
bos@34	325 writing your hooks in Python may be a good choice. However, when you
bos@34	326 have a straightforward hook to write and you don't need to care about
bos@34	327 performance (probably the majority of hooks), a shell script is
bos@34	328 perfectly fine.
bos@34	329
bos@37	330 \subsection{Hook parameters}
bos@34	331 \label{sec:hook:param}
bos@34	332
bos@34	333 Mercurial calls each hook with a set of well-defined parameters. In
bos@34	334 Python, a parameter is passed as a keyword argument to your hook
bos@34	335 function. For an external program, a parameter is passed as an
bos@34	336 environment variable.
bos@34	337
bos@34	338 Whether your hook is written in Python or as a shell script, the
bos@37	339 hook-specific parameter names and values will be the same. A boolean
bos@37	340 parameter will be represented as a boolean value in Python, but as the
bos@37	341 number 1 (for ``true'') or 0 (for ``false'') as an environment
bos@37	342 variable for an external hook. If a hook parameter is named
bos@37	343 \texttt{foo}, the keyword argument for a Python hook will also be
bos@51	344 named \texttt{foo}, while the environment variable for an external
bos@51	345 hook will be named \texttt{HG\_FOO}.
bos@37	346
bos@37	347 \subsection{Hook return values and activity control}
bos@37	348
bos@37	349 A hook that executes successfully must exit with a status of zero if
bos@37	350 external, or return boolean ``false'' if in-process. Failure is
bos@37	351 indicated with a non-zero exit status from an external hook, or an
bos@37	352 in-process hook returning boolean ``true''. If an in-process hook
bos@37	353 raises an exception, the hook is considered to have failed.
bos@37	354
bos@37	355 For a hook that controls whether an activity can proceed, zero/false
bos@37	356 means ``allow'', while non-zero/true/exception means ``deny''.
bos@37	357
bos@37	358 \subsection{Writing an external hook}
bos@37	359
bos@37	360 When you define an external hook in your \hgrc\ and the hook is run,
bos@37	361 its value is passed to your shell, which interprets it. This means
bos@37	362 that you can use normal shell constructs in the body of the hook.
bos@37	363
bos@37	364 An executable hook is always run with its current directory set to a
bos@37	365 repository's root directory.
bos@37	366
bos@37	367 Each hook parameter is passed in as an environment variable; the name
bos@37	368 is upper-cased, and prefixed with the string ``\texttt{HG\_}''.
bos@37	369
bos@37	370 With the exception of hook parameters, Mercurial does not set or
bos@37	371 modify any environment variables when running a hook. This is useful
bos@37	372 to remember if you are writing a site-wide hook that may be run by a
bos@37	373 number of different users with differing environment variables set.
bos@37	374 In multi-user situations, you should not rely on environment variables
bos@37	375 being set to the values you have in your environment when testing the
bos@37	376 hook.
bos@37	377
bos@37	378 \subsection{Telling Mercurial to use an in-process hook}
bos@37	379
bos@37	380 The \hgrc\ syntax for defining an in-process hook is slightly
bos@37	381 different than for an executable hook. The value of the hook must
bos@37	382 start with the text ``\texttt{python:}'', and continue with the
bos@37	383 fully-qualified name of a callable object to use as the hook's value.
bos@37	384
bos@37	385 The module in which a hook lives is automatically imported when a hook
bos@37	386 is run. So long as you have the module name and \envar{PYTHONPATH}
bos@37	387 right, it should ``just work''.
bos@37	388
bos@37	389 The following \hgrc\ example snippet illustrates the syntax and
bos@37	390 meaning of the notions we just described.
bos@37	391 \begin{codesample2}
bos@37	392 [hooks]
bos@37	393 commit.example = python:mymodule.submodule.myhook
bos@37	394 \end{codesample2}
bos@37	395 When Mercurial runs the \texttt{commit.example} hook, it imports
bos@37	396 \texttt{mymodule.submodule}, looks for the callable object named
bos@37	397 \texttt{myhook}, and calls it.
bos@37	398
bos@37	399 \subsection{Writing an in-process hook}
bos@37	400
bos@37	401 The simplest in-process hook does nothing, but illustrates the basic
bos@37	402 shape of the hook API:
bos@37	403 \begin{codesample2}
bos@37	404 def myhook(ui, repo, **kwargs):
bos@37	405 pass
bos@37	406 \end{codesample2}
bos@37	407 The first argument to a Python hook is always a
bos@37	408 \pymodclass{mercurial.ui}{ui} object. The second is a repository object;
bos@37	409 at the moment, it is always an instance of
bos@37	410 \pymodclass{mercurial.localrepo}{localrepository}. Following these two
bos@37	411 arguments are other keyword arguments. Which ones are passed in
bos@37	412 depends on the hook being called, but a hook can ignore arguments it
bos@37	413 doesn't care about by dropping them into a keyword argument dict, as
bos@37	414 with \texttt{**kwargs} above.
bos@34	415
bos@44	416 \section{Some hook examples}
bos@44	417
bos@49	418 \subsection{Writing meaningful commit messages}
bos@49	419
bos@49	420 It's hard to imagine a useful commit message being very short. The
bos@49	421 simple \hook{pretxncommit} hook of figure~\ref{ex:hook:msglen.run}
bos@49	422 will prevent you from committing a changeset with a message that is
bos@49	423 less than ten bytes long.
bos@49	424
bos@49	425 \begin{figure}[ht]
bos@49	426 \interaction{hook.msglen.run}
bos@49	427 \caption{A hook that forbids overly short commit messages}
bos@49	428 \label{ex:hook:msglen.run}
bos@49	429 \end{figure}
bos@49	430
bos@49	431 \subsection{Checking for trailing whitespace}
bos@44	432
bos@44	433 An interesting use of a commit-related hook is to help you to write
bos@44	434 cleaner code. A simple example of ``cleaner code'' is the dictum that
bos@44	435 a change should not add any new lines of text that contain ``trailing
bos@44	436 whitespace''. Trailing whitespace is a series of space and tab
bos@44	437 characters at the end of a line of text. In most cases, trailing
bos@44	438 whitespace is unnecessary, invisible noise, but it is occasionally
bos@49	439 problematic, and people often prefer to get rid of it.
bos@44	440
bos@44	441 You can use either the \hook{precommit} or \hook{pretxncommit} hook to
bos@44	442 tell whether you have a trailing whitespace problem. If you use the
bos@44	443 \hook{precommit} hook, the hook will not know which files you are
bos@44	444 committing, so it will have to check every modified file in the
bos@44	445 repository for trailing white space. If you want to commit a change
bos@44	446 to just the file \filename{foo}, but the file \filename{bar} contains
bos@44	447 trailing whitespace, doing a check in the \hook{precommit} hook will
bos@44	448 prevent you from committing \filename{foo} due to the problem with
bos@44	449 \filename{bar}. This doesn't seem right.
bos@44	450
bos@44	451 Should you choose the \hook{pretxncommit} hook, the check won't occur
bos@44	452 until just before the transaction for the commit completes. This will
bos@44	453 allow you to check for problems only the exact files that are being
bos@44	454 committed. However, if you entered the commit message interactively
bos@44	455 and the hook fails, the transaction will roll back; you'll have to
bos@44	456 re-enter the commit message after you fix the trailing whitespace and
bos@44	457 run \hgcmd{commit} again.
bos@44	458
bos@44	459 \begin{figure}[ht]
bos@44	460 \interaction{hook.ws.simple}
bos@44	461 \caption{A simple hook that checks for trailing whitespace}
bos@44	462 \label{ex:hook:ws.simple}
bos@44	463 \end{figure}
bos@44	464
bos@44	465 Figure~\ref{ex:hook:ws.simple} introduces a simple \hook{pretxncommit}
bos@44	466 hook that checks for trailing whitespace. This hook is short, but not
bos@44	467 very helpful. It exits with an error status if a change adds a line
bos@44	468 with trailing whitespace to any file, but does not print any
bos@49	469 information that might help us to identify the offending file or
bos@49	470 line. It also has the nice property of not paying attention to
bos@49	471 unmodified lines; only lines that introduce new trailing whitespace
bos@49	472 cause problems.
bos@49	473
bos@49	474 \begin{figure}[ht]
bos@49	475 \interaction{hook.ws.better}
bos@49	476 \caption{A better trailing whitespace hook}
bos@49	477 \label{ex:hook:ws.better}
bos@49	478 \end{figure}
bos@49	479
bos@49	480 The example of figure~\ref{ex:hook:ws.better} is much more complex,
bos@49	481 but also more useful. It parses a unified diff to see if any lines
bos@49	482 add trailing whitespace, and prints the name of the file and the line
bos@49	483 number of each such occurrence. Even better, if the change adds
bos@49	484 trailing whitespace, this hook saves the commit comment and prints the
bos@49	485 name of the save file before exiting and telling Mercurial to roll the
bos@49	486 transaction back, so you can use
bos@49	487 \hgcmdargs{commit}{\hgopt{commit}{-l}~\emph{filename}} to reuse the
bos@49	488 saved commit message once you've corrected the problem.
bos@49	489
bos@49	490 As a final aside, note in figure~\ref{ex:hook:ws.better} the use of
bos@49	491 \command{perl}'s in-place editing feature to get rid of trailing
bos@49	492 whitespace from a file. This is concise and useful enough that I will
bos@49	493 reproduce it here.
bos@49	494 \begin{codesample2}
bos@54	495 perl -pi -e 's,\\s+\$,,' filename
bos@49	496 \end{codesample2}
bos@49	497
bos@49	498 \section{Bundled hooks}
bos@49	499
bos@49	500 Mercurial ships with several bundled hooks. You can find them in the
bos@49	501 \dirname{hgext} directory of a Mercurial source tree. If you are
bos@49	502 using a Mercurial binary package, the hooks will be located in the
bos@49	503 \dirname{hgext} directory of wherever your package installer put
bos@49	504 Mercurial.
bos@49	505
bos@49	506 \subsection{\hgext{acl}---access control for parts of a repository}
bos@49	507
bos@49	508 The \hgext{acl} extension lets you control which remote users are
bos@49	509 allowed to push changesets to a networked server. You can protect any
bos@49	510 portion of a repository (including the entire repo), so that a
bos@49	511 specific remote user can push changes that do not affect the protected
bos@49	512 portion.
bos@49	513
bos@49	514 This extension implements access control based on the identity of the
bos@49	515 user performing a push, \emph{not} on who committed the changesets
bos@49	516 they're pushing. (If access control based on committer was to work
bos@49	517 properly, it would require commits to be cryptographically signed,
bos@49	518 which is an onerous and hence unusual policy to enforce.)
bos@49	519
bos@49	520 XXX More help.
bos@44	521
bos@39	522 \section{Hook reference}
bos@41	523 \label{sec:hook:ref}
bos@39	524
bos@39	525 \subsection{In-process hook execution}
bos@39	526
bos@39	527 An in-process hook is called with arguments of the following form:
bos@39	528 \begin{codesample2}
bos@39	529 def myhook(ui, repo, **kwargs):
bos@39	530 pass
bos@39	531 \end{codesample2}
bos@39	532 The \texttt{ui} parameter is a \pymodclass{mercurial.ui}{ui} object.
bos@39	533 The \texttt{repo} parameter is a
bos@39	534 \pymodclass{mercurial.localrepo}{localrepository} object. The
bos@39	535 names and values of the \texttt{**kwargs} parameters depend on the
bos@39	536 hook being invoked, with the following common features:
bos@39	537 \begin{itemize}
bos@39	538 \item If a parameter is named \texttt{node} or
bos@39	539 \texttt{parent\emph{N}}, it will contain a hexadecimal changeset ID.
bos@39	540 The empty string is used to represent ``null changeset ID'' instead
bos@39	541 of a string of zeroes.
bos@39	542 \item Boolean-valued parameters are represented as Python
bos@39	543 \texttt{bool} objects.
bos@39	544 \end{itemize}
bos@39	545
bos@39	546 An in-process hook is called without a change to the process's working
bos@39	547 directory (unlike external hooks, which are run in the root of the
bos@39	548 repository). It must not change the process's working directory. If
bos@39	549 it were to do so, it would probably cause calls to the Mercurial API,
bos@39	550 or operations after the hook finishes, to fail.
bos@39	551
bos@39	552 If a hook returns a boolean ``false'' value, it is considered to
bos@39	553 have succeeded. If it returns a boolean ``true'' value or raises an
bos@39	554 exception, it is considered to have failed.
bos@39	555
bos@39	556 \subsection{External hook execution}
bos@39	557
bos@39	558 An external hook is passed to the user's shell for execution, so
bos@39	559 features of that shell, such as variable substitution and command
bos@39	560 redirection, are available. The hook is run in the root directory of
bos@39	561 the repository.
bos@39	562
bos@39	563 Hook parameters are passed to the hook as environment variables. Each
bos@39	564 environment variable's name is converted in upper case and prefixed
bos@39	565 with the string ``\texttt{HG\_}''. For example, if the name of a
bos@39	566 parameter is ``\texttt{node}'', the name of the environment variable
bos@39	567 representing that parameter will be ``\texttt{HG\_NODE}''.
bos@39	568
bos@39	569 A boolean parameter is represented as the string ``\texttt{1}'' for
bos@39	570 ``true'', ``\texttt{0}'' for ``false''. If an environment variable is
bos@39	571 named \envar{HG\_NODE}, \envar{HG\_PARENT1} or \envar{HG\_PARENT2}, it
bos@39	572 contains a changeset ID represented as a hexadecimal string. The
bos@39	573 empty string is used to represent ``null changeset ID'' instead of a
bos@39	574 string of zeroes.
bos@39	575
bos@39	576 If a hook exits with a status of zero, it is considered to have
bos@39	577 succeeded. If it exits with a non-zero status, it is considered to
bos@39	578 have failed.
bos@39	579
bos@39	580 \subsection{The \hook{changegroup} hook}
bos@39	581 \label{sec:hook:changegroup}
bos@39	582
bos@40	583 This hook is run after a group of pre-existing changesets has been
bos@40	584 added to the repository, for example via a \hgcmd{pull} or
bos@40	585 \hgcmd{unbundle}. This hook is run once per operation that added one
bos@41	586 or more changesets. This is in contrast to the \hook{incoming} hook,
bos@41	587 which is run once per changeset, regardless of whether the changesets
bos@41	588 arrive in a group.
bos@41	589
bos@41	590 Some possible uses for this hook include kicking off an automated
bos@41	591 build or test of the added changesets, updating a bug database, or
bos@41	592 notifying subscribers that a repository contains new changes.
bos@40	593
bos@40	594 Parameters to this hook:
bos@40	595 \begin{itemize}
bos@40	596 \item[\texttt{node}] A changeset ID. The changeset ID of the first
bos@40	597 changeset in the group that was added. All changesets between this
bos@40	598 and \index{tags!\texttt{tip}}\texttt{tip}, inclusive, were added by
bos@40	599 a single \hgcmd{pull}, \hgcmd{push} or \hgcmd{unbundle}.
bos@40	600 \end{itemize}
bos@40	601
bos@40	602 See also: \hook{incoming} (section~\ref{sec:hook:incoming}),
bos@40	603 \hook{prechangegroup} (section~\ref{sec:hook:prechangegroup}),
bos@40	604 \hook{pretxnchangegroup} (section~\ref{sec:hook:pretxnchangegroup})
bos@39	605
bos@39	606 \subsection{The \hook{commit} hook}
bos@39	607 \label{sec:hook:commit}
bos@39	608
bos@40	609 This hook is run after a new changeset has been created.
bos@40	610
bos@40	611 Parameters to this hook:
bos@40	612 \begin{itemize}
bos@40	613 \item[\texttt{node}] A changeset ID. The changeset ID of the newly
bos@40	614 committed changeset.
bos@40	615 \item[\texttt{parent1}] A changeset ID. The changeset ID of the first
bos@40	616 parent of the newly committed changeset.
bos@40	617 \item[\texttt{parent2}] A changeset ID. The changeset ID of the second
bos@40	618 parent of the newly committed changeset.
bos@40	619 \end{itemize}
bos@40	620
bos@40	621 See also: \hook{precommit} (section~\ref{sec:hook:precommit}),
bos@40	622 \hook{pretxncommit} (section~\ref{sec:hook:pretxncommit})
bos@40	623
bos@40	624 \subsection{The \hook{incoming} hook}
bos@40	625 \label{sec:hook:incoming}
bos@40	626
bos@40	627 This hook is run after a pre-existing changeset has been added to the
bos@40	628 repository, for example via a \hgcmd{push}. If a group of changesets
bos@40	629 was added in a single operation, this hook is called once for each
bos@40	630 added changeset.
bos@40	631
bos@41	632 You can use this hook for the same purposes as the \hook{changegroup}
bos@41	633 hook (section~\ref{sec:hook:changegroup}); it's simply more convenient
bos@54	634 sometimes to run a hook once per group of changesets, while other
bos@41	635 times it's handier once per changeset.
bos@41	636
bos@40	637 Parameters to this hook:
bos@40	638 \begin{itemize}
bos@40	639 \item[\texttt{node}] A changeset ID. The ID of the newly added
bos@39	640 changeset.
bos@40	641 \end{itemize}
bos@40	642
bos@40	643 See also: \hook{changegroup} (section~\ref{sec:hook:changegroup}) \hook{prechangegroup} (section~\ref{sec:hook:prechangegroup}), \hook{pretxnchangegroup} (section~\ref{sec:hook:pretxnchangegroup})
bos@40	644
bos@40	645 \subsection{The \hook{outgoing} hook}
bos@40	646 \label{sec:hook:outgoing}
bos@40	647
bos@40	648 This hook is run after a group of changesets has been propagated out
bos@40	649 of this repository, for example by a \hgcmd{push} or \hgcmd{bundle}
bos@40	650 command.
bos@40	651
bos@41	652 One possible use for this hook is to notify administrators that
bos@41	653 changes have been pulled.
bos@41	654
bos@40	655 Parameters to this hook:
bos@40	656 \begin{itemize}
bos@40	657 \item[\texttt{node}] A changeset ID. The changeset ID of the first
bos@40	658 changeset of the group that was sent.
bos@40	659 \item[\texttt{source}] A string. The source of the of the operation.
bos@40	660 If a remote client pulled changes from this repository,
bos@40	661 \texttt{source} will be \texttt{serve}. If the client that obtained
bos@40	662 changes from this repository was local, \texttt{source} will be
bos@40	663 \texttt{bundle}, \texttt{pull}, or \texttt{push}, depending on the
bos@40	664 operation the client performed.
bos@40	665 \end{itemize}
bos@40	666
bos@40	667 See also: \hook{preoutgoing} (section~\ref{sec:hook:preoutgoing})
bos@40	668
bos@40	669 \subsection{The \hook{prechangegroup} hook}
bos@40	670 \label{sec:hook:prechangegroup}
bos@40	671
bos@41	672 This controlling hook is run before Mercurial begins to add a group of
bos@41	673 changesets from another repository.
bos@41	674
bos@41	675 This hook does not have any information about the changesets to be
bos@41	676 added, because it is run before transmission of those changesets is
bos@41	677 allowed to begin. If this hook fails, the changesets will not be
bos@41	678 transmitted.
bos@41	679
bos@41	680 One use for this hook is to prevent external changes from being added
bos@41	681 to a repository, for example to ``freeze'' a server-hosted branch
bos@41	682 temporarily or permanently.
bos@41	683
bos@40	684 This hook is not passed any parameters.
bos@40	685
bos@40	686 See also: \hook{changegroup} (section~\ref{sec:hook:changegroup}),
bos@40	687 \hook{incoming} (section~\ref{sec:hook:incoming}), ,
bos@40	688 \hook{pretxnchangegroup} (section~\ref{sec:hook:pretxnchangegroup})
bos@40	689
bos@40	690 \subsection{The \hook{precommit} hook}
bos@40	691 \label{sec:hook:precommit}
bos@40	692
bos@41	693 This hook is run before Mercurial begins to commit a new changeset.
bos@41	694 It is run before Mercurial has any of the metadata for the commit,
bos@41	695 such as the files to be committed, the commit message, or the commit
bos@41	696 date.
bos@41	697
bos@41	698 One use for this hook is to disable the ability to commit new
bos@41	699 changesets, while still allowing incoming changesets. Another is to
bos@41	700 run a build or test, and only allow the commit to begin if the build
bos@41	701 or test succeeds.
bos@40	702
bos@40	703 Parameters to this hook:
bos@40	704 \begin{itemize}
bos@40	705 \item[\texttt{parent1}] A changeset ID. The changeset ID of the first
bos@40	706 parent of the working directory.
bos@40	707 \item[\texttt{parent2}] A changeset ID. The changeset ID of the second
bos@40	708 parent of the working directory.
bos@40	709 \end{itemize}
bos@40	710 If the commit proceeds, the parents of the working directory will
bos@40	711 become the parents of the new changeset.
bos@40	712
bos@40	713 See also: \hook{commit} (section~\ref{sec:hook:commit}),
bos@40	714 \hook{pretxncommit} (section~\ref{sec:hook:pretxncommit})
bos@40	715
bos@40	716 \subsection{The \hook{preoutgoing} hook}
bos@40	717 \label{sec:hook:preoutgoing}
bos@40	718
bos@40	719 This hook is invoked before Mercurial knows the identities of the
bos@40	720 changesets to be transmitted.
bos@40	721
bos@41	722 One use for this hook is to prevent changes from being transmitted to
bos@41	723 another repository.
bos@41	724
bos@40	725 Parameters to this hook:
bos@40	726 \begin{itemize}
bos@40	727 \item[\texttt{source}] A string. The source of the operation that is
bos@40	728 attempting to obtain changes from this repository. See the
bos@40	729 documentation for the \texttt{source} parameter to the
bos@40	730 \hook{outgoing} hook, in section~\ref{sec:hook:outgoing}, for
bos@40	731 possible values of this parameter..
bos@40	732 \end{itemize}
bos@40	733
bos@40	734 See also: \hook{outgoing} (section~\ref{sec:hook:outgoing})
bos@40	735
bos@40	736 \subsection{The \hook{pretag} hook}
bos@40	737 \label{sec:hook:pretag}
bos@40	738
bos@41	739 This controlling hook is run before a tag is created. If the hook
bos@41	740 succeeds, creation of the tag proceeds. If the hook fails, the tag is
bos@41	741 not created.
bos@41	742
bos@40	743 Parameters to this hook:
bos@40	744 \begin{itemize}
bos@40	745 \item[\texttt{local}] A boolean. Whether the tag is local to this
bos@40	746 repository instance (i.e.~stored in \sfilename{.hg/tags}) or managed
bos@40	747 by Mercurial (stored in \sfilename{.hgtags}).
bos@40	748 \item[\texttt{node}] A changeset ID. The ID of the changeset to be tagged.
bos@40	749 \item[\texttt{tag}] A string. The name of the tag to be created.
bos@40	750 \end{itemize}
bos@40	751
bos@40	752 If the tag to be created is revision-controlled, the \hook{precommit}
bos@40	753 and \hook{pretxncommit} hooks (sections~\ref{sec:hook:commit}
bos@40	754 and~\ref{sec:hook:pretxncommit}) will also be run.
bos@40	755
bos@40	756 See also: \hook{tag} (section~\ref{sec:hook:tag})
bos@40	757
bos@40	758 \subsection{The \hook{pretxnchangegroup} hook}
bos@40	759 \label{sec:hook:pretxnchangegroup}
bos@40	760
bos@41	761 This controlling hook is run before a transaction---that manages the
bos@41	762 addition of a group of new changesets from outside the
bos@41	763 repository---completes. If the hook succeeds, the transaction
bos@41	764 completes, and all of the changesets become permanent within this
bos@41	765 repository. If the hook fails, the transaction is rolled back, and
bos@41	766 the data for the changesets is erased.
bos@41	767
bos@41	768 This hook can access the metadata associated with the almost-added
bos@41	769 changesets, but it should not do anything permanent with this data.
bos@41	770 It must also not modify the working directory.
bos@41	771
bos@41	772 While this hook is running, if other Mercurial processes access this
bos@41	773 repository, they will be able to see the almost-added changesets as if
bos@41	774 they are permanent. This may lead to race conditions if you do not
bos@41	775 take steps to avoid them.
bos@41	776
bos@41	777 This hook can be used to automatically vet a group of changesets. If
bos@41	778 the hook fails, all of the changesets are ``rejected'' when the
bos@41	779 transaction rolls back.
bos@41	780
bos@40	781 Parameters to this hook are the same as for the \hook{changegroup}
bos@40	782 hook; see section~\ref{sec:hook:changegroup} for details.
bos@40	783
bos@40	784 See also: \hook{changegroup} (section~\ref{sec:hook:changegroup}),
bos@40	785 \hook{incoming} (section~\ref{sec:hook:incoming}),
bos@40	786 \hook{prechangegroup} (section~\ref{sec:hook:prechangegroup})
bos@40	787
bos@40	788 \subsection{The \hook{pretxncommit} hook}
bos@40	789 \label{sec:hook:pretxncommit}
bos@40	790
bos@41	791 This controlling hook is run before a transaction---that manages a new
bos@41	792 commit---completes. If the hook succeeds, the transaction completes
bos@41	793 and the changeset becomes permanent within this repository. If the
bos@41	794 hook fails, the transaction is rolled back, and the commit data is
bos@41	795 erased.
bos@41	796
bos@41	797 This hook can access the metadata associated with the almost-new
bos@41	798 changeset, but it should not do anything permanent with this data. It
bos@41	799 must also not modify the working directory.
bos@41	800
bos@41	801 While this hook is running, if other Mercurial processes access this
bos@41	802 repository, they will be able to see the almost-new changeset as if it
bos@41	803 is permanent. This may lead to race conditions if you do not take
bos@41	804 steps to avoid them.
bos@41	805
bos@40	806 Parameters to this hook are the same as for the \hook{commit} hook;
bos@40	807 see section~\ref{sec:hook:commit} for details.
bos@40	808
bos@40	809 See also: \hook{precommit} (section~\ref{sec:hook:precommit})
bos@40	810
bos@40	811 \subsection{The \hook{preupdate} hook}
bos@40	812 \label{sec:hook:preupdate}
bos@40	813
bos@41	814 This controlling hook is run before an update or merge of the working
bos@41	815 directory begins. It is run only if Mercurial's normal pre-update
bos@41	816 checks determine that the update or merge can proceed. If the hook
bos@41	817 succeeds, the update or merge may proceed; if it fails, the update or
bos@41	818 merge does not start.
bos@41	819
bos@40	820 Parameters to this hook:
bos@40	821 \begin{itemize}
bos@40	822 \item[\texttt{parent1}] A changeset ID. The ID of the parent that the
bos@40	823 working directory is to be updated to. If the working directory is
bos@40	824 being merged, it will not change this parent.
bos@40	825 \item[\texttt{parent2}] A changeset ID. Only set if the working
bos@40	826 directory is being merged. The ID of the revision that the working
bos@40	827 directory is being merged with.
bos@40	828 \end{itemize}
bos@40	829
bos@40	830 See also: \hook{update} (section~\ref{sec:hook:update})
bos@40	831
bos@40	832 \subsection{The \hook{tag} hook}
bos@40	833 \label{sec:hook:tag}
bos@40	834
bos@41	835 This hook is run after a tag has been created.
bos@41	836
bos@40	837 Parameters to this hook are the same as for the \hook{pretag} hook;
bos@40	838 see section~\ref{sec:hook:pretag} for details.
bos@40	839
bos@40	840 If the created tag is revision-controlled, the \hook{commit} hook
bos@41	841 (section~\ref{sec:hook:commit}) is run before this hook.
bos@40	842
bos@40	843 See also: \hook{pretag} (section~\ref{sec:hook:pretag})
bos@40	844
bos@40	845 \subsection{The \hook{update} hook}
bos@40	846 \label{sec:hook:update}
bos@40	847
bos@41	848 This hook is run after an update or merge of the working directory
bos@41	849 completes. Since a merge can fail (if the external \command{hgmerge}
bos@41	850 command fails to resolve conflicts in a file), this hook communicates
bos@41	851 whether the update or merge completed cleanly.
bos@41	852
bos@40	853 \begin{itemize}
bos@40	854 \item[\texttt{error}] A boolean. Indicates whether the update or
bos@40	855 merge completed successfully.
bos@40	856 \item[\texttt{parent1}] A changeset ID. The ID of the parent that the
bos@40	857 working directory was updated to. If the working directory was
bos@40	858 merged, it will not have changed this parent.
bos@40	859 \item[\texttt{parent2}] A changeset ID. Only set if the working
bos@40	860 directory was merged. The ID of the revision that the working
bos@40	861 directory was merged with.
bos@40	862 \end{itemize}
bos@40	863
bos@40	864 See also: \hook{preupdate} (section~\ref{sec:hook:preupdate})
bos@34	865
bos@34	866 %%% Local Variables:
bos@34	867 %%% mode: latex
bos@34	868 %%% TeX-master: "00book"
bos@34	869 %%% End: