hgbook

view en/ch06-collab.xml @ 647:d0160b0b1a9e

Merge with http://hg.serpentine.com/mercurial/book
author Dongsheng Song <dongsheng.song@gmail.com>
date Wed Mar 18 20:32:37 2009 +0800 (2009-03-18)
parents a13813534ccd 8366882f67f2
children e0ac2341a861
line source
1 <!-- vim: set filetype=docbkxml shiftwidth=2 autoindent expandtab tw=77 : -->
3 <chapter id="cha.collab">
4 <?dbhtml filename="collaborating-with-other-people.html"?>
5 <title>Collaborating with other people</title>
7 <para>As a completely decentralised tool, Mercurial doesn't impose
8 any policy on how people ought to work with each other. However,
9 if you're new to distributed revision control, it helps to have
10 some tools and examples in mind when you're thinking about
11 possible workflow models.</para>
13 <sect1>
14 <title>Mercurial's web interface</title>
16 <para>Mercurial has a powerful web interface that provides several
17 useful capabilities.</para>
19 <para>For interactive use, the web interface lets you browse a
20 single repository or a collection of repositories. You can view
21 the history of a repository, examine each change (comments and
22 diffs), and view the contents of each directory and file.</para>
24 <para>Also for human consumption, the web interface provides an
25 RSS feed of the changes in a repository. This lets you
26 <quote>subscribe</quote> to a repository using your favourite
27 feed reader, and be automatically notified of activity in that
28 repository as soon as it happens. I find this capability much
29 more convenient than the model of subscribing to a mailing list
30 to which notifications are sent, as it requires no additional
31 configuration on the part of whoever is serving the
32 repository.</para>
34 <para>The web interface also lets remote users clone a repository,
35 pull changes from it, and (when the server is configured to
36 permit it) push changes back to it. Mercurial's HTTP tunneling
37 protocol aggressively compresses data, so that it works
38 efficiently even over low-bandwidth network connections.</para>
40 <para>The easiest way to get started with the web interface is to
41 use your web browser to visit an existing repository, such as
42 the master Mercurial repository at <ulink
43 url="http://www.selenic.com/repo/hg?style=gitweb">http://www.selenic.com/repo/hg?style=gitweb</ulink>.</para>
45 <para>If you're interested in providing a web interface to your
46 own repositories, Mercurial provides two ways to do this. The
47 first is using the <command role="hg-cmd">hg serve</command>
48 command, which is best suited to short-term
49 <quote>lightweight</quote> serving. See section <xref
50 linkend="sec.collab.serve"/> below for details of how to use
51 this command. If you have a long-lived repository that you'd
52 like to make permanently available, Mercurial has built-in
53 support for the CGI (Common Gateway Interface) standard, which
54 all common web servers support. See section <xref
55 linkend="sec.collab.cgi"/> for details of CGI
56 configuration.</para>
58 </sect1>
59 <sect1>
60 <title>Collaboration models</title>
62 <para>With a suitably flexible tool, making decisions about
63 workflow is much more of a social engineering challenge than a
64 technical one. Mercurial imposes few limitations on how you can
65 structure the flow of work in a project, so it's up to you and
66 your group to set up and live with a model that matches your own
67 particular needs.</para>
69 <sect2>
70 <title>Factors to keep in mind</title>
72 <para>The most important aspect of any model that you must keep
73 in mind is how well it matches the needs and capabilities of
74 the people who will be using it. This might seem
75 self-evident; even so, you still can't afford to forget it for
76 a moment.</para>
78 <para>I once put together a workflow model that seemed to make
79 perfect sense to me, but that caused a considerable amount of
80 consternation and strife within my development team. In spite
81 of my attempts to explain why we needed a complex set of
82 branches, and how changes ought to flow between them, a few
83 team members revolted. Even though they were smart people,
84 they didn't want to pay attention to the constraints we were
85 operating under, or face the consequences of those constraints
86 in the details of the model that I was advocating.</para>
88 <para>Don't sweep foreseeable social or technical problems under
89 the rug. Whatever scheme you put into effect, you should plan
90 for mistakes and problem scenarios. Consider adding automated
91 machinery to prevent, or quickly recover from, trouble that
92 you can anticipate. As an example, if you intend to have a
93 branch with not-for-release changes in it, you'd do well to
94 think early about the possibility that someone might
95 accidentally merge those changes into a release branch. You
96 could avoid this particular problem by writing a hook that
97 prevents changes from being merged from an inappropriate
98 branch.</para>
100 </sect2>
101 <sect2>
102 <title>Informal anarchy</title>
104 <para>I wouldn't suggest an <quote>anything goes</quote>
105 approach as something sustainable, but it's a model that's
106 easy to grasp, and it works perfectly well in a few unusual
107 situations.</para>
109 <para>As one example, many projects have a loose-knit group of
110 collaborators who rarely physically meet each other. Some
111 groups like to overcome the isolation of working at a distance
112 by organising occasional <quote>sprints</quote>. In a sprint,
113 a number of people get together in a single location (a
114 company's conference room, a hotel meeting room, that kind of
115 place) and spend several days more or less locked in there,
116 hacking intensely on a handful of projects.</para>
118 <para>A sprint is the perfect place to use the <command
119 role="hg-cmd">hg serve</command> command, since <command
120 role="hg-cmd">hg serve</command> does not require any fancy
121 server infrastructure. You can get started with <command
122 role="hg-cmd">hg serve</command> in moments, by reading
123 section <xref linkend="sec.collab.serve"/> below. Then simply
124 tell
125 the person next to you that you're running a server, send the
126 URL to them in an instant message, and you immediately have a
127 quick-turnaround way to work together. They can type your URL
128 into their web browser and quickly review your changes; or
129 they can pull a bugfix from you and verify it; or they can
130 clone a branch containing a new feature and try it out.</para>
132 <para>The charm, and the problem, with doing things in an ad hoc
133 fashion like this is that only people who know about your
134 changes, and where they are, can see them. Such an informal
135 approach simply doesn't scale beyond a handful people, because
136 each individual needs to know about $n$ different repositories
137 to pull from.</para>
139 </sect2>
140 <sect2>
141 <title>A single central repository</title>
143 <para>For smaller projects migrating from a centralised revision
144 control tool, perhaps the easiest way to get started is to
145 have changes flow through a single shared central repository.
146 This is also the most common <quote>building block</quote> for
147 more ambitious workflow schemes.</para>
149 <para>Contributors start by cloning a copy of this repository.
150 They can pull changes from it whenever they need to, and some
151 (perhaps all) developers have permission to push a change back
152 when they're ready for other people to see it.</para>
154 <para>Under this model, it can still often make sense for people
155 to pull changes directly from each other, without going
156 through the central repository. Consider a case in which I
157 have a tentative bug fix, but I am worried that if I were to
158 publish it to the central repository, it might subsequently
159 break everyone else's trees as they pull it. To reduce the
160 potential for damage, I can ask you to clone my repository
161 into a temporary repository of your own and test it. This
162 lets us put off publishing the potentially unsafe change until
163 it has had a little testing.</para>
165 <para>In this kind of scenario, people usually use the
166 <command>ssh</command> protocol to securely push changes to
167 the central repository, as documented in section <xref
168 linkend="sec.collab.ssh"/>. It's also
169 usual to publish a read-only copy of the repository over HTTP
170 using CGI, as in section <xref linkend="sec.collab.cgi"/>.
171 Publishing over HTTP
172 satisfies the needs of people who don't have push access, and
173 those who want to use web browsers to browse the repository's
174 history.</para>
176 </sect2>
177 <sect2>
178 <title>Working with multiple branches</title>
180 <para>Projects of any significant size naturally tend to make
181 progress on several fronts simultaneously. In the case of
182 software, it's common for a project to go through periodic
183 official releases. A release might then go into
184 <quote>maintenance mode</quote> for a while after its first
185 publication; maintenance releases tend to contain only bug
186 fixes, not new features. In parallel with these maintenance
187 releases, one or more future releases may be under
188 development. People normally use the word
189 <quote>branch</quote> to refer to one of these many slightly
190 different directions in which development is
191 proceeding.</para>
193 <para>Mercurial is particularly well suited to managing a number
194 of simultaneous, but not identical, branches. Each
195 <quote>development direction</quote> can live in its own
196 central repository, and you can merge changes from one to
197 another as the need arises. Because repositories are
198 independent of each other, unstable changes in a development
199 branch will never affect a stable branch unless someone
200 explicitly merges those changes in.</para>
202 <para>Here's an example of how this can work in practice. Let's
203 say you have one <quote>main branch</quote> on a central
204 server.</para>
206 &interaction.branching.init;
208 <para>People clone it, make changes locally, test them, and push
209 them back.</para>
211 <para>Once the main branch reaches a release milestone, you can
212 use the <command role="hg-cmd">hg tag</command> command to
213 give a permanent name to the milestone revision.</para>
215 &interaction.branching.tag;
217 <para>Let's say some ongoing
218 development occurs on the main branch.</para>
220 &interaction.branching.main;
222 <para>Using the tag that was recorded at the milestone, people
223 who clone that repository at any time in the future can use
224 <command role="hg-cmd">hg update</command> to get a copy of
225 the working directory exactly as it was when that tagged
226 revision was committed.</para>
228 &interaction.branching.update;
230 <para>In addition, immediately after the main branch is tagged,
231 someone can then clone the main branch on the server to a new
232 <quote>stable</quote> branch, also on the server.</para>
234 &interaction.branching.clone;
236 <para>Someone who needs to make a change to the stable branch
237 can then clone <emphasis>that</emphasis> repository, make
238 their changes, commit, and push their changes back there.</para>
240 &interaction.branching.stable;
242 <para>Because Mercurial repositories are independent, and
243 Mercurial doesn't move changes around automatically, the
244 stable and main branches are <emphasis>isolated</emphasis>
245 from each other. The changes that you made on the main branch
246 don't <quote>leak</quote> to the stable branch, and vice
247 versa.</para>
249 <para>You'll often want all of your bugfixes on the stable
250 branch to show up on the main branch, too. Rather than
251 rewrite a bugfix on the main branch, you can simply pull and
252 merge changes from the stable to the main branch, and
253 Mercurial will bring those bugfixes in for you.</para>
255 &interaction.branching.merge;
257 <para>The main branch will still contain changes that are not on
258 the stable branch, but it will also contain all of the
259 bugfixes from the stable branch. The stable branch remains
260 unaffected by these changes.</para>
262 </sect2>
263 <sect2>
264 <title>Feature branches</title>
266 <para>For larger projects, an effective way to manage change is
267 to break up a team into smaller groups. Each group has a
268 shared branch of its own, cloned from a single
269 <quote>master</quote> branch used by the entire project.
270 People working on an individual branch are typically quite
271 isolated from developments on other branches.</para>
273 <informalfigure id="fig.collab.feature-branches">
274 <mediaobject>
275 <imageobject><imagedata fileref="images/feature-branches.png"/>
276 </imageobject>
277 <textobject><phrase>XXX add text</phrase></textobject>
278 <caption><para id="fig.collab.feature-branches.caption">Feature
279 branches</para></caption>
280 </mediaobject>
281 </informalfigure>
283 <para>When a particular feature is deemed to be in suitable
284 shape, someone on that feature team pulls and merges from the
285 master branch into the feature branch, then pushes back up to
286 the master branch.</para>
288 </sect2>
289 <sect2>
290 <title>The release train</title>
292 <para>Some projects are organised on a <quote>train</quote>
293 basis: a release is scheduled to happen every few months, and
294 whatever features are ready when the <quote>train</quote> is
295 ready to leave are allowed in.</para>
297 <para>This model resembles working with feature branches. The
298 difference is that when a feature branch misses a train,
299 someone on the feature team pulls and merges the changes that
300 went out on that train release into the feature branch, and
301 the team continues its work on top of that release so that
302 their feature can make the next release.</para>
304 </sect2>
305 <sect2>
306 <title>The Linux kernel model</title>
308 <para>The development of the Linux kernel has a shallow
309 hierarchical structure, surrounded by a cloud of apparent
310 chaos. Because most Linux developers use
311 <command>git</command>, a distributed revision control tool
312 with capabilities similar to Mercurial, it's useful to
313 describe the way work flows in that environment; if you like
314 the ideas, the approach translates well across tools.</para>
316 <para>At the center of the community sits Linus Torvalds, the
317 creator of Linux. He publishes a single source repository
318 that is considered the <quote>authoritative</quote> current
319 tree by the entire developer community. Anyone can clone
320 Linus's tree, but he is very choosy about whose trees he pulls
321 from.</para>
323 <para>Linus has a number of <quote>trusted lieutenants</quote>.
324 As a general rule, he pulls whatever changes they publish, in
325 most cases without even reviewing those changes. Some of
326 those lieutenants are generally agreed to be
327 <quote>maintainers</quote>, responsible for specific
328 subsystems within the kernel. If a random kernel hacker wants
329 to make a change to a subsystem that they want to end up in
330 Linus's tree, they must find out who the subsystem's
331 maintainer is, and ask that maintainer to take their change.
332 If the maintainer reviews their changes and agrees to take
333 them, they'll pass them along to Linus in due course.</para>
335 <para>Individual lieutenants have their own approaches to
336 reviewing, accepting, and publishing changes; and for deciding
337 when to feed them to Linus. In addition, there are several
338 well known branches that people use for different purposes.
339 For example, a few people maintain <quote>stable</quote>
340 repositories of older versions of the kernel, to which they
341 apply critical fixes as needed. Some maintainers publish
342 multiple trees: one for experimental changes; one for changes
343 that they are about to feed upstream; and so on. Others just
344 publish a single tree.</para>
346 <para>This model has two notable features. The first is that
347 it's <quote>pull only</quote>. You have to ask, convince, or
348 beg another developer to take a change from you, because there
349 are almost no trees to which more than one person can push,
350 and there's no way to push changes into a tree that someone
351 else controls.</para>
353 <para>The second is that it's based on reputation and acclaim.
354 If you're an unknown, Linus will probably ignore changes from
355 you without even responding. But a subsystem maintainer will
356 probably review them, and will likely take them if they pass
357 their criteria for suitability. The more <quote>good</quote>
358 changes you contribute to a maintainer, the more likely they
359 are to trust your judgment and accept your changes. If you're
360 well-known and maintain a long-lived branch for something
361 Linus hasn't yet accepted, people with similar interests may
362 pull your changes regularly to keep up with your work.</para>
364 <para>Reputation and acclaim don't necessarily cross subsystem
365 or <quote>people</quote> boundaries. If you're a respected
366 but specialised storage hacker, and you try to fix a
367 networking bug, that change will receive a level of scrutiny
368 from a network maintainer comparable to a change from a
369 complete stranger.</para>
371 <para>To people who come from more orderly project backgrounds,
372 the comparatively chaotic Linux kernel development process
373 often seems completely insane. It's subject to the whims of
374 individuals; people make sweeping changes whenever they deem
375 it appropriate; and the pace of development is astounding.
376 And yet Linux is a highly successful, well-regarded piece of
377 software.</para>
379 </sect2>
380 <sect2>
381 <title>Pull-only versus shared-push collaboration</title>
383 <para>A perpetual source of heat in the open source community is
384 whether a development model in which people only ever pull
385 changes from others is <quote>better than</quote> one in which
386 multiple people can push changes to a shared
387 repository.</para>
389 <para>Typically, the backers of the shared-push model use tools
390 that actively enforce this approach. If you're using a
391 centralised revision control tool such as Subversion, there's
392 no way to make a choice over which model you'll use: the tool
393 gives you shared-push, and if you want to do anything else,
394 you'll have to roll your own approach on top (such as applying
395 a patch by hand).</para>
397 <para>A good distributed revision control tool, such as
398 Mercurial, will support both models. You and your
399 collaborators can then structure how you work together based
400 on your own needs and preferences, not on what contortions
401 your tools force you into.</para>
403 </sect2>
404 <sect2>
405 <title>Where collaboration meets branch management</title>
407 <para>Once you and your team set up some shared repositories and
408 start propagating changes back and forth between local and
409 shared repos, you begin to face a related, but slightly
410 different challenge: that of managing the multiple directions
411 in which your team may be moving at once. Even though this
412 subject is intimately related to how your team collaborates,
413 it's dense enough to merit treatment of its own, in chapter
414 <xref linkend="chap.branch"/>.</para>
416 </sect2>
417 </sect1>
418 <sect1>
419 <title>The technical side of sharing</title>
421 <para>The remainder of this chapter is devoted to the question of
422 serving data to your collaborators.</para>
424 </sect1>
425 <sect1 id="sec.collab.serve">
426 <title>Informal sharing with <command role="hg-cmd">hg
427 serve</command></title>
429 <para>Mercurial's <command role="hg-cmd">hg serve</command>
430 command is wonderfully suited to small, tight-knit, and
431 fast-paced group environments. It also provides a great way to
432 get a feel for using Mercurial commands over a network.</para>
434 <para>Run <command role="hg-cmd">hg serve</command> inside a
435 repository, and in under a second it will bring up a specialised
436 HTTP server; this will accept connections from any client, and
437 serve up data for that repository until you terminate it.
438 Anyone who knows the URL of the server you just started, and can
439 talk to your computer over the network, can then use a web
440 browser or Mercurial to read data from that repository. A URL
441 for a <command role="hg-cmd">hg serve</command> instance running
442 on a laptop is likely to look something like
443 <literal>http://my-laptop.local:8000/</literal>.</para>
445 <para>The <command role="hg-cmd">hg serve</command> command is
446 <emphasis>not</emphasis> a general-purpose web server. It can do
447 only two things:</para>
448 <itemizedlist>
449 <listitem><para>Allow people to browse the history of the
450 repository it's serving, from their normal web
451 browsers.</para>
452 </listitem>
453 <listitem><para>Speak Mercurial's wire protocol, so that people
454 can <command role="hg-cmd">hg clone</command> or <command
455 role="hg-cmd">hg pull</command> changes from that
456 repository.</para>
457 </listitem></itemizedlist>
458 <para>In particular, <command role="hg-cmd">hg serve</command>
459 won't allow remote users to <emphasis>modify</emphasis> your
460 repository. It's intended for read-only use.</para>
462 <para>If you're getting started with Mercurial, there's nothing to
463 prevent you from using <command role="hg-cmd">hg serve</command>
464 to serve up a repository on your own computer, then use commands
465 like <command role="hg-cmd">hg clone</command>, <command
466 role="hg-cmd">hg incoming</command>, and so on to talk to that
467 server as if the repository was hosted remotely. This can help
468 you to quickly get acquainted with using commands on
469 network-hosted repositories.</para>
471 <sect2>
472 <title>A few things to keep in mind</title>
474 <para>Because it provides unauthenticated read access to all
475 clients, you should only use <command role="hg-cmd">hg
476 serve</command> in an environment where you either don't
477 care, or have complete control over, who can access your
478 network and pull data from your repository.</para>
480 <para>The <command role="hg-cmd">hg serve</command> command
481 knows nothing about any firewall software you might have
482 installed on your system or network. It cannot detect or
483 control your firewall software. If other people are unable to
484 talk to a running <command role="hg-cmd">hg serve</command>
485 instance, the second thing you should do
486 (<emphasis>after</emphasis> you make sure that they're using
487 the correct URL) is check your firewall configuration.</para>
489 <para>By default, <command role="hg-cmd">hg serve</command>
490 listens for incoming connections on port 8000. If another
491 process is already listening on the port you want to use, you
492 can specify a different port to listen on using the <option
493 role="hg-opt-serve">-p</option> option.</para>
495 <para>Normally, when <command role="hg-cmd">hg serve</command>
496 starts, it prints no output, which can be a bit unnerving. If
497 you'd like to confirm that it is indeed running correctly, and
498 find out what URL you should send to your collaborators, start
499 it with the <option role="hg-opt-global">-v</option>
500 option.</para>
502 </sect2>
503 </sect1>
504 <sect1 id="sec.collab.ssh">
505 <title>Using the Secure Shell (ssh) protocol</title>
507 <para>You can pull and push changes securely over a network
508 connection using the Secure Shell (<literal>ssh</literal>)
509 protocol. To use this successfully, you may have to do a little
510 bit of configuration on the client or server sides.</para>
512 <para>If you're not familiar with ssh, it's a network protocol
513 that lets you securely communicate with another computer. To
514 use it with Mercurial, you'll be setting up one or more user
515 accounts on a server so that remote users can log in and execute
516 commands.</para>
518 <para>(If you <emphasis>are</emphasis> familiar with ssh, you'll
519 probably find some of the material that follows to be elementary
520 in nature.)</para>
522 <sect2>
523 <title>How to read and write ssh URLs</title>
525 <para>An ssh URL tends to look like this:</para>
526 <programlisting>ssh://bos@hg.serpentine.com:22/hg/hgbook</programlisting>
527 <orderedlist>
528 <listitem><para>The <quote><literal>ssh://</literal></quote>
529 part tells Mercurial to use the ssh protocol.</para>
530 </listitem>
531 <listitem><para>The <quote><literal>bos@</literal></quote>
532 component indicates what username to log into the server
533 as. You can leave this out if the remote username is the
534 same as your local username.</para>
535 </listitem>
536 <listitem><para>The
537 <quote><literal>hg.serpentine.com</literal></quote> gives
538 the hostname of the server to log into.</para>
539 </listitem>
540 <listitem><para>The <quote>:22</quote> identifies the port
541 number to connect to the server on. The default port is
542 22, so you only need to specify a colon and port number if
543 you're <emphasis>not</emphasis> using port 22.</para>
544 </listitem>
545 <listitem><para>The remainder of the URL is the local path to
546 the repository on the server.</para>
547 </listitem></orderedlist>
549 <para>There's plenty of scope for confusion with the path
550 component of ssh URLs, as there is no standard way for tools
551 to interpret it. Some programs behave differently than others
552 when dealing with these paths. This isn't an ideal situation,
553 but it's unlikely to change. Please read the following
554 paragraphs carefully.</para>
556 <para>Mercurial treats the path to a repository on the server as
557 relative to the remote user's home directory. For example, if
558 user <literal>foo</literal> on the server has a home directory
559 of <filename class="directory">/home/foo</filename>, then an
560 ssh URL that contains a path component of <filename
561 class="directory">bar</filename> <emphasis>really</emphasis>
562 refers to the directory <filename
563 class="directory">/home/foo/bar</filename>.</para>
565 <para>If you want to specify a path relative to another user's
566 home directory, you can use a path that starts with a tilde
567 character followed by the user's name (let's call them
568 <literal>otheruser</literal>), like this.</para>
569 <programlisting>ssh://server/~otheruser/hg/repo</programlisting>
571 <para>And if you really want to specify an
572 <emphasis>absolute</emphasis> path on the server, begin the
573 path component with two slashes, as in this example.</para>
574 <programlisting>ssh://server//absolute/path</programlisting>
576 </sect2>
577 <sect2>
578 <title>Finding an ssh client for your system</title>
580 <para>Almost every Unix-like system comes with OpenSSH
581 preinstalled. If you're using such a system, run
582 <literal>which ssh</literal> to find out if the
583 <command>ssh</command> command is installed (it's usually in
584 <filename class="directory">/usr/bin</filename>). In the
585 unlikely event that it isn't present, take a look at your
586 system documentation to figure out how to install it.</para>
588 <para>On Windows, you'll first need to download a suitable ssh
589 client. There are two alternatives.</para>
590 <itemizedlist>
591 <listitem><para>Simon Tatham's excellent PuTTY package
592 <citation>web:putty</citation> provides a complete suite
593 of ssh client commands.</para>
594 </listitem>
595 <listitem><para>If you have a high tolerance for pain, you can
596 use the Cygwin port of OpenSSH.</para>
597 </listitem></itemizedlist>
598 <para>In either case, you'll need to edit your <filename
599 role="special">hg.ini</filename> file to
600 tell Mercurial where to find the actual client command. For
601 example, if you're using PuTTY, you'll need to use the
602 <command>plink</command> command as a command-line ssh
603 client.</para>
604 <programlisting>[ui]
605 ssh = C:/path/to/plink.exe -ssh -i "C:/path/to/my/private/key"</programlisting>
607 <note>
608 <para> The path to <command>plink</command> shouldn't contain
609 any whitespace characters, or Mercurial may not be able to
610 run it correctly (so putting it in <filename
611 class="directory">C:\Program Files</filename> is probably
612 not a good idea).</para>
613 </note>
615 </sect2>
616 <sect2>
617 <title>Generating a key pair</title>
619 <para>To avoid the need to repetitively type a password every
620 time you need to use your ssh client, I recommend generating a
621 key pair. On a Unix-like system, the
622 <command>ssh-keygen</command> command will do the trick. On
623 Windows, if you're using PuTTY, the
624 <command>puttygen</command> command is what you'll
625 need.</para>
627 <para>When you generate a key pair, it's usually
628 <emphasis>highly</emphasis> advisable to protect it with a
629 passphrase. (The only time that you might not want to do this
630 is when you're using the ssh protocol for automated tasks on a
631 secure network.)</para>
633 <para>Simply generating a key pair isn't enough, however.
634 You'll need to add the public key to the set of authorised
635 keys for whatever user you're logging in remotely as. For
636 servers using OpenSSH (the vast majority), this will mean
637 adding the public key to a list in a file called <filename
638 role="special">authorized_keys</filename> in their <filename
639 role="special" class="directory">.ssh</filename>
640 directory.</para>
642 <para>On a Unix-like system, your public key will have a
643 <filename>.pub</filename> extension. If you're using
644 <command>puttygen</command> on Windows, you can save the
645 public key to a file of your choosing, or paste it from the
646 window it's displayed in straight into the <filename
647 role="special">authorized_keys</filename> file.</para>
649 </sect2>
650 <sect2>
651 <title>Using an authentication agent</title>
653 <para>An authentication agent is a daemon that stores
654 passphrases in memory (so it will forget passphrases if you
655 log out and log back in again). An ssh client will notice if
656 it's running, and query it for a passphrase. If there's no
657 authentication agent running, or the agent doesn't store the
658 necessary passphrase, you'll have to type your passphrase
659 every time Mercurial tries to communicate with a server on
660 your behalf (e.g. whenever you pull or push changes).</para>
662 <para>The downside of storing passphrases in an agent is that
663 it's possible for a well-prepared attacker to recover the
664 plain text of your passphrases, in some cases even if your
665 system has been power-cycled. You should make your own
666 judgment as to whether this is an acceptable risk. It
667 certainly saves a lot of repeated typing.</para>
669 <para>On Unix-like systems, the agent is called
670 <command>ssh-agent</command>, and it's often run automatically
671 for you when you log in. You'll need to use the
672 <command>ssh-add</command> command to add passphrases to the
673 agent's store. On Windows, if you're using PuTTY, the
674 <command>pageant</command> command acts as the agent. It adds
675 an icon to your system tray that will let you manage stored
676 passphrases.</para>
678 </sect2>
679 <sect2>
680 <title>Configuring the server side properly</title>
682 <para>Because ssh can be fiddly to set up if you're new to it,
683 there's a variety of things that can go wrong. Add Mercurial
684 on top, and there's plenty more scope for head-scratching.
685 Most of these potential problems occur on the server side, not
686 the client side. The good news is that once you've gotten a
687 configuration working, it will usually continue to work
688 indefinitely.</para>
690 <para>Before you try using Mercurial to talk to an ssh server,
691 it's best to make sure that you can use the normal
692 <command>ssh</command> or <command>putty</command> command to
693 talk to the server first. If you run into problems with using
694 these commands directly, Mercurial surely won't work. Worse,
695 it will obscure the underlying problem. Any time you want to
696 debug ssh-related Mercurial problems, you should drop back to
697 making sure that plain ssh client commands work first,
698 <emphasis>before</emphasis> you worry about whether there's a
699 problem with Mercurial.</para>
701 <para>The first thing to be sure of on the server side is that
702 you can actually log in from another machine at all. If you
703 can't use <command>ssh</command> or <command>putty</command>
704 to log in, the error message you get may give you a few hints
705 as to what's wrong. The most common problems are as
706 follows.</para>
707 <itemizedlist>
708 <listitem><para>If you get a <quote>connection refused</quote>
709 error, either there isn't an SSH daemon running on the
710 server at all, or it's inaccessible due to firewall
711 configuration.</para>
712 </listitem>
713 <listitem><para>If you get a <quote>no route to host</quote>
714 error, you either have an incorrect address for the server
715 or a seriously locked down firewall that won't admit its
716 existence at all.</para>
717 </listitem>
718 <listitem><para>If you get a <quote>permission denied</quote>
719 error, you may have mistyped the username on the server,
720 or you could have mistyped your key's passphrase or the
721 remote user's password.</para>
722 </listitem></itemizedlist>
723 <para>In summary, if you're having trouble talking to the
724 server's ssh daemon, first make sure that one is running at
725 all. On many systems it will be installed, but disabled, by
726 default. Once you're done with this step, you should then
727 check that the server's firewall is configured to allow
728 incoming connections on the port the ssh daemon is listening
729 on (usually 22). Don't worry about more exotic possibilities
730 for misconfiguration until you've checked these two
731 first.</para>
733 <para>If you're using an authentication agent on the client side
734 to store passphrases for your keys, you ought to be able to
735 log into the server without being prompted for a passphrase or
736 a password. If you're prompted for a passphrase, there are a
737 few possible culprits.</para>
738 <itemizedlist>
739 <listitem><para>You might have forgotten to use
740 <command>ssh-add</command> or <command>pageant</command>
741 to store the passphrase.</para>
742 </listitem>
743 <listitem><para>You might have stored the passphrase for the
744 wrong key.</para>
745 </listitem></itemizedlist>
746 <para>If you're being prompted for the remote user's password,
747 there are another few possible problems to check.</para>
748 <itemizedlist>
749 <listitem><para>Either the user's home directory or their
750 <filename role="special" class="directory">.ssh</filename>
751 directory might have excessively liberal permissions. As
752 a result, the ssh daemon will not trust or read their
753 <filename role="special">authorized_keys</filename> file.
754 For example, a group-writable home or <filename
755 role="special" class="directory">.ssh</filename>
756 directory will often cause this symptom.</para>
757 </listitem>
758 <listitem><para>The user's <filename
759 role="special">authorized_keys</filename> file may have
760 a problem. If anyone other than the user owns or can write
761 to that file, the ssh daemon will not trust or read
762 it.</para>
763 </listitem></itemizedlist>
765 <para>In the ideal world, you should be able to run the
766 following command successfully, and it should print exactly
767 one line of output, the current date and time.</para>
768 <programlisting>ssh myserver date</programlisting>
770 <para>If, on your server, you have login scripts that print
771 banners or other junk even when running non-interactive
772 commands like this, you should fix them before you continue,
773 so that they only print output if they're run interactively.
774 Otherwise these banners will at least clutter up Mercurial's
775 output. Worse, they could potentially cause problems with
776 running Mercurial commands remotely. Mercurial makes tries to
777 detect and ignore banners in non-interactive
778 <command>ssh</command> sessions, but it is not foolproof. (If
779 you're editing your login scripts on your server, the usual
780 way to see if a login script is running in an interactive
781 shell is to check the return code from the command
782 <literal>tty -s</literal>.)</para>
784 <para>Once you've verified that plain old ssh is working with
785 your server, the next step is to ensure that Mercurial runs on
786 the server. The following command should run
787 successfully:</para>
789 <programlisting>ssh myserver hg version</programlisting>
791 <para>If you see an error message instead of normal <command
792 role="hg-cmd">hg version</command> output, this is usually
793 because you haven't installed Mercurial to <filename
794 class="directory">/usr/bin</filename>. Don't worry if this
795 is the case; you don't need to do that. But you should check
796 for a few possible problems.</para>
797 <itemizedlist>
798 <listitem><para>Is Mercurial really installed on the server at
799 all? I know this sounds trivial, but it's worth
800 checking!</para>
801 </listitem>
802 <listitem><para>Maybe your shell's search path (usually set
803 via the <envar>PATH</envar> environment variable) is
804 simply misconfigured.</para>
805 </listitem>
806 <listitem><para>Perhaps your <envar>PATH</envar> environment
807 variable is only being set to point to the location of the
808 <command>hg</command> executable if the login session is
809 interactive. This can happen if you're setting the path
810 in the wrong shell login script. See your shell's
811 documentation for details.</para>
812 </listitem>
813 <listitem><para>The <envar>PYTHONPATH</envar> environment
814 variable may need to contain the path to the Mercurial
815 Python modules. It might not be set at all; it could be
816 incorrect; or it may be set only if the login is
817 interactive.</para>
818 </listitem></itemizedlist>
820 <para>If you can run <command role="hg-cmd">hg version</command>
821 over an ssh connection, well done! You've got the server and
822 client sorted out. You should now be able to use Mercurial to
823 access repositories hosted by that username on that server.
824 If you run into problems with Mercurial and ssh at this point,
825 try using the <option role="hg-opt-global">--debug</option>
826 option to get a clearer picture of what's going on.</para>
828 </sect2>
829 <sect2>
830 <title>Using compression with ssh</title>
832 <para>Mercurial does not compress data when it uses the ssh
833 protocol, because the ssh protocol can transparently compress
834 data. However, the default behaviour of ssh clients is
835 <emphasis>not</emphasis> to request compression.</para>
837 <para>Over any network other than a fast LAN (even a wireless
838 network), using compression is likely to significantly speed
839 up Mercurial's network operations. For example, over a WAN,
840 someone measured compression as reducing the amount of time
841 required to clone a particularly large repository from 51
842 minutes to 17 minutes.</para>
844 <para>Both <command>ssh</command> and <command>plink</command>
845 accept a <option role="cmd-opt-ssh">-C</option> option which
846 turns on compression. You can easily edit your <filename
847 role="special">~/.hgrc</filename> to enable compression for
848 all of Mercurial's uses of the ssh protocol.</para>
849 <programlisting>[ui]
850 ssh = ssh -C</programlisting>
852 <para>If you use <command>ssh</command>, you can configure it to
853 always use compression when talking to your server. To do
854 this, edit your <filename
855 role="special">.ssh/config</filename> file (which may not
856 yet exist), as follows.</para>
857 <programlisting>Host hg
858 Compression yes
859 HostName hg.example.com</programlisting>
860 <para>This defines an alias, <literal>hg</literal>. When you
861 use it on the <command>ssh</command> command line or in a
862 Mercurial <literal>ssh</literal>-protocol URL, it will cause
863 <command>ssh</command> to connect to
864 <literal>hg.example.com</literal> and use compression. This
865 gives you both a shorter name to type and compression, each of
866 which is a good thing in its own right.</para>
868 </sect2>
869 </sect1>
870 <sect1 id="sec.collab.cgi">
871 <title>Serving over HTTP using CGI</title>
873 <para>Depending on how ambitious you are, configuring Mercurial's
874 CGI interface can take anything from a few moments to several
875 hours.</para>
877 <para>We'll begin with the simplest of examples, and work our way
878 towards a more complex configuration. Even for the most basic
879 case, you're almost certainly going to need to read and modify
880 your web server's configuration.</para>
882 <note>
883 <para> Configuring a web server is a complex, fiddly, and
884 highly system-dependent activity. I can't possibly give you
885 instructions that will cover anything like all of the cases
886 you will encounter. Please use your discretion and judgment in
887 following the sections below. Be prepared to make plenty of
888 mistakes, and to spend a lot of time reading your server's
889 error logs.</para>
890 </note>
892 <sect2>
893 <title>Web server configuration checklist</title>
895 <para>Before you continue, do take a few moments to check a few
896 aspects of your system's setup.</para>
898 <orderedlist>
899 <listitem><para>Do you have a web server installed at all?
900 Mac OS X ships with Apache, but many other systems may not
901 have a web server installed.</para>
902 </listitem>
903 <listitem><para>If you have a web server installed, is it
904 actually running? On most systems, even if one is
905 present, it will be disabled by default.</para>
906 </listitem>
907 <listitem><para>Is your server configured to allow you to run
908 CGI programs in the directory where you plan to do so?
909 Most servers default to explicitly disabling the ability
910 to run CGI programs.</para>
911 </listitem></orderedlist>
913 <para>If you don't have a web server installed, and don't have
914 substantial experience configuring Apache, you should consider
915 using the <literal>lighttpd</literal> web server instead of
916 Apache. Apache has a well-deserved reputation for baroque and
917 confusing configuration. While <literal>lighttpd</literal> is
918 less capable in some ways than Apache, most of these
919 capabilities are not relevant to serving Mercurial
920 repositories. And <literal>lighttpd</literal> is undeniably
921 <emphasis>much</emphasis> easier to get started with than
922 Apache.</para>
924 </sect2>
925 <sect2>
926 <title>Basic CGI configuration</title>
928 <para>On Unix-like systems, it's common for users to have a
929 subdirectory named something like <filename
930 class="directory">public_html</filename> in their home
931 directory, from which they can serve up web pages. A file
932 named <filename>foo</filename> in this directory will be
933 accessible at a URL of the form
934 <literal>http://www.example.com/username/foo</literal>.</para>
936 <para>To get started, find the <filename
937 role="special">hgweb.cgi</filename> script that should be
938 present in your Mercurial installation. If you can't quickly
939 find a local copy on your system, simply download one from the
940 master Mercurial repository at <ulink
941 url="http://www.selenic.com/repo/hg/raw-file/tip/hgweb.cgi">http://www.selenic.com/repo/hg/raw-file/tip/hgweb.cgi</ulink>.</para>
943 <para>You'll need to copy this script into your <filename
944 class="directory">public_html</filename> directory, and
945 ensure that it's executable.</para>
946 <programlisting>cp .../hgweb.cgi ~/public_html
947 chmod 755 ~/public_html/hgweb.cgi</programlisting>
948 <para>The <literal>755</literal> argument to
949 <command>chmod</command> is a little more general than just
950 making the script executable: it ensures that the script is
951 executable by anyone, and that <quote>group</quote> and
952 <quote>other</quote> write permissions are
953 <emphasis>not</emphasis> set. If you were to leave those
954 write permissions enabled, Apache's <literal>suexec</literal>
955 subsystem would likely refuse to execute the script. In fact,
956 <literal>suexec</literal> also insists that the
957 <emphasis>directory</emphasis> in which the script resides
958 must not be writable by others.</para>
959 <programlisting>chmod 755 ~/public_html</programlisting>
961 <sect3 id="sec.collab.wtf">
962 <title>What could <emphasis>possibly</emphasis> go
963 wrong?</title>
965 <para>Once you've copied the CGI script into place, go into a
966 web browser, and try to open the URL <ulink
967 url="http://myhostname/
968 myuser/hgweb.cgi">http://myhostname/
969 myuser/hgweb.cgi</ulink>, <emphasis>but</emphasis> brace
970 yourself for instant failure. There's a high probability
971 that trying to visit this URL will fail, and there are many
972 possible reasons for this. In fact, you're likely to
973 stumble over almost every one of the possible errors below,
974 so please read carefully. The following are all of the
975 problems I ran into on a system running Fedora 7, with a
976 fresh installation of Apache, and a user account that I
977 created specially to perform this exercise.</para>
979 <para>Your web server may have per-user directories disabled.
980 If you're using Apache, search your config file for a
981 <literal>UserDir</literal> directive. If there's none
982 present, per-user directories will be disabled. If one
983 exists, but its value is <literal>disabled</literal>, then
984 per-user directories will be disabled. Otherwise, the
985 string after <literal>UserDir</literal> gives the name of
986 the subdirectory that Apache will look in under your home
987 directory, for example <filename
988 class="directory">public_html</filename>.</para>
990 <para>Your file access permissions may be too restrictive.
991 The web server must be able to traverse your home directory
992 and directories under your <filename
993 class="directory">public_html</filename> directory, and
994 read files under the latter too. Here's a quick recipe to
995 help you to make your permissions more appropriate.</para>
996 <programlisting>chmod 755 ~
997 find ~/public_html -type d -print0 | xargs -0r chmod 755
998 find ~/public_html -type f -print0 | xargs -0r chmod 644</programlisting>
1000 <para>The other possibility with permissions is that you might
1001 get a completely empty window when you try to load the
1002 script. In this case, it's likely that your access
1003 permissions are <emphasis>too permissive</emphasis>. Apache's
1004 <literal>suexec</literal> subsystem won't execute a script
1005 that's group- or world-writable, for example.</para>
1007 <para>Your web server may be configured to disallow execution
1008 of CGI programs in your per-user web directory. Here's
1009 Apache's default per-user configuration from my Fedora
1010 system.</para>
1012 &ch06-apache-config.lst;
1014 <para>If you find a similar-looking
1015 <literal>Directory</literal> group in your Apache
1016 configuration, the directive to look at inside it is
1017 <literal>Options</literal>. Add <literal>ExecCGI</literal>
1018 to the end of this list if it's missing, and restart the web
1019 server.</para>
1021 <para>If you find that Apache serves you the text of the CGI
1022 script instead of executing it, you may need to either
1023 uncomment (if already present) or add a directive like
1024 this.</para>
1025 <programlisting>AddHandler cgi-script .cgi</programlisting>
1027 <para>The next possibility is that you might be served with a
1028 colourful Python backtrace claiming that it can't import a
1029 <literal>mercurial</literal>-related module. This is
1030 actually progress! The server is now capable of executing
1031 your CGI script. This error is only likely to occur if
1032 you're running a private installation of Mercurial, instead
1033 of a system-wide version. Remember that the web server runs
1034 the CGI program without any of the environment variables
1035 that you take for granted in an interactive session. If
1036 this error happens to you, edit your copy of <filename
1037 role="special">hgweb.cgi</filename> and follow the
1038 directions inside it to correctly set your
1039 <envar>PYTHONPATH</envar> environment variable.</para>
1041 <para>Finally, you are <emphasis>certain</emphasis> to by
1042 served with another colourful Python backtrace: this one
1043 will complain that it can't find <filename
1044 class="directory">/path/to/repository</filename>. Edit
1045 your <filename role="special">hgweb.cgi</filename> script
1046 and replace the <filename
1047 class="directory">/path/to/repository</filename> string
1048 with the complete path to the repository you want to serve
1049 up.</para>
1051 <para>At this point, when you try to reload the page, you
1052 should be presented with a nice HTML view of your
1053 repository's history. Whew!</para>
1055 </sect3>
1056 <sect3>
1057 <title>Configuring lighttpd</title>
1059 <para>To be exhaustive in my experiments, I tried configuring
1060 the increasingly popular <literal>lighttpd</literal> web
1061 server to serve the same repository as I described with
1062 Apache above. I had already overcome all of the problems I
1063 outlined with Apache, many of which are not server-specific.
1064 As a result, I was fairly sure that my file and directory
1065 permissions were good, and that my <filename
1066 role="special">hgweb.cgi</filename> script was properly
1067 edited.</para>
1069 <para>Once I had Apache running, getting
1070 <literal>lighttpd</literal> to serve the repository was a
1071 snap (in other words, even if you're trying to use
1072 <literal>lighttpd</literal>, you should read the Apache
1073 section). I first had to edit the
1074 <literal>mod_access</literal> section of its config file to
1075 enable <literal>mod_cgi</literal> and
1076 <literal>mod_userdir</literal>, both of which were disabled
1077 by default on my system. I then added a few lines to the
1078 end of the config file, to configure these modules.</para>
1079 <programlisting>userdir.path = "public_html"
1080 cgi.assign = (".cgi" =&gt; "" )</programlisting>
1081 <para>With this done, <literal>lighttpd</literal> ran
1082 immediately for me. If I had configured
1083 <literal>lighttpd</literal> before Apache, I'd almost
1084 certainly have run into many of the same system-level
1085 configuration problems as I did with Apache. However, I
1086 found <literal>lighttpd</literal> to be noticeably easier to
1087 configure than Apache, even though I've used Apache for over
1088 a decade, and this was my first exposure to
1089 <literal>lighttpd</literal>.</para>
1091 </sect3>
1092 </sect2>
1093 <sect2>
1094 <title>Sharing multiple repositories with one CGI script</title>
1096 <para>The <filename role="special">hgweb.cgi</filename> script
1097 only lets you publish a single repository, which is an
1098 annoying restriction. If you want to publish more than one
1099 without wracking yourself with multiple copies of the same
1100 script, each with different names, a better choice is to use
1101 the <filename role="special">hgwebdir.cgi</filename>
1102 script.</para>
1104 <para>The procedure to configure <filename
1105 role="special">hgwebdir.cgi</filename> is only a little more
1106 involved than for <filename
1107 role="special">hgweb.cgi</filename>. First, you must obtain
1108 a copy of the script. If you don't have one handy, you can
1109 download a copy from the master Mercurial repository at <ulink
1110 url="http://www.selenic.com/repo/hg/raw-file/tip/hgwebdir.cgi">http://www.selenic.com/repo/hg/raw-file/tip/hgwebdir.cgi</ulink>.</para>
1112 <para>You'll need to copy this script into your <filename
1113 class="directory">public_html</filename> directory, and
1114 ensure that it's executable.</para>
1115 <programlisting>cp .../hgwebdir.cgi ~/public_html
1116 chmod 755 ~/public_html ~/public_html/hgwebdir.cgi</programlisting>
1117 <para>With basic configuration out of the way, try to visit
1118 <ulink url="http://myhostname/
1119 myuser/hgwebdir.cgi">http://myhostname/
1120 myuser/hgwebdir.cgi</ulink> in your browser. It should
1121 display an empty list of repositories. If you get a blank
1122 window or error message, try walking through the list of
1123 potential problems in section <xref
1124 linkend="sec.collab.wtf"/>.</para>
1126 <para>The <filename role="special">hgwebdir.cgi</filename>
1127 script relies on an external configuration file. By default,
1128 it searches for a file named <filename
1129 role="special">hgweb.config</filename> in the same directory
1130 as itself. You'll need to create this file, and make it
1131 world-readable. The format of the file is similar to a
1132 Windows <quote>ini</quote> file, as understood by Python's
1133 <literal>ConfigParser</literal>
1134 <citation>web:configparser</citation> module.</para>
1136 <para>The easiest way to configure <filename
1137 role="special">hgwebdir.cgi</filename> is with a section
1138 named <literal>collections</literal>. This will automatically
1139 publish <emphasis>every</emphasis> repository under the
1140 directories you name. The section should look like
1141 this:</para>
1142 <programlisting>[collections]
1143 /my/root = /my/root</programlisting>
1144 <para>Mercurial interprets this by looking at the directory name
1145 on the <emphasis>right</emphasis> hand side of the
1146 <quote><literal>=</literal></quote> sign; finding repositories
1147 in that directory hierarchy; and using the text on the
1148 <emphasis>left</emphasis> to strip off matching text from the
1149 names it will actually list in the web interface. The
1150 remaining component of a path after this stripping has
1151 occurred is called a <quote>virtual path</quote>.</para>
1153 <para>Given the example above, if we have a repository whose
1154 local path is <filename
1155 class="directory">/my/root/this/repo</filename>, the CGI
1156 script will strip the leading <filename
1157 class="directory">/my/root</filename> from the name, and
1158 publish the repository with a virtual path of <filename
1159 class="directory">this/repo</filename>. If the base URL for
1160 our CGI script is <ulink url="http://myhostname/
1161 myuser/hgwebdir.cgi">http://myhostname/
1162 myuser/hgwebdir.cgi</ulink>, the complete URL for that
1163 repository will be <ulink url="http://myhostname/
1164 myuser/hgwebdir.cgi/this/repo">http://myhostname/
1165 myuser/hgwebdir.cgi/this/repo</ulink>.</para>
1167 <para>If we replace <filename
1168 class="directory">/my/root</filename> on the left hand side
1169 of this example with <filename
1170 class="directory">/my</filename>, then <filename
1171 role="special">hgwebdir.cgi</filename> will only strip off
1172 <filename class="directory">/my</filename> from the repository
1173 name, and will give us a virtual path of <filename
1174 class="directory">root/this/repo</filename> instead of
1175 <filename class="directory">this/repo</filename>.</para>
1177 <para>The <filename role="special">hgwebdir.cgi</filename>
1178 script will recursively search each directory listed in the
1179 <literal>collections</literal> section of its configuration
1180 file, but it will <literal>not</literal> recurse into the
1181 repositories it finds.</para>
1183 <para>The <literal>collections</literal> mechanism makes it easy
1184 to publish many repositories in a <quote>fire and
1185 forget</quote> manner. You only need to set up the CGI
1186 script and configuration file one time. Afterwards, you can
1187 publish or unpublish a repository at any time by simply moving
1188 it into, or out of, the directory hierarchy in which you've
1189 configured <filename role="special">hgwebdir.cgi</filename> to
1190 look.</para>
1192 <sect3>
1193 <title>Explicitly specifying which repositories to
1194 publish</title>
1196 <para>In addition to the <literal>collections</literal>
1197 mechanism, the <filename
1198 role="special">hgwebdir.cgi</filename> script allows you
1199 to publish a specific list of repositories. To do so,
1200 create a <literal>paths</literal> section, with contents of
1201 the following form.</para>
1202 <programlisting>[paths]
1203 repo1 = /my/path/to/some/repo
1204 repo2 = /some/path/to/another</programlisting>
1205 <para>In this case, the virtual path (the component that will
1206 appear in a URL) is on the left hand side of each
1207 definition, while the path to the repository is on the
1208 right. Notice that there does not need to be any
1209 relationship between the virtual path you choose and the
1210 location of a repository in your filesystem.</para>
1212 <para>If you wish, you can use both the
1213 <literal>collections</literal> and <literal>paths</literal>
1214 mechanisms simultaneously in a single configuration
1215 file.</para>
1217 <note>
1218 <para> If multiple repositories have the same virtual path,
1219 <filename role="special">hgwebdir.cgi</filename> will not
1220 report an error. Instead, it will behave
1221 unpredictably.</para>
1222 </note>
1224 </sect3>
1225 </sect2>
1226 <sect2>
1227 <title>Downloading source archives</title>
1229 <para>Mercurial's web interface lets users download an archive
1230 of any revision. This archive will contain a snapshot of the
1231 working directory as of that revision, but it will not contain
1232 a copy of the repository data.</para>
1234 <para>By default, this feature is not enabled. To enable it,
1235 you'll need to add an <envar
1236 role="rc-item-web">allow_archive</envar> item to the
1237 <literal role="rc-web">web</literal> section of your <filename
1238 role="special">~/.hgrc</filename>.</para>
1240 </sect2>
1241 <sect2>
1242 <title>Web configuration options</title>
1244 <para>Mercurial's web interfaces (the <command role="hg-cmd">hg
1245 serve</command> command, and the <filename
1246 role="special">hgweb.cgi</filename> and <filename
1247 role="special">hgwebdir.cgi</filename> scripts) have a
1248 number of configuration options that you can set. These
1249 belong in a section named <literal
1250 role="rc-web">web</literal>.</para>
1251 <itemizedlist>
1252 <listitem><para><envar
1253 role="rc-item-web">allow_archive</envar>: Determines
1254 which (if any) archive download mechanisms Mercurial
1255 supports. If you enable this feature, users of the web
1256 interface will be able to download an archive of whatever
1257 revision of a repository they are viewing. To enable the
1258 archive feature, this item must take the form of a
1259 sequence of words drawn from the list below.</para>
1260 <itemizedlist>
1261 <listitem><para><literal>bz2</literal>: A
1262 <command>tar</command> archive, compressed using
1263 <literal>bzip2</literal> compression. This has the
1264 best compression ratio, but uses the most CPU time on
1265 the server.</para>
1266 </listitem>
1267 <listitem><para><literal>gz</literal>: A
1268 <command>tar</command> archive, compressed using
1269 <literal>gzip</literal> compression.</para>
1270 </listitem>
1271 <listitem><para><literal>zip</literal>: A
1272 <command>zip</command> archive, compressed using LZW
1273 compression. This format has the worst compression
1274 ratio, but is widely used in the Windows world.</para>
1275 </listitem>
1276 </itemizedlist>
1277 <para> If you provide an empty list, or don't have an
1278 <envar role="rc-item-web">allow_archive</envar> entry at
1279 all, this feature will be disabled. Here is an example of
1280 how to enable all three supported formats.</para>
1281 <programlisting>[web]
1282 allow_archive = bz2 gz zip</programlisting>
1283 </listitem>
1284 <listitem><para><envar role="rc-item-web">allowpull</envar>:
1285 Boolean. Determines whether the web interface allows
1286 remote users to <command role="hg-cmd">hg pull</command>
1287 and <command role="hg-cmd">hg clone</command> this
1288 repository over HTTP. If set to <literal>no</literal> or
1289 <literal>false</literal>, only the
1290 <quote>human-oriented</quote> portion of the web interface
1291 is available.</para>
1292 </listitem>
1293 <listitem><para><envar role="rc-item-web">contact</envar>:
1294 String. A free-form (but preferably brief) string
1295 identifying the person or group in charge of the
1296 repository. This often contains the name and email
1297 address of a person or mailing list. It often makes sense
1298 to place this entry in a repository's own <filename
1299 role="special">.hg/hgrc</filename> file, but it can make
1300 sense to use in a global <filename
1301 role="special">~/.hgrc</filename> if every repository
1302 has a single maintainer.</para>
1303 </listitem>
1304 <listitem><para><envar role="rc-item-web">maxchanges</envar>:
1305 Integer. The default maximum number of changesets to
1306 display in a single page of output.</para>
1307 </listitem>
1308 <listitem><para><envar role="rc-item-web">maxfiles</envar>:
1309 Integer. The default maximum number of modified files to
1310 display in a single page of output.</para>
1311 </listitem>
1312 <listitem><para><envar role="rc-item-web">stripes</envar>:
1313 Integer. If the web interface displays alternating
1314 <quote>stripes</quote> to make it easier to visually align
1315 rows when you are looking at a table, this number controls
1316 the number of rows in each stripe.</para>
1317 </listitem>
1318 <listitem><para><envar role="rc-item-web">style</envar>:
1319 Controls the template Mercurial uses to display the web
1320 interface. Mercurial ships with two web templates, named
1321 <literal>default</literal> and <literal>gitweb</literal>
1322 (the latter is much more visually attractive). You can
1323 also specify a custom template of your own; see chapter
1324 <xref linkend="chap.template"/> for details.
1325 Here, you can see how to enable the
1326 <literal>gitweb</literal> style.</para>
1327 <programlisting>[web]
1328 style = gitweb</programlisting>
1329 </listitem>
1330 <listitem><para><envar role="rc-item-web">templates</envar>:
1331 Path. The directory in which to search for template
1332 files. By default, Mercurial searches in the directory in
1333 which it was installed.</para>
1334 </listitem></itemizedlist>
1335 <para>If you are using <filename
1336 role="special">hgwebdir.cgi</filename>, you can place a few
1337 configuration items in a <literal role="rc-web">web</literal>
1338 section of the <filename
1339 role="special">hgweb.config</filename> file instead of a
1340 <filename role="special">~/.hgrc</filename> file, for
1341 convenience. These items are <envar
1342 role="rc-item-web">motd</envar> and <envar
1343 role="rc-item-web">style</envar>.</para>
1345 <sect3>
1346 <title>Options specific to an individual repository</title>
1348 <para>A few <literal role="rc-web">web</literal> configuration
1349 items ought to be placed in a repository's local <filename
1350 role="special">.hg/hgrc</filename>, rather than a user's
1351 or global <filename role="special">~/.hgrc</filename>.</para>
1352 <itemizedlist>
1353 <listitem><para><envar
1354 role="rc-item-web">description</envar>: String. A
1355 free-form (but preferably brief) string that describes
1356 the contents or purpose of the repository.</para>
1357 </listitem>
1358 <listitem><para><envar role="rc-item-web">name</envar>:
1359 String. The name to use for the repository in the web
1360 interface. This overrides the default name, which is
1361 the last component of the repository's path.</para>
1362 </listitem></itemizedlist>
1364 </sect3>
1365 <sect3>
1366 <title>Options specific to the <command role="hg-cmd">hg
1367 serve</command> command</title>
1369 <para>Some of the items in the <literal
1370 role="rc-web">web</literal> section of a <filename
1371 role="special">~/.hgrc</filename> file are only for use
1372 with the <command role="hg-cmd">hg serve</command>
1373 command.</para>
1374 <itemizedlist>
1375 <listitem><para><envar role="rc-item-web">accesslog</envar>:
1376 Path. The name of a file into which to write an access
1377 log. By default, the <command role="hg-cmd">hg
1378 serve</command> command writes this information to
1379 standard output, not to a file. Log entries are written
1380 in the standard <quote>combined</quote> file format used
1381 by almost all web servers.</para>
1382 </listitem>
1383 <listitem><para><envar role="rc-item-web">address</envar>:
1384 String. The local address on which the server should
1385 listen for incoming connections. By default, the server
1386 listens on all addresses.</para>
1387 </listitem>
1388 <listitem><para><envar role="rc-item-web">errorlog</envar>:
1389 Path. The name of a file into which to write an error
1390 log. By default, the <command role="hg-cmd">hg
1391 serve</command> command writes this information to
1392 standard error, not to a file.</para>
1393 </listitem>
1394 <listitem><para><envar role="rc-item-web">ipv6</envar>:
1395 Boolean. Whether to use the IPv6 protocol. By default,
1396 IPv6 is not used.</para>
1397 </listitem>
1398 <listitem><para><envar role="rc-item-web">port</envar>:
1399 Integer. The TCP port number on which the server should
1400 listen. The default port number used is 8000.</para>
1401 </listitem></itemizedlist>
1403 </sect3>
1404 <sect3>
1405 <title>Choosing the right <filename
1406 role="special">~/.hgrc</filename> file to add <literal
1407 role="rc-web">web</literal> items to</title>
1409 <para>It is important to remember that a web server like
1410 Apache or <literal>lighttpd</literal> will run under a user
1411 ID that is different to yours. CGI scripts run by your
1412 server, such as <filename
1413 role="special">hgweb.cgi</filename>, will usually also run
1414 under that user ID.</para>
1416 <para>If you add <literal role="rc-web">web</literal> items to
1417 your own personal <filename role="special">~/.hgrc</filename> file, CGI scripts won't read that
1418 <filename role="special">~/.hgrc</filename> file. Those
1419 settings will thus only affect the behaviour of the <command
1420 role="hg-cmd">hg serve</command> command when you run it.
1421 To cause CGI scripts to see your settings, either create a
1422 <filename role="special">~/.hgrc</filename> file in the
1423 home directory of the user ID that runs your web server, or
1424 add those settings to a system-wide <filename
1425 role="special">~/.hgrc</filename> file.</para>
1428 </sect3>
1429 </sect2>
1430 </sect1>
1431 </chapter>
1433 <!--
1434 local variables:
1435 sgml-parent-document: ("00book.xml" "book" "chapter")
1436 end:
1437 -->