hgbook: d0160b0b1a9e en/ch06-collab.xml

hgbook

view en/ch06-collab.xml @ 647:d0160b0b1a9e

Merge with http://hg.serpentine.com/mercurial/book

author	Dongsheng Song <dongsheng.song@gmail.com>
date	Wed Mar 18 20:32:37 2009 +0800 (2009-03-18)
parents	a13813534ccd 8366882f67f2
children	e0ac2341a861

line source

1

3 <chapter id="cha.collab">

4 <?dbhtml filename="collaborating-with-other-people.html"?>

5 <title>Collaborating with other people</title>

7 <para>As a completely decentralised tool, Mercurial doesn't impose

8 any policy on how people ought to work with each other. However,

9 if you're new to distributed revision control, it helps to have

10 some tools and examples in mind when you're thinking about

11 possible workflow models.</para>

13 <sect1>

14 <title>Mercurial's web interface</title>

16 <para>Mercurial has a powerful web interface that provides several

17 useful capabilities.</para>

19 <para>For interactive use, the web interface lets you browse a

20 single repository or a collection of repositories. You can view

21 the history of a repository, examine each change (comments and

22 diffs), and view the contents of each directory and file.</para>

24 <para>Also for human consumption, the web interface provides an

25 RSS feed of the changes in a repository. This lets you

26 <quote>subscribe</quote> to a repository using your favourite

27 feed reader, and be automatically notified of activity in that

28 repository as soon as it happens. I find this capability much

29 more convenient than the model of subscribing to a mailing list

30 to which notifications are sent, as it requires no additional

31 configuration on the part of whoever is serving the

32 repository.</para>

34 <para>The web interface also lets remote users clone a repository,

35 pull changes from it, and (when the server is configured to

36 permit it) push changes back to it. Mercurial's HTTP tunneling

37 protocol aggressively compresses data, so that it works

38 efficiently even over low-bandwidth network connections.</para>

40 <para>The easiest way to get started with the web interface is to

41 use your web browser to visit an existing repository, such as

42 the master Mercurial repository at <ulink

43 url="http://www.selenic.com/repo/hg?style=gitweb">http://www.selenic.com/repo/hg?style=gitweb</ulink>.</para>

45 <para>If you're interested in providing a web interface to your

46 own repositories, Mercurial provides two ways to do this. The

47 first is using the <command role="hg-cmd">hg serve</command>

48 command, which is best suited to short-term

49 <quote>lightweight</quote> serving. See section <xref

50 linkend="sec.collab.serve"/> below for details of how to use

51 this command. If you have a long-lived repository that you'd

52 like to make permanently available, Mercurial has built-in

53 support for the CGI (Common Gateway Interface) standard, which

54 all common web servers support. See section <xref

55 linkend="sec.collab.cgi"/> for details of CGI

56 configuration.</para>

58 </sect1>

59 <sect1>

60 <title>Collaboration models</title>

62 <para>With a suitably flexible tool, making decisions about

63 workflow is much more of a social engineering challenge than a

64 technical one. Mercurial imposes few limitations on how you can

65 structure the flow of work in a project, so it's up to you and

66 your group to set up and live with a model that matches your own

67 particular needs.</para>

69 <sect2>

70 <title>Factors to keep in mind</title>

72 <para>The most important aspect of any model that you must keep

73 in mind is how well it matches the needs and capabilities of

74 the people who will be using it. This might seem

75 self-evident; even so, you still can't afford to forget it for

76 a moment.</para>

78 <para>I once put together a workflow model that seemed to make

79 perfect sense to me, but that caused a considerable amount of

80 consternation and strife within my development team. In spite

81 of my attempts to explain why we needed a complex set of

82 branches, and how changes ought to flow between them, a few

83 team members revolted. Even though they were smart people,

84 they didn't want to pay attention to the constraints we were

85 operating under, or face the consequences of those constraints

86 in the details of the model that I was advocating.</para>

88 <para>Don't sweep foreseeable social or technical problems under

89 the rug. Whatever scheme you put into effect, you should plan

90 for mistakes and problem scenarios. Consider adding automated

91 machinery to prevent, or quickly recover from, trouble that

92 you can anticipate. As an example, if you intend to have a

93 branch with not-for-release changes in it, you'd do well to

94 think early about the possibility that someone might

95 accidentally merge those changes into a release branch. You

96 could avoid this particular problem by writing a hook that

97 prevents changes from being merged from an inappropriate

98 branch.</para>

100 </sect2>

101 <sect2>

102 <title>Informal anarchy</title>

103

104 <para>I wouldn't suggest an <quote>anything goes</quote>

105 approach as something sustainable, but it's a model that's

106 easy to grasp, and it works perfectly well in a few unusual

107 situations.</para>

108

109 <para>As one example, many projects have a loose-knit group of

110 collaborators who rarely physically meet each other. Some

111 groups like to overcome the isolation of working at a distance

112 by organising occasional <quote>sprints</quote>. In a sprint,

113 a number of people get together in a single location (a

114 company's conference room, a hotel meeting room, that kind of

115 place) and spend several days more or less locked in there,

116 hacking intensely on a handful of projects.</para>

117

118 <para>A sprint is the perfect place to use the <command

119 role="hg-cmd">hg serve</command> command, since <command

120 role="hg-cmd">hg serve</command> does not require any fancy

121 server infrastructure. You can get started with <command

122 role="hg-cmd">hg serve</command> in moments, by reading

123 section <xref linkend="sec.collab.serve"/> below. Then simply

124 tell

125 the person next to you that you're running a server, send the

126 URL to them in an instant message, and you immediately have a

127 quick-turnaround way to work together. They can type your URL

128 into their web browser and quickly review your changes; or

129 they can pull a bugfix from you and verify it; or they can

130 clone a branch containing a new feature and try it out.</para>

131

132 <para>The charm, and the problem, with doing things in an ad hoc

133 fashion like this is that only people who know about your

134 changes, and where they are, can see them. Such an informal

135 approach simply doesn't scale beyond a handful people, because

136 each individual needs to know about $n$ different repositories

137 to pull from.</para>

138

139 </sect2>

140 <sect2>

141 <title>A single central repository</title>

142

143 <para>For smaller projects migrating from a centralised revision

144 control tool, perhaps the easiest way to get started is to

145 have changes flow through a single shared central repository.

146 This is also the most common <quote>building block</quote> for

147 more ambitious workflow schemes.</para>

148

149 <para>Contributors start by cloning a copy of this repository.

150 They can pull changes from it whenever they need to, and some

151 (perhaps all) developers have permission to push a change back

152 when they're ready for other people to see it.</para>

153

154 <para>Under this model, it can still often make sense for people

155 to pull changes directly from each other, without going

156 through the central repository. Consider a case in which I

157 have a tentative bug fix, but I am worried that if I were to

158 publish it to the central repository, it might subsequently

159 break everyone else's trees as they pull it. To reduce the

160 potential for damage, I can ask you to clone my repository

161 into a temporary repository of your own and test it. This

162 lets us put off publishing the potentially unsafe change until

163 it has had a little testing.</para>

164

165 <para>In this kind of scenario, people usually use the

166 <command>ssh</command> protocol to securely push changes to

167 the central repository, as documented in section <xref

168 linkend="sec.collab.ssh"/>. It's also

169 usual to publish a read-only copy of the repository over HTTP

170 using CGI, as in section <xref linkend="sec.collab.cgi"/>.

171 Publishing over HTTP

172 satisfies the needs of people who don't have push access, and

173 those who want to use web browsers to browse the repository's

174 history.</para>

175

176 </sect2>

177 <sect2>

178 <title>Working with multiple branches</title>

179

180 <para>Projects of any significant size naturally tend to make

181 progress on several fronts simultaneously. In the case of

182 software, it's common for a project to go through periodic

183 official releases. A release might then go into

184 <quote>maintenance mode</quote> for a while after its first

185 publication; maintenance releases tend to contain only bug

186 fixes, not new features. In parallel with these maintenance

187 releases, one or more future releases may be under

188 development. People normally use the word

189 <quote>branch</quote> to refer to one of these many slightly

190 different directions in which development is

191 proceeding.</para>

192

193 <para>Mercurial is particularly well suited to managing a number

194 of simultaneous, but not identical, branches. Each

195 <quote>development direction</quote> can live in its own

196 central repository, and you can merge changes from one to

197 another as the need arises. Because repositories are

198 independent of each other, unstable changes in a development

199 branch will never affect a stable branch unless someone

200 explicitly merges those changes in.</para>

201

202 <para>Here's an example of how this can work in practice. Let's

203 say you have one <quote>main branch</quote> on a central

204 server.</para>

205

206 &interaction.branching.init;

207

208 <para>People clone it, make changes locally, test them, and push

209 them back.</para>

210

211 <para>Once the main branch reaches a release milestone, you can

212 use the <command role="hg-cmd">hg tag</command> command to

213 give a permanent name to the milestone revision.</para>

214

215 &interaction.branching.tag;

216

217 <para>Let's say some ongoing

218 development occurs on the main branch.</para>

219

220 &interaction.branching.main;

221

222 <para>Using the tag that was recorded at the milestone, people

223 who clone that repository at any time in the future can use

224 <command role="hg-cmd">hg update</command> to get a copy of

225 the working directory exactly as it was when that tagged

226 revision was committed.</para>

227

228 &interaction.branching.update;

229

230 <para>In addition, immediately after the main branch is tagged,

231 someone can then clone the main branch on the server to a new

232 <quote>stable</quote> branch, also on the server.</para>

233

234 &interaction.branching.clone;

235

236 <para>Someone who needs to make a change to the stable branch

237 can then clone <emphasis>that</emphasis> repository, make

238 their changes, commit, and push their changes back there.</para>

239

240 &interaction.branching.stable;

241

242 <para>Because Mercurial repositories are independent, and

243 Mercurial doesn't move changes around automatically, the

244 stable and main branches are <emphasis>isolated</emphasis>

245 from each other. The changes that you made on the main branch

246 don't <quote>leak</quote> to the stable branch, and vice

247 versa.</para>

248

249 <para>You'll often want all of your bugfixes on the stable

250 branch to show up on the main branch, too. Rather than

251 rewrite a bugfix on the main branch, you can simply pull and

252 merge changes from the stable to the main branch, and

253 Mercurial will bring those bugfixes in for you.</para>

254

255 &interaction.branching.merge;

256

257 <para>The main branch will still contain changes that are not on

258 the stable branch, but it will also contain all of the

259 bugfixes from the stable branch. The stable branch remains

260 unaffected by these changes.</para>

261

262 </sect2>

263 <sect2>

264 <title>Feature branches</title>

265

266 <para>For larger projects, an effective way to manage change is

267 to break up a team into smaller groups. Each group has a

268 shared branch of its own, cloned from a single

269 <quote>master</quote> branch used by the entire project.

270 People working on an individual branch are typically quite

271 isolated from developments on other branches.</para>

272

273 <informalfigure id="fig.collab.feature-branches">

274 <mediaobject>

275 <imageobject><imagedata fileref="images/feature-branches.png"/>

276 </imageobject>

277 <textobject><phrase>XXX add text</phrase></textobject>

278 <caption><para id="fig.collab.feature-branches.caption">Feature

279 branches</para></caption>

280 </mediaobject>

281 </informalfigure>

282

283 <para>When a particular feature is deemed to be in suitable

284 shape, someone on that feature team pulls and merges from the

285 master branch into the feature branch, then pushes back up to

286 the master branch.</para>

287

288 </sect2>

289 <sect2>

290 <title>The release train</title>

291

292 <para>Some projects are organised on a <quote>train</quote>

293 basis: a release is scheduled to happen every few months, and

294 whatever features are ready when the <quote>train</quote> is

295 ready to leave are allowed in.</para>

296

297 <para>This model resembles working with feature branches. The

298 difference is that when a feature branch misses a train,

299 someone on the feature team pulls and merges the changes that

300 went out on that train release into the feature branch, and

301 the team continues its work on top of that release so that

302 their feature can make the next release.</para>

303

304 </sect2>

305 <sect2>

306 <title>The Linux kernel model</title>

307

308 <para>The development of the Linux kernel has a shallow

309 hierarchical structure, surrounded by a cloud of apparent

310 chaos. Because most Linux developers use

311 <command>git</command>, a distributed revision control tool

312 with capabilities similar to Mercurial, it's useful to

313 describe the way work flows in that environment; if you like

314 the ideas, the approach translates well across tools.</para>

315

316 <para>At the center of the community sits Linus Torvalds, the

317 creator of Linux. He publishes a single source repository

318 that is considered the <quote>authoritative</quote> current

319 tree by the entire developer community. Anyone can clone

320 Linus's tree, but he is very choosy about whose trees he pulls

321 from.</para>

322

323 <para>Linus has a number of <quote>trusted lieutenants</quote>.

324 As a general rule, he pulls whatever changes they publish, in

325 most cases without even reviewing those changes. Some of

326 those lieutenants are generally agreed to be

327 <quote>maintainers</quote>, responsible for specific

328 subsystems within the kernel. If a random kernel hacker wants

329 to make a change to a subsystem that they want to end up in

330 Linus's tree, they must find out who the subsystem's

331 maintainer is, and ask that maintainer to take their change.

332 If the maintainer reviews their changes and agrees to take

333 them, they'll pass them along to Linus in due course.</para>

334

335 <para>Individual lieutenants have their own approaches to

336 reviewing, accepting, and publishing changes; and for deciding

337 when to feed them to Linus. In addition, there are several

338 well known branches that people use for different purposes.

339 For example, a few people maintain <quote>stable</quote>

340 repositories of older versions of the kernel, to which they

341 apply critical fixes as needed. Some maintainers publish

342 multiple trees: one for experimental changes; one for changes

343 that they are about to feed upstream; and so on. Others just

344 publish a single tree.</para>

345

346 <para>This model has two notable features. The first is that

347 it's <quote>pull only</quote>. You have to ask, convince, or

348 beg another developer to take a change from you, because there

349 are almost no trees to which more than one person can push,

350 and there's no way to push changes into a tree that someone

351 else controls.</para>

352

353 <para>The second is that it's based on reputation and acclaim.

354 If you're an unknown, Linus will probably ignore changes from

355 you without even responding. But a subsystem maintainer will

356 probably review them, and will likely take them if they pass

357 their criteria for suitability. The more <quote>good</quote>

358 changes you contribute to a maintainer, the more likely they

359 are to trust your judgment and accept your changes. If you're

360 well-known and maintain a long-lived branch for something

361 Linus hasn't yet accepted, people with similar interests may

362 pull your changes regularly to keep up with your work.</para>

363

364 <para>Reputation and acclaim don't necessarily cross subsystem

365 or <quote>people</quote> boundaries. If you're a respected

366 but specialised storage hacker, and you try to fix a

367 networking bug, that change will receive a level of scrutiny

368 from a network maintainer comparable to a change from a

369 complete stranger.</para>

370

371 <para>To people who come from more orderly project backgrounds,

372 the comparatively chaotic Linux kernel development process

373 often seems completely insane. It's subject to the whims of

374 individuals; people make sweeping changes whenever they deem

375 it appropriate; and the pace of development is astounding.

376 And yet Linux is a highly successful, well-regarded piece of

377 software.</para>

378

379 </sect2>

380 <sect2>

381 <title>Pull-only versus shared-push collaboration</title>

382

383 <para>A perpetual source of heat in the open source community is

384 whether a development model in which people only ever pull

385 changes from others is <quote>better than</quote> one in which

386 multiple people can push changes to a shared

387 repository.</para>

388

389 <para>Typically, the backers of the shared-push model use tools

390 that actively enforce this approach. If you're using a

391 centralised revision control tool such as Subversion, there's

392 no way to make a choice over which model you'll use: the tool

393 gives you shared-push, and if you want to do anything else,

394 you'll have to roll your own approach on top (such as applying

395 a patch by hand).</para>

396

397 <para>A good distributed revision control tool, such as

398 Mercurial, will support both models. You and your

399 collaborators can then structure how you work together based

400 on your own needs and preferences, not on what contortions

401 your tools force you into.</para>

402

403 </sect2>

404 <sect2>

405 <title>Where collaboration meets branch management</title>

406

407 <para>Once you and your team set up some shared repositories and

408 start propagating changes back and forth between local and

409 shared repos, you begin to face a related, but slightly

410 different challenge: that of managing the multiple directions

411 in which your team may be moving at once. Even though this

412 subject is intimately related to how your team collaborates,

413 it's dense enough to merit treatment of its own, in chapter

414 <xref linkend="chap.branch"/>.</para>

415

416 </sect2>

417 </sect1>

418 <sect1>

419 <title>The technical side of sharing</title>

420

421 <para>The remainder of this chapter is devoted to the question of

422 serving data to your collaborators.</para>

423

424 </sect1>

425 <sect1 id="sec.collab.serve">

426 <title>Informal sharing with <command role="hg-cmd">hg

427 serve</command></title>

428

429 <para>Mercurial's <command role="hg-cmd">hg serve</command>

430 command is wonderfully suited to small, tight-knit, and

431 fast-paced group environments. It also provides a great way to

432 get a feel for using Mercurial commands over a network.</para>

433

434 <para>Run <command role="hg-cmd">hg serve</command> inside a

435 repository, and in under a second it will bring up a specialised

436 HTTP server; this will accept connections from any client, and

437 serve up data for that repository until you terminate it.

438 Anyone who knows the URL of the server you just started, and can

439 talk to your computer over the network, can then use a web

440 browser or Mercurial to read data from that repository. A URL

441 for a <command role="hg-cmd">hg serve</command> instance running

442 on a laptop is likely to look something like

443 <literal>http://my-laptop.local:8000/</literal>.</para>

444

445 <para>The <command role="hg-cmd">hg serve</command> command is

446 <emphasis>not</emphasis> a general-purpose web server. It can do

447 only two things:</para>

448 <itemizedlist>

449 <listitem><para>Allow people to browse the history of the

450 repository it's serving, from their normal web

451 browsers.</para>

452 </listitem>

453 <listitem><para>Speak Mercurial's wire protocol, so that people

454 can <command role="hg-cmd">hg clone</command> or <command

455 role="hg-cmd">hg pull</command> changes from that

456 repository.</para>

457 </listitem></itemizedlist>

458 <para>In particular, <command role="hg-cmd">hg serve</command>

459 won't allow remote users to <emphasis>modify</emphasis> your

460 repository. It's intended for read-only use.</para>

461

462 <para>If you're getting started with Mercurial, there's nothing to

463 prevent you from using <command role="hg-cmd">hg serve</command>

464 to serve up a repository on your own computer, then use commands

465 like <command role="hg-cmd">hg clone</command>, <command

466 role="hg-cmd">hg incoming</command>, and so on to talk to that

467 server as if the repository was hosted remotely. This can help

468 you to quickly get acquainted with using commands on

469 network-hosted repositories.</para>

470

471 <sect2>

472 <title>A few things to keep in mind</title>

473

474 <para>Because it provides unauthenticated read access to all

475 clients, you should only use <command role="hg-cmd">hg

476 serve</command> in an environment where you either don't

477 care, or have complete control over, who can access your

478 network and pull data from your repository.</para>

479

480 <para>The <command role="hg-cmd">hg serve</command> command

481 knows nothing about any firewall software you might have

482 installed on your system or network. It cannot detect or

483 control your firewall software. If other people are unable to

484 talk to a running <command role="hg-cmd">hg serve</command>

485 instance, the second thing you should do

486 (<emphasis>after</emphasis> you make sure that they're using

487 the correct URL) is check your firewall configuration.</para>

488

489 <para>By default, <command role="hg-cmd">hg serve</command>

490 listens for incoming connections on port 8000. If another

491 process is already listening on the port you want to use, you

492 can specify a different port to listen on using the <option

493 role="hg-opt-serve">-p</option> option.</para>

494

495 <para>Normally, when <command role="hg-cmd">hg serve</command>

496 starts, it prints no output, which can be a bit unnerving. If

497 you'd like to confirm that it is indeed running correctly, and

498 find out what URL you should send to your collaborators, start

499 it with the <option role="hg-opt-global">-v</option>

500 option.</para>

501

502 </sect2>

503 </sect1>

504 <sect1 id="sec.collab.ssh">

505 <title>Using the Secure Shell (ssh) protocol</title>

506

507 <para>You can pull and push changes securely over a network

508 connection using the Secure Shell (<literal>ssh</literal>)

509 protocol. To use this successfully, you may have to do a little

510 bit of configuration on the client or server sides.</para>

511

512 <para>If you're not familiar with ssh, it's a network protocol

513 that lets you securely communicate with another computer. To

514 use it with Mercurial, you'll be setting up one or more user

515 accounts on a server so that remote users can log in and execute

516 commands.</para>

517

518 <para>(If you <emphasis>are</emphasis> familiar with ssh, you'll

519 probably find some of the material that follows to be elementary

520 in nature.)</para>

521

522 <sect2>

523 <title>How to read and write ssh URLs</title>

524

525 <para>An ssh URL tends to look like this:</para>

526 <programlisting>ssh://bos@hg.serpentine.com:22/hg/hgbook</programlisting>

527 <orderedlist>

528 <listitem><para>The <quote><literal>ssh://</literal></quote>

529 part tells Mercurial to use the ssh protocol.</para>

530 </listitem>

531 <listitem><para>The <quote><literal>bos@</literal></quote>

532 component indicates what username to log into the server

533 as. You can leave this out if the remote username is the

534 same as your local username.</para>

535 </listitem>

536 <listitem><para>The

537 <quote><literal>hg.serpentine.com</literal></quote> gives

538 the hostname of the server to log into.</para>

539 </listitem>

540 <listitem><para>The <quote>:22</quote> identifies the port

541 number to connect to the server on. The default port is

542 22, so you only need to specify a colon and port number if

543 you're <emphasis>not</emphasis> using port 22.</para>

544 </listitem>

545 <listitem><para>The remainder of the URL is the local path to

546 the repository on the server.</para>

547 </listitem></orderedlist>

548

549 <para>There's plenty of scope for confusion with the path

550 component of ssh URLs, as there is no standard way for tools

551 to interpret it. Some programs behave differently than others

552 when dealing with these paths. This isn't an ideal situation,

553 but it's unlikely to change. Please read the following

554 paragraphs carefully.</para>

555

556 <para>Mercurial treats the path to a repository on the server as

557 relative to the remote user's home directory. For example, if

558 user <literal>foo</literal> on the server has a home directory

559 of <filename class="directory">/home/foo</filename>, then an

560 ssh URL that contains a path component of <filename

561 class="directory">bar</filename> <emphasis>really</emphasis>

562 refers to the directory <filename

563 class="directory">/home/foo/bar</filename>.</para>

564

565 <para>If you want to specify a path relative to another user's

566 home directory, you can use a path that starts with a tilde

567 character followed by the user's name (let's call them

568 <literal>otheruser</literal>), like this.</para>

569 <programlisting>ssh://server/~otheruser/hg/repo</programlisting>

570

571 <para>And if you really want to specify an

572 <emphasis>absolute</emphasis> path on the server, begin the

573 path component with two slashes, as in this example.</para>

574 <programlisting>ssh://server//absolute/path</programlisting>

575

576 </sect2>

577 <sect2>

578 <title>Finding an ssh client for your system</title>

579

580 <para>Almost every Unix-like system comes with OpenSSH

581 preinstalled. If you're using such a system, run

582 <literal>which ssh</literal> to find out if the

583 <command>ssh</command> command is installed (it's usually in

584 <filename class="directory">/usr/bin</filename>). In the

585 unlikely event that it isn't present, take a look at your

586 system documentation to figure out how to install it.</para>

587

588 <para>On Windows, you'll first need to download a suitable ssh

589 client. There are two alternatives.</para>

590 <itemizedlist>

591 <listitem><para>Simon Tatham's excellent PuTTY package

592 <citation>web:putty</citation> provides a complete suite

593 of ssh client commands.</para>

594 </listitem>

595 <listitem><para>If you have a high tolerance for pain, you can

596 use the Cygwin port of OpenSSH.</para>

597 </listitem></itemizedlist>

598 <para>In either case, you'll need to edit your <filename

599 role="special">hg.ini</filename> file to

600 tell Mercurial where to find the actual client command. For

601 example, if you're using PuTTY, you'll need to use the

602 <command>plink</command> command as a command-line ssh

603 client.</para>

604 <programlisting>[ui]

605 ssh = C:/path/to/plink.exe -ssh -i "C:/path/to/my/private/key"</programlisting>

606

607 <note>

608 <para> The path to <command>plink</command> shouldn't contain

609 any whitespace characters, or Mercurial may not be able to

610 run it correctly (so putting it in <filename

611 class="directory">C:\Program Files</filename> is probably

612 not a good idea).</para>

613 </note>

614

615 </sect2>

616 <sect2>

617 <title>Generating a key pair</title>

618

619 <para>To avoid the need to repetitively type a password every

620 time you need to use your ssh client, I recommend generating a

621 key pair. On a Unix-like system, the

622 <command>ssh-keygen</command> command will do the trick. On

623 Windows, if you're using PuTTY, the

624 <command>puttygen</command> command is what you'll

625 need.</para>

626

627 <para>When you generate a key pair, it's usually

628 <emphasis>highly</emphasis> advisable to protect it with a

629 passphrase. (The only time that you might not want to do this

630 is when you're using the ssh protocol for automated tasks on a

631 secure network.)</para>

632

633 <para>Simply generating a key pair isn't enough, however.

634 You'll need to add the public key to the set of authorised

635 keys for whatever user you're logging in remotely as. For

636 servers using OpenSSH (the vast majority), this will mean

637 adding the public key to a list in a file called <filename

638 role="special">authorized_keys</filename> in their <filename

639 role="special" class="directory">.ssh</filename>

640 directory.</para>

641

642 <para>On a Unix-like system, your public key will have a

643 <filename>.pub</filename> extension. If you're using

644 <command>puttygen</command> on Windows, you can save the

645 public key to a file of your choosing, or paste it from the

646 window it's displayed in straight into the <filename

647 role="special">authorized_keys</filename> file.</para>

648

649 </sect2>

650 <sect2>

651 <title>Using an authentication agent</title>

652

653 <para>An authentication agent is a daemon that stores

654 passphrases in memory (so it will forget passphrases if you

655 log out and log back in again). An ssh client will notice if

656 it's running, and query it for a passphrase. If there's no

657 authentication agent running, or the agent doesn't store the

658 necessary passphrase, you'll have to type your passphrase

659 every time Mercurial tries to communicate with a server on

660 your behalf (e.g. whenever you pull or push changes).</para>

661

662 <para>The downside of storing passphrases in an agent is that

663 it's possible for a well-prepared attacker to recover the

664 plain text of your passphrases, in some cases even if your

665 system has been power-cycled. You should make your own

666 judgment as to whether this is an acceptable risk. It

667 certainly saves a lot of repeated typing.</para>

668

669 <para>On Unix-like systems, the agent is called

670 <command>ssh-agent</command>, and it's often run automatically

671 for you when you log in. You'll need to use the

672 <command>ssh-add</command> command to add passphrases to the

673 agent's store. On Windows, if you're using PuTTY, the

674 <command>pageant</command> command acts as the agent. It adds

675 an icon to your system tray that will let you manage stored

676 passphrases.</para>

677

678 </sect2>

679 <sect2>

680 <title>Configuring the server side properly</title>

681

682 <para>Because ssh can be fiddly to set up if you're new to it,

683 there's a variety of things that can go wrong. Add Mercurial

684 on top, and there's plenty more scope for head-scratching.

685 Most of these potential problems occur on the server side, not

686 the client side. The good news is that once you've gotten a

687 configuration working, it will usually continue to work

688 indefinitely.</para>

689

690 <para>Before you try using Mercurial to talk to an ssh server,

691 it's best to make sure that you can use the normal

692 <command>ssh</command> or <command>putty</command> command to

693 talk to the server first. If you run into problems with using

694 these commands directly, Mercurial surely won't work. Worse,

695 it will obscure the underlying problem. Any time you want to

696 debug ssh-related Mercurial problems, you should drop back to

697 making sure that plain ssh client commands work first,

698 <emphasis>before</emphasis> you worry about whether there's a

699 problem with Mercurial.</para>

700

701 <para>The first thing to be sure of on the server side is that

702 you can actually log in from another machine at all. If you

703 can't use <command>ssh</command> or <command>putty</command>

704 to log in, the error message you get may give you a few hints

705 as to what's wrong. The most common problems are as

706 follows.</para>

707 <itemizedlist>

708 <listitem><para>If you get a <quote>connection refused</quote>

709 error, either there isn't an SSH daemon running on the

710 server at all, or it's inaccessible due to firewall

711 configuration.</para>

712 </listitem>

713 <listitem><para>If you get a <quote>no route to host</quote>

714 error, you either have an incorrect address for the server

715 or a seriously locked down firewall that won't admit its

716 existence at all.</para>

717 </listitem>

718 <listitem><para>If you get a <quote>permission denied</quote>

719 error, you may have mistyped the username on the server,

720 or you could have mistyped your key's passphrase or the

721 remote user's password.</para>

722 </listitem></itemizedlist>

723 <para>In summary, if you're having trouble talking to the

724 server's ssh daemon, first make sure that one is running at

725 all. On many systems it will be installed, but disabled, by

726 default. Once you're done with this step, you should then

727 check that the server's firewall is configured to allow

728 incoming connections on the port the ssh daemon is listening

729 on (usually 22). Don't worry about more exotic possibilities

730 for misconfiguration until you've checked these two

731 first.</para>

732

733 <para>If you're using an authentication agent on the client side

734 to store passphrases for your keys, you ought to be able to

735 log into the server without being prompted for a passphrase or

736 a password. If you're prompted for a passphrase, there are a

737 few possible culprits.</para>

738 <itemizedlist>

739 <listitem><para>You might have forgotten to use

740 <command>ssh-add</command> or <command>pageant</command>

741 to store the passphrase.</para>

742 </listitem>

743 <listitem><para>You might have stored the passphrase for the

744 wrong key.</para>

745 </listitem></itemizedlist>

746 <para>If you're being prompted for the remote user's password,

747 there are another few possible problems to check.</para>

748 <itemizedlist>

749 <listitem><para>Either the user's home directory or their

750 <filename role="special" class="directory">.ssh</filename>

751 directory might have excessively liberal permissions. As

752 a result, the ssh daemon will not trust or read their

753 <filename role="special">authorized_keys</filename> file.

754 For example, a group-writable home or <filename

755 role="special" class="directory">.ssh</filename>

756 directory will often cause this symptom.</para>

757 </listitem>

758 <listitem><para>The user's <filename

759 role="special">authorized_keys</filename> file may have

760 a problem. If anyone other than the user owns or can write

761 to that file, the ssh daemon will not trust or read

762 it.</para>

763 </listitem></itemizedlist>

764

765 <para>In the ideal world, you should be able to run the

766 following command successfully, and it should print exactly

767 one line of output, the current date and time.</para>

768 <programlisting>ssh myserver date</programlisting>

769

770 <para>If, on your server, you have login scripts that print

771 banners or other junk even when running non-interactive

772 commands like this, you should fix them before you continue,

773 so that they only print output if they're run interactively.

774 Otherwise these banners will at least clutter up Mercurial's

775 output. Worse, they could potentially cause problems with

776 running Mercurial commands remotely. Mercurial makes tries to

777 detect and ignore banners in non-interactive

778 <command>ssh</command> sessions, but it is not foolproof. (If

779 you're editing your login scripts on your server, the usual

780 way to see if a login script is running in an interactive

781 shell is to check the return code from the command

782 <literal>tty -s</literal>.)</para>

783

784 <para>Once you've verified that plain old ssh is working with

785 your server, the next step is to ensure that Mercurial runs on

786 the server. The following command should run

787 successfully:</para>

788

789 <programlisting>ssh myserver hg version</programlisting>

790

791 <para>If you see an error message instead of normal <command

792 role="hg-cmd">hg version</command> output, this is usually

793 because you haven't installed Mercurial to <filename

794 class="directory">/usr/bin</filename>. Don't worry if this

795 is the case; you don't need to do that. But you should check

796 for a few possible problems.</para>

797 <itemizedlist>

798 <listitem><para>Is Mercurial really installed on the server at

799 all? I know this sounds trivial, but it's worth

800 checking!</para>

801 </listitem>

802 <listitem><para>Maybe your shell's search path (usually set

803 via the <envar>PATH</envar> environment variable) is

804 simply misconfigured.</para>

805 </listitem>

806 <listitem><para>Perhaps your <envar>PATH</envar> environment

807 variable is only being set to point to the location of the

808 <command>hg</command> executable if the login session is

809 interactive. This can happen if you're setting the path

810 in the wrong shell login script. See your shell's

811 documentation for details.</para>

812 </listitem>

813 <listitem><para>The <envar>PYTHONPATH</envar> environment

814 variable may need to contain the path to the Mercurial

815 Python modules. It might not be set at all; it could be

816 incorrect; or it may be set only if the login is

817 interactive.</para>

818 </listitem></itemizedlist>

819

820 <para>If you can run <command role="hg-cmd">hg version</command>

821 over an ssh connection, well done! You've got the server and

822 client sorted out. You should now be able to use Mercurial to

823 access repositories hosted by that username on that server.

824 If you run into problems with Mercurial and ssh at this point,

825 try using the <option role="hg-opt-global">--debug</option>

826 option to get a clearer picture of what's going on.</para>

827

828 </sect2>

829 <sect2>

830 <title>Using compression with ssh</title>

831

832 <para>Mercurial does not compress data when it uses the ssh

833 protocol, because the ssh protocol can transparently compress

834 data. However, the default behaviour of ssh clients is

835 <emphasis>not</emphasis> to request compression.</para>

836

837 <para>Over any network other than a fast LAN (even a wireless

838 network), using compression is likely to significantly speed

839 up Mercurial's network operations. For example, over a WAN,

840 someone measured compression as reducing the amount of time

841 required to clone a particularly large repository from 51

842 minutes to 17 minutes.</para>

843

844 <para>Both <command>ssh</command> and <command>plink</command>

845 accept a <option role="cmd-opt-ssh">-C</option> option which

846 turns on compression. You can easily edit your <filename

847 role="special">~/.hgrc</filename> to enable compression for

848 all of Mercurial's uses of the ssh protocol.</para>

849 <programlisting>[ui]

850 ssh = ssh -C</programlisting>

851

852 <para>If you use <command>ssh</command>, you can configure it to

853 always use compression when talking to your server. To do

854 this, edit your <filename

855 role="special">.ssh/config</filename> file (which may not

856 yet exist), as follows.</para>

857 <programlisting>Host hg

858 Compression yes

859 HostName hg.example.com</programlisting>

860 <para>This defines an alias, <literal>hg</literal>. When you

861 use it on the <command>ssh</command> command line or in a

862 Mercurial <literal>ssh</literal>-protocol URL, it will cause

863 <command>ssh</command> to connect to

864 <literal>hg.example.com</literal> and use compression. This

865 gives you both a shorter name to type and compression, each of

866 which is a good thing in its own right.</para>

867

868 </sect2>

869 </sect1>

870 <sect1 id="sec.collab.cgi">

871 <title>Serving over HTTP using CGI</title>

872

873 <para>Depending on how ambitious you are, configuring Mercurial's

874 CGI interface can take anything from a few moments to several

875 hours.</para>

876

877 <para>We'll begin with the simplest of examples, and work our way

878 towards a more complex configuration. Even for the most basic

879 case, you're almost certainly going to need to read and modify

880 your web server's configuration.</para>

881

882 <note>

883 <para> Configuring a web server is a complex, fiddly, and

884 highly system-dependent activity. I can't possibly give you

885 instructions that will cover anything like all of the cases

886 you will encounter. Please use your discretion and judgment in

887 following the sections below. Be prepared to make plenty of

888 mistakes, and to spend a lot of time reading your server's

889 error logs.</para>

890 </note>

891

892 <sect2>

893 <title>Web server configuration checklist</title>

894

895 <para>Before you continue, do take a few moments to check a few

896 aspects of your system's setup.</para>

897

898 <orderedlist>

899 <listitem><para>Do you have a web server installed at all?

900 Mac OS X ships with Apache, but many other systems may not

901 have a web server installed.</para>

902 </listitem>

903 <listitem><para>If you have a web server installed, is it

904 actually running? On most systems, even if one is

905 present, it will be disabled by default.</para>

906 </listitem>

907 <listitem><para>Is your server configured to allow you to run

908 CGI programs in the directory where you plan to do so?

909 Most servers default to explicitly disabling the ability

910 to run CGI programs.</para>

911 </listitem></orderedlist>

912

913 <para>If you don't have a web server installed, and don't have

914 substantial experience configuring Apache, you should consider

915 using the <literal>lighttpd</literal> web server instead of

916 Apache. Apache has a well-deserved reputation for baroque and

917 confusing configuration. While <literal>lighttpd</literal> is

918 less capable in some ways than Apache, most of these

919 capabilities are not relevant to serving Mercurial

920 repositories. And <literal>lighttpd</literal> is undeniably

921 <emphasis>much</emphasis> easier to get started with than

922 Apache.</para>

923

924 </sect2>

925 <sect2>

926 <title>Basic CGI configuration</title>

927

928 <para>On Unix-like systems, it's common for users to have a

929 subdirectory named something like <filename

930 class="directory">public_html</filename> in their home

931 directory, from which they can serve up web pages. A file

932 named <filename>foo</filename> in this directory will be

933 accessible at a URL of the form

934 <literal>http://www.example.com/username/foo</literal>.</para>

935

936 <para>To get started, find the <filename

937 role="special">hgweb.cgi</filename> script that should be

938 present in your Mercurial installation. If you can't quickly

939 find a local copy on your system, simply download one from the

940 master Mercurial repository at <ulink

941 url="http://www.selenic.com/repo/hg/raw-file/tip/hgweb.cgi">http://www.selenic.com/repo/hg/raw-file/tip/hgweb.cgi</ulink>.</para>

942

943 <para>You'll need to copy this script into your <filename

944 class="directory">public_html</filename> directory, and

945 ensure that it's executable.</para>

946 <programlisting>cp .../hgweb.cgi ~/public_html

947 chmod 755 ~/public_html/hgweb.cgi</programlisting>

948 <para>The <literal>755</literal> argument to

949 <command>chmod</command> is a little more general than just

950 making the script executable: it ensures that the script is

951 executable by anyone, and that <quote>group</quote> and

952 <quote>other</quote> write permissions are

953 <emphasis>not</emphasis> set. If you were to leave those

954 write permissions enabled, Apache's <literal>suexec</literal>

955 subsystem would likely refuse to execute the script. In fact,

956 <literal>suexec</literal> also insists that the

957 <emphasis>directory</emphasis> in which the script resides

958 must not be writable by others.</para>

959 <programlisting>chmod 755 ~/public_html</programlisting>

960

961 <sect3 id="sec.collab.wtf">

962 <title>What could <emphasis>possibly</emphasis> go

963 wrong?</title>

964

965 <para>Once you've copied the CGI script into place, go into a

966 web browser, and try to open the URL <ulink

967 url="http://myhostname/

968 myuser/hgweb.cgi">http://myhostname/

969 myuser/hgweb.cgi</ulink>, <emphasis>but</emphasis> brace

970 yourself for instant failure. There's a high probability

971 that trying to visit this URL will fail, and there are many

972 possible reasons for this. In fact, you're likely to

973 stumble over almost every one of the possible errors below,

974 so please read carefully. The following are all of the

975 problems I ran into on a system running Fedora 7, with a

976 fresh installation of Apache, and a user account that I

977 created specially to perform this exercise.</para>

978

979 <para>Your web server may have per-user directories disabled.

980 If you're using Apache, search your config file for a

981 <literal>UserDir</literal> directive. If there's none

982 present, per-user directories will be disabled. If one

983 exists, but its value is <literal>disabled</literal>, then

984 per-user directories will be disabled. Otherwise, the

985 string after <literal>UserDir</literal> gives the name of

986 the subdirectory that Apache will look in under your home

987 directory, for example <filename

988 class="directory">public_html</filename>.</para>

989

990 <para>Your file access permissions may be too restrictive.

991 The web server must be able to traverse your home directory

992 and directories under your <filename

993 class="directory">public_html</filename> directory, and

994 read files under the latter too. Here's a quick recipe to

995 help you to make your permissions more appropriate.</para>

996 <programlisting>chmod 755 ~

997 find ~/public_html -type d -print0 | xargs -0r chmod 755

998 find ~/public_html -type f -print0 | xargs -0r chmod 644</programlisting>

999

1000 <para>The other possibility with permissions is that you might

1001 get a completely empty window when you try to load the

1002 script. In this case, it's likely that your access

1003 permissions are <emphasis>too permissive</emphasis>. Apache's

1004 <literal>suexec</literal> subsystem won't execute a script

1005 that's group- or world-writable, for example.</para>

1006

1007 <para>Your web server may be configured to disallow execution

1008 of CGI programs in your per-user web directory. Here's

1009 Apache's default per-user configuration from my Fedora

1010 system.</para>

1011

1012 &ch06-apache-config.lst;

1013

1014 <para>If you find a similar-looking

1015 <literal>Directory</literal> group in your Apache

1016 configuration, the directive to look at inside it is

1017 <literal>Options</literal>. Add <literal>ExecCGI</literal>

1018 to the end of this list if it's missing, and restart the web

1019 server.</para>

1020

1021 <para>If you find that Apache serves you the text of the CGI

1022 script instead of executing it, you may need to either

1023 uncomment (if already present) or add a directive like

1024 this.</para>

1025 <programlisting>AddHandler cgi-script .cgi</programlisting>

1026

1027 <para>The next possibility is that you might be served with a

1028 colourful Python backtrace claiming that it can't import a

1029 <literal>mercurial</literal>-related module. This is

1030 actually progress! The server is now capable of executing

1031 your CGI script. This error is only likely to occur if

1032 you're running a private installation of Mercurial, instead

1033 of a system-wide version. Remember that the web server runs

1034 the CGI program without any of the environment variables

1035 that you take for granted in an interactive session. If

1036 this error happens to you, edit your copy of <filename

1037 role="special">hgweb.cgi</filename> and follow the

1038 directions inside it to correctly set your

1039 <envar>PYTHONPATH</envar> environment variable.</para>

1040

1041 <para>Finally, you are <emphasis>certain</emphasis> to by

1042 served with another colourful Python backtrace: this one

1043 will complain that it can't find <filename

1044 class="directory">/path/to/repository</filename>. Edit

1045 your <filename role="special">hgweb.cgi</filename> script

1046 and replace the <filename

1047 class="directory">/path/to/repository</filename> string

1048 with the complete path to the repository you want to serve

1049 up.</para>

1050

1051 <para>At this point, when you try to reload the page, you

1052 should be presented with a nice HTML view of your

1053 repository's history. Whew!</para>

1054

1055 </sect3>

1056 <sect3>

1057 <title>Configuring lighttpd</title>

1058

1059 <para>To be exhaustive in my experiments, I tried configuring

1060 the increasingly popular <literal>lighttpd</literal> web

1061 server to serve the same repository as I described with

1062 Apache above. I had already overcome all of the problems I

1063 outlined with Apache, many of which are not server-specific.

1064 As a result, I was fairly sure that my file and directory

1065 permissions were good, and that my <filename

1066 role="special">hgweb.cgi</filename> script was properly

1067 edited.</para>

1068

1069 <para>Once I had Apache running, getting

1070 <literal>lighttpd</literal> to serve the repository was a

1071 snap (in other words, even if you're trying to use

1072 <literal>lighttpd</literal>, you should read the Apache

1073 section). I first had to edit the

1074 <literal>mod_access</literal> section of its config file to

1075 enable <literal>mod_cgi</literal> and

1076 <literal>mod_userdir</literal>, both of which were disabled

1077 by default on my system. I then added a few lines to the

1078 end of the config file, to configure these modules.</para>

1079 <programlisting>userdir.path = "public_html"

1080 cgi.assign = (".cgi" => "" )</programlisting>

1081 <para>With this done, <literal>lighttpd</literal> ran

1082 immediately for me. If I had configured

1083 <literal>lighttpd</literal> before Apache, I'd almost

1084 certainly have run into many of the same system-level

1085 configuration problems as I did with Apache. However, I

1086 found <literal>lighttpd</literal> to be noticeably easier to

1087 configure than Apache, even though I've used Apache for over

1088 a decade, and this was my first exposure to

1089 <literal>lighttpd</literal>.</para>

1090

1091 </sect3>

1092 </sect2>

1093 <sect2>

1094 <title>Sharing multiple repositories with one CGI script</title>

1095

1096 <para>The <filename role="special">hgweb.cgi</filename> script

1097 only lets you publish a single repository, which is an

1098 annoying restriction. If you want to publish more than one

1099 without wracking yourself with multiple copies of the same

1100 script, each with different names, a better choice is to use

1101 the <filename role="special">hgwebdir.cgi</filename>

1102 script.</para>

1103

1104 <para>The procedure to configure <filename

1105 role="special">hgwebdir.cgi</filename> is only a little more

1106 involved than for <filename

1107 role="special">hgweb.cgi</filename>. First, you must obtain

1108 a copy of the script. If you don't have one handy, you can

1109 download a copy from the master Mercurial repository at <ulink

1110 url="http://www.selenic.com/repo/hg/raw-file/tip/hgwebdir.cgi">http://www.selenic.com/repo/hg/raw-file/tip/hgwebdir.cgi</ulink>.</para>

1111

1112 <para>You'll need to copy this script into your <filename

1113 class="directory">public_html</filename> directory, and

1114 ensure that it's executable.</para>

1115 <programlisting>cp .../hgwebdir.cgi ~/public_html

1116 chmod 755 ~/public_html ~/public_html/hgwebdir.cgi</programlisting>

1117 <para>With basic configuration out of the way, try to visit

1118 <ulink url="http://myhostname/

1119 myuser/hgwebdir.cgi">http://myhostname/

1120 myuser/hgwebdir.cgi</ulink> in your browser. It should

1121 display an empty list of repositories. If you get a blank

1122 window or error message, try walking through the list of

1123 potential problems in section <xref

1124 linkend="sec.collab.wtf"/>.</para>

1125

1126 <para>The <filename role="special">hgwebdir.cgi</filename>

1127 script relies on an external configuration file. By default,

1128 it searches for a file named <filename

1129 role="special">hgweb.config</filename> in the same directory

1130 as itself. You'll need to create this file, and make it

1131 world-readable. The format of the file is similar to a

1132 Windows <quote>ini</quote> file, as understood by Python's

1133 <literal>ConfigParser</literal>

1134 <citation>web:configparser</citation> module.</para>

1135

1136 <para>The easiest way to configure <filename

1137 role="special">hgwebdir.cgi</filename> is with a section

1138 named <literal>collections</literal>. This will automatically

1139 publish <emphasis>every</emphasis> repository under the

1140 directories you name. The section should look like

1141 this:</para>

1142 <programlisting>[collections]

1143 /my/root = /my/root</programlisting>

1144 <para>Mercurial interprets this by looking at the directory name

1145 on the <emphasis>right</emphasis> hand side of the

1146 <quote><literal>=</literal></quote> sign; finding repositories

1147 in that directory hierarchy; and using the text on the

1148 <emphasis>left</emphasis> to strip off matching text from the

1149 names it will actually list in the web interface. The

1150 remaining component of a path after this stripping has

1151 occurred is called a <quote>virtual path</quote>.</para>

1152

1153 <para>Given the example above, if we have a repository whose

1154 local path is <filename

1155 class="directory">/my/root/this/repo</filename>, the CGI

1156 script will strip the leading <filename

1157 class="directory">/my/root</filename> from the name, and

1158 publish the repository with a virtual path of <filename

1159 class="directory">this/repo</filename>. If the base URL for

1160 our CGI script is <ulink url="http://myhostname/

1161 myuser/hgwebdir.cgi">http://myhostname/

1162 myuser/hgwebdir.cgi</ulink>, the complete URL for that

1163 repository will be <ulink url="http://myhostname/

1164 myuser/hgwebdir.cgi/this/repo">http://myhostname/

1165 myuser/hgwebdir.cgi/this/repo</ulink>.</para>

1166

1167 <para>If we replace <filename

1168 class="directory">/my/root</filename> on the left hand side

1169 of this example with <filename

1170 class="directory">/my</filename>, then <filename

1171 role="special">hgwebdir.cgi</filename> will only strip off

1172 <filename class="directory">/my</filename> from the repository

1173 name, and will give us a virtual path of <filename

1174 class="directory">root/this/repo</filename> instead of

1175 <filename class="directory">this/repo</filename>.</para>

1176

1177 <para>The <filename role="special">hgwebdir.cgi</filename>

1178 script will recursively search each directory listed in the

1179 <literal>collections</literal> section of its configuration

1180 file, but it will <literal>not</literal> recurse into the

1181 repositories it finds.</para>

1182

1183 <para>The <literal>collections</literal> mechanism makes it easy

1184 to publish many repositories in a <quote>fire and

1185 forget</quote> manner. You only need to set up the CGI

1186 script and configuration file one time. Afterwards, you can

1187 publish or unpublish a repository at any time by simply moving

1188 it into, or out of, the directory hierarchy in which you've

1189 configured <filename role="special">hgwebdir.cgi</filename> to

1190 look.</para>

1191

1192 <sect3>

1193 <title>Explicitly specifying which repositories to

1194 publish</title>

1195

1196 <para>In addition to the <literal>collections</literal>

1197 mechanism, the <filename

1198 role="special">hgwebdir.cgi</filename> script allows you

1199 to publish a specific list of repositories. To do so,

1200 create a <literal>paths</literal> section, with contents of

1201 the following form.</para>

1202 <programlisting>[paths]

1203 repo1 = /my/path/to/some/repo

1204 repo2 = /some/path/to/another</programlisting>

1205 <para>In this case, the virtual path (the component that will

1206 appear in a URL) is on the left hand side of each

1207 definition, while the path to the repository is on the

1208 right. Notice that there does not need to be any

1209 relationship between the virtual path you choose and the

1210 location of a repository in your filesystem.</para>

1211

1212 <para>If you wish, you can use both the

1213 <literal>collections</literal> and <literal>paths</literal>

1214 mechanisms simultaneously in a single configuration

1215 file.</para>

1216

1217 <note>

1218 <para> If multiple repositories have the same virtual path,

1219 <filename role="special">hgwebdir.cgi</filename> will not

1220 report an error. Instead, it will behave

1221 unpredictably.</para>

1222 </note>

1223

1224 </sect3>

1225 </sect2>

1226 <sect2>

1227 <title>Downloading source archives</title>

1228

1229 <para>Mercurial's web interface lets users download an archive

1230 of any revision. This archive will contain a snapshot of the

1231 working directory as of that revision, but it will not contain

1232 a copy of the repository data.</para>

1233

1234 <para>By default, this feature is not enabled. To enable it,

1235 you'll need to add an <envar

1236 role="rc-item-web">allow_archive</envar> item to the

1237 <literal role="rc-web">web</literal> section of your <filename

1238 role="special">~/.hgrc</filename>.</para>

1239

1240 </sect2>

1241 <sect2>

1242 <title>Web configuration options</title>

1243

1244 <para>Mercurial's web interfaces (the <command role="hg-cmd">hg

1245 serve</command> command, and the <filename

1246 role="special">hgweb.cgi</filename> and <filename

1247 role="special">hgwebdir.cgi</filename> scripts) have a

1248 number of configuration options that you can set. These

1249 belong in a section named <literal

1250 role="rc-web">web</literal>.</para>

1251 <itemizedlist>

1252 <listitem><para><envar

1253 role="rc-item-web">allow_archive</envar>: Determines

1254 which (if any) archive download mechanisms Mercurial

1255 supports. If you enable this feature, users of the web

1256 interface will be able to download an archive of whatever

1257 revision of a repository they are viewing. To enable the

1258 archive feature, this item must take the form of a

1259 sequence of words drawn from the list below.</para>

1260 <itemizedlist>

1261 <listitem><para><literal>bz2</literal>: A

1262 <command>tar</command> archive, compressed using

1263 <literal>bzip2</literal> compression. This has the

1264 best compression ratio, but uses the most CPU time on

1265 the server.</para>

1266 </listitem>

1267 <listitem><para><literal>gz</literal>: A

1268 <command>tar</command> archive, compressed using

1269 <literal>gzip</literal> compression.</para>

1270 </listitem>

1271 <listitem><para><literal>zip</literal>: A

1272 <command>zip</command> archive, compressed using LZW

1273 compression. This format has the worst compression

1274 ratio, but is widely used in the Windows world.</para>

1275 </listitem>

1276 </itemizedlist>

1277 <para> If you provide an empty list, or don't have an

1278 <envar role="rc-item-web">allow_archive</envar> entry at

1279 all, this feature will be disabled. Here is an example of

1280 how to enable all three supported formats.</para>

1281 <programlisting>[web]

1282 allow_archive = bz2 gz zip</programlisting>

1283 </listitem>

1284 <listitem><para><envar role="rc-item-web">allowpull</envar>:

1285 Boolean. Determines whether the web interface allows

1286 remote users to <command role="hg-cmd">hg pull</command>

1287 and <command role="hg-cmd">hg clone</command> this

1288 repository over HTTP. If set to <literal>no</literal> or

1289 <literal>false</literal>, only the

1290 <quote>human-oriented</quote> portion of the web interface

1291 is available.</para>

1292 </listitem>

1293 <listitem><para><envar role="rc-item-web">contact</envar>:

1294 String. A free-form (but preferably brief) string

1295 identifying the person or group in charge of the

1296 repository. This often contains the name and email

1297 address of a person or mailing list. It often makes sense

1298 to place this entry in a repository's own <filename

1299 role="special">.hg/hgrc</filename> file, but it can make

1300 sense to use in a global <filename

1301 role="special">~/.hgrc</filename> if every repository

1302 has a single maintainer.</para>

1303 </listitem>

1304 <listitem><para><envar role="rc-item-web">maxchanges</envar>:

1305 Integer. The default maximum number of changesets to

1306 display in a single page of output.</para>

1307 </listitem>

1308 <listitem><para><envar role="rc-item-web">maxfiles</envar>:

1309 Integer. The default maximum number of modified files to

1310 display in a single page of output.</para>

1311 </listitem>

1312 <listitem><para><envar role="rc-item-web">stripes</envar>:

1313 Integer. If the web interface displays alternating

1314 <quote>stripes</quote> to make it easier to visually align

1315 rows when you are looking at a table, this number controls

1316 the number of rows in each stripe.</para>

1317 </listitem>

1318 <listitem><para><envar role="rc-item-web">style</envar>:

1319 Controls the template Mercurial uses to display the web

1320 interface. Mercurial ships with two web templates, named

1321 <literal>default</literal> and <literal>gitweb</literal>

1322 (the latter is much more visually attractive). You can

1323 also specify a custom template of your own; see chapter

1324 <xref linkend="chap.template"/> for details.

1325 Here, you can see how to enable the

1326 <literal>gitweb</literal> style.</para>

1327 <programlisting>[web]

1328 style = gitweb</programlisting>

1329 </listitem>

1330 <listitem><para><envar role="rc-item-web">templates</envar>:

1331 Path. The directory in which to search for template

1332 files. By default, Mercurial searches in the directory in

1333 which it was installed.</para>

1334 </listitem></itemizedlist>

1335 <para>If you are using <filename

1336 role="special">hgwebdir.cgi</filename>, you can place a few

1337 configuration items in a <literal role="rc-web">web</literal>

1338 section of the <filename

1339 role="special">hgweb.config</filename> file instead of a

1340 <filename role="special">~/.hgrc</filename> file, for

1341 convenience. These items are <envar

1342 role="rc-item-web">motd</envar> and <envar

1343 role="rc-item-web">style</envar>.</para>

1344

1345 <sect3>

1346 <title>Options specific to an individual repository</title>

1347

1348 <para>A few <literal role="rc-web">web</literal> configuration

1349 items ought to be placed in a repository's local <filename

1350 role="special">.hg/hgrc</filename>, rather than a user's

1351 or global <filename role="special">~/.hgrc</filename>.</para>

1352 <itemizedlist>

1353 <listitem><para><envar

1354 role="rc-item-web">description</envar>: String. A

1355 free-form (but preferably brief) string that describes

1356 the contents or purpose of the repository.</para>

1357 </listitem>

1358 <listitem><para><envar role="rc-item-web">name</envar>:

1359 String. The name to use for the repository in the web

1360 interface. This overrides the default name, which is

1361 the last component of the repository's path.</para>

1362 </listitem></itemizedlist>

1363

1364 </sect3>

1365 <sect3>

1366 <title>Options specific to the <command role="hg-cmd">hg

1367 serve</command> command</title>

1368

1369 <para>Some of the items in the <literal

1370 role="rc-web">web</literal> section of a <filename

1371 role="special">~/.hgrc</filename> file are only for use

1372 with the <command role="hg-cmd">hg serve</command>

1373 command.</para>

1374 <itemizedlist>

1375 <listitem><para><envar role="rc-item-web">accesslog</envar>:

1376 Path. The name of a file into which to write an access

1377 log. By default, the <command role="hg-cmd">hg

1378 serve</command> command writes this information to

1379 standard output, not to a file. Log entries are written

1380 in the standard <quote>combined</quote> file format used

1381 by almost all web servers.</para>

1382 </listitem>

1383 <listitem><para><envar role="rc-item-web">address</envar>:

1384 String. The local address on which the server should

1385 listen for incoming connections. By default, the server

1386 listens on all addresses.</para>

1387 </listitem>

1388 <listitem><para><envar role="rc-item-web">errorlog</envar>:

1389 Path. The name of a file into which to write an error

1390 log. By default, the <command role="hg-cmd">hg

1391 serve</command> command writes this information to

1392 standard error, not to a file.</para>

1393 </listitem>

1394 <listitem><para><envar role="rc-item-web">ipv6</envar>:

1395 Boolean. Whether to use the IPv6 protocol. By default,

1396 IPv6 is not used.</para>

1397 </listitem>

1398 <listitem><para><envar role="rc-item-web">port</envar>:

1399 Integer. The TCP port number on which the server should

1400 listen. The default port number used is 8000.</para>

1401 </listitem></itemizedlist>

1402

1403 </sect3>

1404 <sect3>

1405 <title>Choosing the right <filename

1406 role="special">~/.hgrc</filename> file to add <literal

1407 role="rc-web">web</literal> items to</title>

1408

1409 <para>It is important to remember that a web server like

1410 Apache or <literal>lighttpd</literal> will run under a user

1411 ID that is different to yours. CGI scripts run by your

1412 server, such as <filename

1413 role="special">hgweb.cgi</filename>, will usually also run

1414 under that user ID.</para>

1415

1416 <para>If you add <literal role="rc-web">web</literal> items to

1417 your own personal <filename role="special">~/.hgrc</filename> file, CGI scripts won't read that

1418 <filename role="special">~/.hgrc</filename> file. Those

1419 settings will thus only affect the behaviour of the <command

1420 role="hg-cmd">hg serve</command> command when you run it.

1421 To cause CGI scripts to see your settings, either create a

1422 <filename role="special">~/.hgrc</filename> file in the

1423 home directory of the user ID that runs your web server, or

1424 add those settings to a system-wide <filename

1425 role="special">~/.hgrc</filename> file.</para>

1426

1427

1428 </sect3>

1429 </sect2>

1430 </sect1>

1431 </chapter>

1432

1433 <!--

1434 local variables:

1435 sgml-parent-document: ("00book.xml" "book" "chapter")

1436 end:

1437 -->