hgbook: 7e838acf7350 es/collab.tex

hgbook

view es/collab.tex @ 417:7e838acf7350

translated more paragraphs of collab to spanish

author	Igor TAmara <igor@tamarapatino.org>
date	Thu Nov 13 23:09:45 2008 -0500 (2008-11-13)
parents	c7234c5d01b2
children	7f5d542be96b

line source

1 \chapter{Colaborar con otros}

2 \label{cha:collab}

4 Debido a su naturaleza descentralizada, Mercurial no impone política

5 alguna de cómo deben trabajar los grupos de personas. Sin embargo, si

6 usted es nuevo al control distribuido de versiones, es bueno tener

7 herramientas y ejemplos a la mano al pensar en posibles modelos de

8 flujo de trabajo.

10 \section{La interfaz web de Mercurial}

12 Mercurial tiene una poderosa interfaz web que provee bastantes

13 capacidades útiles.

15 Para uso interactivo, la interfaz le permite visualizar uno o varios

16 repositorios. Puede ver la historia de un repositorio, examinar cada

17 cambio(comentarios y diferencias), y ver los contenidos de cada

18 directorio y fichero.

20 Adicionalmente la interfaz provee feeds de RSS de los cambios de los

21 repositorios. Que le permite ``subscribirse''a un repositorio usando

22 su herramienta de lectura de feeds favorita, y ser notificado

23 automáticamente de la actividad en el repositorio tan pronto como

24 sucede. Me gusta mucho más este modelo que el estar suscrito a una

25 lista de correo a la cual se envían las notificaciones, dado que no

26 requiere configuración adicional de parte de quien sea que está

27 administrando el repositorio.

29 La interfaz web también permite clonar repositorios a los usuarios

30 remotos, jalar cambios, y (cuando el servidor está configurado para

31 permitirlo) publicar cambios en el mismo. El protocolo de tunneling

32 de Mercurial comprime datos agresivamente, de forma que trabaja

33 eficientemente incluso con conexiones de red con poco ancho de banda.

35 La forma más sencilla de iniciarse con la interfaz web es usar su

36 navegador para visitar un repositorio existente, como por ejemplo el

37 repositorio principal de Mercurial \url{http://www.selenic.com/repo/hg?style=gitweb}.

39 Si está interesado en proveer una interfaz web a sus propios

40 repositorios, Mercurial provee dos formas de hacerlo. La primera es

41 usando la orden \hgcmd{serve}, que está enfocada a servir ``de forma

42 liviana'' y por intervalos cortos. Para más detalles de cómo usar

43 esta orden vea la sección~\ref{sec:collab:serve} más adelante. Si

44 tiene un repositorio que desea hacer permanente, Mercurial tiene

45 soporte embebido del \command{ssh} para publicar cambios con seguridad

46 al repositorio central, como se documenta en la

47 sección~\ref{sec:collab:ssh}. Es muy usual que se publique una copia

48 de sólo lectura en el repositorio que está corriendo sobre HTTP usando

49 CGI, como en la sección~\ref{sec:collab:cgi}. Publicar sobre HTTP

50 satisface las necesidades de la gente que no tiene permisos de

51 publicación y de aquellos que quieren usar navegadores web para

52 visualizar la historia del repositorio.

54 \subsection{Trabajo con muchas ramas}

56 Los proyectos de cierta talla tienden naturlamente a progresar de

57 forma simultánea en varios frentes. En el caso del software, es común

58 que un proyecto tenga versiones periódicas oficiales. Una versión

59 puede entrar a ``modo mantenimiento'' por un tiempo después de su

60 primera publicación; las versiones de mantenimiento tienden a contener

61 solamente arreglos de fallos, pero no nuevas características. En

62 paralelo con las versiones de mantenimiento puede haber una o muchas

63 versiones futuras pueden estar en desarrollo. La gente usa normalmente

64 la palabra ``rama'' para referirse a una de las direcciones

65 ligeramente distintas en las cuales procede el desarrollo.

67 Mercurial está especialmente preparado para administrar un buen número

68 de ramas simultáneas pero no idénticas. Cada ``dirección de

69 desarrollo'' puede vivir en su propio repositorio central, y puede

70 mezclar los cambios de una a otra de acuerdo con las necesidades. Dado

71 que los repositorios son independientes, uno del otro, los cambios

72 inestables de una rama de desarrollo nunca afectarán una rama estable

73 a menos que alguien explícitamente mezcle los cambios.

75 A continuación un ejemplo de cómo podría hacerse esto en la

76 práctica. Digamos que tiene una ``rama principal'' en un servidor

77 central.

78 \interaction{branching.init}

79 Alguien lo clona, hace cambios locales, los prueba, y los publica allí

80 mismo.

82 Una vez que la rama principal alcanza una estado de versión se puede

83 usar la orden \hgcmd{tag} para dar un nombre permanente a la revisión.

84 \interaction{branching.tag}

85 Digamos que en la rama principal ocurre más desarrollo.

86 \interaction{branching.main}

87 Cuando se usa la etiqueta con que se identificó la versión, la gente

88 puede clonar el repositorio en cualquier momento en el futuro

89 empleando \hgcmd{update} para obtener una copia del directorio de

90 trabajo exacta como cuando se creó la etiqueta de la revisión que se

91 consignó.

92 \interaction{branching.update}

94 Adicionalmente, justo después de que la rama principal se etiquete,

95 alguien puede clonarla en el servidor a una nueva rama ``estable'',

96 también en el servidor.

97 \interaction{branching.clone}

99 Alguien que requiera hacer un cambio en la rama estable puede clonar

100 \emph{ese} repositorio, hacer sus cambios, consignar y publicarlos

101 posteriormente al inicial.

102 \interaction{branching.stable}

103 Puesto que los repositorios de Mercurial son independientes, y que

104 Mercurial no mueve los cambios de un lado a otro automáticamente, las

105 ramas estable y principal están \emph{aisladas} la una de la otra.

106 Los cambios que haga en la rama principal no ``se filtran'' a la rama

107 estable o vice versa.

108

109 Es usual que los arreglos de fallos de la rama estable deban hacerse

110 aparecer en la rama principal también. En lugar de reescribir el

111 arreglo del fallo en la rama principal, puede jalar y mezclar los

112 cambios de la rama estable a la principal, Mercurial traerá tales

113 arreglos por usted.

114 \interaction{branching.merge}

115 La rama principal contendtrá aún los cambios que no están en la

116 estable y contendrá además todos los arreglos de fallos de la rama

117 estable. La rama estable permanece incólume a tales cambios.

118

119 \subsection{Ramas de Características}

120

121 En proyectos grandes, una forma efectiva de administrar los cambios es

122 dividir el equipo en grupos más pequeños. Cada grupo tiene una rama

123 compartida, clonada de una rama ``principal'' que conforma el proyecto

124 completo. Aquellos que trabajan en ramas individuales típicamente

125 están aislados de los desarrollos de otras ramas.

126

127 \begin{figure}[ht]

128 \centering

129 \grafix{feature-branches}

130 \caption{Ramas de Características}

131 \label{fig:collab:feature-branches}

132 \end{figure}

133

134 Cuando una rama particular alcanza un estado deseado, alguien del

135 equipo de características jala y fusiona de la rama principal hacia

136 la rama de características y publica posteriormente a la rama principal.

137

138 \subsection{El tren de publicación}

139

140 Algunos proyectos se organizan al estilo``tren'': Una versión se

141 planifica para ser liberada cada cierto tiempo, y las características

142 que estén listas cuando ha llegado el momento ``tren'', se incorporan.

143

144 Este modelo tiene cierta similitud a las ramas de características. La

145 diferencia es que cuando una característica pierde el tren, alguien en

146 el equipo de características jala y fusiona los cambios que se fueron

147 en la versión liberada hacia la rama de característica, y el trabajo

148 continúa sobre lo fusionado para que la característica logre estar en

149 la próxima versión.

150

151 \subsection{El modelo del kernel linux}

152

153 El desarrollo del Kernel Linux tiene una estructura jerárquica

154 bastante horizontal, rodeada de una nube de caos aparente. Dado que la

155 mayoría de desarrolladores usan \command{git}, una herramienta distribuida

156 de control de versiones con capacidades similares a Mercurial, resulta

157 de utilidad describir la forma en que el trabajo fluye en tal

158 ambiente; si le gustan las ideas, la aproximación se traduce bien

159 entre Git y Mercurial.

160

161 En el centro de la comunidad está Linus Torvalds, el creador de Linux.

162 Él publica un único repositorio que es considerado el árbol

163 ``oficial'' actual por la comunidad completa de

164 desarrolladores. Cualquiera puede clonar el árbol de Linus, pero él es

165 muy selectivo acerca de los árboles de los cuales jala.

166

167 Linux tiene varios ``lugartenientes confiables''. Como regla, él jala

168 todos los cambios que ellos publican, en la mayoría de los casos sin

169 siquiera revisarlos. Algunos de sus lugartenientes generalmente

170 aceptan ser los ``mantenedores'', responsables de subsistemas

171 específicos dentro del kernel. Si un hacker cualquiera desea hacer un

172 cambio a un subsistema y busca que termine en el árbol de Linus, debe

173 encontrar quien es el mantenedor del subsistema y solicitarle que

174 tenga en cuenta su cambio. Si el mantenedor revisa los cambios y está

175 de acuerdo en tomarlos, estos pasarán al árbol de Linus de acuerdo a

176 lo expuesto.

177

178 Cada lugarteniente tiene su forma particular de revisar, aceptar y

179 publicar los cambios; y para decidir cuando hacerlos presentes a

180 Linus. Adicionalmente existen varias ramas conocidas que mucha gente

181 usa para propósitos distintos. Por ejemplo, pocas personas mantienen

182 repositorios ``estables'' de versiones anteriores del kernel, a los

183 cuales aplican arreglos de fallos críticos necesarios. Algunos

184 mantenedores publican varios árboles: uno para cambios

185 experimentales; uno para cambios que van a ofrecer al mantenedor

186 principal; y así sucesivamente. Otros publican un solo árbol.

187

188 Este modelo tiene dos características notables. La primera es que son

189 de ``jalar exclusivamente''. Usted debe solicitar, convencer o

190 incluso rogar a otro desarrollador para que tome sus cabmios, porque

191 casi no hay árboles en los cuales más de una persona pueda publicar, y

192 no hay forma de publicar cambios en un árbol que otra persona controla.

193

194 El segundo está basado en reputación y meritocracia. Si usted es un

195 desconocido, Linus probablemente ignorará sus cambios, sin siquiera

196 responderle. Pero un mantenedor de un subsistema probablemente los

197 revisara, y los acogerá en caso de que aprueben su criterio de

198 aplicabilidad. A medida que usted ofrezca ``mejores'' cambios a un

199 mantenedor, habrá más posibilidad de que se confie en su juicio y se

200 acepten los cambios. Si usted es reconocido y matiene una rama

201 durante bastante tiempo para algo que Linus no ha aceptado, personas

202 con intereses similares pueden jalar sus cambios regularmente para

203 estar al día con su trabajo.

204

205 La reputación y meritocracia no necesariamente es transversal entre

206 ``personas'' de diferentes subsistemas. Si usted es respetado pero es

207 un hacker en almacenamiento y trata de arreglar un fallo de redes,

208 tal cambio puede recibir un nivel de escrutinio de un mantenedor de

209 redes comparable con el que se le haría a un completo extraño.

210

211 Personas que vienen de proyectos con un ordenamiento distinto, sienten

212 que el proceso comparativamente caótico del Kernel Linux es

213 completamente lunático. Es objeto de los caprichos individuales; la

214 gente desecha cambios cuando lo desean; y la fase de desarrollo es

215 alucinante. A pesar de eso Linux es una pieza de software exitosa y

216 bien reconocida.

217

218 \subsection{Solamente jalar frente a colaboración pública}

219

220 Una fuente perpetua de discusiones en la comunidad de código abierto

221 yace en el modelo de desarrollo en el cual la gente solamente jala

222 cambios de otros ``es mejor que'' uno en el cual muchas personas

223 pueden publicar cambios a un repositorio compartido.

224

225 Tícamente los partidarios del modelo de publicar usan las herramientas

226 que se apegan a este modelo. Si usted usa una herramienta

227 centralizada de control de versiones como Subversion, no hay forma de

228 elegir qué modelo va a usar: La herramienta le ofrece publicación

229 compartida, y si desea hacer cualquier otra cosa, va a tener que

230 aplicar una aproximación artificial (tal como aplicar parches a mano).

231

232 Una buena herramienta distribuida de control de versiones, tal como

233 Mercurial soportará los dos modelos. Usted y sus colaboradores

234 pueden estructurar cómo trabajarán juntos basados en sus propias

235 necesidades y preferencias, sin depender de las peripecias que la

236 herramienta les obligue a hacer.

237

238 \subsection{Cuando la colaboración encuentra la administración ramificada}

239

240 Una vez que usted y su equipo configurar algunos repositorios

241 compartidos y comienzan a propagar cambios entre sus repositorios

242 locales y compartidos, comenzará a encarar un reto relacionado, pero

243 un poco distinto: Administrar las direcciones en las cuales su equipo

244 puede moverse. A pesar de que está intimamente ligado acerca de cómo

245 interactúa su equipo, es lo suficientemente denso para ameritar un

246 tratamiento en el capítulo~\ref{chap:branch}.

247

248 \section{Aspectos técnicos de la colaboración}

249

250 Lo que resta del capítulo lo dedicamos a las cuestiones de servir

251 datos a sus colaboradores.

252

253 \section{Compartir informalmente con \hgcmd{serve}}

254 \label{sec:collab:serve}

255

256 La orden \hgcmd{serve} de Mercurial satisface de forma espectacular

257 las necesidades de un grupo pequeño, acoplado y de corto

258 tiempo. Se constituye en una demostración de cómo se siente usar los

259 comandos usando la red.

260

261 Ejecute \hgcmd{serve} dentro de un repositorio, y en pocos segundos

262 iniciará un servidor HTTP especializado; aceptará conexiones desde

263 cualquier cliente y servirá datos de este repositorio mientrs lo

264 mantenga funcionando. Todo el que sepa el URL del servidor que ha

265 iniciado, y que puede comunicarse con su computador por la red, puede

266 usar un navegador web o Mercurial para leer datos del repositorio. Un

267 URL para una instancia de \hgcmd{serve} ejecutándose en un portátil

268 debería lucir algo \Verb|http://my-laptop.local:8000/|.

269

270 La orden \hgcmd{serve} \emph{no} es un servidor web de propósito

271 general. Solamente puede hacer dos cosas:

272 \begin{itemize}

273 \item Permitir que se pueda visualizar la historia del repositorio que

274 está sirviendo desde navegadores web.

275 \item Hablar el protocolo de conexión de Mercurial para que puedan hacer

276 \hgcmd{clone} o \hgcmd{pull} (jalar) cambios de tal repositorio.

277 \end{itemize}

278 En particular, \hgcmd{serve} no permitirá que los usuarios remotos

279 puedan \emph{modificar} su repositorio. Es de tipo solo lectura.

280

281 Si está comenzando con Mercurial, no hay nada que le impida usar

282 \hgcmd{serve} para servir un repositorio en su propio computador, y

283 usar posteriormente órdenes como \hgcmd{clone}, \hgcmd{incoming}, para

284 comunicarse con el servidor como si el repositorio estuviera alojado

285 remotamente. Lo que además puede ayudarle a adecuarse rápidamente para

286 usar comandos en repositorios alojados en la red.

287

288 \subsection{Cuestiones adicionales para tener en cuenta}

289

290 Dado que permite lectura sin autenticación a todos sus clientes,

291 debería usar \hgcmd{serve} exclusivamente en ambientes en los cuáles

292 no tenga problema en que otros vean, o en los cuales tenga control

293 completo acerca de quien puede acceder a su red y jalar cambios de su

294 repositorio.

295

296 La orden \hgcmd{serve} no tiene conocimiento acerca de programas

297 cortafuegos que puedan estar instalados en su sistema o en su red. No

298 puede detectar o controlar sus cortafuegos. Si otras personas no

299 pueden acceder a su instancia \hgcmd{serve}, lo siguiente que debería hacer

300 (\emph{después} de asegurarse que tienen el URL correcto) es verificar

301 su configuración de cortafuegos.

302

303 De forma predeterminada, \hgcmd{serve} escucha conexiones entrantes en

304 el puerto~8000. Si otro proceso está escuchando en tal puerto, usted

305 podrá especificar un puerto distinto para escuchar con la opción

306 \hgopt{serve}{-p} .

307

308 Normalmente, cuando se inicia \hgcmd{serve}, no imprime nada, lo cual

309 puede ser desconcertante. Si desea confirmar que en efecto está

310 ejecutándose correctamente, y darse cuenta qué URL debería enviar a

311 sus colaboradores, inícielo con la opción \hggopt{-v}.

312

313 \section{Using the Secure Shell (ssh) protocol}

314 \label{sec:collab:ssh}

315

316 You can pull and push changes securely over a network connection using

317 the Secure Shell (\texttt{ssh}) protocol. To use this successfully,

318 you may have to do a little bit of configuration on the client or

319 server sides.

320

321 If you're not familiar with ssh, it's a network protocol that lets you

322 securely communicate with another computer. To use it with Mercurial,

323 you'll be setting up one or more user accounts on a server so that

324 remote users can log in and execute commands.

325

326 (If you \emph{are} familiar with ssh, you'll probably find some of the

327 material that follows to be elementary in nature.)

328

329 \subsection{How to read and write ssh URLs}

330

331 An ssh URL tends to look like this:

332 \begin{codesample2}

333 ssh://bos@hg.serpentine.com:22/hg/hgbook

334 \end{codesample2}

335 \begin{enumerate}

336 \item The ``\texttt{ssh://}'' part tells Mercurial to use the ssh

337 protocol.

338 \item The ``\texttt{bos@}'' component indicates what username to log

339 into the server as. You can leave this out if the remote username

340 is the same as your local username.

341 \item The ``\texttt{hg.serpentine.com}'' gives the hostname of the

342 server to log into.

343 \item The ``:22'' identifies the port number to connect to the server

344 on. The default port is~22, so you only need to specify this part

345 if you're \emph{not} using port~22.

346 \item The remainder of the URL is the local path to the repository on

347 the server.

348 \end{enumerate}

349

350 There's plenty of scope for confusion with the path component of ssh

351 URLs, as there is no standard way for tools to interpret it. Some

352 programs behave differently than others when dealing with these paths.

353 This isn't an ideal situation, but it's unlikely to change. Please

354 read the following paragraphs carefully.

355

356 Mercurial treats the path to a repository on the server as relative to

357 the remote user's home directory. For example, if user \texttt{foo}

358 on the server has a home directory of \dirname{/home/foo}, then an ssh

359 URL that contains a path component of \dirname{bar}

360 \emph{really} refers to the directory \dirname{/home/foo/bar}.

361

362 If you want to specify a path relative to another user's home

363 directory, you can use a path that starts with a tilde character

364 followed by the user's name (let's call them \texttt{otheruser}), like

365 this.

366 \begin{codesample2}

367 ssh://server/~otheruser/hg/repo

368 \end{codesample2}

369

370 And if you really want to specify an \emph{absolute} path on the

371 server, begin the path component with two slashes, as in this example.

372 \begin{codesample2}

373 ssh://server//absolute/path

374 \end{codesample2}

375

376 \subsection{Finding an ssh client for your system}

377

378 Almost every Unix-like system comes with OpenSSH preinstalled. If

379 you're using such a system, run \Verb|which ssh| to find out if

380 the \command{ssh} command is installed (it's usually in

381 \dirname{/usr/bin}). In the unlikely event that it isn't present,

382 take a look at your system documentation to figure out how to install

383 it.

384

385 On Windows, you'll first need to choose download a suitable ssh

386 client. There are two alternatives.

387 \begin{itemize}

388 \item Simon Tatham's excellent PuTTY package~\cite{web:putty} provides

389 a complete suite of ssh client commands.

390 \item If you have a high tolerance for pain, you can use the Cygwin

391 port of OpenSSH.

392 \end{itemize}

393 In either case, you'll need to edit your \hgini\ file to tell

394 Mercurial where to find the actual client command. For example, if

395 you're using PuTTY, you'll need to use the \command{plink} command as

396 a command-line ssh client.

397 \begin{codesample2}

398 [ui]

399 ssh = C:/path/to/plink.exe -ssh -i "C:/path/to/my/private/key"

400 \end{codesample2}

401

402 \begin{note}

403 The path to \command{plink} shouldn't contain any whitespace

404 characters, or Mercurial may not be able to run it correctly (so

405 putting it in \dirname{C:\\Program Files} is probably not a good

406 idea).

407 \end{note}

408

409 \subsection{Generating a key pair}

410

411 To avoid the need to repetitively type a password every time you need

412 to use your ssh client, I recommend generating a key pair. On a

413 Unix-like system, the \command{ssh-keygen} command will do the trick.

414 On Windows, if you're using PuTTY, the \command{puttygen} command is

415 what you'll need.

416

417 When you generate a key pair, it's usually \emph{highly} advisable to

418 protect it with a passphrase. (The only time that you might not want

419 to do this id when you're using the ssh protocol for automated tasks

420 on a secure network.)

421

422 Simply generating a key pair isn't enough, however. You'll need to

423 add the public key to the set of authorised keys for whatever user

424 you're logging in remotely as. For servers using OpenSSH (the vast

425 majority), this will mean adding the public key to a list in a file

426 called \sfilename{authorized\_keys} in their \sdirname{.ssh}

427 directory.

428

429 On a Unix-like system, your public key will have a \filename{.pub}

430 extension. If you're using \command{puttygen} on Windows, you can

431 save the public key to a file of your choosing, or paste it from the

432 window it's displayed in straight into the

433 \sfilename{authorized\_keys} file.

434

435 \subsection{Using an authentication agent}

436

437 An authentication agent is a daemon that stores passphrases in memory

438 (so it will forget passphrases if you log out and log back in again).

439 An ssh client will notice if it's running, and query it for a

440 passphrase. If there's no authentication agent running, or the agent

441 doesn't store the necessary passphrase, you'll have to type your

442 passphrase every time Mercurial tries to communicate with a server on

443 your behalf (e.g.~whenever you pull or push changes).

444

445 The downside of storing passphrases in an agent is that it's possible

446 for a well-prepared attacker to recover the plain text of your

447 passphrases, in some cases even if your system has been power-cycled.

448 You should make your own judgment as to whether this is an acceptable

449 risk. It certainly saves a lot of repeated typing.

450

451 On Unix-like systems, the agent is called \command{ssh-agent}, and

452 it's often run automatically for you when you log in. You'll need to

453 use the \command{ssh-add} command to add passphrases to the agent's

454 store. On Windows, if you're using PuTTY, the \command{pageant}

455 command acts as the agent. It adds an icon to your system tray that

456 will let you manage stored passphrases.

457

458 \subsection{Configuring the server side properly}

459

460 Because ssh can be fiddly to set up if you're new to it, there's a

461 variety of things that can go wrong. Add Mercurial on top, and

462 there's plenty more scope for head-scratching. Most of these

463 potential problems occur on the server side, not the client side. The

464 good news is that once you've gotten a configuration working, it will

465 usually continue to work indefinitely.

466

467 Before you try using Mercurial to talk to an ssh server, it's best to

468 make sure that you can use the normal \command{ssh} or \command{putty}

469 command to talk to the server first. If you run into problems with

470 using these commands directly, Mercurial surely won't work. Worse, it

471 will obscure the underlying problem. Any time you want to debug

472 ssh-related Mercurial problems, you should drop back to making sure

473 that plain ssh client commands work first, \emph{before} you worry

474 about whether there's a problem with Mercurial.

475

476 The first thing to be sure of on the server side is that you can

477 actually log in from another machine at all. If you can't use

478 \command{ssh} or \command{putty} to log in, the error message you get

479 may give you a few hints as to what's wrong. The most common problems

480 are as follows.

481 \begin{itemize}

482 \item If you get a ``connection refused'' error, either there isn't an

483 SSH daemon running on the server at all, or it's inaccessible due to

484 firewall configuration.

485 \item If you get a ``no route to host'' error, you either have an

486 incorrect address for the server or a seriously locked down firewall

487 that won't admit its existence at all.

488 \item If you get a ``permission denied'' error, you may have mistyped

489 the username on the server, or you could have mistyped your key's

490 passphrase or the remote user's password.

491 \end{itemize}

492 In summary, if you're having trouble talking to the server's ssh

493 daemon, first make sure that one is running at all. On many systems

494 it will be installed, but disabled, by default. Once you're done with

495 this step, you should then check that the server's firewall is

496 configured to allow incoming connections on the port the ssh daemon is

497 listening on (usually~22). Don't worry about more exotic

498 possibilities for misconfiguration until you've checked these two

499 first.

500

501 If you're using an authentication agent on the client side to store

502 passphrases for your keys, you ought to be able to log into the server

503 without being prompted for a passphrase or a password. If you're

504 prompted for a passphrase, there are a few possible culprits.

505 \begin{itemize}

506 \item You might have forgotten to use \command{ssh-add} or

507 \command{pageant} to store the passphrase.

508 \item You might have stored the passphrase for the wrong key.

509 \end{itemize}

510 If you're being prompted for the remote user's password, there are

511 another few possible problems to check.

512 \begin{itemize}

513 \item Either the user's home directory or their \sdirname{.ssh}

514 directory might have excessively liberal permissions. As a result,

515 the ssh daemon will not trust or read their

516 \sfilename{authorized\_keys} file. For example, a group-writable

517 home or \sdirname{.ssh} directory will often cause this symptom.

518 \item The user's \sfilename{authorized\_keys} file may have a problem.

519 If anyone other than the user owns or can write to that file, the

520 ssh daemon will not trust or read it.

521 \end{itemize}

522

523 In the ideal world, you should be able to run the following command

524 successfully, and it should print exactly one line of output, the

525 current date and time.

526 \begin{codesample2}

527 ssh myserver date

528 \end{codesample2}

529

530 If, on your server, you have login scripts that print banners or other

531 junk even when running non-interactive commands like this, you should

532 fix them before you continue, so that they only print output if

533 they're run interactively. Otherwise these banners will at least

534 clutter up Mercurial's output. Worse, they could potentially cause

535 problems with running Mercurial commands remotely. Mercurial makes

536 tries to detect and ignore banners in non-interactive \command{ssh}

537 sessions, but it is not foolproof. (If you're editing your login

538 scripts on your server, the usual way to see if a login script is

539 running in an interactive shell is to check the return code from the

540 command \Verb|tty -s|.)

541

542 Once you've verified that plain old ssh is working with your server,

543 the next step is to ensure that Mercurial runs on the server. The

544 following command should run successfully:

545 \begin{codesample2}

546 ssh myserver hg version

547 \end{codesample2}

548 If you see an error message instead of normal \hgcmd{version} output,

549 this is usually because you haven't installed Mercurial to

550 \dirname{/usr/bin}. Don't worry if this is the case; you don't need

551 to do that. But you should check for a few possible problems.

552 \begin{itemize}

553 \item Is Mercurial really installed on the server at all? I know this

554 sounds trivial, but it's worth checking!

555 \item Maybe your shell's search path (usually set via the \envar{PATH}

556 environment variable) is simply misconfigured.

557 \item Perhaps your \envar{PATH} environment variable is only being set

558 to point to the location of the \command{hg} executable if the login

559 session is interactive. This can happen if you're setting the path

560 in the wrong shell login script. See your shell's documentation for

561 details.

562 \item The \envar{PYTHONPATH} environment variable may need to contain

563 the path to the Mercurial Python modules. It might not be set at

564 all; it could be incorrect; or it may be set only if the login is

565 interactive.

566 \end{itemize}

567

568 If you can run \hgcmd{version} over an ssh connection, well done!

569 You've got the server and client sorted out. You should now be able

570 to use Mercurial to access repositories hosted by that username on

571 that server. If you run into problems with Mercurial and ssh at this

572 point, try using the \hggopt{--debug} option to get a clearer picture

573 of what's going on.

574

575 \subsection{Using compression with ssh}

576

577 Mercurial does not compress data when it uses the ssh protocol,

578 because the ssh protocol can transparently compress data. However,

579 the default behaviour of ssh clients is \emph{not} to request

580 compression.

581

582 Over any network other than a fast LAN (even a wireless network),

583 using compression is likely to significantly speed up Mercurial's

584 network operations. For example, over a WAN, someone measured

585 compression as reducing the amount of time required to clone a

586 particularly large repository from~51 minutes to~17 minutes.

587

588 Both \command{ssh} and \command{plink} accept a \cmdopt{ssh}{-C}

589 option which turns on compression. You can easily edit your \hgrc\ to

590 enable compression for all of Mercurial's uses of the ssh protocol.

591 \begin{codesample2}

592 [ui]

593 ssh = ssh -C

594 \end{codesample2}

595

596 If you use \command{ssh}, you can configure it to always use

597 compression when talking to your server. To do this, edit your

598 \sfilename{.ssh/config} file (which may not yet exist), as follows.

599 \begin{codesample2}

600 Host hg

601 Compression yes

602 HostName hg.example.com

603 \end{codesample2}

604 This defines an alias, \texttt{hg}. When you use it on the

605 \command{ssh} command line or in a Mercurial \texttt{ssh}-protocol

606 URL, it will cause \command{ssh} to connect to \texttt{hg.example.com}

607 and use compression. This gives you both a shorter name to type and

608 compression, each of which is a good thing in its own right.

609

610 \section{Serving over HTTP using CGI}

611 \label{sec:collab:cgi}

612

613 Depending on how ambitious you are, configuring Mercurial's CGI

614 interface can take anything from a few moments to several hours.

615

616 We'll begin with the simplest of examples, and work our way towards a

617 more complex configuration. Even for the most basic case, you're

618 almost certainly going to need to read and modify your web server's

619 configuration.

620

621 \begin{note}

622 Configuring a web server is a complex, fiddly, and highly

623 system-dependent activity. I can't possibly give you instructions

624 that will cover anything like all of the cases you will encounter.

625 Please use your discretion and judgment in following the sections

626 below. Be prepared to make plenty of mistakes, and to spend a lot

627 of time reading your server's error logs.

628 \end{note}

629

630 \subsection{Web server configuration checklist}

631

632 Before you continue, do take a few moments to check a few aspects of

633 your system's setup.

634

635 \begin{enumerate}

636 \item Do you have a web server installed at all? Mac OS X ships with

637 Apache, but many other systems may not have a web server installed.

638 \item If you have a web server installed, is it actually running? On

639 most systems, even if one is present, it will be disabled by

640 default.

641 \item Is your server configured to allow you to run CGI programs in

642 the directory where you plan to do so? Most servers default to

643 explicitly disabling the ability to run CGI programs.

644 \end{enumerate}

645

646 If you don't have a web server installed, and don't have substantial

647 experience configuring Apache, you should consider using the

648 \texttt{lighttpd} web server instead of Apache. Apache has a

649 well-deserved reputation for baroque and confusing configuration.

650 While \texttt{lighttpd} is less capable in some ways than Apache, most

651 of these capabilities are not relevant to serving Mercurial

652 repositories. And \texttt{lighttpd} is undeniably \emph{much} easier

653 to get started with than Apache.

654

655 \subsection{Basic CGI configuration}

656

657 On Unix-like systems, it's common for users to have a subdirectory

658 named something like \dirname{public\_html} in their home directory,

659 from which they can serve up web pages. A file named \filename{foo}

660 in this directory will be accessible at a URL of the form

661 \texttt{http://www.example.com/\~username/foo}.

662

663 To get started, find the \sfilename{hgweb.cgi} script that should be

664 present in your Mercurial installation. If you can't quickly find a

665 local copy on your system, simply download one from the master

666 Mercurial repository at

667 \url{http://www.selenic.com/repo/hg/raw-file/tip/hgweb.cgi}.

668

669 You'll need to copy this script into your \dirname{public\_html}

670 directory, and ensure that it's executable.

671 \begin{codesample2}

672 cp .../hgweb.cgi ~/public_html

673 chmod 755 ~/public_html/hgweb.cgi

674 \end{codesample2}

675 The \texttt{755} argument to \command{chmod} is a little more general

676 than just making the script executable: it ensures that the script is

677 executable by anyone, and that ``group'' and ``other'' write

678 permissions are \emph{not} set. If you were to leave those write

679 permissions enabled, Apache's \texttt{suexec} subsystem would likely

680 refuse to execute the script. In fact, \texttt{suexec} also insists

681 that the \emph{directory} in which the script resides must not be

682 writable by others.

683 \begin{codesample2}

684 chmod 755 ~/public_html

685 \end{codesample2}

686

687 \subsubsection{What could \emph{possibly} go wrong?}

688 \label{sec:collab:wtf}

689

690 Once you've copied the CGI script into place, go into a web browser,

691 and try to open the URL \url{http://myhostname/~myuser/hgweb.cgi},

692 \emph{but} brace yourself for instant failure. There's a high

693 probability that trying to visit this URL will fail, and there are

694 many possible reasons for this. In fact, you're likely to stumble

695 over almost every one of the possible errors below, so please read

696 carefully. The following are all of the problems I ran into on a

697 system running Fedora~7, with a fresh installation of Apache, and a

698 user account that I created specially to perform this exercise.

699

700 Your web server may have per-user directories disabled. If you're

701 using Apache, search your config file for a \texttt{UserDir}

702 directive. If there's none present, per-user directories will be

703 disabled. If one exists, but its value is \texttt{disabled}, then

704 per-user directories will be disabled. Otherwise, the string after

705 \texttt{UserDir} gives the name of the subdirectory that Apache will

706 look in under your home directory, for example \dirname{public\_html}.

707

708 Your file access permissions may be too restrictive. The web server

709 must be able to traverse your home directory and directories under

710 your \dirname{public\_html} directory, and read files under the latter

711 too. Here's a quick recipe to help you to make your permissions more

712 appropriate.

713 \begin{codesample2}

714 chmod 755 ~

715 find ~/public_html -type d -print0 | xargs -0r chmod 755

716 find ~/public_html -type f -print0 | xargs -0r chmod 644

717 \end{codesample2}

718

719 The other possibility with permissions is that you might get a

720 completely empty window when you try to load the script. In this

721 case, it's likely that your access permissions are \emph{too

722 permissive}. Apache's \texttt{suexec} subsystem won't execute a

723 script that's group-~or world-writable, for example.

724

725 Your web server may be configured to disallow execution of CGI

726 programs in your per-user web directory. Here's Apache's

727 default per-user configuration from my Fedora system.

728 \begin{codesample2}

729 <Directory /home/*/public_html>

730 AllowOverride FileInfo AuthConfig Limit

731 Options MultiViews Indexes SymLinksIfOwnerMatch IncludesNoExec

732 <Limit GET POST OPTIONS>

733 Order allow,deny

734 Allow from all

735 </Limit>

736 <LimitExcept GET POST OPTIONS>

737 Order deny,allow

738 Deny from all

739 </LimitExcept>

740 </Directory>

741 \end{codesample2}

742 If you find a similar-looking \texttt{Directory} group in your Apache

743 configuration, the directive to look at inside it is \texttt{Options}.

744 Add \texttt{ExecCGI} to the end of this list if it's missing, and

745 restart the web server.

746

747 If you find that Apache serves you the text of the CGI script instead

748 of executing it, you may need to either uncomment (if already present)

749 or add a directive like this.

750 \begin{codesample2}

751 AddHandler cgi-script .cgi

752 \end{codesample2}

753

754 The next possibility is that you might be served with a colourful

755 Python backtrace claiming that it can't import a

756 \texttt{mercurial}-related module. This is actually progress! The

757 server is now capable of executing your CGI script. This error is

758 only likely to occur if you're running a private installation of

759 Mercurial, instead of a system-wide version. Remember that the web

760 server runs the CGI program without any of the environment variables

761 that you take for granted in an interactive session. If this error

762 happens to you, edit your copy of \sfilename{hgweb.cgi} and follow the

763 directions inside it to correctly set your \envar{PYTHONPATH}

764 environment variable.

765

766 Finally, you are \emph{certain} to by served with another colourful

767 Python backtrace: this one will complain that it can't find

768 \dirname{/path/to/repository}. Edit your \sfilename{hgweb.cgi} script

769 and replace the \dirname{/path/to/repository} string with the complete

770 path to the repository you want to serve up.

771

772 At this point, when you try to reload the page, you should be

773 presented with a nice HTML view of your repository's history. Whew!

774

775 \subsubsection{Configuring lighttpd}

776

777 To be exhaustive in my experiments, I tried configuring the

778 increasingly popular \texttt{lighttpd} web server to serve the same

779 repository as I described with Apache above. I had already overcome

780 all of the problems I outlined with Apache, many of which are not

781 server-specific. As a result, I was fairly sure that my file and

782 directory permissions were good, and that my \sfilename{hgweb.cgi}

783 script was properly edited.

784

785 Once I had Apache running, getting \texttt{lighttpd} to serve the

786 repository was a snap (in other words, even if you're trying to use

787 \texttt{lighttpd}, you should read the Apache section). I first had

788 to edit the \texttt{mod\_access} section of its config file to enable

789 \texttt{mod\_cgi} and \texttt{mod\_userdir}, both of which were

790 disabled by default on my system. I then added a few lines to the end

791 of the config file, to configure these modules.

792 \begin{codesample2}

793 userdir.path = "public_html"

794 cgi.assign = ( ".cgi" => "" )

795 \end{codesample2}

796 With this done, \texttt{lighttpd} ran immediately for me. If I had

797 configured \texttt{lighttpd} before Apache, I'd almost certainly have

798 run into many of the same system-level configuration problems as I did

799 with Apache. However, I found \texttt{lighttpd} to be noticeably

800 easier to configure than Apache, even though I've used Apache for over

801 a decade, and this was my first exposure to \texttt{lighttpd}.

802

803 \subsection{Sharing multiple repositories with one CGI script}

804

805 The \sfilename{hgweb.cgi} script only lets you publish a single

806 repository, which is an annoying restriction. If you want to publish

807 more than one without wracking yourself with multiple copies of the

808 same script, each with different names, a better choice is to use the

809 \sfilename{hgwebdir.cgi} script.

810

811 The procedure to configure \sfilename{hgwebdir.cgi} is only a little

812 more involved than for \sfilename{hgweb.cgi}. First, you must obtain

813 a copy of the script. If you don't have one handy, you can download a

814 copy from the master Mercurial repository at

815 \url{http://www.selenic.com/repo/hg/raw-file/tip/hgwebdir.cgi}.

816

817 You'll need to copy this script into your \dirname{public\_html}

818 directory, and ensure that it's executable.

819 \begin{codesample2}

820 cp .../hgwebdir.cgi ~/public_html

821 chmod 755 ~/public_html ~/public_html/hgwebdir.cgi

822 \end{codesample2}

823 With basic configuration out of the way, try to visit

824 \url{http://myhostname/~myuser/hgwebdir.cgi} in your browser. It

825 should display an empty list of repositories. If you get a blank

826 window or error message, try walking through the list of potential

827 problems in section~\ref{sec:collab:wtf}.

828

829 The \sfilename{hgwebdir.cgi} script relies on an external

830 configuration file. By default, it searches for a file named

831 \sfilename{hgweb.config} in the same directory as itself. You'll need

832 to create this file, and make it world-readable. The format of the

833 file is similar to a Windows ``ini'' file, as understood by Python's

834 \texttt{ConfigParser}~\cite{web:configparser} module.

835

836 The easiest way to configure \sfilename{hgwebdir.cgi} is with a

837 section named \texttt{collections}. This will automatically publish

838 \emph{every} repository under the directories you name. The section

839 should look like this:

840 \begin{codesample2}

841 [collections]

842 /my/root = /my/root

843 \end{codesample2}

844 Mercurial interprets this by looking at the directory name on the

845 \emph{right} hand side of the ``\texttt{=}'' sign; finding

846 repositories in that directory hierarchy; and using the text on the

847 \emph{left} to strip off matching text from the names it will actually

848 list in the web interface. The remaining component of a path after

849 this stripping has occurred is called a ``virtual path''.

850

851 Given the example above, if we have a repository whose local path is

852 \dirname{/my/root/this/repo}, the CGI script will strip the leading

853 \dirname{/my/root} from the name, and publish the repository with a

854 virtual path of \dirname{this/repo}. If the base URL for our CGI

855 script is \url{http://myhostname/~myuser/hgwebdir.cgi}, the complete

856 URL for that repository will be

857 \url{http://myhostname/~myuser/hgwebdir.cgi/this/repo}.

858

859 If we replace \dirname{/my/root} on the left hand side of this example

860 with \dirname{/my}, then \sfilename{hgwebdir.cgi} will only strip off

861 \dirname{/my} from the repository name, and will give us a virtual

862 path of \dirname{root/this/repo} instead of \dirname{this/repo}.

863

864 The \sfilename{hgwebdir.cgi} script will recursively search each

865 directory listed in the \texttt{collections} section of its

866 configuration file, but it will \texttt{not} recurse into the

867 repositories it finds.

868

869 The \texttt{collections} mechanism makes it easy to publish many

870 repositories in a ``fire and forget'' manner. You only need to set up

871 the CGI script and configuration file one time. Afterwards, you can

872 publish or unpublish a repository at any time by simply moving it

873 into, or out of, the directory hierarchy in which you've configured

874 \sfilename{hgwebdir.cgi} to look.

875

876 \subsubsection{Explicitly specifying which repositories to publish}

877

878 In addition to the \texttt{collections} mechanism, the

879 \sfilename{hgwebdir.cgi} script allows you to publish a specific list

880 of repositories. To do so, create a \texttt{paths} section, with

881 contents of the following form.

882 \begin{codesample2}

883 [paths]

884 repo1 = /my/path/to/some/repo

885 repo2 = /some/path/to/another

886 \end{codesample2}

887 In this case, the virtual path (the component that will appear in a

888 URL) is on the left hand side of each definition, while the path to

889 the repository is on the right. Notice that there does not need to be

890 any relationship between the virtual path you choose and the location

891 of a repository in your filesystem.

892

893 If you wish, you can use both the \texttt{collections} and

894 \texttt{paths} mechanisms simultaneously in a single configuration

895 file.

896

897 \begin{note}

898 If multiple repositories have the same virtual path,

899 \sfilename{hgwebdir.cgi} will not report an error. Instead, it will

900 behave unpredictably.

901 \end{note}

902

903 \subsection{Downloading source archives}

904

905 Mercurial's web interface lets users download an archive of any

906 revision. This archive will contain a snapshot of the working

907 directory as of that revision, but it will not contain a copy of the

908 repository data.

909

910 By default, this feature is not enabled. To enable it, you'll need to

911 add an \rcitem{web}{allow\_archive} item to the \rcsection{web}

912 section of your \hgrc.

913

914 \subsection{Web configuration options}

915

916 Mercurial's web interfaces (the \hgcmd{serve} command, and the

917 \sfilename{hgweb.cgi} and \sfilename{hgwebdir.cgi} scripts) have a

918 number of configuration options that you can set. These belong in a

919 section named \rcsection{web}.

920 \begin{itemize}

921 \item[\rcitem{web}{allow\_archive}] Determines which (if any) archive

922 download mechanisms Mercurial supports. If you enable this

923 feature, users of the web interface will be able to download an

924 archive of whatever revision of a repository they are viewing.

925 To enable the archive feature, this item must take the form of a

926 sequence of words drawn from the list below.

927 \begin{itemize}

928 \item[\texttt{bz2}] A \command{tar} archive, compressed using

929 \texttt{bzip2} compression. This has the best compression ratio,

930 but uses the most CPU time on the server.

931 \item[\texttt{gz}] A \command{tar} archive, compressed using

932 \texttt{gzip} compression.

933 \item[\texttt{zip}] A \command{zip} archive, compressed using LZW

934 compression. This format has the worst compression ratio, but is

935 widely used in the Windows world.

936 \end{itemize}

937 If you provide an empty list, or don't have an

938 \rcitem{web}{allow\_archive} entry at all, this feature will be

939 disabled. Here is an example of how to enable all three supported

940 formats.

941 \begin{codesample4}

942 [web]

943 allow_archive = bz2 gz zip

944 \end{codesample4}

945 \item[\rcitem{web}{allowpull}] Boolean. Determines whether the web

946 interface allows remote users to \hgcmd{pull} and \hgcmd{clone} this

947 repository over~HTTP. If set to \texttt{no} or \texttt{false}, only

948 the ``human-oriented'' portion of the web interface is available.

949 \item[\rcitem{web}{contact}] String. A free-form (but preferably

950 brief) string identifying the person or group in charge of the

951 repository. This often contains the name and email address of a

952 person or mailing list. It often makes sense to place this entry in

953 a repository's own \sfilename{.hg/hgrc} file, but it can make sense

954 to use in a global \hgrc\ if every repository has a single

955 maintainer.

956 \item[\rcitem{web}{maxchanges}] Integer. The default maximum number

957 of changesets to display in a single page of output.

958 \item[\rcitem{web}{maxfiles}] Integer. The default maximum number

959 of modified files to display in a single page of output.

960 \item[\rcitem{web}{stripes}] Integer. If the web interface displays

961 alternating ``stripes'' to make it easier to visually align rows

962 when you are looking at a table, this number controls the number of

963 rows in each stripe.

964 \item[\rcitem{web}{style}] Controls the template Mercurial uses to

965 display the web interface. Mercurial ships with two web templates,

966 named \texttt{default} and \texttt{gitweb} (the latter is much more

967 visually attractive). You can also specify a custom template of

968 your own; see chapter~\ref{chap:template} for details. Here, you

969 can see how to enable the \texttt{gitweb} style.

970 \begin{codesample4}

971 [web]

972 style = gitweb

973 \end{codesample4}

974 \item[\rcitem{web}{templates}] Path. The directory in which to search

975 for template files. By default, Mercurial searches in the directory

976 in which it was installed.

977 \end{itemize}

978 If you are using \sfilename{hgwebdir.cgi}, you can place a few

979 configuration items in a \rcsection{web} section of the

980 \sfilename{hgweb.config} file instead of a \hgrc\ file, for

981 convenience. These items are \rcitem{web}{motd} and

982 \rcitem{web}{style}.

983

984 \subsubsection{Options specific to an individual repository}

985

986 A few \rcsection{web} configuration items ought to be placed in a

987 repository's local \sfilename{.hg/hgrc}, rather than a user's or

988 global \hgrc.

989 \begin{itemize}

990 \item[\rcitem{web}{description}] String. A free-form (but preferably

991 brief) string that describes the contents or purpose of the

992 repository.

993 \item[\rcitem{web}{name}] String. The name to use for the repository

994 in the web interface. This overrides the default name, which is the

995 last component of the repository's path.

996 \end{itemize}

997

998 \subsubsection{Options specific to the \hgcmd{serve} command}

999

1000 Some of the items in the \rcsection{web} section of a \hgrc\ file are

1001 only for use with the \hgcmd{serve} command.

1002 \begin{itemize}

1003 \item[\rcitem{web}{accesslog}] Path. The name of a file into which to

1004 write an access log. By default, the \hgcmd{serve} command writes

1005 this information to standard output, not to a file. Log entries are

1006 written in the standard ``combined'' file format used by almost all

1007 web servers.

1008 \item[\rcitem{web}{address}] String. The local address on which the

1009 server should listen for incoming connections. By default, the

1010 server listens on all addresses.

1011 \item[\rcitem{web}{errorlog}] Path. The name of a file into which to

1012 write an error log. By default, the \hgcmd{serve} command writes this

1013 information to standard error, not to a file.

1014 \item[\rcitem{web}{ipv6}] Boolean. Whether to use the IPv6 protocol.

1015 By default, IPv6 is not used.

1016 \item[\rcitem{web}{port}] Integer. The TCP~port number on which the

1017 server should listen. The default port number used is~8000.

1018 \end{itemize}

1019

1020 \subsubsection{Choosing the right \hgrc\ file to add \rcsection{web}

1021 items to}

1022

1023 It is important to remember that a web server like Apache or

1024 \texttt{lighttpd} will run under a user~ID that is different to yours.

1025 CGI scripts run by your server, such as \sfilename{hgweb.cgi}, will

1026 usually also run under that user~ID.

1027

1028 If you add \rcsection{web} items to your own personal \hgrc\ file, CGI

1029 scripts won't read that \hgrc\ file. Those settings will thus only

1030 affect the behaviour of the \hgcmd{serve} command when you run it. To

1031 cause CGI scripts to see your settings, either create a \hgrc\ file in

1032 the home directory of the user ID that runs your web server, or add

1033 those settings to a system-wide \hgrc\ file.

1034

1035

1036 %%% Local Variables:

1037 %%% mode: latex

1038 %%% TeX-master: "00book"

1039 %%% End: