On 23 Jan 2012, at 16:04, Julien Nioche wrote: > check your URL filter : the link above contains a '?' which by default > would get the URL to be filtered out
That was definitely the problem. Nutch is happily fetching those documents now! Thanks very much for your help. Ian. --

