Please try altering the http.accept property in nutch-default.xml to the following
<property> <name>http.accept</name> <value>text/html,application/xhtml+xml,application/xml,application/pdf;q=0.9,*/*;q=0.8</value> <description>Value of the "Accept" request header field. </description> </property> On Mon, Jan 14, 2013 at 2:54 AM, paddz <[email protected]> wrote: > Thanks for your advice gora, it is being served. > > Patrick > > > > -- > View this message in context: > http://lucene.472066.n3.nabble.com/Crawling-PDFs-no-file-extension-tp4032174p4033107.html > Sent from the Nutch - User mailing list archive at Nabble.com. > -- *Lewis*

