Please try altering the http.accept property in nutch-default.xml to the
following

<property>
  <name>http.accept</name>
  
<value>text/html,application/xhtml+xml,application/xml,application/pdf;q=0.9,*/*;q=0.8</value>
  <description>Value of the "Accept" request header field.
  </description>
</property>



On Mon, Jan 14, 2013 at 2:54 AM, paddz <[email protected]> wrote:

> Thanks for your advice gora, it is being served.
>
> Patrick
>
>
>
> --
> View this message in context:
> http://lucene.472066.n3.nabble.com/Crawling-PDFs-no-file-extension-tp4032174p4033107.html
> Sent from the Nutch - User mailing list archive at Nabble.com.
>



-- 
*Lewis*

Reply via email to