Must I have index-more enabled to get the pdf titles to work.
I did a test with some pdf files, all pdf titles were ignored (nutch 0.7.1).
Håvard W. Kongsgård wrote:
It'd be nice if this was changed so that if a PDF has no title then
the first xx words become the new title.
(but it seems that the Google title process is more advanced that this)
Jérôme Charron wrote:
When searching with nutch the title of pdf documents is a url to the
file like:
http://www.ists.dartmouth.edu/library/wse0901.pdf
In Nutch, the title of PDF file is displayed if a title is available,
otherwise the URL
of the document is displayed.
Regards
Jérôme
--
http://motrech.free.fr/
http://www.frutch.org/