Must I have index-more enabled to get the pdf titles to work.
I did a test with some pdf files, all pdf titles were ignored (nutch 0.7.1).



Håvard W. Kongsgård wrote:

It'd be nice if this was changed so that if a PDF has no title then the first xx words become the new title.
(but it seems that the Google title process is more advanced that this)



Jérôme Charron wrote:

When searching with nutch the title of pdf documents is a url to the
file like:
http://www.ists.dartmouth.edu/library/wse0901.pdf


In Nutch, the title of PDF file is displayed if a title is available,
otherwise the URL
of the document is displayed.

Regards

Jérôme

--
http://motrech.free.fr/
http://www.frutch.org/




Reply via email to