When searching with nutch the title of pdf documents is a url to the file like:
http://www.ists.dartmouth.edu/library/wse0901.pdf

I have noticed that google and ultraseek creates a normal title like:
WebALPS: A Survey of E-Commerce Privacy and Security Applications

Is it possible to make nutch do the same?


-------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=103432&bid=230486&dat=121642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to