Okay, I will replace the - with a + and crawl again. Thanks, Andy -----Original Message----- From: Michael Plax [mailto:[EMAIL PROTECTED] Sent: Friday, January 20, 2006 2:49 PM To: [email protected] Subject: Re: New server not processing pdf files in asp pages, how to add plugins
Hello, I'm new to Nutch so my guess maybe it's happed because of filter (regex-urlfilter.txt) # skip URLs containing certain characters as probable queries, etc. [EMAIL PROTECTED] Michael ----- Original Message ----- From: "Andy Morris" <[EMAIL PROTECTED]> To: <[email protected]> Sent: Friday, January 20, 2006 11:10 AM Subject: New server not processing pdf files in asp pages, how to add plugins Okay, I have had nutch running for some time now and it was doing great searching regulat html files. We have moved to an asp only website and nutch is not finding any image files or pdf files. What do I need to add to the site_xml file to find these files? Andy ------------------------------------------------------- This SF.net email is sponsored by: Splunk Inc. Do you grep through log files for problems? Stop! Download the new AJAX search engine that makes searching your log files as easy as surfing the web. DOWNLOAD SPLUNK! http://sel.as-us.falkag.net/sel?cmd=lnk&kid3432&bid#0486&dat1642 _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
