Okay, I will replace the - with a + and crawl again.
Thanks,
Andy 

-----Original Message-----
From: Michael Plax [mailto:[EMAIL PROTECTED] 
Sent: Friday, January 20, 2006 2:49 PM
To: [email protected]
Subject: Re: New server not processing pdf files in asp pages, how to
add plugins

Hello,

I'm new to Nutch so my guess maybe it's happed because of filter
(regex-urlfilter.txt)

# skip URLs containing certain characters as probable queries, etc.
[EMAIL PROTECTED]

Michael


----- Original Message -----
From: "Andy Morris" <[EMAIL PROTECTED]>
To: <[email protected]>
Sent: Friday, January 20, 2006 11:10 AM
Subject: New server not processing pdf files in asp pages, how to add
plugins


Okay, I have had nutch running for some time now  and it was doing great
searching regulat html files.  We have moved to an asp only website and
nutch is not finding any image files or pdf files.  What do I need to
add to the site_xml file to find these files?

Andy



-------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid3432&bid#0486&dat1642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to