Lukas Vlcek wrote:

How can I learn that?
What I do is running regular one-step command [/bin/nutch crawl]

In that case your nutch-default.xml / nutch-site.xml decides, there is a boolean option there. If you didn't change this, then it defaults to true (i.e. your fetcher is parsing the content).

Is it easy to reproduce this if I knew the seed urls? If that's the case, please send me the seed urls (contact me off the list, if it's sensitive).

--
Best regards,
Andrzej Bialecki     <><
___. ___ ___ ___ _ _   __________________________________
[__ || __|__/|__||\/|  Information Retrieval, Semantic Web
___|||__||  \|  ||  |  Embedded Unix, System Integration
http://www.sigram.com  Contact: info at sigram dot com




-------------------------------------------------------
This SF.net email is sponsored by: Splunk Inc. Do you grep through log files
for problems?  Stop!  Download the new AJAX search engine that makes
searching your log files as easy as surfing the  web.  DOWNLOAD SPLUNK!
http://ads.osdn.com/?ad_id=7637&alloc_id=16865&op=click
_______________________________________________
Nutch-developers mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-developers

Reply via email to