Hi,

My fetching is stuck in the middle of somewhere in
fetching. And when I took a look at fetch log, I got
the following error messages,

Any reason why this happens? Should I stop current
session and restart crawling again?

by the way, my config for ftp content is set as
unlimited "<name>http.content.limit</name> 
<value>-1</value> ". Will that be the reason?

thanks

"
060408 072825 SEVERE error writing
output:java.io.IOException: key out of order: 33079
after 33079
java.io.IOException: key out of order: 33079 after
33079
        at
org.apache.nutch.io.MapFile$Writer.checkKey(MapFile.java:134)
        at
org.apache.nutch.io.MapFile$Writer.append(MapFile.java:120)
        at
org.apache.nutch.io.ArrayFile$Writer.append(ArrayFile.java:39)
        at
org.apache.nutch.fetcher.Fetcher$FetcherThread.outputPage(Fetcher.java:318)
        at
org.apache.nutch.fetcher.Fetcher$FetcherThread.handleFetch(Fetcher.java:301)
        at
org.apache.nutch.fetcher.Fetcher$FetcherThread.run(Fetcher.java:160)
Exception in thread "main" java.lang.RuntimeException:
SEVERE error logged.  Exiting fetcher.
        at
org.apache.nutch.fetcher.Fetcher.run(Fetcher.java:394)
        at
org.apache.nutch.fetcher.Fetcher.main(Fetcher.java:528)
"

__________________________________________________
Do You Yahoo!?
Tired of spam?  Yahoo! Mail has the best spam protection around 
http://mail.yahoo.com 


-------------------------------------------------------
This SF.Net email is sponsored by xPML, a groundbreaking scripting language
that extends applications into web and mobile media. Attend the live webcast
and join the prime developer group breaking into this new coding territory!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=110944&bid=241720&dat=121642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to