Hi,
My fetching is stuck in the middle of somewhere in
fetching. And when I took a look at fetch log, I got
the following error messages,
Any reason why this happens? Should I stop current
session and restart crawling again?
by the way, my config for ftp content is set as
unlimited "<name>http.content.limit</name>
<value>-1</value> ". Will that be the reason?
thanks
"
060408 072825 SEVERE error writing
output:java.io.IOException: key out of order: 33079
after 33079
java.io.IOException: key out of order: 33079 after
33079
at
org.apache.nutch.io.MapFile$Writer.checkKey(MapFile.java:134)
at
org.apache.nutch.io.MapFile$Writer.append(MapFile.java:120)
at
org.apache.nutch.io.ArrayFile$Writer.append(ArrayFile.java:39)
at
org.apache.nutch.fetcher.Fetcher$FetcherThread.outputPage(Fetcher.java:318)
at
org.apache.nutch.fetcher.Fetcher$FetcherThread.handleFetch(Fetcher.java:301)
at
org.apache.nutch.fetcher.Fetcher$FetcherThread.run(Fetcher.java:160)
Exception in thread "main" java.lang.RuntimeException:
SEVERE error logged. Exiting fetcher.
at
org.apache.nutch.fetcher.Fetcher.run(Fetcher.java:394)
at
org.apache.nutch.fetcher.Fetcher.main(Fetcher.java:528)
"
__________________________________________________
Do You Yahoo!?
Tired of spam? Yahoo! Mail has the best spam protection around
http://mail.yahoo.com