Hi,
My fetching is stuck in the middle of somewhere in
fetching. And when I took a look at fetch log, I got
the following error messages,
Any reason why this happens? Should I stop current
session and restart crawling again?
by the way, my config for ftp content is set as
unlimited "<name>http.content.limit</name>
<value>-1</value> ". Will that be the reason?
thanks
"
060408 072825 SEVERE error writing
output:java.io.IOException: key out of order: 33079
after 33079
java.io.IOException: key out of order: 33079 after
33079
at
org.apache.nutch.io.MapFile$Writer.checkKey(MapFile.java:134)
at
org.apache.nutch.io.MapFile$Writer.append(MapFile.java:120)
at
org.apache.nutch.io.ArrayFile$Writer.append(ArrayFile.java:39)
at
org.apache.nutch.fetcher.Fetcher$FetcherThread.outputPage(Fetcher.java:318)
at
org.apache.nutch.fetcher.Fetcher$FetcherThread.handleFetch(Fetcher.java:301)
at
org.apache.nutch.fetcher.Fetcher$FetcherThread.run(Fetcher.java:160)
Exception in thread "main" java.lang.RuntimeException:
SEVERE error logged. Exiting fetcher.
at
org.apache.nutch.fetcher.Fetcher.run(Fetcher.java:394)
at
org.apache.nutch.fetcher.Fetcher.main(Fetcher.java:528)
"
__________________________________________________
Do You Yahoo!?
Tired of spam? Yahoo! Mail has the best spam protection around
http://mail.yahoo.com
-------------------------------------------------------
This SF.Net email is sponsored by xPML, a groundbreaking scripting language
that extends applications into web and mobile media. Attend the live webcast
and join the prime developer group breaking into this new coding territory!
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=110944&bid=241720&dat=121642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general