Hi Folks,

 

 I've recently encountered the following error using the crawl tool:

 

050426 214400 fetching
http://search.csmonitor.com/specials/neocon/index.html

050426 214401 fetching http://perspolis.usc.edu/Users/shahram/

050426 214401 fetching
http://www.cnn.com/rssclick/2005/TECH/science/04/26/stem.cells.ethics.ap/ind
ex.html?section=cnn_health

Found a new channel: Christian Science Monitor | Commentary

050426 214401 SEVERE error writing output:java.lang.NullPointerException

050426 214401 status: segment 20050426213847, 300 pages, 68 errors, 11258062
bytes, 310265 ms

050426 214401 status: 0.9669153 pages/s, 283.47897 kb/s, 37526.875
bytes/page

050426 214401 SEVERE error writing output:java.io.IOException: key out of
order: 367 after 367

Exception in thread "main" java.lang.RuntimeException: SEVERE error logged.
Exiting fetcher.

        at org.apache.nutch.fetcher.Fetcher.run(Fetcher.java:304)

        at org.apache.nutch.fetcher.Fetcher.main(Fetcher.java:439)

        at org.apache.nutch.tools.CrawlTool.main(CrawlTool.java:140)

050426 214403 SEVERE error writing output:java.io.IOException: key out of
order: 367 after 367

050426 214404 SEVERE error writing output:java.io.IOException: key out of
order: 367 after 367

050426 214406 SEVERE error writing output:java.io.IOException: key out of
order: 367 after 367

050426 214406 SEVERE error writing output:java.io.IOException: key out of
order: 367 after 367

050426 214408 SEVERE error writing output:java.io.IOException: key out of
order: 367 after 367

050426 214409 SEVERE error writing output:java.io.IOException: key out of
order: 367 after 367

050426 214411 SEVERE error writing output:java.io.IOException: key out of
order: 367 after 367

050426 214413 SEVERE error writing output:java.io.IOException: key out of
order: 367 after 367

[EMAIL PROTECTED] nutch]$

 

I have the parse-pdf, parse-rss, parse-msword, and parse-html, I have all
the basic indexing and querying, and protocol plugins enabled. Further, I'm
running Nutch 0.7-dev, off a redhat linux box. I've just started to
encounter this problem recently. Does anyone have any clue  as to what
causes it? I can attach the full crawl log if necessary. Please let me know.

 

Thanks very much,

 

  Chris Mattmann

 

 

______________________________________________
Chris A. Mattmann
[EMAIL PROTECTED] 
Staff Member
Modeling and Data Management Systems Section (387)

Data Management Systems and Technologies Group

_________________________________________________
Jet Propulsion Laboratory            Pasadena, CA
Office: 171-266B                        Mailstop:  171-246
_______________________________________________________

Disclaimer:  The opinions presented within are my own and do not reflect
those of either NASA, JPL, or the California Institute of Technology.

 

Reply via email to