Upon further testing, the crawler seems to be hitting this impossible error when the sites are frequently updated during the crawl. Any clue?
Thanks. On 1/4/06, Sunnyvale Fl <[EMAIL PROTECTED]> wrote: > > I keep running into an impossible situation error when crawling with Nutch > 0.7 - can't quite decipher what it means. The error message is > > java.io.IOException: Impossible situation. There is a score-edit for > http://www.foo.com/a, which comes after the current Page > http://www.foo.com/b > > After the crash, I'll delete the unfinished segments, tmp dirs, and > webdb.new, and re-crawl the same sites, and the problem goes away. Once > in a while I hit the same error again and I have to clean up the db for that > purpose. Any ideas? > > Thanks! >
