Hello,
I need to know more about the parse-ext plugin ... what can it do for example ?
Then, I get the following error when I crawl with index-more plugin :
050302 183540 Updating /nutch-0.6/agoria.2mar/db
050302 183540 Updating for /nutch-0.6/agoria.2mar/segments/20050302183116
050302 183540 Processing document 0
050302 183541 Finishing update
050302 183542 Processing pagesByURL: Sorted 2931 instructions in 0.915 seconds.
050302 183542 Processing pagesByURL: Sorted 3203.27868852459 instructions/second
Exception in thread "main" java.io.IOException: already exists: /nutch-0.6/agoria.2mar/db/webdb.new/pagesByURL
at net.nutch.io.MapFile$Writer.<init>(MapFile.java:67)
at net.nutch.db.WebDBWriter$CloseProcessor.closeDown(WebDBWriter.java:536)
at net.nutch.db.WebDBWriter.close(WebDBWriter.java:1531)
at net.nutch.tools.UpdateDatabaseTool.close(UpdateDatabaseTool.java:301)
at net.nutch.tools.UpdateDatabaseTool.main(UpdateDatabaseTool.java:351)
at net.nutch.tools.CrawlTool.main(CrawlTool.java:128)
Thanks for help.
Christophe.
------------------------------------------------------- SF email is sponsored by - The IT Product Guide Read honest & candid reviews on hundreds of IT Products from real users. Discover which products truly live up to the hype. Start reading now. http://ads.osdn.com/?ad_id=6595&alloc_id=14396&op=click _______________________________________________ Nutch-developers mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-developers
