Hi, I would like to use Nutch only as a (whole web) crawler, without the indexing stage... After I've completed the fetching stage, how can I access the database with the crawled data, in particular the texts of the fetched pages? I tried to use segread and readdb from the command line, unfortuately with no success.
Cheers, Olena ------------------------------------------------------- SF.Net email is Sponsored by the Better Software Conference & EXPO September 19-22, 2005 * San Francisco, CA * Development Lifecycle Practices Agile & Plan-Driven Development * Managing Projects & Teams * Testing & QA Security * Process Improvement & Measurement * http://www.sqe.com/bsce5sf _______________________________________________ Nutch-general mailing list [email protected] https://lists.sourceforge.net/lists/listinfo/nutch-general
