You can just delete the parse output folders and start the parsing tool.
Parsing a given page again makes only sense for debug reasons since hadoop io system can not update entries.
If you need to debug I suggest to write you a junit test.

HTH
Stefan


Am 29.05.2006 um 01:01 schrieb Stefan Neufeind:

Hi,

was is needed to re-parse documents that were already fetched into a
segment? Is another "nutch index ..."-run sufficient, or how could I
send the documents through the parse-plugins again?


Regards,
 Stefan




-------------------------------------------------------
All the advantages of Linux Managed Hosting--Without the Cost and Risk!
Fully trained technicians. The highest number of Red Hat certifications in
the hosting industry. Fanatical Support. Click to learn more
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=107521&bid=248729&dat=121642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general

Reply via email to