You can just delete the parse output folders and start the parsing tool.
Parsing a given page again makes only sense for debug reasons since
hadoop io system can not update entries.
If you need to debug I suggest to write you a junit test.
HTH
Stefan
Am 29.05.2006 um 01:01 schrieb Stefan Neufeind:
Hi,
was is needed to re-parse documents that were already fetched into a
segment? Is another "nutch index ..."-run sufficient, or how could I
send the documents through the parse-plugins again?
Regards,
Stefan
-------------------------------------------------------
All the advantages of Linux Managed Hosting--Without the Cost and Risk!
Fully trained technicians. The highest number of Red Hat certifications in
the hosting industry. Fanatical Support. Click to learn more
http://sel.as-us.falkag.net/sel?cmd=lnk&kid=107521&bid=248729&dat=121642
_______________________________________________
Nutch-general mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/nutch-general