Dear Wiki user, You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change notification.
The following page has been changed by AdrianoPalombo: http://wiki.apache.org/nutch/bin/nutch_parse ------------------------------------------------------------------------------ It assumes, under given segment, existence of ./fetcher_output/, which is typically generated after a non-parsing fetcher run (i.e., fetcher is started with option -noParsing). - Contents in one segemnt are parsed and saved in these steps: + Contents in one segment are parsed and saved in these steps: 1. ./fetcher_output/ and ./content/ are looped together (possibly by multiple ParserThreads), and content is parsed for each entry. The entry number and resultant ParserOutput are saved in ./parser.unsorted.
