Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change 
notification.

The following page has been changed by AdrianoPalombo:
http://wiki.apache.org/nutch/bin/nutch_parse

------------------------------------------------------------------------------
  
  It assumes, under given segment, existence of ./fetcher_output/, which is 
typically generated after a non-parsing fetcher run (i.e., fetcher is started 
with option -noParsing).
  
- Contents in one segemnt are parsed and saved in these steps:
+ Contents in one segment are parsed and saved in these steps:
  
  1. ./fetcher_output/ and ./content/ are looped together (possibly by multiple 
ParserThreads), and content is parsed for each entry. The entry number and 
resultant ParserOutput are saved in ./parser.unsorted.
  

Reply via email to