Dear Wiki user,

You have subscribed to a wiki page or wiki category on "Nutch Wiki" for change 
notification.

The "bin/nutch parsechecker" page has been changed by LewisJohnMcgibbney:
http://wiki.apache.org/nutch/bin/nutch%20parsechecker

New page:
Parsechecker is an alias for org.apache.nutch.parse.ParserChecker

This class takes an URL, fetches it (without storing) and returns the URL, the 
parse_data and all of the parse_text of that URL. It is extremely useful for 
checking parser implementations from the command line.

Usage: 

{{{
bin/nutch ParserChecker [-dumpText] [-forceAs mimeType] url
}}}

'''[-dumpText]''': TEnables us to dump the parse_text into a text file

'''[-forceAs mimeType]''': Forces mimType for the given URL arguement.

''url ''': The URL you wish to check the parser on.

CommandLineOptions

Reply via email to