[ 
https://issues.apache.org/jira/browse/NUTCH-1898?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Lewis John McGibbney updated NUTCH-1898:
----------------------------------------
    Description: 
The ability to obtain raw HTML alongside all of the other parse data we get 
within existing parsechecker would compliment the tool.
This issue should merely append the raw HTML markup to the existing output. It 
should be an optional parameter, same as -dumpText

  was:Obtaining raw HTML


> Add -dumpRawHTML prameter to parsechecker tool
> ----------------------------------------------
>
>                 Key: NUTCH-1898
>                 URL: https://issues.apache.org/jira/browse/NUTCH-1898
>             Project: Nutch
>          Issue Type: Bug
>          Components: parser
>    Affects Versions: 1.9, 2.2.1
>            Reporter: Lewis John McGibbney
>            Priority: Minor
>             Fix For: 2.4, 1.10
>
>
> The ability to obtain raw HTML alongside all of the other parse data we get 
> within existing parsechecker would compliment the tool.
> This issue should merely append the raw HTML markup to the existing output. 
> It should be an optional parameter, same as -dumpText



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to