Output from ParserChecker Url missing a newline
-----------------------------------------------

                 Key: NUTCH-1209
                 URL: https://issues.apache.org/jira/browse/NUTCH-1209
             Project: Nutch
          Issue Type: Bug
          Components: parser
    Affects Versions: 1.4
         Environment: While testing this: 
http://www.mail-archive.com/user@nutch.apache.org/msg04688.html
            Reporter: Chris A. Mattmann
            Assignee: Chris A. Mattmann
            Priority: Trivial
             Fix For: 1.5


While working on:

http://www.mail-archive.com/user@nutch.apache.org/msg04688.html

I found out that the ParserChecker is missing a newline in its report.

E.g., note:

{noformat}
./bin/nutch org.apache.nutch.parse.ParserChecker 
http://vault.fbi.gov/watergate/watergate-summary-part-01-of-02/view
{noformat}

produces:

{noformat}
fetching: http://vault.fbi.gov/watergate/watergate-summary-part-01-of-02/view
parsing: http://vault.fbi.gov/watergate/watergate-summary-part-01-of-02/view
contentType: application/xhtml+xml
---------
Url
---------------
http://vault.fbi.gov/watergate/watergate-summary-part-01-of-02/view---------
ParseData
---------
Version: 5
...snip
{noformat}

Note that there is no space between *view* and -----.



--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to