Better parsed text
------------------

                 Key: NUTCH-624
                 URL: https://issues.apache.org/jira/browse/NUTCH-624
             Project: Nutch
          Issue Type: Improvement
            Reporter: Vinci


I found the parsed text by default parser Neko is not easy to process - it just 
add a space to the end of the tag. Can neko (or other parser) change the 
behaviour to 
1.adding tab (for inline element)
2.add a tab+newline  for block level element end
instead of  space, so we can have a better parsed text?

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to