Automatic whitespace for block elements in XHTMLContentHandler
--------------------------------------------------------------

                 Key: TIKA-188
                 URL: https://issues.apache.org/jira/browse/TIKA-188
             Project: Tika
          Issue Type: Improvement
          Components: parser
            Reporter: Jukka Zitting
            Priority: Minor


As discussed in TIKA-171, it would be a good idea to make the 
XHTMLContentHandler automatically add extra whitespace to separate block level 
elements from each other. This would prevent extracted words to accidentally 
get concatenated in clients that only care about the character events.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to