Automatic whitespace for block elements in XHTMLContentHandler --------------------------------------------------------------
Key: TIKA-188 URL: https://issues.apache.org/jira/browse/TIKA-188 Project: Tika Issue Type: Improvement Components: parser Reporter: Jukka Zitting Priority: Minor As discussed in TIKA-171, it would be a good idea to make the XHTMLContentHandler automatically add extra whitespace to separate block level elements from each other. This would prevent extracted words to accidentally get concatenated in clients that only care about the character events. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.