Daniel Bonniot de Ruisselet created TIKA-1017:
-------------------------------------------------
Summary: DefaultHtmlMapper misses some safe elements
Key: TIKA-1017
URL: https://issues.apache.org/jira/browse/TIKA-1017
Project: Tika
Issue Type: Bug
Reporter: Daniel Bonniot de Ruisselet
The code of DefaultHtmlMapper says that the list of "safe" elements is based on
http://www.w3.org/TR/xhtml1/DTD/xhtml1-strict.dtd
Elements like <sub> and <i> are not included in the safe list. Is this
intentional (a comment with the rationale would be useful) or should they be
added?
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira