[ https://issues.apache.org/jira/browse/TIKA-1782?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14976405#comment-14976405 ]
Tim Allison edited comment on TIKA-1782 at 10/27/15 1:49 PM: ------------------------------------------------------------- Y, I think so. The stacktrace seems to suggest a more profound issue, and my build with this patch works on RHEL and java 1.7. Was your build with trunk working before this patch? was (Author: talli...@mitre.org): Y, I think so. The stacktrace seems to suggest a more profound issue, and my build with this patch works on RHEL and java 1.7. What do you think? > XHTMLContentHandler doesn't pass attributes of html element > ----------------------------------------------------------- > > Key: TIKA-1782 > URL: https://issues.apache.org/jira/browse/TIKA-1782 > Project: Tika > Issue Type: Bug > Components: parser > Affects Versions: 1.11 > Reporter: Markus Jelsma > Fix For: 1.12 > > Attachments: TIKA-1782.patch > > > XHTMLContentHandler.startElement() uses lazyHead() for the html element > because it's defined in the AUTO Set. As a consequence, attributes of the > html element are not passed to downstream content handlers. -- This message was sent by Atlassian JIRA (v6.3.4#6332)