Mismatched start and end elements in HtmlParser
-----------------------------------------------

                 Key: TIKA-525
                 URL: https://issues.apache.org/jira/browse/TIKA-525
             Project: Tika
          Issue Type: Bug
          Components: parser
    Affects Versions: 0.7
            Reporter: Geoff Jarrad
             Fix For: 0.8


Due to the use of conditionals when determining whether or not to output a HTML 
element, the HtmlParser sometimes blocks a start element but erroneously passes 
the end element.
Example: Consider the action of the element <base target="_top"/> for the 
methods:

<pre>
    public void startElement(
            String uri, String local, String name, Attributes atts)
            throws SAXException {
...
            } else if ("BASE".equals(name) && atts.getValue("href") != null) {
...
                xhtml.startElement(uri, local, "base", atts);
           }
</pre>

and

<pre>
    public void endElement(
            String uri, String local, String name) throws SAXException {
...
            } else if ("BASE".equals(name)) {
                xhtml.endElement("base");
            }
...
</pre>

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to