Hello,
We have a unit test (TIKA-980) where we want to read the attributes for the
body element just as we read attributes of all other elements. The body
element, however, always yields zero attributes! It's very empty.
public void startElement(String uri, String local, String name, Attributes
attributes) throws SAXException {
System.out.print(local + ": " + Integer.toString(attributes.getLength()));
..
}
The HTML is very simple: <body itemscope itemtype="http://schema.org/WebPage">
but it always prints "body: 0". I can read attributes for all other elements
and building the microdata works well, except when i have attributes in the
body element.
Any hints to share?
Thanks,
Markus