Hi, I was looking at http://tika.apache.org/0.8/formats.html and found several issues with it:
- Says that it lists the formats supported by Tika 0.6 instead of 0.8. - Says that it has links to parser class javadocs when it doesn't. - Though the page promises that the parser class java docs have more detailed information about each document format and how it is parsed, the two I looked at, OOXMLParser and OfficeParser, had no details in their javadoc. Paul
