On Tue, 11 Mar 2014, Mirko Sertic wrote:
a) Is the Tika class stateful? Do i have to create a new instance for every document, or can i reuse it?
You can share one Tika object or TikaConfig object throughout your application
b) Are the parsers stateful? Do i have to create a new parser for every document, or can i reuse it?
You can re-use the same parser multiple times, from multiple threads
c) How can i tune the org.openxmlformat.schema classes?
These are auto-generated by Apache XMLBeans from the Microsoft provided schemas. You'd need to help the XMLBeans project if you wanted to speed these up
Nick
