On Tue, 11 Mar 2014, Mirko Sertic wrote:
a) Is the Tika class stateful? Do i have to create a new instance for every document, or can i reuse it?

You can share one Tika object or TikaConfig object throughout your application

b) Are the parsers stateful? Do i have to create a new parser for every document, or can i reuse it?

You can re-use the same parser multiple times, from multiple threads

c) How can i tune the org.openxmlformat.schema classes?

These are auto-generated by Apache XMLBeans from the Microsoft provided schemas. You'd need to help the XMLBeans project if you wanted to speed these up

Nick

Reply via email to