Hi,

[Please avoid cross-posting between user@ and d...@. Responding only on u...@]

On Sat, Oct 16, 2010 at 9:27 PM, qubit <[email protected]> wrote:
> One of the persons on the project has discovered a configuration file that
> he thinks could be tweaked to improve performance of our editor/transcriber.
> But we can't seem to find the documentation for this file.
> Looking through the source tree I discovered a file TikaConfig.java that
> generates a default configuration file.  This implies that the configuration
> file is generated rather than being part of the interface.

We used to have a default tika-config.xml file, but it was replaced by
automatically collecting all parsers available in the classpath.

The TikaConfig class still supports the XML configuration file format,
and you can use it through one of the non-default constructors.
However, the only documentation are the sources of that class.

If you need to tweak the set of parsers used by your application, a
better alternative would probably be something like using the new
AutoDetectParser(Parser... parsers) constructor available in the svn
trunk (and in the upcoming 0.8 release).

BR,

Jukka Zitting

Reply via email to