>I am just a bit worried that the one abstraction to rule them all may preclude 
>me from easily handling more esoteric parts of some document formats.

If you find that you have needs that are too use-case-specific to fold into 
Tika, you can easily create your own parser/modification of one of our parsers 
within the Tika framework. [1]  As someone who learned the hard way from doing 
my own custom content extraction work before drinking the Tika koolaid, I 
highly encourage going with the one abstraction until it fails you and then 
either modifying Tika through patches/issues or creating your own custom 
parsers and using those within the Tika framework.

>I presume that the best way to request enhancements is to create a JIRA entry 
>so it can be tracked?

Y, please.

Cheers,

         Tim

[1] https://tika.apache.org/1.13/parser_guide.html

Reply via email to