On Wed, 6 Apr 2011, Markus Jelsma wrote:
However, removing all types other than plain/text from tika-mimetypes might do the trick. Will Tika then fall back to that type even if it first doesn't mark as such?
Tika will fall back to application/octet-stream if it doesn't know what a file is.
In the case of something like text/csv extends text/plain, then if you remove the entry for text/csv then Tika should fall back on text/plain (except if the match was only occuring on the latter)
Nick
