Hi, I just started playing with Tika and I have a few questions I'm trying to detect the mimetype of a file using both
Tika.detect(InputStream) and Tika.detect(File) I get 2 different results. I'm testing with a Microsoft Word (.doc) file. As a stream, I get application/x-tika-msoffice. As a file I get application/msword Why are they different? I was also wondering why the mimetype is not returned in the metadata when parsing a file Thank you Peter
