> Can you try with a recent Tika nightly build? e.g. https://builds.apache.org/job/Tika-trunk/lastBuild/org.apache.tika$tika-app/
-----Original Message----- From: Nick Burch [mailto:[email protected]] Sent: Monday, July 25, 2016 3:03 PM To: [email protected] Subject: Re: Problem with detection of .mbox file On Mon, 25 Jul 2016, Vjeran Marcinko wrote: > I fist noticed that my .mbox file doesn't get parsed by MBoxParser, > and later, after debugging Tika source code, I found what the problem > is - default detector doesn't even recognize it as "applciation/mbox" > MIME type, and although file extension is .mbox, it ignores this hint > because its "magic" way of detecting file type based on some amount of > initial bytes detects it is "text/html" Can you try with a recent Tika nightly build? Only there have been some tweaks done around that sort of thing recently If a nightly build / build from Git still shows the issue, please open a bug in Jira and attach a problematic file, then we can take a look! Nick
