tika-user  

Re: Detecting rfc822 (email) messages

Nick Burch
Wed, 10 Feb 2010 04:04:01 -0800

On Tue, 9 Feb 2010, François Cassistat wrote:
I've got to index and sort files where many are in .eml format (message/rfc822). Apache Tika detects them as plain/text.

Currently it looks like tika only handles such files with a .mbox extension, not .eml. Did you try editing tika-mimetypes.xml and adding in .eml too?

Nick