tika-user  

Re: Detecting rfc822 (email) messages

Jukka Zitting
Wed, 10 Feb 2010 06:51:13 -0800

Hi,

2010/2/9 François Cassistat <f...@maya-systems.com>:
> I was thinking to hack my application to add a check when I receive the
> mimetype plain/text file from Tika, but maybe I should try to write my own
> parser (this could be great to add support for indexing emails attachments).
> Any pointers?

See the o.a.tika.parser.mbox.MboxParser class for our current email
parsing functionality, and http://james.apache.org/mime4j/ for a more
complete email parser library. It would be great if we could integrate
mime4j to Tika!

BR,

Jukka Zitting