I have raised NIFI-2380 to track this improvement. While raising the ticket I was wondering:
are you happy to give the use the option to chose if to extract the winmail.dat or not? I mean something like: - PROPERTY: "Extract Attachments within a TNEF (i.e. winmail.data): true / false If yes, then every time a decoding occur we test the name (or something better in case it is possible) and then extract it. An attachment created by a TNEF file would have an attribute email.attachment.tnefdecoded (or whatever name we decide) set to yes. If no, processing continues as it is today (i.e. purely based on Apache Commons MimeMessageParser). Another possible solution would be an additional processor but IMNSHO this would be overkill and counter productive. Ken to hear your thoughts On Sun, Jul 17, 2016 at 4:46 PM, Andre <[email protected]> wrote: > Dan, > > Ingesting Microsoft Journals seem like a great suggestion for a new > processor ( ParseExchangeJounal ?). > > Regarding TNEF: As far as I know, Apache Commons - Mail does not pase > "winmail.dat" > type attachments. As far as I understand the only ASL compatible > implementation of a TNEF extractor is Apache's POI and even that > implementation is not part of POI's main release. > > If TNEF support is required we will ether have to code from scratch or > perhaps use https://github.com/koodaamo/tnefparse together with > ExecuteScript (although since tnefparse is LGPL, this solution cannot be > packaged as part of NiFi). > > Cheers > > On Sun, Jul 17, 2016 at 10:53 AM, djmdata <[email protected]> wrote: > >> What is the JIRA #? >> >> I have a production system that reads email from a custom SMTP listener >> and >> places the SMTP payload into Kafka. A Storm topology reads messages from >> Kafka and parses the emails (Java code using JavaMail API) into useful >> info >> (subject, text, attachments, body, etc...). >> >> I'm looking at plugging NiFi into this to replace the custom SMTP >> listener. >> If you had a processor that could act as a reliable (we can't lose emails) >> and performant SMTP listener alternative we would use it. >> >> Your "email parser processor" is an interesting idea - but beware of the >> mess you'll find in the wild with email. In our case, we try to parse >> Exchange (full of non-standard wonders like "TNEF" attachments") as well >> as >> email from virtually anywhere (GMail, Yahoo, Joe's email client...). If >> you >> can crack that you'll be on to something. We have even more complexity in >> that we read "Microsoft Journals" which wrap the standard SMTP layout in a >> Microsoft layer (you'll see this at large Exchange shops doing this kind >> of >> thing for use cases like compliance). >> >> >> >> -- >> View this message in context: >> http://apache-nifi-developer-list.39713.n7.nabble.com/ListenSMTP-processor-tp10510p12827.html >> Sent from the Apache NiFi Developer List mailing list archive at >> Nabble.com. >> > >
