Hi, Thanks for the reply. As per your suggestion, I have filed an improvement request stating this issue (TIKA-803).
Thanks and Regards, Swapna. -----Original Message----- From: Jukka Zitting [mailto:[email protected]] Sent: Wednesday, December 07, 2011 9:04 PM To: [email protected] Subject: Re: Body of Outlook msg files Hi, On Wed, Dec 7, 2011 at 10:28 AM, Swapna Vuppala <[email protected]> wrote: > Am interested in knowing where the body of the message goes to. Currently the Outlook parser doesn't mark the message body in any special way, so there's no easy way to achieve your use case. The best way forward on this would be to file an improvement request [1] to make the Outlook parsing result mark the message body with something like <div class="message-body">...</div>. We might also want to reconsider the decision to put the message subject and other header fields in the XHTML body, or at least make that behavior configurable. [1] https://issues.apache.org/jira/browse/TIKA BR, Jukka Zitting ____________________________________________________________ Electronic mail messages entering and leaving Arup business systems are scanned for acceptability of content and viruses
