Hi,

Thanks for the reply. As per your suggestion, I have filed an improvement 
request stating this issue (TIKA-803).

Thanks and Regards,
Swapna.

-----Original Message-----
From: Jukka Zitting [mailto:[email protected]] 
Sent: Wednesday, December 07, 2011 9:04 PM
To: [email protected]
Subject: Re: Body of Outlook msg files

Hi,

On Wed, Dec 7, 2011 at 10:28 AM, Swapna Vuppala <[email protected]> wrote:
> Am interested in knowing where the body of the message goes to.

Currently the Outlook parser doesn't mark the message body in any
special way, so there's no easy way to achieve your use case.

The best way forward on this would be to file an improvement request
[1] to make the Outlook parsing result mark the message body with
something like <div class="message-body">...</div>. We might also want
to reconsider the decision to put the message subject and other header
fields in the XHTML body, or at least make that behavior configurable.

[1] https://issues.apache.org/jira/browse/TIKA

BR,

Jukka Zitting
____________________________________________________________
Electronic mail messages entering and leaving Arup  business
systems are scanned for acceptability of content and viruses

Reply via email to