Hi, On Thu, Feb 5, 2009 at 7:10 AM, Jana, Kumar Raja <kj...@ptc.com> wrote: > I see 50 copies of the content in the extracted text output.
OK. This is probably some issue with the Outlook parser from POI or with the way we use it in Tika. > I have attached a sample Outlook (msg) file to this mail (which happens > to be a mail from you to the dev group). Hope it helps. Unfortunately the mailing list filters seem to have stripped the attachment. Can you file a bug report about this in Jira and attach the example mail there? BR, Jukka Zitting