[
https://issues.apache.org/jira/browse/TIKA-623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13015046#comment-13015046
]
Tran Nam Quang commented on TIKA-623:
-------------------------------------
Cool! I'll start writing the Tika parser as soon as I can. Could take a couple
of days though.
Richard, I have one question regarding the API: PSTMessage has two methods,
getDescriptorNodeId() and getInternetMessageId(). Both return identifiers,
apparently. My question is: Which one is an unique identifier that will never,
ever change? Cause I wouldn't want the Tika parser to extract identifiers that
are "internal-only" and not unique.
> Add support for Outlook PST
> ---------------------------
>
> Key: TIKA-623
> URL: https://issues.apache.org/jira/browse/TIKA-623
> Project: Tika
> Issue Type: New Feature
> Components: parser
> Reporter: Tran Nam Quang
>
> Hello everyone,
> As you might know, Outlook stores its mails and other stuff in a single PST
> file. There's a relatively new Java library called java-libpst for reading
> Outlook PST files. It is licensed under the LGPL and available over here:
> http://code.google.com/p/java-libpst/
> I have tested the library on Outlook 2000 and Outlook 2003, with good
> results. It would be great if the library could be integrated into Tika.
> Best regards
> Tran Nam Quang
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira