Paul Woods created TIKA-2898:
--------------------------------
Summary: wrong email send date being set in OutlookPSTParser
Key: TIKA-2898
URL: https://issues.apache.org/jira/browse/TIKA-2898
Project: Tika
Issue Type: Bug
Components: handler
Affects Versions: 1.21
Reporter: Paul Woods
Attachments: albert_meyers_000_1_1.pst
When parsing a pst file from the enron data set I get an email created date of
May 2009 from the OutlookPSTParser for all emails. Looking at the underlying
PSTMessage coming up from libpst I can see that the createdDate field has
indeed got that value - I'm not sure where that date is coming from, but the
Enron data set was released in 2002.
I can see that the pst message has the correct date value in the
clientSubmitTime field.
getClientSubmitTime() is not used when setting metadata in the
OutlookPSTParser.
Can the result of this call be added to the metadata?
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)