[
https://issues.apache.org/jira/browse/TIKA-1504?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14265864#comment-14265864
]
Badger commented on TIKA-1504:
------------------------------
Thanks, I'd come to the same conclusion after experimenting with the Enron
data. When parsing each email it pretty clear the date sent is the most
significant, not the file attribute time and it sort of all made sense as to
what was happening.
I'd just incorrectly assumed that if there is no meta-data derived created date
it would use the file time.
> TikaCoreProperties.DATE not populated for XML files
> ---------------------------------------------------
>
> Key: TIKA-1504
> URL: https://issues.apache.org/jira/browse/TIKA-1504
> Project: Tika
> Issue Type: Bug
> Components: core, parser
> Affects Versions: 1.5, 1.6
> Environment: Windows 7
> Reporter: Badger
>
> Using the default parser configurations it appears when an XML file is parsed
> the meta data property for the creation date is not populated. I'm using
> TikaCoreProperties.DATE which works for other document types but not xml
> documents.
> This can be confirmed by dropping any xml file into the tika gui or through
> code.
> --
> I wasn't sure how to go about reporting this as a bug so signed up for JIRA
> account, apologies if I was meant to send it in to a dev list for triage.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)