[ 
https://issues.apache.org/jira/browse/TIKA-1504?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14265864#comment-14265864
 ] 

Badger commented on TIKA-1504:
------------------------------

Thanks, I'd come to the same conclusion after experimenting with the Enron 
data. When parsing each email it pretty clear the date sent is the most 
significant, not the file attribute time and it sort of all made sense as to 
what was happening. 

I'd just incorrectly assumed that if there is no meta-data derived created date 
it would use the file time. 

> TikaCoreProperties.DATE not populated for XML files
> ---------------------------------------------------
>
>                 Key: TIKA-1504
>                 URL: https://issues.apache.org/jira/browse/TIKA-1504
>             Project: Tika
>          Issue Type: Bug
>          Components: core, parser
>    Affects Versions: 1.5, 1.6
>         Environment: Windows 7
>            Reporter: Badger
>
> Using the default parser configurations it appears when an XML file is parsed 
>  the meta data property for the creation date is not populated. I'm using 
> TikaCoreProperties.DATE which works for other document types but not xml 
> documents.
> This can be confirmed by dropping any xml file into the tika gui or through 
> code. 
> -- 
> I wasn't sure how to go about reporting this as a bug so signed up for JIRA 
> account, apologies if I was meant to send it in to a dev list for triage. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to