[ 
https://issues.apache.org/jira/browse/TIKA-3496?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tim Allison resolved TIKA-3496.
-------------------------------
    Fix Version/s: 2.0.1
       Resolution: Fixed

> Dates should have a timezone?
> -----------------------------
>
>                 Key: TIKA-3496
>                 URL: https://issues.apache.org/jira/browse/TIKA-3496
>             Project: Tika
>          Issue Type: Bug
>            Reporter: Tim Allison
>            Priority: Major
>             Fix For: 2.0.1
>
>
> In working on the Solr pipe emitter, I noticed that some dates as stored in 
> our Metadata do not have a timezone, which causes a problem for Solr.
> I noticed this issue in a JPEG with date: "2011-06-11T09:30:54". 
> In a comment in our JPEG parser, I see:
> {noformat}
> // Unless we have GPS time we don't know the time zone so date must be set
>                 // as ISO 8601 datetime without timezone suffix (no Z or +/-)
> {noformat}
> So, the question is should we try to add a timezone (arbitrarily assign 'Z') 
> in the Solr (and OpenSearch) emitter or should we store the date as if it 
> were Z in the JPEG parser?  
> Or do something else?
> The challenge with doing anything on the emitter side, is that we aren't 
> currently storing the property type in the metadata.  So, at emit time, we 
> only have string keys and string values.  We can't easily guess which fields 
> should be a date in order to reformat for the sake of Solr. We could make a 
> request to Solr/OpenSearch to figure out what the field types are, but that 
> seems really awful...
> Ideas?



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to