[
https://issues.apache.org/jira/browse/TIKA-3815?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17566129#comment-17566129
]
Tilman Hausherr commented on TIKA-3815:
---------------------------------------
The build fails for me (in Germany):
{noformat}
[ERROR] Tests run: 23, Failures: 1, Errors: 0, Skipped: 0, Time elapsed: 2.131
s <<< FAILURE! - in org.apache.tika.parser.mail.RFC822ParserTest
[ERROR] org.apache.tika.parser.mail.RFC822ParserTest.testDates Time elapsed:
0.042 s <<< FAILURE!
org.opentest4j.AssertionFailedError: failed to match: Sun, 15 May 2016 01:32:00
UTC ==> expected: <2016-05-15T01:32:00Z> but was: <2016-05-15T03:32:00Z>
at org.junit.jupiter.api.AssertionUtils.fail(AssertionUtils.java:55)
at
org.junit.jupiter.api.AssertionUtils.failNotEqual(AssertionUtils.java:62)
at
org.junit.jupiter.api.AssertEquals.assertEquals(AssertEquals.java:182)
at org.junit.jupiter.api.Assertions.assertEquals(Assertions.java:1152)
at
org.apache.tika.parser.mail.RFC822ParserTest.testDate(RFC822ParserTest.java:428)
at
org.apache.tika.parser.mail.RFC822ParserTest.testDates(RFC822ParserTest.java:390)
{noformat}
> Inconsistent Date/Time information extracted from Exif data
> -----------------------------------------------------------
>
> Key: TIKA-3815
> URL: https://issues.apache.org/jira/browse/TIKA-3815
> Project: Tika
> Issue Type: Bug
> Components: parser
> Affects Versions: 2.4.1, 1.28.4
> Reporter: Luís Filipe Nassif
> Assignee: Luís Filipe Nassif
> Priority: Major
> Fix For: 2.4.2
>
> Attachments: IMG_20220616_111848_HDR.jpg
>
>
> Running tika-app-2.4.1.jar on the attached image, these metadata is returned:
> Exif IFD0:Date/Time: 2022:06:16 11:18:49
> Exif SubIFD:Date/Time Digitized: 2022:06:16 11:18:49
> Exif SubIFD:Date/Time Original: 2022:06:16 11:18:49
> Exif SubIFD:Time Zone: -03:00
> Exif SubIFD:Time Zone Digitized: -03:00
> Exif SubIFD:Time Zone Original: -03:00
> File Modified Date: Thu Jun 16 11:18:50 -03:00 2022
> GPS:GPS Date Stamp: 2022:06:16
> GPS:GPS Time-Stamp: 14:18:47.000 UTC
> dcterms:created: 2022-06-16T08:18:49
> dcterms:modified: 2022-06-16T08:18:49
> exif:DateTimeOriginal: 2022-06-16T08:18:49
>
> The right value is 2022-06-16T14:18:49Z. Although there is no timezone
> specified for some values, I think it makes no sense converting them to
> timezones different than GMT, the one used to take the picture (-03:00) or
> the one used to run the application (-03:00), so Tika could be making an
> incorrect timezone conversion on the last 3 fields.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)