Tim Allison created TIKA-4043:
---------------------------------
Summary: Fix build for variations in tesseract and timezone info
in RTFs
Key: TIKA-4043
URL: https://issues.apache.org/jira/browse/TIKA-4043
Project: Tika
Issue Type: Task
Reporter: Tim Allison
>From [~grossws]:
> * OCR (tesseract) multipage test is still the same, it extracts "Page?2"
> instead of "Page 2" on my laptop;
> * RTFParserTest testMetaDataCounts fails because of different time zone since
> RTF format itself has only local date/time in meta and I fall into different
> size of midnight with my local time (known issue, requires some changes in
> metadata to handle correctly). When building with TZ=UTC works fine.
We should fix these.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)