Tim Allison created TIKA-4043:
---------------------------------

             Summary: Fix build for variations in tesseract and timezone info 
in RTFs
                 Key: TIKA-4043
                 URL: https://issues.apache.org/jira/browse/TIKA-4043
             Project: Tika
          Issue Type: Task
            Reporter: Tim Allison


>From [~grossws]:

> * OCR (tesseract) multipage test is still the same, it extracts "Page?2" 
> instead of "Page 2" on my laptop;
> * RTFParserTest testMetaDataCounts fails because of different time zone since 
> RTF format itself has only local date/time in meta and I fall into different 
> size of midnight with my local time (known issue, requires some changes in 
> metadata to handle correctly). When building with TZ=UTC works fine.

We should fix these.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to