[
https://issues.apache.org/jira/browse/TIKA-3260?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17258556#comment-17258556
]
Tim Allison commented on TIKA-3260:
-----------------------------------
I'm attaching an example file shared by Peter Kronenberg on the user list. If
we include our own {{rms_flat}} and change {{python}} to {{python3}}, I can
trace through the debugger success...
This is the text I get:
{noformat}
[his 1s an example of a non-text-searchable PDF. Because it was created from
an image rather than a text document, it cannot be rendered as plain text by the
PDF reader. Thus, attempting to select the text on the page as though it were a
text document or website will not work, regardless of how neatly it 1s
organized.
{noformat}
> Update rotation.py to work with python3 and a more modern matplotlib
> --------------------------------------------------------------------
>
> Key: TIKA-3260
> URL: https://issues.apache.org/jira/browse/TIKA-3260
> Project: Tika
> Issue Type: Improvement
> Reporter: Tim Allison
> Priority: Major
> Attachments: apache-tika-8408777197187584954.png,
> skewed5_image_text.png
>
>
> When I tried to work with rotation.py, I found that we should allow python to
> be python3 (not require an alias), and I found that rms_flat (once
> deprecated) has actually been removed in recent versions of matplotlib.
--
This message was sent by Atlassian Jira
(v8.3.4#803005)