[ 
https://issues.apache.org/jira/browse/TIKA-3347?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17315582#comment-17315582
 ] 

Tim Allison commented on TIKA-3347:
-----------------------------------

Thank you, [~tilman]!  

Most of the tests now pass.  I started a TIKA-3347 branch.

1) The access checker tests in PDFParserTest don't pass because the parser now 
has problems with the four files (flate compression exception)...the relevant 
tests are ignored (  @Ignore("failing in 3.x")) 

2) The PreflightParser needs help.  I need to figure out what changed.  My 
hasty attempts didn't work...code compiles, but I'm sure I didn't do the right 
thing.

3) The marked content tests aren't working.  My fixes in the 
PDFMarkedContent2XHTML were not correct...I'll look into these.

> Upgrade to PDFBox 3.x when available
> ------------------------------------
>
>                 Key: TIKA-3347
>                 URL: https://issues.apache.org/jira/browse/TIKA-3347
>             Project: Tika
>          Issue Type: Task
>            Reporter: Tim Allison
>            Priority: Major
>
> 3.0.0-RC1 was recently released.  We should integrate it on a dev branch asap 
> so that we can help with regression testing...



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to