[jira] Resolved: (PDFBOX-38) Text extraction fails for PDFs from Apple's quartzfilter

Jukka Zitting (JIRA) Thu, 02 Dec 2010 14:48:33 -0800

     [ 
https://issues.apache.org/jira/browse/PDFBOX-38?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]


Jukka Zitting resolved PDFBOX-38.
---------------------------------

    Resolution: Incomplete

Test documents are not available.

> Text extraction fails for PDFs from Apple's quartzfilter
> --------------------------------------------------------
>
>                 Key: PDFBOX-38
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-38
>             Project: PDFBox
>          Issue Type: Bug
>
> [imported from SourceForge]
> http://sourceforge.net/tracker/index.php?group_id=78314&atid=552832&aid=1117537
> Originally submitted by benlitchfield on 2005-02-06 14:24.
> Text extraction fails on pdf-documents filtered with 
> Apple's quartzfilter (regardless of the filtering done). 
> Only some number of line breaks are extracted from the 
> filtered document. I tried the text extraction with the 
> included utility class as well as my own implementation 
> (following example from the utility class).
> Please find enclosed two documents:
>     loremipsum.pdf - a simple pdf with a page of dummy 
> text, with this the extractioni works OK
>     loremipsum.filtered.pdf - pdf filtered with quarzfilter  
> (with a dummy filter specification not doing anything 
> useful). With this extraction fails.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

[jira] Resolved: (PDFBOX-38) Text extraction fails for PDFs from Apple's quartzfilter

Reply via email to