[ 
https://issues.apache.org/jira/browse/PDFBOX-277?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Jukka Zitting resolved PDFBOX-277.
----------------------------------

    Resolution: Not A Problem

You can't extract text from a page that just contains an image.

> Have fix for a bug with images
> ------------------------------
>
>                 Key: PDFBOX-277
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-277
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Parsing
>            Priority: Minor
>
> [imported from SourceForge]
> http://sourceforge.net/tracker/index.php?group_id=78314&atid=552832&aid=1723353
> Originally submitted by nobody on 2007-05-22 04:06.
> Hello there.
> I was using PDFBox to extract text from documents. A few days ago I had an 
> issue with one of them: PDFTextStripper  was unable to parse some pages, 
> there were simply empty. After a quick analysis I found that those pages were 
> with embedded images. Then I have downloaded latest sources and started 
> debugging the PDFStreamParser. 
> The issue was in a specific PDF format, the image data look like this 
> ID xxxxxxxxxxxxxxxxxEI Q.
> It seems that you've got a lot of troubles with parsing an EI operator. I 
> have fixed this and wanted to give you a solution. Please let me know if you 
> are interesting.
> ---
> Thanks,
> Andrew.
> [comment on SourceForge]
> Originally sent by benlitchfield.
> Logged In: YES 
> user_id=601708
> Originator: NO
> kubovich,
> Yes I am interested in the patch, please send it along, I'll email you as 
> well.
> Ben
> [comment on SourceForge]
> Originally sent by skyf.
> Logged In: YES 
> user_id=1815045
> Originator: NO
> I have this problem too.
> please send the solution to me,thanks.
> my email: [email protected] 
> [comment on SourceForge]
> Originally sent by nobody.
> Logged In: NO 
> Forgot to provide my email: kubovich @ inbox.ru

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to