[
https://issues.apache.org/jira/browse/PDFBOX-1823?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13860137#comment-13860137
]
Andreas Lehmkühler commented on PDFBOX-1823:
--------------------------------------------
I'm afraid no one can suggest a workaround for a unknown problem ....
Did you check if the text can be extracted at all? Try to save the text using
acrobat reader. If that doesn't work, PDFBox most likely isn't able to extract
the text too.
> Apache PDFBox 1.6.0 TextStripper not able to recognise characters having
> "Frutiger LT - 45" fonts
> -------------------------------------------------------------------------------------------------
>
> Key: PDFBOX-1823
> URL: https://issues.apache.org/jira/browse/PDFBOX-1823
> Project: PDFBox
> Issue Type: Bug
> Components: FontBox
> Affects Versions: 1.6.0
> Environment: jdk1.6
> Reporter: Chitrang Natu
> Labels: newbie
> Original Estimate: 504h
> Remaining Estimate: 504h
>
> When i tried to extract contents from PDF's I am successfully able to extract
> all text with PDFBox API but getting trouble with fonts having 'Frutiger'
> style. For these i am getting squared Boxes in place of characters.
> It seems PDFBox FontBox supports only 14 UTF characters set And none of them
> is Frutiger style fonts.
> If anybody please can suggest something. That would be of great help. I am in
> urgent need of the solution.
--
This message was sent by Atlassian JIRA
(v6.1.5#6160)