[ 
https://issues.apache.org/jira/browse/PDFBOX-1919?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14029118#comment-14029118
 ] 

Tilman Hausherr commented on PDFBOX-1919:
-----------------------------------------

The weird thing is that I get this with Adobe reader, latest version, on 
windows, and it is what [~Co42] wants:
{quote}
Report of the
IN NORTHE RN IRELAND
www.cih.org/northernireland/housingcommission
independent
Commission on the
Future for Housing
{quote}

Please understand that the rendering has nothing to do with the text 
extraction. A font (very simplified explanation!) is 1. a map from a code to a 
glyph (= an image), 2. a map from a code to a character value. The PDF file has 
only the codes and the info where they are to be displayed. IMHO the second map 
is incorrect.

> Font descriptor flags are not implemented
> -----------------------------------------
>
>                 Key: PDFBOX-1919
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-1919
>             Project: PDFBox
>          Issue Type: Bug
>          Components: Text extraction
>    Affects Versions: 1.8.5, 1.8.6, 2.0.0
>            Reporter: Corentin Regal
>         Attachments: PDFBOX-1919.AdobeReader.txt, PDFBOX-1919.pdf, 
> PDFBOX-1919.txt
>
>
> The font descriptor flags are not set.
> They are described in the document "PDF reference 1.7" at : 5.7.1 Font 
> Descriptor Flags
> The methods in PDFontDescriptor are ready but never called :
> setFlags()
> setSerif()
> setAllCap() which is used in a lot of PDF
> ...
> I saw some TODO that relate to that issue in the code, is it planned to be 
> implemented soon?



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to