[ 
https://issues.apache.org/jira/browse/PDFBOX-4116?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16369555#comment-16369555
 ] 

Tilman Hausherr commented on PDFBOX-4116:
-----------------------------------------

"embedded" is the correct word. The embedded subsetted fonts shouldn't be 
reused for the two reasons I mentioned, i.e. (1) sometimes missing unicode, and 
(2) missing glyphs. (1) is what you had, (2) is because it is subsetted, i.e. 
such a subset won't have all the glyphs, so you may have "a", "b" and "d" but 
not "c".

{color:#333333}"This may or may not work"{color} is because the "raw" command 
parameter is just the end of a chain of calculations. So sometimes the code 
there is identical to the unicode value, but often it is not.

In other words: you'll often have a very bad day reusing subsetted fonts. 
Better get the original font.

I am surprised that you wrote "{color:#333333}I am working on extracting 
information from PDF publications" but what you really did was adding 
text.{color}

> could not add text without unicode in the font
> ----------------------------------------------
>
>                 Key: PDFBOX-4116
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-4116
>             Project: PDFBox
>          Issue Type: Wish
>          Components: PDModel
>    Affects Versions: 2.0.8
>         Environment: Windows
>            Reporter: xing Wang
>            Priority: Minor
>         Attachments: image-2018-02-19-09-23-00-110.png
>
>
> !image-2018-02-19-09-23-00-110.png!
> As shown in the debugger, that the PDFType1Font map the code of 33 to 
> "minus", but there is no unicode value associated with it. 
> If we use the code `contentStream.showText("\u0021");` to add content, it 
> will cause an error of following. 
> Exception in thread "main" java.lang.IllegalArgumentException: U+0021 
> ('exclam') is not available in this font AMZNGR+CMSY10 (generic: 
> FREBPT+CMSY10) encoding: built-in (Type 1) with differences
> at org.apache.pdfbox.pdmodel.font.PDType1Font.encode(PDType1Font.java:439)
> at org.apache.pdfbox.pdmodel.font.PDFont.encode(PDFont.java:323)
> at org.apache.pdfbox.debugger.CreatePDF.main(CreatePDF.java:63)
> The best way I could do is used the "appendRawCommands", but I find it's 
> marked as deprecated. I am wondering why or is there any replacement for this 
> function?
>  



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org

Reply via email to