[
https://issues.apache.org/jira/browse/PDFBOX-4648?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16928282#comment-16928282
]
Tilman Hausherr edited comment on PDFBOX-4648 at 9/12/19 6:47 AM:
------------------------------------------------------------------
The squares are Adobe only, so we can't do anything.
The missing "511 TM" is also missing on Adobe text extraction. This is because
the font has no ToUnicode stream.
"SLIM CUT" appears fine here. Even if I use 2.0.4.
Please try again with 2.0.16, make sure you have a current java version on your
computer, then download and run PDFDebugger and look for the font F4 in your
file. Here's how it looks on my system:
!image-2019-09-12-08-47-32-706.png!
was (Author: tilman):
The squares are Adobe only, so we can't do anything.
The missing "511 TM" is also missing on Adobe text extraction. This is because
the font has no ToUnicode stream.
"SLIM CUT" appears fine here. Even if I use 2.0.4.
Please try again with 2.0.16, make sure you have a current java version on your
computer, then download and run PDFDebugger and look for the font F4 in your
file. Here's how it looks on my system:
!image-2019-09-12-08-46-39-391.png!
> OpenType Layout tables used in font ABCDEE+Times New Roman,Bold are not
> implemented in PDFBox and will be ignored
> -----------------------------------------------------------------------------------------------------------------
>
> Key: PDFBOX-4648
> URL: https://issues.apache.org/jira/browse/PDFBOX-4648
> Project: PDFBox
> Issue Type: Improvement
> Components: Text extraction
> Affects Versions: 2.0.4
> Reporter: wanling
> Priority: Major
> Attachments: 5e214f828f164322a6600f183191dda5-Adobe.txt,
> 5e214f828f164322a6600f183191dda5-PDFBox.txt,
> 5e214f828f164322a6600f183191dda5.pdf, image-2019-09-12-08-46-39-391.png,
> image-2019-09-12-08-47-32-706.png
>
>
> No PostScript name information is provided for the font Arial-BoldMT
> OpenType Layout tables used in font ABCDEE+Times New Roman,Bold are not
> implemented in PDFBox and will be ignored
> No Unicode mapping for CID+47 (47) in font ABCDEE+Times New Roman,Bold
>
> Adobe is normal but pdfbox cann't see the _parts not all_. OCI cann‘t see
> it completely.
--
This message was sent by Atlassian Jira
(v8.3.2#803003)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]