[ 
https://issues.apache.org/jira/browse/PDFBOX-2842?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14604129#comment-14604129
 ] 

Tilman Hausherr commented on PDFBOX-2842:
-----------------------------------------

Some fonts are not rendering correctly, instead tiny glyphs appear:

166292-fi-ligature.pdf
4PP-Highlighting.pdf
AXSH01.pdf
AXSH02.pdf
AXSTITCH.pdf
CIB-coons-vs-tensormesh.pdf
CIB-coonsmesh.pdf
FOP-2252.pdf
FUNSH01.pdf
ICEPDF-402.pdf
ICEPDF-411.pdf
LATTICE1.pdf
PATTYP1.pdf
PDFBOX-1010.pdf
PDFBOX-1094-065514-XStep32767.pdf
PDFBOX-1147.pdf
PDFBOX-1288.pdf
PDFBOX-1292.pdf
PDFBOX-142-acroform.pdf
PDFBOX-1422.pdf
PDFBOX-1434.pdf
PDFBOX-1442.pdf
PDFBOX-1515.pdf
PDFBOX-1538.pdf
PDFBOX-1606.pdf
PDFBOX-161.pdf
PDFBOX-1661.pdf
PDFBOX-1708-faxcrash.pdf
PDFBOX-1724.pdf
PDFBOX-1735-confidential.pdf
PDFBOX-1739-RegisSTAR.pdf
PDFBOX-1770.pdf
PDFBOX-1814-extreme_slow.pdf
PDFBOX-1879.pdf
PDFBOX-1917.pdf
PDFBOX-1934.pdf
PDFBOX-1988.pdf
PDFBOX-1998.pdf
PDFBOX-2001.pdf
PDFBOX-2007.pdf
PDFBOX-2016.pdf
PDFBOX-2023-zerofontheight.pdf
PDFBOX-2025.pdf
PDFBOX-205-T05140.pdf
PDFBOX-205-hexnumberproblem.pdf
PDFBOX-2080.pdf
PDFBOX-2126.pdf
PDFBOX-2128-CAPITAL.pdf
PDFBOX-2133.pdf
PDFBOX-2134.pdf
PDFBOX-2135.pdf
PDFBOX-2137.pdf
PDFBOX-2140.pdf
PDFBOX-2141.pdf
PDFBOX-2158.pdf
PDFBOX-2163-029016.pdf
PDFBOX-2187-002489.pdf
PDFBOX-2191-006816.pdf
PDFBOX-2192-006972.pdf
PDFBOX-2193-013973.pdf
PDFBOX-2203-019672.pdf
PDFBOX-2216-025977.pdf
PDFBOX-2228-031207.pdf
PDFBOX-2244-040483.pdf
PDFBOX-2250-055794.pdf
PDFBOX-2250-107425-empty-xref.pdf
PDFBOX-2250-110264-xref-zeronumber.pdf
PDFBOX-2250-113223.pdf
PDFBOX-2281.pdf
PDFBOX-2295-c21-5916.pdf
PDFBOX-2298-PDFBOX-935.pdf
PDFBOX-2307-159827.pdf
PDFBOX-2307-161334.pdf
PDFBOX-2307-162362.pdf
PDFBOX-2309-PDFBOX-147.pdf
PDFBOX-2310.pdf
PDFBOX-2335-203040-p17.pdf
PDFBOX-2345.pdf
PDFBOX-2350-289451-endless.pdf
PDFBOX-2365-284078-p38-PDFDocEncoding.pdf
PDFBOX-2367.pdf
PDFBOX-2376-087731.pdf
PDFBOX-2376-466070.pdf
PDFBOX-2376-504130.pdf
PDFBOX-2376-943662.pdf
PDFBOX-2385-146515.pdf
PDFBOX-2385-351923.pdf
PDFBOX-2385-539663.pdf
PDFBOX-2385-937619.pdf
PDFBOX-239-zerowidth.pdf
PDFBOX-2398.pdf
PDFBOX-2421-357567.pdf
PDFBOX-2448-115258.pdf
PDFBOX-2505-032618-p96.pdf
PDFBOX-2526.pdf
PDFBOX-2552-GermanPaper.pdf
PDFBOX-2552-JPEDAL-A104-10-2003E.pdf
PDFBOX-2552-altimage.pdf
PDFBOX-2553.pdf
PDFBOX-2557.pdf
PDFBOX-2583.pdf
PDFBOX-2586.pdf
PDFBOX-2641-168002-p2.pdf
PDFBOX-2642-277053-p3.pdf
PDFBOX-2829.pdf
PDFBOX-448.pdf
PDFBOX-488.pdf
PDFBOX-563-acroform.pdf
PDFBOX-582.pdf
PDFBOX-677.pdf
PDFBOX-705.pdf
PDFBOX-870-a_metro-vlc.pdf
PDFBOX-940.pdf
PDFBOX-958-WrycanLoremIpsumTest.pdf
RGBCUBE.pdf
Shading2Function2text.pdf
TENSOR.pdf
UnderstandingWebSphereClassLoaders.pdf
annots.pdf
bugzilla858109.pdf
color_gradient.pdf
example_013.pdf
example_026.pdf
example_030.pdf
gs-bugzilla228714.pdf
gs-bugzilla691181.pdf
image_demo.pdf
jpeg_demo.pdf
pattern-shading-2-4-noMatrix.pdf
png_demo.pdf
pslib-shading.pdf
source.pdf
swftools-fonts.pdf
swftools-transparency.pdf
tiger-as-form-xobject.pdf

In the future, I suggest that such major changes be tested in a temporary 
branch.

> Overhaul font substitution
> --------------------------
>
>                 Key: PDFBOX-2842
>                 URL: https://issues.apache.org/jira/browse/PDFBOX-2842
>             Project: PDFBox
>          Issue Type: Improvement
>          Components: FontBox, PDModel
>    Affects Versions: 2.0.0
>            Reporter: John Hewson
>            Assignee: John Hewson
>             Fix For: 2.0.0
>
>
> The improved font substitution mechanisms in 2.0 are not quite sufficient to 
> handle all PDFs. Specifically, CJK substitution and substitution of TTF in 
> place of CFF fonts is not possible with the current design.
> The CJK problems can be seen in PDFBOX-2509 and PDFBOX-2563, which does not 
> solve the problem. Additional font API weaknesses can be found in PDFBOX-2578 
> and PDFBOX-2366. This meta-issue aims to address all of those sub-issues.
> The current problems are:
> - FontBox does not provide a generic font type, so we have handle 
> TrueTypeFont, CFFFont, and Type1Font separately. This hinders cross-format 
> substitution.
> - ExternalFonts has no knowledge of the CIDSystemInfo which is necessary for 
> CJK substitution
> - FontProvider contains too much public logic which should be internal to 
> PDFBox, e.g. substitution logic, this makes it brittle and means we won't be 
> able to add additional logic after 2.0 is released, e.g. CJK substitution.
> - Too much confusion about the role of ExternalFonts, particularly with 
> regards to mapping of built-in fonts and the definition of substitute vs. 
> fallback font.
> - ExternalFonts is a black box: the user cannot tell whether the font 
> returned is an exact match, or a last-resort fallback.
> - Confusing font substitution API, users preferred having a flat file format
> - PDSimpleFont#getEncoding() can return null for TTFs which use built-in 
> encodings. This has caused a lot of bugs - there must be a better way.
> - We still have some confusing names, for example a CustomEncoding is known 
> as a "built-in encoding" in the spec.
> - There is no fallback CFF font, we resort to AdobeBlank instead, which has 
> no rendering.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org

Reply via email to