[ https://issues.apache.org/jira/browse/FOP-2701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17551111#comment-17551111 ]
J Frank edited comment on FOP-2701 at 6/7/22 2:37 PM: ------------------------------------------------------ as per screen shot I can see ligature are missing for font FrutigerNeueLTW1 G-Light irrespective of language and embedding mode I tried installing FrutigerNeueLTW1 font but unable to use in pdf via fop, any idea if I am missing anything in fonts WARNING: Font "Frutiger,normal,400" not found. Substituting with "any,normal,400". Can you check if you are getting any warning for font as well as if font cache is enabled or not in fop.xconf? Update: got the fonts working on local and tried with Font FrutigerNeueLTW1, ligatures are missing irrespective of embedding mode; These characters are searchable in both pdfs, doesn't look like an issue. !image-2022-06-07-15-31-01-526.png! was (Author: jagruti.fr...@gmail.com): as per screen shot I can see ligature are missing for font FrutigerNeueLTW1 G-Light irrespective of language and embedding mode I tried installing FrutigerNeueLTW1 font but unable to use in pdf via fop, any idea if I am missing anything in fonts WARNING: Font "Frutiger,normal,400" not found. Substituting with "any,normal,400". Can you check if you are getting any warning for font as well as if font cache is enabled or not in fop.xconf? Update: got the fonts working on local and tried with Font FrutigerNeueLTW1, ligatures are missing irrespective of embedding mode, these characters are searchable, doesn't look like an issue. !image-2022-06-07-15-31-01-526.png! > Some of the latin ligatures make text not searchable in PDF > ----------------------------------------------------------- > > Key: FOP-2701 > URL: https://issues.apache.org/jira/browse/FOP-2701 > Project: FOP > Issue Type: Bug > Components: font/opentype > Affects Versions: 2.1 > Environment: Windows 10, Calibri font. > Reporter: Dan Caprioara > Assignee: J Frank > Priority: Major > Attachments: 3-fonts-copy-paste-result.png, 3-fonts-fop.xconf, > 3-fonts-latn-ligatures-FOP.fo, 3-fonts-latn-ligatures-FOP.pdf, Screenshot > 2022-06-07 092013.png, fop.xconf, image-2022-05-31-15-50-26-058.png, > image-2022-05-31-15-50-39-029.png, image-2022-05-31-15-52-01-435.png, > image-2022-06-07-15-31-01-526.png, latn-ligatures-Antenna-House.pdf, > latn-ligatures-FOP.pdf, out.pdf, test.fo > > > This problem happens using the Calibri font, that is packed in the MS Office > suite and Windows 10. > I tested with the following text: {{file settings}}. > The resulted PDF text contains ligatures: {{(fi)le se(tti)ngs}} > Searching for {{file}} in Acrobat Reader results in the first word being > selected. This is Ok. But searching for {{set}}, or {{settings}} gives no > results. > The same example, run with Antenna House works fine, you get results when > searching for {{settings}}. > Here is the complete FO file: > {code:xml} > <?xml version="1.0" encoding="UTF-8"?> > <fo:root xmlns:fo="http://www.w3.org/1999/XSL/Format"> > <fo:layout-master-set> > <fo:simple-page-master master-name="a"> > <fo:region-body/> > </fo:simple-page-master> > </fo:layout-master-set> > <fo:page-sequence master-reference="a"> > <fo:flow flow-name="xsl-region-body"> > <fo:block font-family="Calibri" font-size="40pt">file > settings</fo:block> > </fo:flow> > </fo:page-sequence> > </fo:root> > {code} > Some considerations: > # A workaround would be to reject all the substitutions that are not part of > org.apache.fop.fonts.type1.AdobeStandardEncoding. This would leave the (fi) > ligature, but reject the (tti) one. But this seems to work only for Calibri > and not for Roboto!! > # I think there might be some issues with the font embedding, and some > substitution mapping data is lost. It is just a guess, I am not sure how PDF > deals with substitutions. > I know that setting in FO xml:lang to "en" disables the ligatures, but is not > a solution for my project. I would appreciate any suggestions. -- This message was sent by Atlassian Jira (v8.20.7#820007)