[jira] [Commented] (FOP-3065) Shifted combining chars

2022-06-03 Thread Simon Steiner (Jira)


[ 
https://issues.apache.org/jira/browse/FOP-3065?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17546386#comment-17546386
 ] 

Simon Steiner commented on FOP-3065:


The font vendor may list the languages for example at:
https://docs.microsoft.com/en-us/typography/font-list/cambria

> Shifted combining chars
> ---
>
> Key: FOP-3065
> URL: https://issues.apache.org/jira/browse/FOP-3065
> Project: FOP
>  Issue Type: Bug
>Affects Versions: 2.4, 2.6, 2.7
>Reporter: Alexander Dyuzhev
>Priority: Critical
> Attachments: fop.xconf, image-2022-04-26-08-56-51-840.png, 
> image-2022-04-29-11-53-00-968.png, image-2022-04-29-11-54-27-704.png, 
> image-2022-04-29-11-55-20-495.png, image-2022-04-30-13-54-09-030.png, 
> image-2022-04-30-13-54-23-021.png, image-2022-05-29-22-24-51-772.png, 
> shifted-1.pdf, shifted.fo-1.xml
>
>
> Combining chars (above and below main char) render shifted. For instance:
> n
> N
> f
> F
> 
> h
> !image-2022-04-26-08-56-51-840.png!
>  
> Combining char renders shifted to the left or right depends on the font.



--
This message was sent by Atlassian Jira
(v8.20.7#820007)


[jira] [Commented] (FOP-3065) Shifted combining chars

2022-06-03 Thread Alexander Dyuzhev (Jira)


[ 
https://issues.apache.org/jira/browse/FOP-3065?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17546062#comment-17546062
 ] 

Alexander Dyuzhev commented on FOP-3065:


How can I check that font supports the xml:lang? Is there an attribute/property 
in the font?

For instance, if xml:lang="ar", then how can I check the font?

> Shifted combining chars
> ---
>
> Key: FOP-3065
> URL: https://issues.apache.org/jira/browse/FOP-3065
> Project: FOP
>  Issue Type: Bug
>Affects Versions: 2.4, 2.6, 2.7
>Reporter: Alexander Dyuzhev
>Priority: Critical
> Attachments: fop.xconf, image-2022-04-26-08-56-51-840.png, 
> image-2022-04-29-11-53-00-968.png, image-2022-04-29-11-54-27-704.png, 
> image-2022-04-29-11-55-20-495.png, image-2022-04-30-13-54-09-030.png, 
> image-2022-04-30-13-54-23-021.png, image-2022-05-29-22-24-51-772.png, 
> shifted-1.pdf, shifted.fo-1.xml
>
>
> Combining chars (above and below main char) render shifted. For instance:
> n
> N
> f
> F
> 
> h
> !image-2022-04-26-08-56-51-840.png!
>  
> Combining char renders shifted to the left or right depends on the font.



--
This message was sent by Atlassian Jira
(v8.20.7#820007)


[jira] [Commented] (FOP-3065) Shifted combining chars

2022-06-03 Thread Simon Steiner (Jira)


[ 
https://issues.apache.org/jira/browse/FOP-3065?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17545958#comment-17545958
 ] 

Simon Steiner commented on FOP-3065:


You would need to use a font that supports the xml:lang you use

> Shifted combining chars
> ---
>
> Key: FOP-3065
> URL: https://issues.apache.org/jira/browse/FOP-3065
> Project: FOP
>  Issue Type: Bug
>Affects Versions: 2.4, 2.6, 2.7
>Reporter: Alexander Dyuzhev
>Priority: Critical
> Attachments: fop.xconf, image-2022-04-26-08-56-51-840.png, 
> image-2022-04-29-11-53-00-968.png, image-2022-04-29-11-54-27-704.png, 
> image-2022-04-29-11-55-20-495.png, image-2022-04-30-13-54-09-030.png, 
> image-2022-04-30-13-54-23-021.png, image-2022-05-29-22-24-51-772.png, 
> shifted-1.pdf, shifted.fo-1.xml
>
>
> Combining chars (above and below main char) render shifted. For instance:
> n
> N
> f
> F
> 
> h
> !image-2022-04-26-08-56-51-840.png!
>  
> Combining char renders shifted to the left or right depends on the font.



--
This message was sent by Atlassian Jira
(v8.20.7#820007)


[jira] [Comment Edited] (FOP-2701) Some of the latin ligatures make text not searchable in PDF

2022-06-03 Thread Jira


[ 
https://issues.apache.org/jira/browse/FOP-2701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17545761#comment-17545761
 ] 

Martin Hönings edited comment on FOP-2701 at 6/3/22 7:27 AM:
-

Indeed, the workaround with embedding-mode="full" doesn't work for the fonts 
"cambria.ttc" and "Calibri.ttf".
In my project I observed the ligature problem with the font 
"FrutigerNeueLTW1G-Light.otf". There the workaround works.
I add the following attachments:
 - 3-fonts-fop.xconf
 - 3-fonts-latn-ligatures-FOP.fo
 - 3-fonts-latn-ligatures-FOP.pdf
 - 3-fonts-copy-paste-result.png

Directory structure:
fop/conf/3-fonts-fop.xconf
fop/conf/FrutigerNeueLTW1G-Light.otf
fop/3-fonts-latn-ligatures-FOP.fo

Execute:
cd fop
fop -fo 3-fonts-latn-ligatures-FOP.fo -pdf 3-fonts-latn-ligatures-FOP.pdf -c 
conf\3-fonts-fop.xconf

The example compares the ligature problem with three fonts and the embedding 
mode "subset" and "full".
In the different fonts the problem happens with different ligatures.
The embedding mode "full" doesn't work with all fonts.
Maybe it depends on the font type or the font itself.

Unfortunatelly I cannot supply the "FrutigerNeueLTW1G-Light.otf" font, since it 
is not for free.
Maybe you can get a free licence as a developer.  

 

[^3-fonts-fop.xconf]

[^3-fonts-latn-ligatures-FOP.fo]

[^3-fonts-latn-ligatures-FOP.pdf]

!3-fonts-copy-paste-result.png!


was (Author: martinx):
Indeed, the workaround with embedding-mode="full" doesn't work for the fonts 
"cambria.ttc" and "Calibri.ttf".
In my project I observed the ligature problem with the font 
"FrutigerNeueLTW1G-Light.otf". There the workaround works.
I add the following attachments:
- 3-fonts-fop.xconf
- 3-fonts-latn-ligatures-FOP.fo
- 3-fonts-latn-ligatures-FOP.pdf
- 3-fonts-copy-paste-result.png

Directory structure:
fop/conf/3-fonts-fop.xconf
fop/conf/FrutigerNeueLTW1G-Light.otf
fop/3-fonts-latn-ligatures-FOP.fo

Execute:
cd fop
fop -fo 3-fonts-latn-ligatures-FOP.fo -pdf 3-fonts-latn-ligatures-FOP.pdf -c 
conf\3-fonts-fop.xconf

The example compares the ligature problem with three fonts and the embedding 
mode "subset" and "full".
In the different fonts the problem happens with different ligatures.
The embedding mode "full" doesn't work with all fonts.
Maybe it depends on the font type or the font itself.

Unfortunatelly I cannot supply the "FrutigerNeueLTW1G-Light.otf" font, since it 
is not for free.
Maybe you can get a free licence as a developer.  

> Some of the latin ligatures make text not searchable in PDF
> ---
>
> Key: FOP-2701
> URL: https://issues.apache.org/jira/browse/FOP-2701
> Project: FOP
>  Issue Type: Bug
>  Components: font/opentype
>Affects Versions: 2.1
> Environment: Windows 10, Calibri font.
>Reporter: Dan Caprioara
>Priority: Major
> Attachments: 3-fonts-copy-paste-result.png, 3-fonts-fop.xconf, 
> 3-fonts-latn-ligatures-FOP.fo, 3-fonts-latn-ligatures-FOP.pdf, fop.xconf, 
> image-2022-05-31-15-50-26-058.png, image-2022-05-31-15-50-39-029.png, 
> image-2022-05-31-15-52-01-435.png, latn-ligatures-Antenna-House.pdf, 
> latn-ligatures-FOP.pdf, out.pdf, test.fo
>
>
> This problem happens using the Calibri font, that is packed in the MS Office 
> suite and Windows 10.
> I tested with the following text: {{file settings}}. 
> The resulted PDF text contains ligatures: {{(fi)le se(tti)ngs}}
> Searching for {{file}} in Acrobat Reader results in the first word being 
> selected. This is Ok. But searching for {{set}}, or {{settings}} gives no 
> results. 
> The same example, run with Antenna House works fine, you get results when 
> searching for {{settings}}.
> Here is the complete FO file:
> {code:xml}
> 
> http://www.w3.org/1999/XSL/Format;>
> 
> 
> 
> 
> 
> 
> 
> file 
> settings
> 
> 
> 
> {code}
> Some considerations:
> # A workaround would be to reject all the substitutions that are not part of 
> org.apache.fop.fonts.type1.AdobeStandardEncoding. This would leave the (fi) 
> ligature, but reject the (tti) one. But this seems to work only for Calibri 
> and not for Roboto!!
> # I think there might be some issues with the font embedding, and some 
> substitution mapping data is lost. It is just a guess, I am not sure how PDF 
> deals with substitutions.
> I know that setting in FO xml:lang to "en" disables the ligatures, but is not 
> a solution for my project. I would appreciate any suggestions.



--
This message was sent by Atlassian Jira
(v8.20.7#820007)


[jira] [Updated] (FOP-2701) Some of the latin ligatures make text not searchable in PDF

2022-06-03 Thread Jira


 [ 
https://issues.apache.org/jira/browse/FOP-2701?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Martin Hönings updated FOP-2701:

Attachment: 3-fonts-latn-ligatures-FOP.fo
3-fonts-latn-ligatures-FOP.pdf
3-fonts-copy-paste-result.png

> Some of the latin ligatures make text not searchable in PDF
> ---
>
> Key: FOP-2701
> URL: https://issues.apache.org/jira/browse/FOP-2701
> Project: FOP
>  Issue Type: Bug
>  Components: font/opentype
>Affects Versions: 2.1
> Environment: Windows 10, Calibri font.
>Reporter: Dan Caprioara
>Priority: Major
> Attachments: 3-fonts-copy-paste-result.png, 3-fonts-fop.xconf, 
> 3-fonts-latn-ligatures-FOP.fo, 3-fonts-latn-ligatures-FOP.pdf, fop.xconf, 
> image-2022-05-31-15-50-26-058.png, image-2022-05-31-15-50-39-029.png, 
> image-2022-05-31-15-52-01-435.png, latn-ligatures-Antenna-House.pdf, 
> latn-ligatures-FOP.pdf, out.pdf, test.fo
>
>
> This problem happens using the Calibri font, that is packed in the MS Office 
> suite and Windows 10.
> I tested with the following text: {{file settings}}. 
> The resulted PDF text contains ligatures: {{(fi)le se(tti)ngs}}
> Searching for {{file}} in Acrobat Reader results in the first word being 
> selected. This is Ok. But searching for {{set}}, or {{settings}} gives no 
> results. 
> The same example, run with Antenna House works fine, you get results when 
> searching for {{settings}}.
> Here is the complete FO file:
> {code:xml}
> 
> http://www.w3.org/1999/XSL/Format;>
> 
> 
> 
> 
> 
> 
> 
> file 
> settings
> 
> 
> 
> {code}
> Some considerations:
> # A workaround would be to reject all the substitutions that are not part of 
> org.apache.fop.fonts.type1.AdobeStandardEncoding. This would leave the (fi) 
> ligature, but reject the (tti) one. But this seems to work only for Calibri 
> and not for Roboto!!
> # I think there might be some issues with the font embedding, and some 
> substitution mapping data is lost. It is just a guess, I am not sure how PDF 
> deals with substitutions.
> I know that setting in FO xml:lang to "en" disables the ligatures, but is not 
> a solution for my project. I would appreciate any suggestions.



--
This message was sent by Atlassian Jira
(v8.20.7#820007)


[jira] [Commented] (FOP-2701) Some of the latin ligatures make text not searchable in PDF

2022-06-03 Thread Jira


[ 
https://issues.apache.org/jira/browse/FOP-2701?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17545761#comment-17545761
 ] 

Martin Hönings commented on FOP-2701:
-

Indeed, the workaround with embedding-mode="full" doesn't work for the fonts 
"cambria.ttc" and "Calibri.ttf".
In my project I observed the ligature problem with the font 
"FrutigerNeueLTW1G-Light.otf". There the workaround works.
I add the following attachments:
- 3-fonts-fop.xconf
- 3-fonts-latn-ligatures-FOP.fo
- 3-fonts-latn-ligatures-FOP.pdf
- 3-fonts-copy-paste-result.png

Directory structure:
fop/conf/3-fonts-fop.xconf
fop/conf/FrutigerNeueLTW1G-Light.otf
fop/3-fonts-latn-ligatures-FOP.fo

Execute:
cd fop
fop -fo 3-fonts-latn-ligatures-FOP.fo -pdf 3-fonts-latn-ligatures-FOP.pdf -c 
conf\3-fonts-fop.xconf

The example compares the ligature problem with three fonts and the embedding 
mode "subset" and "full".
In the different fonts the problem happens with different ligatures.
The embedding mode "full" doesn't work with all fonts.
Maybe it depends on the font type or the font itself.

Unfortunatelly I cannot supply the "FrutigerNeueLTW1G-Light.otf" font, since it 
is not for free.
Maybe you can get a free licence as a developer.  

> Some of the latin ligatures make text not searchable in PDF
> ---
>
> Key: FOP-2701
> URL: https://issues.apache.org/jira/browse/FOP-2701
> Project: FOP
>  Issue Type: Bug
>  Components: font/opentype
>Affects Versions: 2.1
> Environment: Windows 10, Calibri font.
>Reporter: Dan Caprioara
>Priority: Major
> Attachments: 3-fonts-copy-paste-result.png, 3-fonts-fop.xconf, 
> 3-fonts-latn-ligatures-FOP.fo, 3-fonts-latn-ligatures-FOP.pdf, fop.xconf, 
> image-2022-05-31-15-50-26-058.png, image-2022-05-31-15-50-39-029.png, 
> image-2022-05-31-15-52-01-435.png, latn-ligatures-Antenna-House.pdf, 
> latn-ligatures-FOP.pdf, out.pdf, test.fo
>
>
> This problem happens using the Calibri font, that is packed in the MS Office 
> suite and Windows 10.
> I tested with the following text: {{file settings}}. 
> The resulted PDF text contains ligatures: {{(fi)le se(tti)ngs}}
> Searching for {{file}} in Acrobat Reader results in the first word being 
> selected. This is Ok. But searching for {{set}}, or {{settings}} gives no 
> results. 
> The same example, run with Antenna House works fine, you get results when 
> searching for {{settings}}.
> Here is the complete FO file:
> {code:xml}
> 
> http://www.w3.org/1999/XSL/Format;>
> 
> 
> 
> 
> 
> 
> 
> file 
> settings
> 
> 
> 
> {code}
> Some considerations:
> # A workaround would be to reject all the substitutions that are not part of 
> org.apache.fop.fonts.type1.AdobeStandardEncoding. This would leave the (fi) 
> ligature, but reject the (tti) one. But this seems to work only for Calibri 
> and not for Roboto!!
> # I think there might be some issues with the font embedding, and some 
> substitution mapping data is lost. It is just a guess, I am not sure how PDF 
> deals with substitutions.
> I know that setting in FO xml:lang to "en" disables the ligatures, but is not 
> a solution for my project. I would appreciate any suggestions.



--
This message was sent by Atlassian Jira
(v8.20.7#820007)


[jira] [Updated] (FOP-2701) Some of the latin ligatures make text not searchable in PDF

2022-06-03 Thread Jira


 [ 
https://issues.apache.org/jira/browse/FOP-2701?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Martin Hönings updated FOP-2701:

Attachment: 3-fonts-fop.xconf

> Some of the latin ligatures make text not searchable in PDF
> ---
>
> Key: FOP-2701
> URL: https://issues.apache.org/jira/browse/FOP-2701
> Project: FOP
>  Issue Type: Bug
>  Components: font/opentype
>Affects Versions: 2.1
> Environment: Windows 10, Calibri font.
>Reporter: Dan Caprioara
>Priority: Major
> Attachments: 3-fonts-copy-paste-result.png, 3-fonts-fop.xconf, 
> 3-fonts-latn-ligatures-FOP.fo, 3-fonts-latn-ligatures-FOP.pdf, fop.xconf, 
> image-2022-05-31-15-50-26-058.png, image-2022-05-31-15-50-39-029.png, 
> image-2022-05-31-15-52-01-435.png, latn-ligatures-Antenna-House.pdf, 
> latn-ligatures-FOP.pdf, out.pdf, test.fo
>
>
> This problem happens using the Calibri font, that is packed in the MS Office 
> suite and Windows 10.
> I tested with the following text: {{file settings}}. 
> The resulted PDF text contains ligatures: {{(fi)le se(tti)ngs}}
> Searching for {{file}} in Acrobat Reader results in the first word being 
> selected. This is Ok. But searching for {{set}}, or {{settings}} gives no 
> results. 
> The same example, run with Antenna House works fine, you get results when 
> searching for {{settings}}.
> Here is the complete FO file:
> {code:xml}
> 
> http://www.w3.org/1999/XSL/Format;>
> 
> 
> 
> 
> 
> 
> 
> file 
> settings
> 
> 
> 
> {code}
> Some considerations:
> # A workaround would be to reject all the substitutions that are not part of 
> org.apache.fop.fonts.type1.AdobeStandardEncoding. This would leave the (fi) 
> ligature, but reject the (tti) one. But this seems to work only for Calibri 
> and not for Roboto!!
> # I think there might be some issues with the font embedding, and some 
> substitution mapping data is lost. It is just a guess, I am not sure how PDF 
> deals with substitutions.
> I know that setting in FO xml:lang to "en" disables the ligatures, but is not 
> a solution for my project. I would appreciate any suggestions.



--
This message was sent by Atlassian Jira
(v8.20.7#820007)