[
https://issues.apache.org/jira/browse/FOP-2358?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14032727#comment-14032727
]
Vincent Hennebert commented on FOP-2358:
----------------------------------------
If you [enable
accessibility|http://xmlgraphics.apache.org/fop/1.1/accessibility.html], then
the actual text (without any hyphenation character) will be stored in the
output using special PDF constructs.
I realise that this is not what you're asking, but it might be an acceptable
work-around.
To actually get the soft hyphen copy-pasted, we would have to replace the
hyphen-minus (U+002D) that is currently used with a soft hyphen (U+00AD), but
it's more easily said than done.
We would have to put in place a custom encoding of some sort that targets the
font's hyphen glyph to display on the screen (more precisely, the hyphenation
character set in the FO file by the hyphenation-character property), but
returns the soft hyphen on copy-paste (thanks to an appropriate entry in the
ToUnicode CMap).
But distinction would also have to be made between a soft hyphen that was
manually added to the input text, and a hyphen that was automatically generated
by the hyphenation process. And that distinction is not there at the moment.
Not impossible to achieve, but a bit of work.
> Soft hyphen is not retained on copy/paste
> -----------------------------------------
>
> Key: FOP-2358
> URL: https://issues.apache.org/jira/browse/FOP-2358
> Project: Fop
> Issue Type: Bug
> Affects Versions: 1.1
> Environment: $ fop -v
> FOP Version 1.1
> $ java -version
> java version "1.7.0_51"
> Java(TM) SE Runtime Environment (build 1.7.0_51-b13)
> Java HotSpot(TM) 64-Bit Server VM (build 24.51-b03, mixed mode)
> Reporter: Mark Craig
> Priority: Minor
> Attachments: render-shy.fo, render-shy.pdf
>
>
> Soft hyphen is rendered as hyphen + space.
> As a result, when the text that originally contained a soft hyphen is copied
> and pasted to an editor capable of ignoring soft hyphens or recalculating the
> hyphenation, the "hard" hyphen remains.
> This is particularly unhelpful in hyphenated literals such as URLs or OIDs,
> where adding a hyphen + space means that the copy is broken.
--
This message was sent by Atlassian JIRA
(v6.2#6252)