On 15/1/14 20:00, Philip Taylor wrote:
Arthur Reutenauer wrote:
You should. Characters such as U+FB01 are deprecated and shouldn't be
used in text.
/Characters/ ... : yes. But consider a Unicode-in/Unicode-out
preprocessor; might it not generate fi in the output stream,
since it thinks it is generating glyphs,
A preprocessor generating such a "Unicode-out" stream would necessarily
be confused, because Unicode is not a glyph encoding, it's a character
encoding.
If a preprocessor wants to generate a stream of glyphs rather than
characters, it needs to do so according to some glyph encoding standard
(which is usually font-specific), and this can no longer be expected to
work in conjunction with Unicode character-based hyphenation patterns.
JK
yet in a pipeline
environment that output might get re-used as TeX input ...
** Phil.
.