Re: [tex-hyphen] Names of files in OFFO

Claudio Beccari Sat, 19 Mar 2016 02:08:40 -0700

The example of greek is a good one, but for qwhat concerns the TeXSystemit is a bad one.

When unicode/utf8 engines are used the unicode encoded patterns areavailable because Apostolos Syropoulos created the several years ago(since these engine have been available) and I suppose they are OK.

At the moment the pattern files for 8-bit engines (in practice pdftexand knuthian tex) LGR encoded greek fonts deal only with the latintranslitteration and do not deal with direct greek utf8 encoded greektext. I preapred the necessary extensions to cope with the LICR encodingcreated by Günter Milde, the actual maintainer of the pdftex+babelrelated files (greek.ldf, textalpha.sty, alphabeta.sty, and severalother ones) and the uft8 direct input of the three varieties of greek:monotoniko, politoniko, ancient; 18 months ago, more or less, I sent thenew pattern files to some greek TeXies for the necessary controls, butup to now I did not get any feedback.

Tonos is the only accent used in monotoniko, but it generally has thesame shape as an acute/oxia one, but ins ome instances it is and"unslanted acute" a straight stroke ove the vovel. But unicode does notdeal directly with shapes of the single glyphs, it deal with the namesand give a sample shape in order to make it clear what tha name deals with.

Obviously the tonos and the oxia may be identical in shape in mostfonts, but in some other ones they are different; and they may be soboth in self combining glyphs or in preaccented ones. Unicode has todeal with them as two distinct gliphs.

May be hyphenation patterns for polytoniko may be considered a supersetof monotoniko, but the patterns for ancient are different, not onlybecase there is a different lexicon, but also for hyphenation rules thatfor ancient greek are mre etymological than for modern greek.

Therefore we have a situation similar to the one we discussed formodern. medieval classical, ecclesiastic. latin not long ago.


Claudio

On 17/03/2016 19:20, Barbara Beeton wrote:

     On Thu, Mar 17, 2016 at 01:55:27PM -0400, Barbara Beeton wrote:
     > that's all very well, and i understand
     > how *unicode* works.  what i'd really
     > like to see is how this equivalence
     > is determined in a (la)tex source file.

       In the case of Greek hyphenation, by making as many copies of the
     patterns containing an oxia-tonos as is necessary.  That's very
     pedestrian, but works; it's done by a script, of course.

okay.  then there *are* two entries for
every possibility (although only the
ones with oxia would be needed for
"properly encoded" classical greek).

     >                       there has been
     > a discussion on the unicode discussion
     > list to the effect that the NamesList
     > file should *not* be used for this
     > sort of analysis.

       Well, the authoritative data is UnicodeData.txt, and it's just as easy
     to parse (easier, in fact), so that's what should be used.  Do you have
     a pointer to the discussion?

i've had it bookmarked for over a week,
ever since i got an inquiry regarding
the source of several symbols in the
"miscellaneous symbols" block.  i'll
go back and reread the discussion.
thanks.
                                        -- bb

Re: [tex-hyphen] Names of files in OFFO

Reply via email to