Mojca Miklavec wrote: > Hello Hans, > > On 8/24/07, Hans Hagen wrote: >> Hi, >> >> I uploaded a new version of mkiv (regular zip). > > Thanks a lot! > >> - case changing using attributes and node processing >> >> simple test file for spacing and casing: > > I'm attaching a not-so-trivial test file for "casing", just to see how > well it works for Croatian. > > A few observations: > > - LM doesn't have any lj, nj, dz, dž, ... (probably another request > for the Polish guys)
hm, just write a small proposal ... however, dealing with non present chars is to be dealt with anyway > - It would be great if MK IV did the trasformation from digraphs to > normal letters in case those digraphs are not present in the font > itself (for ij, lj, nj, dz, dž, ... just as it would be great if > ccaron was automatically composed out of c and caron if the letter > wasn't present in that font). \definefontfeature [test][mode=node,language=dflt,script=latn,complement=yes] {\font\test = lmtypewriter8-regular*test at 12.3pt \test ljubljana Ljubljana LJUBLJANA } currently the complement only replaces LATIN/compat combinations (see char-def.lua) > Visually there is probably no difference in plain text, except in > exactly the cases for which you're sending the tests (that's casing > and spacing). See http://en.wikipedia.org/wiki/Gaj's_Latin_alphabet > how the word "MJENJACNICA" is split into letters. > Normal people still type n+j in text, not the digraph "?" (nj), but in > case you get some text with those digraphs which are valid Unicode > letters, it would be nice if they were processed ... dealing with n+j in text is too dangerous to catch, unless we start implementing complex language depenent replacements, and even then it's messy (what to do when one really wants a nj (two char)) ... so, thos old docs can best be converted to proper utf then >> \starttext >> >> test: oeps {\setcharacterspacing[frenchpunctuation] x: xx \bfd x: xx} >> oeps: test >> >> test \WORD{test TEST \TeX} test >> >> test \word{test TEST \TeX} test >> >> test \Word{test TEST \TeX} test > > Another few observations: > - \word doesn't work in XeTeX no, neither in pdftex i think; new > - What exactly is \Words supposed to do (with non-first letters in a word)? make first chars uppercase but only when the next is a char; (i changed it a bit, defs were not seen (overloaded later by macros) > - ConTeXt with XeTeX outputs dozens of empty lines to the console. indeed, has to do with the fact that i need to test if a font is present on the system (file vs name stuff) and the empty lines are a side effect of entering/exiting batchmode > An extra challenge would be to get this work (but unless some Croats > ask you for that or unless you have too much time left, don't bother > about that - it needs slightly more than only lccode and uccode of a > letter since there are three forms: one for lowercase [ljubljana -> > lj], one for all-uppercase words [LJUBLJANA -> LJ] and one for the > first letter of a word starting with an uppercase [Ljubljana -> Lj]): > > In Unicode: > > \word{?ub?ana} -> ?ub?ana > \Word{?ub?ana} -> ?ub?ana > \WORD{?ub?ana} -> ?UB?ANA > > \word{?ub?ana} -> ?ub?ana > \Word{?ub?ana} -> ?ub?ana > \WORD{?ub?ana} -> ?UB?ANA > > \word{?UB?ANA} -> ?ub?ana > \Word{?UB?ANA} -> ?ub?ana > \WORD{?UB?ANA} -> ?UB?ANA as long as we have utf it's already taken care of > In Latin transcript (in case you have problems seing some Unicode letters): > > \word{ljubljana} -> ljubljana > \Word{ljubljana} -> Ljubljana > \WORD{ljubljana} -> LJUBLJANA > > \word{Ljubljana} -> ljubljana > \Word{Ljubljana} -> Ljubljana > \WORD{Ljubljana} -> LJUBLJANA > > \word{LJUBLJANA} -> ljubljana > \Word{LJUBLJANA} -> Ljubljana > \WORD{LJUBLJANA} -> LJUBLJANA > > See also: > > http://unicode.org/cldr/data/common/collation/hr.xml > http://en.wikipedia.org/wiki/Gaj's_Latin_alphabet > >> {\setcharacterkerning[extrakerning]\input zapf\endgraf } > > (That could be "backported" to XeTeX. I think it enables a similar > feature now, but I should check.) hm, i'm not going to backport everything; keep in mind that i these features are not font related; actually future mkiv versions will also do dynamic feature change so ... anyhow, ... new upload to play with Hans -- ----------------------------------------------------------------- Hans Hagen | PRAGMA ADE Ridderstraat 27 | 8061 GH Hasselt | The Netherlands tel: 038 477 53 69 | fax: 038 477 53 74 | www.pragma-ade.com | www.pragma-pod.nl ----------------------------------------------------------------- ___________________________________________________________________________________ If your question is of interest to others as well, please add an entry to the Wiki! maillist : ntg-context@ntg.nl / http://www.ntg.nl/mailman/listinfo/ntg-context webpage : http://www.pragma-ade.nl / http://tex.aanhet.net archive : https://foundry.supelec.fr/projects/contextrev/ wiki : http://contextgarden.net ___________________________________________________________________________________