Dear all,
in the context of a historical corpus of Slovene I'd want to mark texts that 
are written in the Bohorič alphabet 
(http://en.wikipedia.org/wiki/Bohori%C4%8D_alphabet).
So far I've just been using @xml:lang="sl-boh" but I know this is sinful - but 
I'm not sure how it should be encoded.
First, I'm not sure if it even qualifies as a "script", as e.g. I can't find a 
script for old English which used the long s, but maybe because this only 
substitutes one character for another - with Bohorič  it's more complicated. 
Even taking it as a script (so I could write sl-Boho), 
http://www.tei-c.org/release/doc/tei-p5-doc/en/html/CH.html#CHSH does say that 
they should be taken from ISO 15924, 
http://unicode.org/iso15924/iso15924-codes.html and there is no Boho there; I 
also can't find an extension mechanism as there is with languages.
Any tips gratefully received.
Best,
Tomaž
-- 
Tomaž Erjavec, http://nl.ijs.si/et/
Dept. of Knowledge Technologies, Jožef Stefan Institute, Ljubljana

_______________________________________________
lugos-slo mailing list
[email protected]
http://liste2.lugos.si/cgi-bin/mailman/listinfo/lugos-slo

Одговори путем е-поште