Ken Irwin wrote:
Hi all,

I'm looking for a good source to help me understand character sets and how to 
use them. I pretty much know nothing about this - the whole world of Unicode, 
ASCII, octal, UTF-8, etc. is baffling to me.

Other people have recommended a whole lot of fabulous resources, so I won't cover ground they already have.

If, however, you need to deal with characters which don't qualify for inclusion in Unicode (or which do qualify but which haven't yet been assigned code points). I recommend tei:glyph:

http://www.tei-c.org/release/doc/tei-p5-doc/en/html/ref-glyph.html

We use this to represent typographically interesting but short-lived approaches to the representation of Māori in printed works. See for example the 'wh' ligature (which looks like a 'vh' and is pronounced in modern usage like 'f') in the following text:

http://www.nzetc.org/tm/scholarly/tei-Auc1911NgaM-t1-body-d4.html

for the underlying TEI XML representation see:

http://www.nzetc.org/tei-source/Auc1911NgaM.xml

cheers
stuart
--
Stuart Yeates
http://www.nzetc.org/       New Zealand Electronic Text Centre
http://researcharchive.vuw.ac.nz/     Institutional Repository

Reply via email to