Byte Order Marks

2001-04-19 Thread Tomas McGuinness
specification I could refer to? Thanks. Tom Tomas McGuinness Consultant -- University Technology Park* +353 21 4933 277 Curraheen Rd, Cork *+353 21 4933

Byte Order Marks

2001-04-10 Thread Tomas McGuinness
the first 3 octets to be EF BB EF could I assume I am dealing with a UTF-8 Document. Apart from UTF and Unicode/UCS encoding formats do any other "legacy" character sets use Byte Order Marks? Regrads, Tom. Tomas McGuinness

gb2312

2001-04-10 Thread Tomas McGuinness
Hi, Is the character set gb2312 encoded in a two octet scheme? If so does it pad out its ascii characters to two octets e.g. the character is 0x3C in ascii so does it become 0x003C in gb2312? Regrards, Tom. Tomas McGuinness Consultant

Code charts

2001-04-09 Thread Tomas McGuinness
table chart I have anyway. Does the Simplified Chinese character set have this character or is my mapping table incorrect? Could anyone tell me if its possible to download these code mapping charts from the internet. Thanks in advance, Toms McGuinness Tomas McGuinness Consultant

[unicode] UCS-2 Files

2001-03-22 Thread Tomas McGuinness
Hi, I have a question relating to UCS-2. I am currently developing a product that will support UCS-2 and I have been sent several documents encoded in UCS-2. I have no reader or writer for UCS-2 but I have performed Hexdumps in UNIX. At the beginning of the UCS-2 characters there are two rogue