A classic general overview (on the topic of what the heck ARE
character sets???):
http://www.joelonsoftware.com/articles/Unicode.html
On Wed, Dec 16, 2009 at 11:02 AM, Ken Irwin kir...@wittenberg.edu wrote:
Hi all,
I'm looking for a good source to help me understand character sets and how
This probably one place to start:
http://www.joelonsoftware.com/articles/Unicode.html
Mike
-Original Message-
From: Code for Libraries [mailto:code4...@listserv.nd.edu] On Behalf Of Ken
Irwin
Sent: Wednesday, December 16, 2009 10:02 AM
To: CODE4LIB@LISTSERV.ND.EDU
Subject: [CODE4LIB]
So, character encodings are really confusing, even for those who have
dealt with them before. I'm not sure if there is a good 'dealing with
character encodings for dummies' book, but if there is, I think I could
use it too!
But from your case, I can say:Ideally your source records are in
On Wed, Dec 16, 2009 at 11:24 AM, Walker, David dwal...@calstate.edu wrote:
If you're looking to convert that data to UTF-8 (which I assume you would),
then your best friend is a program from Index Data called yaz-marcdump, which
comes with the Yaz toolkit. It runs on Linux and Windows, and
When you see these kind of errors:
Revista de Música Latinoamericana[weird characters instead of
diacritics]
if you can look at the data in a web browser it can be used as a tool to
help you identify the correct encoding. Web browsers usually render
character sets based on whatever
If you're looking for a book-length treatment, 'Unicode Explained' is
fairly readable, and the first three chapters are about character
encodings in general:
http://books.google.com/books?id=PcWU2yxc8WkCprintsec=frontcover
On 12/16/2009 12:02 PM, Ken Irwin wrote:
Hi all,
I'm looking for a
Hi Ken,
In an effort to better understand character sets myself, I have brought
together some information on my website, with an emphasis on library automation
and the internet environment:
Coded Character Sets A Technical Primer for Librarians
http://rocky.uta.edu/doran/charsets/
Make
Hi all -- thanks for these fabulous replies. I'm learning a lot.
Armed with a bit of new knowledge, I've done some tinkering. I think I've
solved my original quandaries, and have opened new cans of worms. I have a few
more specific questions:
1) It appears that once I switch my MySQL table
Ken,
Great suggestions so far--I have just one thing to add.
If you ever reach the point at which you find yourself examining code tables to
figure out what character set something is using, you might also want to find a
good hex editor so that you can examine your data byte by byte. Since
Ken Irwin wrote:
Hi all,
I'm looking for a good source to help me understand character sets and how to
use them. I pretty much know nothing about this - the whole world of Unicode,
ASCII, octal, UTF-8, etc. is baffling to me.
Other people have recommended a whole lot of fabulous resources,
State University
330-672-1918
-Original Message-
From: Code for Libraries [mailto:code4...@listserv.nd.edu] On
Behalf Of Ken Irwin
Sent: Wednesday, December 16, 2009 1:26 PM
To: CODE4LIB@LISTSERV.ND.EDU
Subject: Re: [CODE4LIB] character-sets for dummies?
Hi all -- thanks
/
-Original Message-
From: Code for Libraries [mailto:code4...@listserv.nd.edu] On Behalf Of
Ken Irwin
Sent: Wednesday, December 16, 2009 12:26 PM
To: CODE4LIB@LISTSERV.ND.EDU
Subject: Re: [CODE4LIB] character-sets for dummies?
Hi all -- thanks for these fabulous replies. I'm learning
Hi
2009/12/17 stuart yeates stuart.yea...@vuw.ac.nz:
If, however, you need to deal with characters which don't qualify for
inclusion in Unicode (or which do qualify but which haven't yet been
assigned code points). I recommend tei:glyph:
Andrew Cunningham wrote:
Hi
2009/12/17 stuart yeates stuart.yea...@vuw.ac.nz:
If, however, you need to deal with characters which don't qualify for
inclusion in Unicode (or which do qualify but which haven't yet been
assigned code points). I recommend tei:glyph:
14 matches
Mail list logo