Re: [CODE4LIB] character-sets for dummies?

2009-12-16 Thread Chad Fennell
A classic general overview (on the topic of what the heck ARE character sets???): http://www.joelonsoftware.com/articles/Unicode.html On Wed, Dec 16, 2009 at 11:02 AM, Ken Irwin kir...@wittenberg.edu wrote: Hi all, I'm looking for a good source to help me understand character sets and how

Re: [CODE4LIB] character-sets for dummies?

2009-12-16 Thread Hagedon, Mike
This probably one place to start: http://www.joelonsoftware.com/articles/Unicode.html Mike -Original Message- From: Code for Libraries [mailto:code4...@listserv.nd.edu] On Behalf Of Ken Irwin Sent: Wednesday, December 16, 2009 10:02 AM To: CODE4LIB@LISTSERV.ND.EDU Subject: [CODE4LIB]

Re: [CODE4LIB] character-sets for dummies?

2009-12-16 Thread Jonathan Rochkind
So, character encodings are really confusing, even for those who have dealt with them before. I'm not sure if there is a good 'dealing with character encodings for dummies' book, but if there is, I think I could use it too! But from your case, I can say:Ideally your source records are in

Re: [CODE4LIB] character-sets for dummies?

2009-12-16 Thread Nate Vack
On Wed, Dec 16, 2009 at 11:24 AM, Walker, David dwal...@calstate.edu wrote: If you're looking to convert that data to UTF-8 (which I assume you would), then your best friend is a program from Index Data called yaz-marcdump, which comes with the Yaz toolkit.  It runs on Linux and Windows, and

Re: [CODE4LIB] character-sets for dummies?

2009-12-16 Thread Brian Stamper
When you see these kind of errors: Revista de Música Latinoamericana[weird characters instead of diacritics] if you can look at the data in a web browser it can be used as a tool to help you identify the correct encoding. Web browsers usually render character sets based on whatever

Re: [CODE4LIB] character-sets for dummies?

2009-12-16 Thread Cory Rockliff
If you're looking for a book-length treatment, 'Unicode Explained' is fairly readable, and the first three chapters are about character encodings in general: http://books.google.com/books?id=PcWU2yxc8WkCprintsec=frontcover On 12/16/2009 12:02 PM, Ken Irwin wrote: Hi all, I'm looking for a

Re: [CODE4LIB] character-sets for dummies?

2009-12-16 Thread Doran, Michael D
Hi Ken, In an effort to better understand character sets myself, I have brought together some information on my website, with an emphasis on library automation and the internet environment: Coded Character Sets A Technical Primer for Librarians http://rocky.uta.edu/doran/charsets/ Make

Re: [CODE4LIB] character-sets for dummies?

2009-12-16 Thread Ken Irwin
Hi all -- thanks for these fabulous replies. I'm learning a lot. Armed with a bit of new knowledge, I've done some tinkering. I think I've solved my original quandaries, and have opened new cans of worms. I have a few more specific questions: 1) It appears that once I switch my MySQL table

Re: [CODE4LIB] character-sets for dummies?

2009-12-16 Thread Thomale, J
Ken, Great suggestions so far--I have just one thing to add. If you ever reach the point at which you find yourself examining code tables to figure out what character set something is using, you might also want to find a good hex editor so that you can examine your data byte by byte. Since

Re: [CODE4LIB] character-sets for dummies?

2009-12-16 Thread stuart yeates
Ken Irwin wrote: Hi all, I'm looking for a good source to help me understand character sets and how to use them. I pretty much know nothing about this - the whole world of Unicode, ASCII, octal, UTF-8, etc. is baffling to me. Other people have recommended a whole lot of fabulous resources,

Re: [CODE4LIB] character-sets for dummies?

2009-12-16 Thread KREYCHE, MICHAEL
State University 330-672-1918 -Original Message- From: Code for Libraries [mailto:code4...@listserv.nd.edu] On Behalf Of Ken Irwin Sent: Wednesday, December 16, 2009 1:26 PM To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] character-sets for dummies? Hi all -- thanks

Re: [CODE4LIB] character-sets for dummies?

2009-12-16 Thread Doran, Michael D
/ -Original Message- From: Code for Libraries [mailto:code4...@listserv.nd.edu] On Behalf Of Ken Irwin Sent: Wednesday, December 16, 2009 12:26 PM To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] character-sets for dummies? Hi all -- thanks for these fabulous replies. I'm learning

Re: [CODE4LIB] character-sets for dummies?

2009-12-16 Thread Andrew Cunningham
Hi 2009/12/17 stuart yeates stuart.yea...@vuw.ac.nz: If, however, you need to deal with characters which don't qualify for inclusion in Unicode (or which do qualify but which haven't yet been assigned code points). I recommend tei:glyph:

Re: [CODE4LIB] character-sets for dummies?

2009-12-16 Thread stuart yeates
Andrew Cunningham wrote: Hi 2009/12/17 stuart yeates stuart.yea...@vuw.ac.nz: If, however, you need to deal with characters which don't qualify for inclusion in Unicode (or which do qualify but which haven't yet been assigned code points). I recommend tei:glyph: