Re: [CODE4LIB] more on MARC char encoding

2012-04-19 Thread Deng, Sai
Hi list, I am a Metadata librarian but not a programmer, sorry if my question seems naïve. We use XSLT stylesheet to transform some harvested DC records from DSpace to MARC in MarcEdit, and then export them to OCLC. Some characters do not display correctly and need manual editing, for example:

Re: [CODE4LIB] Archivists' Toolkit: Adding Digital Objects via MySQL

2012-04-19 Thread Al Matthews
Hi. Is there a reason not to attempt this instead through the CLI? Al Matthews, Software Dev, Atlanta University Center From: Code for Libraries [CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Rosalyn Metz [rosalynm...@gmail.com] Sent: Wednesday, April 18, 2012

Re: [CODE4LIB] Archivists' Toolkit: Adding Digital Objects via MySQL

2012-04-19 Thread Rosalyn Metz
Al, Looking at the CLI quickly it looks like its related to batch exporting. I'm trying to quickly create +1500 new digital object records in AT (ie. go into a resource, click new instance, choose digital object, add the title, add the mets identifier, click save). If I misread what the CLI

[CODE4LIB] NYC code4lib meetup: next Weds April 25

2012-04-19 Thread Yitzchak Schaffer
Hello all, The code4lib-nyc chapter in conjunction with METRO is holding our somewhat-quarterly jam session: next Wednesday, April 25 10am-noon at the METRO Training Center, 57 E 11th Street, NYC. Come talk about your projects, and find out what everybody's working on! Folks without

Re: [CODE4LIB] more on MARC char encoding

2012-04-19 Thread Reese, Terry
Actually -- the issue isn't one of MARC8 versus UTF8 (since this data is being harvested from DSpace and is UTF8 encoded). It's actually an issue with user entered data -- specifically, smart quotes and the like. These values obviously are not in the MARC8 characterset and cause many who

Re: [CODE4LIB] more on MARC char encoding

2012-04-19 Thread Jonathan Rochkind
Ah, thanks Terry. That canned cleaner in MarcEdit sounds potentially useful -- I'm in a continuing battle to keep the character encoding in our local marc corpus clean. (The real blame here is on cataloger interfaces that let catalogers save data that are illegal bytes for the character set

Re: [CODE4LIB] more on MARC char encoding

2012-04-19 Thread LeVan,Ralph
We see Unicode data pasted into MARC8 records all the time. It happens enough that my MARC8-Unicode converter takes a second look at illegal MARC8 bytes and tries a UTF-8 encoding as well. Ralph -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of

Re: [CODE4LIB] more on MARC char encoding

2012-04-19 Thread Jonathan Rochkind
On 4/19/2012 3:23 PM, LeVan,Ralph wrote: We see Unicode data pasted into MARC8 records all the time. It happens enough that my MARC8-Unicode converter takes a second look at illegal MARC8 bytes and tries a UTF-8 encoding as well. Right. I see it too. I'm arguing that means cataloger entry

Re: [CODE4LIB] more on MARC char encoding: Now we're about ISO_2709 and MARC21

2012-04-19 Thread Robert Haschart
On 4/18/2012 12:08 PM, Jonathan Rochkind wrote: On 4/18/2012 11:09 AM, Doran, Michael D wrote: I don't believe that is the case. Take UTF-8 out of the picture, and consider the MARC-8 character set with its escape sequences and combining characters. A character such as an n with a tilde

[CODE4LIB] ruby-marc, better ruby 1.9 char encoding support, testers wanted

2012-04-19 Thread Jonathan Rochkind
I have implemented fairly complete and robust proper support for character encodings in ruby-marc when reading 'binary' marc under ruby 1.9. It's currently in a git branch, not yet released, and not yet in git master. https://github.com/ruby-marc/ruby-marc/tree/char_encodings If anyone who

[CODE4LIB] Job: Head of Metadata Services at Georgetown University

2012-04-19 Thread jobs
Head of Metadata Services Georgetown University Library is seeking a dynamic, forward-thinking, innovative, energetic and teamoriented person to serve as Head of the Metadata Services Unit within the Technical ServicesDepartment. The successful candidate will have overall responsibility