Hi list,
I am a Metadata librarian but not a programmer, sorry if my question seems
naïve. We use XSLT stylesheet to transform some harvested DC records from
DSpace to MARC in MarcEdit, and then export them to OCLC.
Some characters do not display correctly and need manual editing, for example:
Hi. Is there a reason not to attempt this instead through the CLI?
Al Matthews, Software Dev,
Atlanta University Center
From: Code for Libraries [CODE4LIB@LISTSERV.ND.EDU] On Behalf Of Rosalyn Metz
[rosalynm...@gmail.com]
Sent: Wednesday, April 18, 2012
Al,
Looking at the CLI quickly it looks like its related to batch exporting.
I'm trying to quickly create +1500 new digital object records in AT (ie.
go into a resource, click new instance, choose digital object, add the
title, add the mets identifier, click save).
If I misread what the CLI
Hello all,
The code4lib-nyc chapter in conjunction with METRO is holding our
somewhat-quarterly jam session:
next Wednesday, April 25 10am-noon at the METRO Training Center,
57 E 11th Street, NYC.
Come talk about your projects, and find out what everybody's working on!
Folks without
Actually -- the issue isn't one of MARC8 versus UTF8 (since this data is being
harvested from DSpace and is UTF8 encoded). It's actually an issue with user
entered data -- specifically, smart quotes and the like. These values
obviously are not in the MARC8 characterset and cause many who
Ah, thanks Terry.
That canned cleaner in MarcEdit sounds potentially useful -- I'm in a
continuing battle to keep the character encoding in our local marc
corpus clean.
(The real blame here is on cataloger interfaces that let catalogers save
data that are illegal bytes for the character set
We see Unicode data pasted into MARC8 records all the time. It happens enough
that my MARC8-Unicode converter takes a second look at illegal MARC8 bytes and
tries a UTF-8 encoding as well.
Ralph
-Original Message-
From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of
On 4/19/2012 3:23 PM, LeVan,Ralph wrote:
We see Unicode data pasted into MARC8 records all the time. It happens enough
that my MARC8-Unicode converter takes a second look at illegal MARC8 bytes and
tries a UTF-8 encoding as well.
Right. I see it too. I'm arguing that means cataloger entry
On 4/18/2012 12:08 PM, Jonathan Rochkind wrote:
On 4/18/2012 11:09 AM, Doran, Michael D wrote:
I don't believe that is the case. Take UTF-8 out of the picture, and
consider the MARC-8 character set with its escape sequences and
combining characters. A character such as an n with a tilde
I have implemented fairly complete and robust proper support for
character encodings in ruby-marc when reading 'binary' marc under ruby 1.9.
It's currently in a git branch, not yet released, and not yet in git
master. https://github.com/ruby-marc/ruby-marc/tree/char_encodings
If anyone who
Head of Metadata Services
Georgetown University Library is seeking a dynamic, forward-thinking,
innovative, energetic and teamoriented person to serve as Head of the Metadata
Services Unit within the Technical ServicesDepartment.
The successful candidate will have overall responsibility
11 matches
Mail list logo