Hi Carlos,
I am writing you for the following: the next month I'll be giving a
training course called UNIX for librarians. ... Sadly there's no
material available in Spanish about this topic.
I'm guessing that you won't find much material (in any language) on the topic
of UNIX for
Hi Jane,
These answers assume that the data you are processing:
1) is encoded in the MARC-8 character set, and
2) consists of the MARC-8 default basic and extended Latin characters.
Dave,Ayod\2003
Paòt,Kaâs\2002
Baks,Dasa\2003
,Viâs\2002
Problem 1: As you can see, I don't really want the
dope.sh is a shell script that facilitates discovery of the Oracle-Perl
environment on a Unix (Solaris) system [1]. I distribute an open-source
Perl application that incorporates a DBI/DBD::Oracle connection. The
users that implement the application generally (but not always) have the
requisite
I'm not sure that everybody who subscribes to this listserv is aware
that perl4lib listserv postings end up in the perl.perl4lib Google
Group. I know that I was a bit surprised to find that out.
Although serving a similar purpose, I make a distinction between
listservs and news groups. The main
Hi Jason,
I believe that MARC::Charset only does MARC-8 to UTF-8 conversion and vice
versa, so won't be a solution for automating your Latin-1 to MARC-8 conversion,
unless you were planning to do Latin-1=UTF-8=MARC-8.
A few years ago, I wrote an imperfect MARC-8 to Latin-1 character set
Subject: Re: MARC-8 to UTF-8 conversion
On 12/5/05, Doran, Michael D [EMAIL PROTECTED] wrote:
So... this is all very interesting (and I've definitely learned
something here), but like I suggested previously, this level of
digging
may be a bit beyond the casual Perl user. ;-)
Yep
: Henri-Damien LAURENT [mailto:[EMAIL PROTECTED]
Sent: Wednesday, March 14, 2007 4:18 AM
To: Doran, Michael D; perl4lib
Subject: Re: MARC::Charset
Doran, Michael D a écrit :
Hi Henri,
Although in my email client, the character in question
appears as a MICRO SIGN (µ), I am assuming
Hi Ashley,
I think #12345; is now legal in MARC-8 now to indicate a
Unicode character that isn't in the MARC-8 repertoire.
Yes, that's also my understanding [1,2], though I've not personally come across
any records yet that use that method. (Although not being a cataloger, I don't
://rocky.uta.edu/doran/
-Original Message-
From: Ashley Sanders [mailto:[EMAIL PROTECTED]
Sent: Wednesday, March 14, 2007 10:52 AM
To: Doran, Michael D
Cc: perl4lib
Subject: Re: MARC::Charset
Michael,
So, basically, you either need prior knowledge about the actual
character encoding
I can also see that this record is broken because the XML entity
apos; is in a MARC communications format file.
The character entity apos; *is valid* in a MARC-XML file.
It is one of the few standard character entities allowed in
an XML file, e.g., amp;, lt;, gt;, and apos;.
A recent
Hi Michael,
An example is the author (personal name) of the book that can
be found at http://catalog.loc.gov/ by searching for ISBN
5040039875 (I'm guessing the fact that the website appears to
be displaying a corrupted name may be part of the problem here).
The Library of Congress
Hi Laurence,
I'm trying to create MARC records from serials data exported
from SFX, using MARC::Charset version 0.98 to convert UTF-8
strings to MARC-8. It seems to be failing on extended latin
characters like U+00C5 CAPITAL LETTER A WITH RING ABOVE
The encoding, U+00C5 (CAPITAL LETTER A
Hi Henri,
Is there a reason why MARC::File::XML considers only a very
strict subset of utf-8 as valid ?
I would guess that it has to do with adhering to the MARC-21 repertoire of
characters, so as to facilitate the round-trip conversion between the MARC-8
and Unicode character sets [1,2].
Hi Jackie,
I'm working on a very similar problem... converting theses/dissertations
records (in XML) to MARC records. I'm still in the testing stage, but have had
similar problems with records with diacritics in the 100 or 245 fields (however
diacritics in a 520a field don't seem to cause any
]
# http://rocky.uta.edu/doran/
-Original Message-
From: Brian Sheppard [mailto:[EMAIL PROTECTED]
Sent: Thursday, February 21, 2008 1:00 PM
To: Doran, Michael D
Cc: perl4lib@perl.org
Subject: Re: Help for utf-8 output
I'd suggest you first make sure your XML is really UTF-8
Back-story:
I have a Perl CGI program. The CGI program needs to utilize variables in one
of several separate configuration files (packages). The different packages all
contain the same variables, but with different values for those variables.
Each package represents a different language for
# 817-688-1926 mobile
# [EMAIL PROTECTED]
# http://rocky.uta.edu/doran/
-Original Message-
From: Leif Andersson [mailto:[EMAIL PROTECTED]
Sent: Sunday, April 27, 2008 3:20 PM
To: Doran, Michael D; Perl4lib
Subject: Re: Importing Perl package variables into a Perl
script
:52 PM
To: Doran, Michael D
Cc: [EMAIL PROTECTED]; Perl4lib
Subject: Re: Stripping out Unicode combining characters (diacritics)
On Mon, May 5, 2008 at 8:26 PM, Doran, Michael D [EMAIL PROTECTED] wrote:
[snip]
I'm pulling my hair out on this... so any help would be appreciated. If
there's any
PROTECTED]
# http://rocky.uta.edu/doran/
-Original Message-
From: Mike Rylander [mailto:[EMAIL PROTECTED]
Sent: Mon 5/5/2008 8:57 PM
To: Doran, Michael D
Cc: Perl4lib
Subject: Re: Importing Perl package variables into a Perl script with require
On Fri, Apr 25, 2008 at 8:46 PM, Doran, Michael
Librarian
# University of Texas at Arlington
# 817-272-5326 office
# 817-688-1926 mobile
# [EMAIL PROTECTED]
# http://rocky.uta.edu/doran/
-Original Message-
From: Doran, Michael D [mailto:[EMAIL PROTECTED]
Sent: Monday, May 05, 2008 7:27 PM
To: [EMAIL PROTECTED]
Cc: Perl4lib
Subject
Hi Emmanuel,
I'm trying to convert an ISIS database to MARC21
What is the character set encoding of the data in the ISIS database?
What is the desired character set encoding for the MARC21 records? I.e. MARC-8
or MARC Unicode(UTF-8)?
If they are dissimilar character encodings, is the data
I am working on a Perl script that retrieves data from our Voyager ILS via an
SQL query. Among other data, I have MARC records in blob form, and the script
processes one MARC record at a time. I want to be able to parse and
modify/convert the MARC record (using MARC::Record) before
, January 06, 2011 6:19 PM
To: Doran, Michael D; perl4lib
Subject: RE: MARC blob to MARC::Record object
How do I make the MARC blob into a MARC::Record object (without having
to first save it a file and read it in with MARC::File/Batch)? The
MARC blob is already in a variable, so
, Systems Librarian
# University of Texas at Arlington
# 817-272-5326 office
# 817-688-1926 mobile
# do...@uta.edu
# http://rocky.uta.edu/doran/
-Original Message-
From: Leif Andersson [mailto:leif.anders...@sub.su.se]
Sent: Monday, January 10, 2011 8:35 AM
To: Doran, Michael D
I never got an answer to this back in 2008 and thought I might have better luck
now...
-- Michael
-Original Message-
From: Doran, Michael D
Sent: Thursday, February 21, 2008 11:03 AM
To: perl4lib@perl.org
Subject: marcdump hex switch
I have MARC::Record 2.0 installed [1
Hi Mark,
Over the years, I've done a few projects that involved manipulation of, and/or
creating MARC holdings (MFHD) records using the Perl MARC::Record module. No
problems that I know of.
-- Michael
# Michael Doran, Systems Librarian
# University of Texas at Arlington
# 817-272-5326 office
I am migrating a perl script from a server running perl v5.8.5 on Solaris 9 to
a server running perl v5.12.2 on Redhat Linux 5.5. The new environment doesn't
seem to like the syntax I'm using to open a file, and I'm scratching my head
over why that is the case.
That part that is not working
...@mail.libs.uga.edu wrote:
On Sat, Mar 17, 2012 at 5:25 PM, Doran, Michael D do...@uta.edu wrote:
It looks like the read pointer was going to the beginning of the file on
Solaris, but the end of the file on Linux. I've edited the script to do
separate opens for when I need to read the file and when
Hi Devon,
I just recently came across this presentation which lays out pretty much
all the issues with Unicode in perl, and makes some recommendations for
best practices.
While Nick Patch's presentation is excellent, I'm not sure that it lays out
pretty much all the issues with Unicode in
29 matches
Mail list logo