Re: MARC::Record / MARC::File::XML bug when fields contain newlines?

2012-01-12 Thread arvinport...@lycos.com
Thanks Terry. I could have sworn I have processed MARC records containing 
newlines with no problems in the past (I.e., not records converted from XML), 
though I've never tried to validate them with MARCEdit. They were simply 
acceptable to III. The MARC-8 character map I use lists both 0A and 0D as 
valid. However I hunted down a more official version of the map here:

http://lcweb2.loc.gov/diglib/codetables/42.html

And sure enough neither 0A or 0D is listed. Learn something new and all that.

Looks like MARC::Record is doing its job correctly. Perhaps changing 
MARC::File::XML is in order.

Arvin

On Jan 12, 2012, Reese, Terry terry.re...@oregonstate.edu wrote: 

I wanted to follow up on this message (saw it through the digest).  New lines 
actually are invalid characters in a USMARC record.  There are a number of 
characters that are not allowed, and these happen to be two of them.  
Essentially, MARC fields cannot be multi-line.  This is why MarcEdit's 
MARCengine automatically will sanitize this data away during the breaking 
process and why records in the mnemonic format cannot be multi-lined.

--TR

-Original Message-
From: perl4lib-digest-h...@perl.org [mailto:perl4lib-digest-h...@perl.org] 
Sent: Thursday, January 12, 2012 8:31 AM
To: perl4lib@perl.org
Subject: perl4lib Digest 12 Jan 2012 16:31:25 - Issue 479


perl4lib Digest 12 Jan 2012 16:31:25 - Issue 479

Topics (messages 2952 through 2953):

MARC::Record / MARC::File::XML bug when fields contain newlines?
2952 by: arvinporthog.lycos.com
2953 by: Jon Gorman

Administrivia:

To subscribe to the digest, e-mail:
perl4lib-digest-subscr...@perl.org

To unsubscribe from the digest, e-mail:
perl4lib-digest-unsubscr...@perl.org

To post to the list, e-mail:
perl4lib@perl.org


--


Re: Re: Identifying file formats for older files

2011-09-01 Thread arvinport...@lycos.com
Unix Solaris would be best because that's where all the files are archived. If 
I have to move them to a Windows PC to do it then Windows will do as well.

Arvin

On Sep 1, 2011, Bob Paver bob.pa...@gmail.com wrote: 

What operating system? Windows, some flavor of Unix, or other?

BP


On Sep 1, 2011, at 10:55 AM, arvinport...@lycos.com arvinport...@lycos.com 
wrote:

 I have several hundred files, most from the MS DOS days, without meaningful 
 file extensions. Most are probably in some old version of MS Word but I don't 
 know for sure. I'm trying to find a way to generate a list of the files and 
 their formats. I have tried both File::Type and File::MMagic on a test 
 directory of known, modern, files but the results weren't very good.
 
 Anyone have recommendations? I'd prefer it be in perl but it doesn't have to 
 be.
 
 Arvin