Hi Brian: thanks for writing,
On Mon, Aug 25, 2003 at 04:29:37PM -0300, Brian Cassidy wrote:
As part of a previous project I was importing MARC records into an RDBMS
structure. In order to facilitate better searching, it was suggested to
me that I do some normalization on my data and that NACO
On Wed, Aug 27, 2003 at 09:15:25AM -0300, Brian Cassidy wrote:
* normalize()
inputs: either a MARC::Record object or a string. This should probably
accept an arbitrary number of inputs so, you can do
my @normrecs = normalize( @records );
rather than
my @normrecs;
foreach my $rec (
On Wed, Sep 10, 2003 at 01:57:31PM -0400, Joshua Ferraro wrote:
sub fetch_handler {
my ($args) = @_;
# warn in fetch_handler; ## troubleshooting
my $offset = $args-{OFFSET};
$offset -= 1; ## because $args-{OFFSET} 1 = record #1
On Thu, Sep 11, 2003 at 08:40:48AM -0500, Chuck Bearden wrote:
I hope this helps.
This helps for the order of the fields, but from looking at his program it looks
like the more pernicious problem is the order of the subfields within each
field!
//Ed
On Fri, Sep 19, 2003 at 07:58:01PM +0530, Saiful Amin wrote:
I never had to worry about the record_length (pos 00-04) or the
base_address (pos 12-16) in the leader. I think they are automagically
updated while writing the record via $rec-as_usmarc().
saiful++
Yes, they should be
On Thu, Sep 25, 2003 at 07:54:29AM -0400, Joshua Ferraro wrote:
Does anyone know how to add separators/terminators when building a single
MARC record?
Joshua, MARC::Record does this for you. Where is the code you used to generate
these records? Is it the Koha code?
//Ed
In case you missed and are interested in such things, perl.com ran a
good article recently on building a full text search engine with Perl
and any old relational database.
http://www.perl.com/pub/a/2003/09/25/searching.html
It provides examples of how to build and use a reverse (inverted)
On Wed, Nov 12, 2003 at 12:15:38PM +, Stephen Graham wrote:
Can't use string ( ) as a HASH ref while strict refs in use at
/usr/lib/perl5/5.8.0/ExtUtils/MM_Unix.pm line 541.
Weird, I'd be willing to try to help you figure this out if you can point me
to the Zeta source. I googled for a
On Wed, Nov 19, 2003 at 07:43:52AM -0500, Morbus Iff wrote:
The LC also uses $ to represent sub-tags (I think that's what
they're called; just woke up... the $a/$b things). But, I
seem to see _a and _b more often. Which is more prevalent?
LC's MARCMaker/MARCBreaker utilities use $ if I
On Wed, Nov 19, 2003 at 11:50:05AM -0500, Morbus Iff wrote:
Has anyone encountered targeted spam from perl4lib or oss4lib posts?
I've posted numerous times to perl4lib, and once to oss4lib. Just now,
I suddenly got a spam for BowkerLink, which submits to Ulrich's
Periodicals Directory,
On Tue, Nov 25, 2003 at 11:14:09AM -0500, Paul Hoffman wrote:
Are you familiar with Test::More? It has some cool features that can
be tricky (conditionally skipping tests, TODO tests, etc.), so holler
if you have questions. I haven't examined MARC::Record's test suite
closely, but what
Bryan:
On Tue, Nov 18, 2003 at 02:31:59PM -0600, Bryan Baldus wrote:
When I ran Lint on a file of records, one of the errors I received was 250:
Subfield _b is not allowed.
The LC doc [1] is meticulously formatted (which is what allows
specs to do what it does). Unfortunately the 250 has a
On Sun, Dec 07, 2003 at 08:53:04PM +0100, Leif Andersson wrote:
Recently on this list it was discussed whether letters as indicators
should be allowed or not. As I understood it, it was concluded that
Field.pm and USMARC.pm should be fixed to allow for this.
Good, our national dialect of the
On Tue, Dec 16, 2003 at 03:52:56PM +0100, Tajoli Zeno wrote:
1)When you call LOC without a specific character you recive data in MARC-8
character set.
2) In MARC-8 character set a letter like è [e grave] is done with TWO
bytes one for the sign [the grave accent] and one for the letter
On Mon, Jan 05, 2004 at 03:54:09PM -0500, Eric Lease Morgan wrote:
The code works, but is really slow. Can you suggest a way to improve my code
or use some other technique for extracting things like author, title, and id
from my XML?
It's slow because you're building a DOM for the entire
On Mon, Jan 05, 2004 at 10:27:39PM -0500, Eric Lease Morgan wrote:
Since my original implementation is still the fastest, and the newer
implementations do not improve the speed of the application, then I must
assume that the process is slow because of the XSLT transformations
themselves. These
? I'm not
confident that the MARC will have survived translation into the body of
your email message.
Thanks!
//Ed
--
Ed Summers
aim: inkdroid
web: http://www.inkdroid.org
The imagination of nature is far, far greater than the imagination of man. [Richard
Feynman]
...for advanced users only :)
But it would be even nicer to know exactly what's going on here first.
//Ed
--
Ed Summers
aim: inkdroid
web: http://www.inkdroid.org
The deeper I go the darker it gets. [Peter Gabriel]
On Fri, Jan 16, 2004 at 05:15:34PM -0600, David Christensen wrote:
Actually, I *was* wondering how to package that all up as a single thingy. I
image it would be something like Tk-MARC-0.1, but I've no idea how to
bundle packages I'm searching through docs as we speak :-)
Well you
I'm forwarding this along in case there are any perl4lib folks who are
interested in GIS systems/data.
//Ed
From: Aran Deltac [EMAIL PROTECTED]
To: [EMAIL PROTECTED]
Subject: Geography Namspace
Date: Sat, 7 Feb 2004 12:13:59 -0500
I've begun some preliminary work on the geography/geo/gis
On Wed, Feb 25, 2004 at 10:56:17AM -0600, Holly Bravender wrote:
Take me off your list! Thank you.
Holly, please send a message to [EMAIL PROTECTED] and respond
to the confirmation that you should receive. Instructions are available
at http://perl4lib.perl.org
If you have trouble please
Hi Rob:
On Wed, Feb 25, 2004 at 03:31:07PM -0500, Robert Fox wrote:
1. Am I using the best XML processing module that I can for this sort of
task?
XPath expressions require building a document object model (DOM) of your XML
file. Building a DOM for a huge file is extremely expensive since it
On Tue, Feb 17, 2004 at 10:55:35AM -0300, Oberdan Luiz May wrote:
I'm running perl 5.8.3 on Solaris 2.6, with the last version of all
modules needed, the latest Berkeley DB, all compiled with GCC 3.3.2 . Any
hints?
There was a bug in MARC::Charset v0.5 which was causing the EastAsian
was pleased with, so I wrote a little command line utility [2] that
does the same thing, with documentation if you want to try it out yourself.
//Ed
[1] http://www.oclc.org/research/researchworks/
[2] http://www.inkdroid.org/code/tools/naf
--
Ed Summers
aim: inkdroid
web: http://www.inkdroid.org
Life
Hi Enrico:
On Sun, Mar 14, 2004 at 02:47:26PM -0500, Enrico Silterra wrote:
I think that having various derived classes of MARC records.
Holding, Bib Records, Name Authority, etc would be useful.
Interesting question. MARC::Record should handle holdings, authority,
classification, and
On Mon, Mar 15, 2004 at 09:21:49AM -0500, Enrico Silterra wrote:
For instance, a holding record has no title fields at all. I think, maybe,
the title method should throw an exception, or error when you try to grab
the 245 of a holding or other record. (or call a user defined error handler)
perl.com just published an article about creating dictionaries with Perl
by Sean Burke.
--
Sean Burke is a linguist who helps save dying languages by
creating dictionaries for them. He shows us how he uses Perl to
lay out and print these dictionaries, using RTF::Writer and some
data structure
On Fri, Apr 02, 2004 at 11:35:40AM -0500, Michael Bowden wrote:
Sirsi uses some non standard subfields to create links between records.
Typically these subfields are '?' and '='. How can I add these non
standard subfields to records that I am creating/editing with
MARC::Record?
MARC::Record
from STDIN?
Try using the magic filehandle. So in foo.pl :
while ( defined( $line = ) ) {
...
}
The magic filehandle will read stuff from @ARGV and will also read from
STDIN.
//Ed
--
Eric Lease Morgan
(574) 631-8604
--
Ed Summers
aim: inkdroid
web: http
On Wed, Jun 23, 2004 at 11:25:48AM -0700, Jon Legree wrote:
Any suggestions, comments, assistance will be greatly appreciated.
Are we talking about patc_server.cgi?
Just out of curiosity what is the $datapath that is defined at the top of
patc_server.cgi file which indicates what directory to
On Thu, Jul 01, 2004 at 11:22:42AM -0400, Houghton,Andrew wrote:
I'm not sure what MARC::Charset does internally, but MARC-8
defines the diacritic separate from the base character. So
even using binmode(STDOUT,:utf8) will produce two characters,
one for the base character followed by the
A MARC-8 sequence places a combining diacritical mark BEFORE the letter
it's supposed to combine. Whereas Unicode syntax is to put it AFTER the
letter it's supposed to combine with.
Hence for example the letter: Z
is produced by the MARC-8 Sequence:
75 5A (macron below + Z)
but
On Thu, Aug 05, 2004 at 08:32:32AM -0500, Anne Highsmith wrote:
How do I skip over the erroneous record and keep processing the rest of
the file?
my $batch = MARC::Batch-new( 'USMARC', 'file.dat' );
$batch-strict_off();
//Ed
On Tue, Aug 10, 2004 at 02:18:35PM +0200, Paul POULAIN wrote:
perl4lib ML, MARC::Record maintainer(s), any idea ?
Not really no. I think we'd need chapter and verse from the relevant specs
to even start thinking about changing this. Especially after the last go
round :)
//Ed
On Tue, Aug 10, 2004 at 04:22:04PM +0200, Paul POULAIN wrote:
danmarc2 HAS subfields lower than 010...
It would help if documentation could be found that supports this.
It would also be nice if we could see a sample of 10 or so sample records
as well.
//Ed
--
Ed Summers
aim: inkdroid
web
I don't feel like I understand the example very well. Have you considered
changing the regex to match multiple punctuation marks at the end of line?
$abstract =~ m/([a-zA-Z0-9\.]+)[.!?]*\s*$/x;
^
|
On Wed, Aug 18, 2004 at 08:23:59AM -0500, Bryan Baldus wrote:
Both seem to fail to capture the warnings reported by MARC::File::USMARC.
There appears to be a bug in MARC::Batch::next() code at line 123 which
extracts the warnings from the newly instantiated MARC::Record object
and stuffs them
... I've not usually bothered to look at how the tests or the
Makefile.PL work. This is one reason I haven't tried to distribute my
modules through CPAN.
What no OS X yet!? The drag and drop trick is what you are stuck with
in MacPerl, and it's kind of a testament to Perl's flexibility that
Apologies if you already saw this over on xml4lib...but it's relevant
here given the use of Perl. //Ed
===
Nature Publishing Group (NPG) are pleased to announce the lastest
release of their open-source RSS aggregator 'Urchin' to SourceForge
(http://urchin.sf.net). Initially funded by the UK
don't mind the list being referenced from two locations:
$self-{author_ids} = [EMAIL PROTECTED];
Hope this helps!
//Ed
--
Ed Summers
aim: inkdroid
web: http://www.inkdroid.org
The deeper I go the darker it gets. [Peter Gabriel]
On Thu, Nov 04, 2004 at 12:20:11AM -0500, Eric Lease Morgan wrote:
Do y'all know of any Perl-based OAI data repository software.
Check out oai-perl, which is from the same group that produce
eprints.org, and may in fact be the OAI core of eprints.
http://oai-perl.sourceforge.net/
//Ed
The author of Business::ISBN is looking for a grant from the Perl
foundation to update the module to work with 13 digit ISBNs.
Business::ISBN is an essential tool for working with ISBNs. If you
use the module brian would appreciate it if you could send him a note
about how you are using it. Any
On Wed, Nov 24, 2004 at 08:22:47AM +, Ashley Sanders wrote:
Is MARC::Record trying to treat than as Unicode when in fact they
are MARC-8?
MARC::Record currently does no transformation of character sets that
I'm aware of. There is a completely separate module MARC::Charset
which provides
as a separate
package to CPAN before releasing new versions. That way we have a baseline
to work from.
Bryan if you need help doing this for the first time (from SourceForge) let me
know and I'll give you a hand (inkdroid on AIM and Yahoo).
//Ed
--
Ed Summers
aim: inkdroid
web: http
I'm thinking that the MicroLIF failure is due to line endings being
different on Mac versions OS X.
There is code in MARC::File::MicroLIF::_get_chunk that handles DOS
(\r\n) and Unix (\n) line endings, but not Mac (\r).
Does anyone know if \r is a legit line ending in MicroLIF?
//Ed
On Fri, Jan 07, 2005 at 08:53:40AM +0100, Ron Davies wrote:
I will have a similar project in a few months' time, converting a whole
bunch of processing from MARC-8 to UTF-8. I would be very happy to assist
in testing or development of a UTF-8 capability for MARC::Record. Is the
problem
On Thu, Jan 06, 2005 at 10:03:13PM -0600, Bryan Baldus wrote:
Is there any problem with committing the revised version of
MARC::File::USMARC, and adding+committing the three files above to
cvs in the t/ directory?
Nice work :) as long as the tests pass I think comitting sounds like a
good
Hi Jane:
On Tue, Jan 11, 2005 at 01:29:55PM -0500, Jacobs, Jane W wrote:
My result was something like:
Dave,Ayod\2003
Paòt,Kaâs\2002
Baks,Dasa\2003
,Viâs\2002
Problem 1: As you can see, I don't really want the first four characters, I
want the first four SEARCHABLE characters. How
know.
--
Ed Summers
aim: inkdroid
web: http://www.inkdroid.org
He who binds to himself a joy Does the winged life destroy; But he who kisses
the joy as it flies Lives in eternity's sun rise. [William Blake]
On Mon, Jan 24, 2005 at 08:37:41AM -0600, Bryan Baldus wrote:
I generally 'use warnings' or use the -w flag in the modules and scripts
I've been writing. I didn't notice it was missing. I need to add strict and
warnings to CodeData, as well. In modules/package files, is it practice to
leave
like
web4lib/xml4lib do for you.
//Ed
[1] http://www.mail-archive.com/perl4lib%40perl.org/
--
Ed Summers
aim: inkdroid
skype: inkdroid
web: http://www.inkdroid.org
Give and ye shall receive.
[Bram Cohen]
I wondered if any of you had run into similar problems, or if you had
any thoughts on how to tackle this particular issue.
It's ironic that MARC::Record *used* to do what Andrew suggests: using
split() rather than
than substr() with the actual directory lengths. The reason for the
switch was
Ok, this is great information to have moving forward wi the next
MARC::Charset...many thanks Michael and Jason. Micheal you are totally
right the installer really shouldn't fail like that...I'd never tested
it on a system that lacked DB_File so I didn't know. And CPAN testers
didn't pick it up
Am I right that this amounts to less than 1Meg (EastAsian.db +
UTF8.db)? Depending on your system and your needs (more
speed?), that may not be considered large and might fit into
memory fine. Otherwise, I think any of the in-core (non-DB_File)
DBM files ought to suffice for that amount of
Code4lib 2006 Conference – Registration Now Open
Registration is now open for Code4lib 2006. Code4lib 2006 is a loosely
structured conference for library technologists to commune,
gather/create/share ideas and software, be inspired, and forge
collaborations. It is also an outgrowth of the Access
On 1/11/06, Sperr, Edwin [EMAIL PROTECTED] wrote:
Well this is odd:
C:\Documents and Settings\esperrppm install marc-xml
Installing package 'marc-xml'...
Error installing package 'marc-xml': Read a PPD for 'marc-xml', but it
is not in
tended for this build of Perl (MSWin32-x86-multi-thread)
When I downloaded the tarball and installed manually I noticed that
the the MANIFEST references a META.yml file, but the tarball doesn't
include one. Perhaps this is somehow choking up CPAN? The 'make dist'
command should generate a META.yml file for you.
I would ask on the cpan-discuss [1] list
On 3/16/06, Mike Rylander [EMAIL PROTECTED] wrote:
Will some brave soul please test this with some UNIMARC records and
let me know how it goes?
Yes please, add the test to the test suite if possible Joshua and Paul.
miker_++
//Ed
I apologize, but I'm finding it hard to trace what exactly this script
is doing. I did take a look at the first failure and sure enough the
record leader says it's 463 bytes but the record itself is 464 bytes.
So a failure is warranted -- given the current behavior of
MARC::Record.
Perhaps
If what follows seems boring and you use MARC::Charset with any
regularity just upgrade MARC::Charset to v0.97. If you are interested
in knowing why read on...
Thanks for the details [1] Michael. You've uncovered a rather nasty
bug in MARC::Charset = v0.8. MARC::Charset::Compiler processes LCs
Can you post said script, or send us a URL for it?
//Ed
It looks like you don't have an XML parser installed that supports the
features that M::F::X requires:
use XML::SAX qw(Namespaces Validation);
Try executing that, and see if you get a similar exception. FWIW
Namespace support is required for the version of MARC::File::SAX that
is in CVS since
+1
Thanks for working on this Galen.
//Ed
On Sun, Jul 26, 2009 at 8:54 PM, Galen Charltongmcha...@gmail.com wrote:
Hi,
On Wed, Jul 22, 2009 at 5:04 PM, Dan Scottdeni...@gmail.com wrote:
It would be nice to see the 0.91 release get pushed out the door, in
any case. 0.88 was a long time ago.
Hi Michele:
I copied and pasted the XML from your email and ran it through a
simple test script (both attached) and the record seemed to be parsed
ok. What do you see if you run the attached test.pl?
//Ed
test.pl
Description: Binary data
marc:record xmlns:marc=http://www.loc.gov/MARC21/slim;
Hi Michele:
Yes, I see a UTF-8 encoding error in that file when I try to check it
with xmllint (from the libxml2 package):
e...@curry:~/Downloads$ xmllint marc.xml
marc.xml:1: parser error : Input is not proper UTF-8, indicate encoding !
Bytes: 0xE0 0x20 0x3A 0x3C
ld code=ble infrastrutture, l'
On Tue, Mar 16, 2010 at 11:13 PM, Galen Charlton gmcha...@gmail.com wrote:
If there are no major objections, in a week's time I plan to make the
CVS repo read-only and we'll move forward with Git.
Hooray, thanks so much Galen! Sounds like a great plan moving forward.
//Ed
mi...@esilibrary.com
mjordan mjor...@sfu.ca
morbus mor...@disobey.com
moregan more...@flr.follett.com
petdance a...@petdance.com
Also, you'll want to set up your git profile appropriately. So in my case:
git config --global user.name Ed Summers
git config --global user.email e
Hi Leif,
Is the downside to this approach that you are modifying a CPAN module
in place, or is it something to do with the behavior of 'use bytes'?
Would there be any undesirable side effects to adding 'use bytes' to
MARC::File::USMARC::encode on CPAN?
//Ed
On Tue, Oct 12, 2010 at 7:58 AM, Leif
On Tue, Oct 12, 2010 at 9:05 AM, Leif Andersson
leif.anders...@sub.su.se wrote:
To sum up.
I think it is a good idea to make the MARC blob a binary object, so to speak.
I don't know if you should just apply my simple hacks to CPAN code.
Or if it is called for a thourough re-write of some parts
Galen, thanks very much for continuing to develop MARC::Charset. You
should feel free to update the Makefile.PL and README to list you as
the author now, since you have taken an active role in maintaining it.
One of the most gratifying parts of my work as a software developer
has been seeing
On Tue, Nov 8, 2011 at 9:32 AM, Jon Gorman jonathan.gor...@gmail.com
wrote: First, on the Library of Congress data, Internet Archive has
a snapshot of the LoC information from 2007. It was collected by
the Scriblio project
http://www.archive.org/details/marc_records_scriblio_net. There's
also
Hi Anja
Sorry for the delay. Yes please feel free to translate it and make it
available. I think it's awesome that you want to! I am cc'ing the
perl4lib mailing list where people continue to talk about MARC::Record
and related modules.
Best wishes,
//Ed
On Tue, Feb 5, 2013 at 6:39 AM, Anja
it.
Of course, I will include a reference to the original page.
Keep in touch,
Anja Skrba
Anja Skrba
an...@webhostinggeeks.com
http://science.webhostinggeeks.com/
Tel: +38162300604
On Wed, Feb 20, 2013 at 1:47 PM, Ed Summers e...@pobox.com wrote:
Hi Anja
Sorry for the delay. Yes please
I forgot to add that, if you need help converting your translation to POD
and getting it into Git I would be happy to work with you on that.
//Ed
On Tue, Mar 5, 2013 at 11:59 AM, Ed Summers e...@pobox.com wrote:
Hi Anja,
Is your translation available as POD? I think it would make a nice
On Thu, Mar 7, 2013 at 7:54 AM, Anja Skrba an...@webhostinggeeks.comwrote:
Hi Ed,
I never did this so you'll have to explain me how to do it :)
Anja Skrba
an...@webhostinggeeks.com
http://science.webhostinggeeks.com/
Tel: +38162300604
On Wed, Mar 6, 2013 at 10:57 AM, Ed Summers e
75 matches
Mail list logo