[CODE4LIB] Registry blog post of interest ...

2009-10-12 Thread Diane I. Hillmann
As some of you know, the RDA registrars have been working with the Deutsche Nationalbibliothek to enable a German language translation of the RDA elements and vocabularies to be available using the same mechanism as the English original. Today, Veronika Leibrecht, who's been working on this,

[CODE4LIB] Job Posting: Digital Archivist (UVa, Charlottesville, VA)

2009-10-12 Thread Graham, Wayne (wsg4w)
Hi All, The University of Virginia Library in Charlottesville, VA has just posted a new position for a Digital Archivist (http://bit.ly/Rhhws). This is a two-year, grant funded position by the Andrew Mellon Foundation to develop an inter-institutional model for stewardship for born-digital

[CODE4LIB] lingua::stem::snowball

2009-10-12 Thread Eric Lease Morgan
Can someone help me use Lingua::Stem::Snowball more efficiently? I want to count the total number of times a word stem appears in a hash. Here is a short example: use strict; use Lingua::Stem::Snowball; my $idea = 'books'; my %words = ( 'books'= 5, 'library' = 6,

Re: [CODE4LIB] lingua::stem::snowball

2009-10-12 Thread Benjamin Florin
It's been a while since I perled, so this might not be the most idiomatic solution, but you could stem the entire words has list once and create a hash of all the sums (%words_stems), then run the list of idea words (@ideas), checking only the desired stems: use strict; use

Re: [CODE4LIB] lingua::stem::snowball

2009-10-12 Thread Matt Jones
Presumably the call to stem() is the expensive part of your loop, so I'd want to cut that out if that is true. It looks to me that you can pass in an array reference to stem(), so there's no need for calling stem() in a loop at all. I'd think something like the code below should help reduce your