At one point, much to my surprise, someone told me that 050 is defined for
numbers assigned by LC not for LCC numbers per se. It doesn't really sound
like that from the current definition
(http://www.loc.gov/marc/bibliographic/bd050.html), but if you look on the
ITS page
Does anyone have a good regular expression that will match all legal LC
Call Numbers from the LC Classified Schedule, but will generally not
match things that could not possibly be an LC Call Number from the LC
Classified Schedule?
In particular, I need it to NOT match an MLC call number,
Check the regexp that Google uses in their call number normalization:
http://code.google.com/p/library-callnumber-lc/wiki/Home
You may want to remove the prefix part, and allow for a fourth cutter.
The folks at UNC pointed me to this a few months ago.
-Tod
On Mar 31, 2011, at 11:29
Thanks, that looks good!
It's hosted on Google Code, but I don't think that code is anything
Google uses, it looks like it's from our very own Bill Dueber.
On 3/31/2011 12:38 PM, Tod Olson wrote:
Check the regexp that Google uses in their call number normalization:
Except now I wonder if those annoying MLCS call numbers might actually
be properly MATCHED by this regex, when I need em excluded. They are
annoying _similar_ to a classified call number. Well, one way to find out.
And the reason this matters is to try and use an LCC to map to a
'discipline'
The Google Code regex looks like it will accept any 1-3 letters at the
start of the call number. But LCC has no I, O, W, X, or Y
classifications.
So you might want to use something more like ^[A-HJ-NP-VZ] at the
start of the regex.
Also, there are only a few major classifications that use three
To: CODE4LIB@LISTSERV.ND.EDU
Subject: [CODE4LIB] regexp for LCC?
Does anyone have a good regular expression that will match all legal LC
Call Numbers from the LC Classified Schedule, but will generally not
match things that could not possibly be an LC Call Number from the LC
Classified
You could also try to use the code I put in SolrMarc utilities classes
ha ha ha.
- Naomi
On Mar 31, 2011, at 10:25 AM, Keith Jenkins wrote:
The Google Code regex looks like it will accept any 1-3 letters at the
start of the call number. But LCC has no I, O, W, X, or Y
classifications.
So