Re: [CODE4LIB] regexp for LCC?

2011-04-01 Thread Kelley McGrath
At one point, much to my surprise, someone told me that 050 is defined for numbers assigned by LC not for LCC numbers per se. It doesn't really sound like that from the current definition (http://www.loc.gov/marc/bibliographic/bd050.html), but if you look on the ITS page

[CODE4LIB] regexp for LCC?

2011-03-31 Thread Jonathan Rochkind
Does anyone have a good regular expression that will match all legal LC Call Numbers from the LC Classified Schedule, but will generally not match things that could not possibly be an LC Call Number from the LC Classified Schedule? In particular, I need it to NOT match an MLC call number,

Re: [CODE4LIB] regexp for LCC?

2011-03-31 Thread Tod Olson
Check the regexp that Google uses in their call number normalization: http://code.google.com/p/library-callnumber-lc/wiki/Home You may want to remove the prefix part, and allow for a fourth cutter. The folks at UNC pointed me to this a few months ago. -Tod On Mar 31, 2011, at 11:29

Re: [CODE4LIB] regexp for LCC?

2011-03-31 Thread Jonathan Rochkind
Thanks, that looks good! It's hosted on Google Code, but I don't think that code is anything Google uses, it looks like it's from our very own Bill Dueber. On 3/31/2011 12:38 PM, Tod Olson wrote: Check the regexp that Google uses in their call number normalization:

Re: [CODE4LIB] regexp for LCC?

2011-03-31 Thread Jonathan Rochkind
Except now I wonder if those annoying MLCS call numbers might actually be properly MATCHED by this regex, when I need em excluded. They are annoying _similar_ to a classified call number. Well, one way to find out. And the reason this matters is to try and use an LCC to map to a 'discipline'

Re: [CODE4LIB] regexp for LCC?

2011-03-31 Thread Keith Jenkins
The Google Code regex looks like it will accept any 1-3 letters at the start of the call number. But LCC has no I, O, W, X, or Y classifications. So you might want to use something more like ^[A-HJ-NP-VZ] at the start of the regex. Also, there are only a few major classifications that use three

Re: [CODE4LIB] regexp for LCC?

2011-03-31 Thread Doran, Michael D
To: CODE4LIB@LISTSERV.ND.EDU Subject: [CODE4LIB] regexp for LCC? Does anyone have a good regular expression that will match all legal LC Call Numbers from the LC Classified Schedule, but will generally not match things that could not possibly be an LC Call Number from the LC Classified

Re: [CODE4LIB] regexp for LCC?

2011-03-31 Thread Naomi Dushay
You could also try to use the code I put in SolrMarc utilities classes ha ha ha. - Naomi On Mar 31, 2011, at 10:25 AM, Keith Jenkins wrote: The Google Code regex looks like it will accept any 1-3 letters at the start of the call number. But LCC has no I, O, W, X, or Y classifications. So