[CODE4LIB] Web Application Developer Position at Ohio State University

2011-10-24 Thread Black, Elizabeth
Founded in 1870, The Ohio State University is a comprehensive, state-assisted university offering a complete environment for learning for its 3,000 faculty and 56,000 students. The Ohio State University Libraries is seeking a motivated, creative person to join our team charged with designing,

Re: [CODE4LIB] web spam block less awful than Captcha?

2011-10-24 Thread David Mayo
I can say from experience that that won't help - spambots even hit lone forms with nondescript names. - Dave Mayo On Mon, Oct 24, 2011 at 1:48 PM, Jonathan Rochkind rochk...@jhu.edu wrote: On 10/24/2011 1:15 PM, MJ Ray wrote: trying to design things so that the return on investment for

[CODE4LIB] marc-8

2011-10-24 Thread Eric Lease Morgan
In Perl, how do I specify MARC-8 when reading (decoding) and writing (encoding) data? Character encoding is the bane of my existence. I have learned that when reading from a file I ought to specify the type of encoding the file is in and decode accordingly, or else. Once read, it is converted

Re: [CODE4LIB] marc-8

2011-10-24 Thread Doran, Michael D
Hi Eric, In Perl, how do I specify MARC-8 when reading (decoding) and writing (encoding) data? You can't. MARC-8 is a character set that is unknown to the operating system. Your best bet is to convert MARC-8-encoded records into UTF-8. ...it is converted it Perl's internal encoding

Re: [CODE4LIB] marc-8

2011-10-24 Thread Walker, David
I know yaz-marcdump changes the encoding bit in MARC leaders. Does it also convert MARC-8 characters to UTF-8? Yes. We use it for that purpose all the time. --Dave - David Walker Library Web Services Manager California State University -Original Message- From: Code

Re: [CODE4LIB] marc-8

2011-10-24 Thread Ross Singer
On Mon, Oct 24, 2011 at 7:39 PM, Eric Lease Morgan emor...@nd.edu wrote: Okay. How do I go about converting MARC-8 encoded records into UTF-8? I know yaz-marcdump changes the encoding bit in MARC leaders. Does it also convert MARC-8 characters to UTF-8? (I guess I could simply try it and see

Re: [CODE4LIB] marc-8

2011-10-24 Thread Jonathan Rochkind
What _ought_ to be easiest of all is getting our ILS's to NEVER export Marc8 _ever_ again. UTF8 only. Sadly, that only ought to be easiest. But IMO there's no reason any of us should be dealing with Marc8 ever again. The only thing that should deal in Marc8 is an ILS, and should only input

Re: [CODE4LIB] marc-8

2011-10-24 Thread Jon Gorman
In Perl, how do I specify MARC-8 when reading (decoding) and writing (encoding) data? You can't. MARC-8 is a character set that is unknown to the operating system. Your best bet is to convert MARC-8-encoded records into UTF-8. /me throws his hands up in the air and screams! Okay. How

Re: [CODE4LIB] marc-8

2011-10-24 Thread Doran, Michael D
Hi Jonathan, I tried to figure out how to custom add a new encoding to ruby 1.9 with the idea of adding Marc8 as an actuall ruby 1.9 character encoding supported same as any other built in char encoding Not a trivial undertaking. Remember that the MARC-8 environment allows alternate

Re: [CODE4LIB] marc-8

2011-10-24 Thread Jonathan Rochkind
Yeah, but if there's Perl code and Java code to do it, can't be _that_ hard to port to ruby if I could figure out what you need to do to get first-class char encoding support in ruby 1.9 anyway. I mean, you could do it just as a library without that... but it's enough trouble that, yeah,

Re: [CODE4LIB] marc-8

2011-10-24 Thread Doran, Michael D
But I had no idea Marc8 allowed escape sequences to temporarily switch to a different encoding. Really? Oh my god. For you young'uns that were born Unicode and are a bit foggy on the MARC-8 environment (and all its... intricacies), I did a short write-up a few years ago: Coded

Re: [CODE4LIB] ISBN Regular Expression

2011-10-24 Thread Tom Pasley
If you're looking for PHP code, then I've done some work for a long-dormant project: http://code.google.com/p/txtckr/source/browse/trunk/mvc/components/identifiers HTH, Tom On Sat, Oct 22, 2011 at 6:44 AM, Kozlowski,Brendon bkozlow...@sals.eduwrote: Hi all. I'm somewhat surprised that

Re: [CODE4LIB] ISBN Regular Expression

2011-10-24 Thread Jon Gorman
Also, I don't know OpenBook to know your source data, but don't forget a lot of publishers have printed ISBNs in different ways over the past few years. The regex would choke on any hyphens. If users are copying from printed material, they could type them in. For example, one of the books near

Re: [CODE4LIB] ISBN Regular Expression

2011-10-24 Thread Bill Dueber
So much duplication. If only there were some sort of organization that might serve as a clearinghouse for this sort of code that's useful to libraries... [Yes, I know the only appropriate response is, Well, Dueber, step up and do something about it. ] On Mon, Oct 24, 2011 at 4:59 PM, Jon Gorman