Re: [CODE4LIB] Code4Lib Midwest?

2010-03-08 Thread Eric Lease Morgan
I have created a crude map illustrating where people might be coming from for a 
regional Midwest Code4Lib meeting. [1]

If others from the midwest were to update the wiki with their possible 
attendance, then I could make the map more accurate. Right now, the geographic 
center of the meeting is around Joliet, IL.  8-)  In the meantime, I will 
organize a conference call, and allow just about anyone to participate, in 
order to figure out when and where such a regional meeting might take place.

'Make sense?

[1] http://wiki.code4lib.org/index.php/Midwest

-- 
Eric Lease Morgan
University of Notre Dame


Re: [CODE4LIB] Q: XML2JSON converter

2010-03-08 Thread Benjamin Young

On 3/6/10 6:59 PM, Houghton,Andrew wrote:

From: Code for Libraries [mailto:code4...@listserv.nd.edu] On Behalf Of
Bill Dueber
Sent: Saturday, March 06, 2010 05:11 PM
To: CODE4LIB@LISTSERV.ND.EDU
Subject: Re: [CODE4LIB] Q: XML2JSON converter

Anyway, hopefully, it won't be a huge surprise that I don't disagree
with any of the quote above in general; I would assert, though, that
application/json and application/marc+json should both return JSON
(in the same way that text/xml, application/xml, and
application/marc+xml can all be expected to return XML).
Newline-delimited json is starting to crop up in a few places
(e.g. couchdb) and should probably have its own mime type
and associated extension. So I would say something like:

application/json -- return json (obviously)
application/marc+json  -- return json
application/marc+ndj  -- return newline-delimited json
 

This sounds like consensus on how to deal with newline-delimited JSON in a 
standards based manner.

I'm not familiar with CouchDB, but I am using MongoDB which is similar.  I'll 
have to dig into how they deal with this newline-delimited JSON.  Can you 
provide any references to get me started?
   
Rather than using a newline-delimited format (the whole of which would 
not together be considered a valid JSON object) why not use the JSON 
array format with or without new lines? Something like:


[{key:value}, {key,value}]

You could include new line delimiters after the , if you needed to 
make pre-parsing easier (in a streaming context), but may be able to get 
away with just looking for the next , or ] after each valid JSON object.


That would allow the entire stream, if desired, to be saved to disk and 
read in as a single JSON object, or the same API to serve smaller JSON 
collections in a JSON standard way.


CouchDB uses this array notation when returning multiple document 
revisions in one request. CouchDB also offers a slightly more annotated 
structure (which might be useful with streaming as well):


{
  total_rows: 2,
  offset: 0,
  rows:[{key:value}, {key,value}]
}

Rows here plays the same roll as the above array-based format, but 
provides an initial row count for the consumer to use (if it wants) for 
knowing what's ahead. The offset key is specific to CouchDB, but 
similar application specific information could be stored in the header 
of the JSON object using this method.

In all cases, we should agree on a standard record serialization,
though, and the pure-json returns should include something that
indicates what the heck it is (hopefully a URI that can act as a
distinct namespace-type identifier, including a version in it).
 

I agree that our MARC-JSON serialization needs some namespace identifier in 
it and it occurred to me that the way it is handling indicators, e.g., ind1 and ind2 
properties, might be better handled as an array to accommodate IFLA's MARC-XML-ish where 
they can have from 1-9 indicator values.

BTW, our MARC-JSON content is specified in Unicode not MARC-8, per the JSON 
standard, which means you need to use \u notation to specify characters in 
strings, not sure I made that clear in earlier posts.  A downside to the 
current ECMA 262 specification is that it doesn't support \U00XX, as Python 
does, for the extended characters.  Hopefully that will get rectified in a 
future ECMA 262 specification.

   

The question for me, I think, is whether within this community,  anyone
who provides one of these types (application/marc+json and
application/marc+ndj) should automatically be expected to provide both.
I don't have an answer for that.
 
As far as mime-type declarations go in general, I'd recommend avoiding 
any format specific mime types and sticking to the application/json 
format and providing document level hints (if needed) for the content 
type. If you do find a need for the special case mime types, I'd 
recommend still responding to Accepts: application/json whenever 
possible--for the sake of standards. :)


All told, I'm just glad to see this discussion being had. I'll be happy 
to provide some CouchDB test cases (replication, etc) if that's of 
interest to anyone.


Thanks,
Benjamin

I think this issue gets into familiar territory when dealing with RDF formats.  
Let's see, there is N3, NT, XML, Turtle, etc.  Do you need to provide all of 
them?  No, but it's nice of the server to at least provide NT or Turtle and 
XML.  Ultimately it's up to the server.  But the only difference between use 
cases #2 and #3 is whether the output is wrapped in an array, so it's probably 
easy for the server to produce both.

Depending on how much time I get next week I'll talk with the developer network 
folks to see what I need to do to put a specification under their 
infrastructure.  Looks like from my schedule it's going to be another week of 
hell :(


Andy.
   


Re: [CODE4LIB] Q: XML2JSON converter

2010-03-08 Thread Houghton,Andrew
 From: Code for Libraries [mailto:code4...@listserv.nd.edu] On Behalf Of
 Benjamin Young
 Sent: Monday, March 08, 2010 09:32 AM
 To: CODE4LIB@LISTSERV.ND.EDU
 Subject: Re: [CODE4LIB] Q: XML2JSON converter
 
 Rather than using a newline-delimited format (the whole of which would
 not together be considered a valid JSON object) why not use the JSON
 array format with or without new lines? Something like:
 
 [{key:value}, {key,value}]
 
 You could include new line delimiters after the , if you needed to
 make pre-parsing easier (in a streaming context), but may be able to
 get
 away with just looking for the next , or ] after each valid JSON
 object.
 
 That would allow the entire stream, if desired, to be saved to disk and
 read in as a single JSON object, or the same API to serve smaller JSON
 collections in a JSON standard way.

I think we just went around full circle again.  There appear to be two distinct 
use cases when dealing with MARC collections.  The first conforms to the ECMA 
262 JSON subset.  Which is what you described, above:

[ { key : value }, { key : value } ]

its media type should be specified as application/json.

The second use case, which there was some discussion between Bill Dueber and 
myself, is a newline delimited format where the JSON array specifiers are 
omitted and the objects are specified one per line without commas separating 
objects.  The misunderstanding between Bill and I was that this malformed 
JSON was being sent as media type application/json which is not what he was 
proposing and I misunderstood.  This newline delimited JSON appears to be an 
import/export format in both CouchDB and MongoDB.

In the FAST work I'm doing I'm probably going to take an alternate approach to 
generating our 10,000 MARC record collection files for download.  The approach 
I'm going to take is to create valid JSON but make it easier for the CouchDB 
and MongoDB folks to import the collection of records.  The format will be:

[
{ key : value }
,
{ key : value }
]

the objects will be one per line, but the array specifier and comma delimiters 
between objects will appear on a separate line.  This would allow the CouchDB 
and MongoDB folks to run a simple sed script on the file before import:

sed -e '/^.$/D' file.json  file.txt

or if they are reading the data as a raw text file, they can just ignore all 
lines that start with opening brace, comma, or closing brace, or alternately 
only process lines starting with an opening brace.

However, this doesn't mean that I'm balking on pursuing a separate media type 
specific to the library community that specifies a specific MARC JSON 
serialization encoded as a single line.

I see multiple steps here with the first being a consensus on serializing MARC 
(ISO 2709) in JSON.  Which begins with me documenting it so people can throw 
some darts at.  I don't think what we are proposing is controversial, but it's 
beneficial to have a variety of perspectives as input.


Andy.


Re: [CODE4LIB] Code4Lib Midwest?

2010-03-08 Thread John Wynstra
I'm in Iowa and could see going as far as Chicago for a one day event. 
Indiana, Michigan and Ohio locations would most likely take me out of 
the mix.


P.S. - Could stuff the Fender HM Strat and Fender Hot Rod Deluxe into 
the trunk for the trip.  Excellent coding tools.


On 3/8/2010 8:09 AM, Eric Lease Morgan wrote:

I have created a crude map illustrating where people might be coming from for a 
regional Midwest Code4Lib meeting. [1]

If others from the midwest were to update the wiki with their possible 
attendance, then I could make the map more accurate. Right now, the geographic center of 
the meeting is around Joliet, IL.  8-)  In the meantime, I will organize a conference 
call, and allow just about anyone to participate, in order to figure out when and where 
such a regional meeting might take place.

'Make sense?

[1] http://wiki.code4lib.org/index.php/Midwest



--

John Wynstra
Library Information Systems Specialist
Rod Library
University of Northern Iowa
Cedar Falls, IA  50613
wyns...@uni.edu
(319)273-6399



[CODE4LIB] ignore my last message

2010-03-08 Thread Jonathan Rochkind
As usual, I'm great at sending the WORST messages to the wrong list. My 
email client is messing up all over.  Please do not reply to that one on 
list, please ignore it, and Eric please remove it from teh archives is 
possible.


Man, today is not my day. I've got to stop using email for a year or 
something.


Re: [CODE4LIB] ignore my last message

2010-03-08 Thread Mike Taylor
On 8 March 2010 17:04, Jonathan Rochkind rochk...@jhu.edu wrote:
 As usual, I'm great at sending the WORST messages to the wrong list. My
 email client is messing up all over.  Please do not reply to that one on
 list, please ignore it, and Eric please remove it from teh archives is
 possible.

 Man, today is not my day. I've got to stop using email for a year or
 something.

This kind of thing is always going to happen from time to time on a
list configuration to fail maximally hard when it fails at all.  See
Reply-To Munging Considered Harmful:
http://www.unicom.com/pw/reply-to-harmful.html


Re: [CODE4LIB] ignore my last message

2010-03-08 Thread Walker, David
That was not a reply but a new message.

--Dave

==
David Walker
Library Web Services Manager
California State University
http://xerxes.calstate.edu

From: Code for Libraries [code4...@listserv.nd.edu] On Behalf Of Mike Taylor 
[m...@indexdata.com]
Sent: Monday, March 08, 2010 9:14 AM
To: CODE4LIB@LISTSERV.ND.EDU
Subject: Re: [CODE4LIB] ignore my last message

On 8 March 2010 17:04, Jonathan Rochkind rochk...@jhu.edu wrote:
 As usual, I'm great at sending the WORST messages to the wrong list. My
 email client is messing up all over.  Please do not reply to that one on
 list, please ignore it, and Eric please remove it from teh archives is
 possible.

 Man, today is not my day. I've got to stop using email for a year or
 something.

This kind of thing is always going to happen from time to time on a
list configuration to fail maximally hard when it fails at all.  See
Reply-To Munging Considered Harmful:
http://www.unicom.com/pw/reply-to-harmful.html


Re: [CODE4LIB] ignore my last message

2010-03-08 Thread Jonathan Rochkind
In this case, I have nobody to blame but my email client autocomplete, 
and the fact that i had to try to send the message like five times to 
get it to go through, and one of those times didn't catch my 
autocomplete doing the wrong thing.


Mike Taylor wrote:

On 8 March 2010 17:04, Jonathan Rochkind rochk...@jhu.edu wrote:
  

As usual, I'm great at sending the WORST messages to the wrong list. My
email client is messing up all over.  Please do not reply to that one on
list, please ignore it, and Eric please remove it from teh archives is
possible.

Man, today is not my day. I've got to stop using email for a year or
something.



This kind of thing is always going to happen from time to time on a
list configuration to fail maximally hard when it fails at all.  See
Reply-To Munging Considered Harmful:
http://www.unicom.com/pw/reply-to-harmful.html

  


Re: [CODE4LIB] conf reviews -- need some advice

2010-03-08 Thread Jodi Schneider
On Mon, Mar 8, 2010 at 4:56 PM, Jonathan Rochkind rochk...@jhu.edu wrote:

 So I've received 2 out of 4 conference reviews from scholarship
 attendees.


Actually, there are only 3. One awardee could not attend.


 Will be sent in two subsequent messages to list, to not go over maximum
 message size.
 I am personally NOT very happy with them, not sure if I'm interested in
 publishing them or not.  They unfortunately the kind of what I did on my
 Code4Lib Summer Vacation articles that I was trying to discourage, I'm not
 sure whether they are actually useful for our journal audience?

 Somehow last year I succeeded in encouraging the scholarship recipients to
 avoid this kind of thing and write something actually interesting for a
 general audience. This year, not so much. In part, this may be that i had
 less time to spend on it, and wasn't as careful with my communications with
 the invitees.  I think that initially telling the recipients that their
 required report for the scholarship committee and their article in C4LJ are
 the same thing didn't help either.


Sorry--that's what we've done in past years. Though we were a bit more
upfront about it in communication this year. Feel free to solicit something
else.


 But I'm not sure what to do at this point.  I would really appreciate
 someone else taking a look at these reviews, and giving some feedback on
 whether the journal should publish them at all -- or maybe skip the reviews
 from scholarship awardees this year? Or something else?
 Of the other 2 outstanding, one is promissed by the author for today, the
 other I haven't heard from the author, so assume it's not coming.

 Jonathan



[CODE4LIB] UBC Library EZproxy GUI Wondertool Release

2010-03-08 Thread Paul Joseph
Hi all,

Thanks to everyone who inquired about accessing screenshots and the code for
our home-grown EZproxy GUI Wondertool. We've had a chance to clean up the
code and are now ready to share it with anyone interested in simplifying the
EZproxy configuration file management process. Take a look at the
screenshots, feel free to download the application, and please let me know
if and how well it works for you.

http://sites.google.com/site/ezproxywondertool/

Thanks,

Paul


Re: [CODE4LIB] Code4Lib Midwest?

2010-03-08 Thread Eric Lease Morgan
 ...In the meantime, I will organize a conference call, and allow just about 
 anyone to participate, in order to figure out when and where such a regional 
 meeting might take place...

If you would like to participate in a synchronous chat to discuss the when, 
where, and what of a Code4Lib Midwest regional meeting, then please complete 
the Doodle poll at the other end of this URL:

  http://doodle.com/mfc6cgxaqmwx35dm

Based on how people respond, a conference call or maybe even and IRC chat will 
be scheduled.

-- 
Eric Lease Morgan
University of Notre Dame

(574) 631-8604


[CODE4LIB] Library Review call for papers: open source in libraries

2010-03-08 Thread Alan Poulter
An upcoming issue of Library Review will feature papers from a conference
held last year in London, Breaking the Barriers 2009
(http://www.openlibraries.eu/?page_id=48). We are interested in getting
other perspectives on the applications of open source software in
libraries: case studies, new applications, cost-benefit studies etc.
Please get in contact with any proposals for papers you may have.

For details about Library Review itself please see:

http://info.emeraldinsight.com/products/journals/journals.htm?id=lr

Alan Poulter - Associate Editor
Dept of Computer and Information Sciences
University of Strathclyde
mailto:alan.poul...@cis.strath.ac.uk
http://www.cis.strath.ac.uk/cis/staff/index.php?uid=ap
tel: 0141 548 3911

The University of Strathclyde is a charitable body, registered in
Scotland, with registration number SC015263


[CODE4LIB] Register Now for Code4Lib Northwest!

2010-03-08 Thread Kyle Banerjee
REGISTER NOW at http://www.surveymonkey.com/s/QFJ6C92

EVENT INFORMATION:

When:  Monday June 7th, 2010
Start: 9 AM
End: 4:00 PM, with evening gathering for those interested at one of
Portland's many local pubs/establishments
Where:  White Stag, Portland, Oregon
Cost:  $50 (primarily to cover snacks, refreshments and lunch for the group)
Size:  Facility can accommodate approximately 60 participants.
Website: http://groups.google.com/group/pnwcode4lib/web/code4lib-northwest-2010

PRE-CONFERENCE ACTIVITIES

When:  Sunday, June 6th, 2010
Start: 6 PM
End:  ???
Where:  TBA
A pre-Code4Lib PNW get together for folks to get together, have a
drink and chat.

BACKGROUND:

The code4lib Conference was born in a chatroom discussion in November
2005*. Code4lib members quickly coalesced around the idea and a
popular, annual conference for library technologist and developers was
born.  The Pacific Northwest Code4Lib group was formed to build upon
the original goals of the group/conference.  It aims to connect
developers so they can share information about projects, trends and
technologies. Most importantly, it seeks to develop a community where
people share information and experience and collaborate on work
towards common goals.

FORMAT:

The Code4Lib Northwest meeting will be a one day conference in
traditional Code4Lib style.  It will feature approximately 8 20 minute
sessions, and two periods (a morning and late afternoon) consisting of
lightning talks.  The facilities and the size should help keep the
meeting cozy, and, with luck, everyone that wants to participate (do a
lightning talk) will have the opportunity.

PRESENTATIONS:

In preparation for Code4Lib PNW, a small group of presentations have
been pre-solicited and are available with the draft schedule here:

http://groups.google.com/group/pnwcode4lib/web/code4lib-northwest-2010

Presentation ideas can be submitted to the Google Group or to Kyle
Banerjee (kyle.baner...@gmail.com) or Terry Reese
(terry.re...@oregonstate.edu) directly.  Presentations will be
solicited until April 1st, 2010 to fill in the remainder of the
presentation schedule.

ATTENDEES AND REGISTRATION:

Code4Lib Northwest 2010 event will be capped at 60 participants.
Registration is $50, and it will cover lunch, refreshments throughout
the day. Many thanks go to the Orbis Cascade Alliance and Oregon State
University for underwriting the conference and generously donating
time, facilities and money to keep registration costs down.

QUESTIONS/COMMENTS:

If you have questions, you can send them to either Terry Reese
(terry.re...@oregonstate.edu), Kyle Banerjee (kyle.baner...@gmail.com)
or submit them to the Google group.


-- 
--
Kyle Banerjee
Digital Services Program Manager
Orbis Cascade Alliance
baner...@uoregon.edu / 503.999.9787


[CODE4LIB] University of Virginia job posting

2010-03-08 Thread Mayhood, Erin (elm8s)
Below is a job announcement from the University of Virginia Library. Please 
pass along to qualified applicants.





FACULTY OPENING



DIRECTOR, ONLINE LIBRARY ENVIRONMENT

University of Virginia Library







The University of Virginia Library seeks a creative and flexible leader for the 
position of Director of our online library environment, a comprehensive suite 
of tools and services to provide access to the Library's physical and digital 
collections.  We seek candidates who are interested in pursuing solutions that 
provide faculty and students a cohesive, innovative environment for accessing 
information used in research, teaching, and learning.





Environment:  The University of Virginia Library (http://www.lib.virginia.edu) 
is a leader in innovative customer service, an international leader in digital 
library research and digital scholarship, and is recognized for the strength 
and variety of its collections.  The Library system consists of twelve 
libraries, with independent libraries for health sciences, law, and business. 
The libraries support 12,000 undergraduates, 6,000 graduate students and 1,600 
teaching faculty. The University and the Library have a strong commitment to 
achieving diversity among faculty and staff. The Neoclassical buildings of 
founder Thomas Jefferson's Academical Village still serve as the center of the 
University's Grounds (http://www.virginia.edu/uvatours/slideshow/) and as a 
unique backdrop for teaching, learning, and research.



Responsibilities:  The Director of the online library environment is 
responsible for leading the investigation and implementation of emerging 
information technologies as well as managing the daily operations for the 
Library's access and delivery applications. The Director will head a newly 
formed department of technologists and librarians in carrying out this 
activity. She or he will have oversight of all aspects of the Library's 
Integrated System (ILS - Sirsi/Dynix Unicorn) and will lead development of an 
information architecture that provides a cohesive access and delivery 
environment. She or he will investigate new ways to provide access  delivery 
and workflow services traditionally provided by an ILS and seek to develop 
gateways to other information resources such as the Library's electronic 
resources and institutional repositories. The Director will:



 *   provide leadership and vision that ensures easy, reliable online access to 
a wide array of collections, information, and services in support of research, 
teaching and learning;

 *   manage the daily operations environment for the Library's access and 
delivery applications;

 *   supervise the daily work of both faculty and classified staff positions;

 *   collaborate with partners within the Library and among entities that 
require access to Library content;

 *   and engage professionally in activities related to librarianship and 
digital scholarship.





Qualifications:   Master's degree in Library Science or master's degree or PhD 
in Computer Science, Information Sciences or related area. Successful 
candidates should have demonstrated significant and progressively responsible 
experience managing positions with a range of technology-specific and 
administrative responsibilities.  Experience in libraries or information 
organizations is preferred.  Preferred candidates will also have:



* demonstrated understanding of digital library concepts and standards 
(e.g., metadata standards, media-specific standards);



* experience in systems design and systems architecture;



* an understanding of and commitment to library technologies;



* the ability to communicate clearly, both verbally and in writing;



* demonstrated ability to manage information technology staff and 
projects as well as departmental priorities;



* demonstrated knowledge of emerging technologies and related research;



* strong interpersonal skills;



* and a customer-service orientation.



Salary and Benefits:  Competitive depending on qualifications. This position 
has general faculty status with excellent benefits, including 22 days of 
vacation and TIAA/CREF and other retirement plans. Review of applications will 
begin on March 15, 2010 and will continue until the position is filled.  
Applicants must apply through the University of Virginia online employment 
website at https://jobs.virginia.edu/  Search by position number FP674, 
complete application, and attach cover letter and resume, with contact 
information for three current, professional references.  For assistance with 
this process contact Library Human Resources at (434) 924-3081.



The University of Virginia is an Equal Opportunity/Affirmative Action employer 
strongly committed to achieving excellence through cultural diversity. The 
University actively encourages applications and nominations from members of 
underrepresented groups.