[CODE4LIB] Fedora Book
Does anyone have a good suggestion for a book on Fedora (with a particular focus on the Repository)? I feel like I'm about to embark on some Fedora work. Fun. Rosalyn
[CODE4LIB] 12 Bones BBQ: Check the Guest List
I have moved 4 people up onto the guest list because the Danish contingent won't be joining us, so give the guest list a once over to see if you're on it. If you don't want to join us Wednesday night, we understand, but please remove your name from the list and insert the next person on the wait list. Rosalyn
[CODE4LIB] 12 Bones: Paying, Getting There, Menu, and Pre-Event Cash Bar
Hello All, The final cost of the 12 Bones BBQ Excursion will be $18 per person. The cost went up a bit based on market value for the meat. If this is too rich for you blood, feel free to remove your name from the list and move up the first person on the waitlist. http://wiki.code4lib.org/index.php/C4L2010_social_activities#12_Bones_BBQ_Dinner_Excursion I've also added information to the wiki about getting there, menu, and a pre-event cash bar. I would include all of that here, but then the email would be really long. You can pay via one of two methods: *Pay in Advance via Credit Card / PayPal* I have created a PayPal button if you need to use a credit card. You must have a PayPal account in order to use this method (it takes 2 minutes to create and add your card). Bear in mind that if you go with this method you'll *actually be charged $18.83* since PayPal charges me 2.29%($18.00) + $0.30 whenever someone uses a credit card, so I'm passing that cost on to you (i know, i'm cheap). You'll get a lovely receipt from PayPal at the end of the transaction and I'll check you off of my list. When you get to the door of 12 Bones, you'll give me your name and I'll let you in so you can stuff your face full of BBQ. You can find the payment page here: http://bit.ly/cu0Bxy. * Pay at the Door with CASH* Technically this is the cheaper method. You show up, hand me your $18.00 *in cash*. Remember, I am not a bank therefore I do not accept credit cards or checks at the door, *cash only.* I will then check your name off of the list, hand you a receipt and we all walk away happy. * *Did I mention I will take *cash only* at the door? Okay good. * *If you refuse to pay via one of these two methods, I'll throw you out on your ear. This will provide entertainment for everyone at the BBQ since I'm only 5' 2 and you're most likely taller than me. But don't underestimate me, I'm feisty and I had a big brother that was twice my size. Rosalyn
[CODE4LIB] 12 Bones BBQ excursion
Only 4 spots left. Get them while you can.
[CODE4LIB] Change to 12 Bones BBQ Excursion
On the social activities section of the wiki we had a signup for a lunch time excursion [1] to 12 Bones [2], a great BBQ restaurant in Asheville. There was an overwhelming response, so much so that we are worried we will overwhelm the restaurant at lunchtime. The new plan is to rent out the place on the evening of Wednesday Feb. 24th at 6:30pm. If we can get at least 50 people to sign up we can hit their minimum order amount. For those interested, the cost would be $15 per person, this price includes tax and tip. The menu would consist of 2 meats, 3 sides, buns/cornbread and non alcoholic beverages. Right now we have 33 people signed up. I'm assuming some will drop while others may add. If you're interested in this now dinnertime excursion, please sign up on the wiki: http://wiki.code4lib.org/index.php/C4L2010_social_activities#12_Bones_BBQ_Dinner_Excursion We hope you can come, because I really want some BBQ. Rosalyn [1] http://wiki.code4lib.org/index.php/C4L2010_social_activities#12_Bones_BBQ_Dinner_Excursion [2] http://www.12bones.com/
[CODE4LIB] T-Shirt Vote Winner
Inquiring minds want to know. Who won the t-shirt vote? The winner was Patrick Hochstenbach's submission: http://wiki.code4lib.org/images/9/9e/Code4lib2010_P-Hochstenbach.png Apologies for not sending this out sooner. The T-Shirt Committee (although the lateness is all rosalyn's fault)
[CODE4LIB] Cloud Computing in Higher Education
Hello All, I'm doing a presentation at NERCOMP in March called Cloud Computing in Higher Education: Changing the Way We Provide Systems. To prepare, I've created a survey called Cloud Computing in Academia. If you work in academia and you utilize the cloud or are interested in utilizing the cloud, please feel free to take the survey using the link below: http://www.surveymonkey.com/s/cloud-academia The survey is a total of 4 questions and should only take you a couple of minutes to complete. Your input would be greatly appreciated. I'll publish a link to the results once I close the survey on Friday February 5th. Rosalyn
Re: [CODE4LIB] Cloud Computing in Higher Education
Thanks Ethan for the feedback. I actually thought of that as I was developing the survey (especially considering what Michael Klein and I are presenting at Code4Lib). But the presentation itself is really geared toward IT professionals, the theme of this year's NERCOMP is: The Next-Generation University: Rethinking IT in Disruptive Times. And somehow I ended up in the enterprise systems and services track (which i definitely wasn't expecting). Perhaps our presentation at Code4Lib will make you see that I'm not single cloud minded (Michael, feel free to flog me off list). Rosalyn On Tue, Jan 26, 2010 at 4:51 PM, Ethan Gruber ewg4x...@gmail.com wrote: Hi Rosalyn, I notice that the survey is constructed from the standpoint of web developers or IT professionals (which makes sense since this is code4lib), but cloud computing is/can be used for more than just that. Some institutions have taken advantage of cloud computing for 3D rendering and visualization, but the survey doesn't really adequately address that aspect, especially since we're talking about the academic community as a whole. Ethan On Tue, Jan 26, 2010 at 4:42 PM, Rosalyn Metz rosalynm...@gmail.com wrote: Hello All, I'm doing a presentation at NERCOMP in March called Cloud Computing in Higher Education: Changing the Way We Provide Systems. To prepare, I've created a survey called Cloud Computing in Academia. If you work in academia and you utilize the cloud or are interested in utilizing the cloud, please feel free to take the survey using the link below: http://www.surveymonkey.com/s/cloud-academia The survey is a total of 4 questions and should only take you a couple of minutes to complete. Your input would be greatly appreciated. I'll publish a link to the results once I close the survey on Friday February 5th. Rosalyn
[CODE4LIB] T-Shirt Voting Ending Soon
Hello All, This is just a quick reminder that t-shirt voting will be ending at 5pm EST. That's in one hour. So if you've haven't had a chance to vote, you might want to do so now. Rosalyn
Re: [CODE4LIB] Rails Hosting
Hi Kevin, I'm going to recommend slicehost also. Again, I haven't used it but I met the (former) owner. He sold the business to rackspace, which has an awesome reputation in the cloud computing world. They are #2 behind amazon. Rosalyn On Thu, Jan 14, 2010 at 11:40 AM, Doran, Michael D do...@uta.edu wrote: Hi Kevin, Although I can't recommend any hosting based on personal experience, a while back I had bookmarked a recommended (by another code4libber) hosting site: Slicehost at http://www.slicehost.com/ I think they pretty much get out of the way and let you do what you want, development wise. Regarding Rails in particular, one of their testimonials said The only thing I can say is Wow! ... Rails up and running in 30 minutes. Another said ...I’m a Rails developer and a Linux enthusiast who can’t believe he found a Gentoo VPS with 256MB RAM for $20/month. And yet another ...I’m a freelance Rails developer, and my experience on an Ubuntu VPS has been fantastic compared to my previous shared hosting experience. [1] Again, this is *not* a recommendation from personal experience. -- Michael [1] http://www.slicehost.com/why-slicehost/testimonials # Michael Doran, Systems Librarian # University of Texas at Arlington # 817-272-5326 office # 817-688-1926 mobile # do...@uta.edu # http://rocky.uta.edu/doran/ -Original Message- From: Code for Libraries [mailto:code4...@listserv.nd.edu] On Behalf Of Kevin Reiss Sent: Thursday, January 14, 2010 10:16 AM To: CODE4LIB@LISTSERV.ND.EDU Subject: [CODE4LIB] Rails Hosting Hi, I was curious if anyone could recommend a hosting service that they've had a good ruby on rails experience with. I've been working with bluehost but my experience has not been good. You need to work through a lot of hoops just to get a moderately complicated rails application properly. The applications we are looking at deploying would be moderately active, 1,000 -2000 visits a day. Thanks for any comments in advance. Regards, Kevin Reiss
[CODE4LIB] Vote for the Code4Lib 2010 T-Shirt
Hello All, T-Shirt voting is now open. You can vote for t-shirts at: http://vote.code4lib.org/election/index/14 The t-shirt vote will close on Friday at 5pm. Questions, comments, concerns? Let me know. Rosalyn
Re: [CODE4LIB] c4l2010 T-Shirt Design Contest Extended to Jan. 6th
Hello All, Today is the last day to submit your T-Shirt Designs, I'll be closing it at noon with a times up message. I'm sure you're already sick of doing work, so waste your day creating a t-shirt. Rosalyn
[CODE4LIB] c4l2010 T-Shirt Design Contest Closed
Well, we received a few more entries. But now its time to close the T-shirt contest. Thanks for participating! The T-Shirt Design Committee --Original Message-- From: Rosalyn Metz To: Code for Libraries Subject: Re: c4l2010 T-Shirt Design Contest Extended to Jan. 6th Sent: Jan 6, 2010 8:46 AM Hello All, Today is the last day to submit your T-Shirt Designs, I'll be closing it at noon with a times up message. I'm sure you're already sick of doing work, so waste your day creating a t-shirt. Rosalyn Sent from my BlackBerry® smartphone with SprintSpeed
Re: [CODE4LIB] T-shirt Design Contest
correct me if i'm wrong, but didn't someone already set up a store. i distinctly remember there being a roy tennant thong (as do others if you google it). it appears to have gone away though... On Sun, Jan 3, 2010 at 5:51 PM, Kevin S. Clarke kscla...@gmail.com wrote: I like that idea (and the idea of it as something that exists apart from the conference budget, but perhaps funds scholarships in the following year). I think last year someone suggested putting all the t-shirt submissions in there (not just the winning one - It lets folks buy the conference shirt, but also others that might appeal to them). I think anyone in the community could register http://shop.cafepress.com/lisforge and manage it as a means to fund scholarships (or contribute in some other way - depending on the amount raised). Kevin On Sun, Jan 3, 2010 at 5:32 PM, Michael J. Giarlo leftw...@alumni.rutgers.edu wrote: We've talked before about setting up a code4lib CafePress store. Maybe we've already done it? It's an idea, at least. -Mike On Sun, Jan 3, 2010 at 16:40, Christina Salazar christinagama...@gmail.com wrote: Y'know... I think y'all should order extras and sell and ship them to those of us who cannot attend. I love my past conference t-shirts and they get some interesting reactions when I wear 'em. I'd buy any one of these designs. Seems like you might be able to make a bit of dough for scholarships and whatnot... Christina Salazar On Sun, Jan 3, 2010 at 12:23 PM, Patrick Hochstenbach patrick.hochstenb...@ugent.be wrote: Hello All, Here is the Inkscape entry designed by my lovely wife :) Greetings from Belgium, P@ Skype: patrick.hochstenbach Patrick Hochstenbach Software Architect University Library +32(0)92647980 Ghent University * Rozier 9 * 9000 * Gent
[CODE4LIB] c4l2010 T-Shirt Design Contest Extended to Jan. 6th
Hello All, Since we have yet to receive any submissions for T-Shirt designs, we are extending the contest until January 6th. This will give those of us with time off due to the holidays a chance to ignore family by creating an awesome design for the t-shirts. And those of you without time off can ignore work by creating an awesome design for the t-shirts. Either way something can be ignored. Remember that like in years past, the design should be one color. Please send any submissions to Rosalyn Metz at rosalynm...@gmail.com. Rosalyn
[CODE4LIB] Code4LibCon T-Shirt Design Contest
Hi Everyone, It is time to design this year's code4lib t-shirt. As in year's past, the design should be for the entire front of the t-shirt, and should work with a one-color printing. Please send a high-quality image of your design to me (email address below). I will compile the entries and then Ross (the vote master) Singer will post it to the code4lib site for voting. Please send in your submissions by Monday December 21st. Thanks! Rosalyn rosalynm...@gmail.com *text plagiarised from Jeremy Frumkin
[CODE4LIB] Join Us to Plan a Service for Publishing TEI Documents
*Join Us to Plan a Service for Publishing TEI Documents* Wheaton, Mount Holyoke, and Dickinson Colleges seek scholars, librarians, and technologists, particularly those from small liberal arts colleges, to join us over the next year in a series meetings sponsored by a collaborative planning grant from the Institute of Museum and Library Services (IMLS) to plan a TEI publishing service. Many scholars, archivists, librarians, technologists, and students working with TEI at smaller institutions often do so without peer or technical support. Having completed the considerable task of encoding their text, they ask themselves: now that I have my document encoded in TEI, what do I do with it? As an XML application, TEI has huge potential for multimedia presentation, data mashups, visualizations, or sophisticated print layout. However, without knowledge of XSLT and other XML technologies, the options are either inaccessible or difficult to learn and to use. In its native form as XML, TEI is of limited practical use: publishers generally don't accept it, non-technical colleagues often don't understand it, and XML alone is of little use when presenting at disciplinary conferences. We propose a service that would serve two broad functions: 1. Easy-to-use document conversion from valid TEI into forms that can be appreciated by those not familiar with TEI or XML encoding. Obvious possibilities include conversion to a printable PDF or browseable webpage, but could also include conversion to a variety of formats suitable for consumption by other tools and services for text analysis, visualization, and data mashups, etc. 2. Long-term TEI hosting, offering open or controlled web access, search, metadata control, etc. With a central hosting service, scholars, librarians, and technologists would be able to share their work with colleagues, scholarly communities, or the public by uploading their document and receiving a URL. The service will optionally also register the content with popular search engines and resource collections to make the TEI document findable as well as accessible. With this service, our intention is to support scholars, librarians, and technologists who currently have TEI files, as well as to attract new users interested in TEI but unsure of what can be done with it. Our hope is that a hosting and conversion service will serve to distribute the scholarly work in an attractive, readable form, thereby promoting both their work and the TEI as a scholarly standard. We intend this to be an ongoing service, and are developing it with long-term sustainability as a key priority. Participation in the initiative involves a commitment to: * attend three meetings to be held in March, June, and September 2010 at the three colleges. * bring your institution's perspective and expertise to the planning and design of the service. We are very excited by the potential contribution of this project to the TEI and scholarly community. Please contact tei.publish...@gmail.com by January 23 if you are interested in participating. We are looking for people who have experience with TEI to participate in the meetings and the planning, but everyone else is welcome to contact us. The first meeting will be in March at Mount Holyoke College in Western Massachusetts ( http://www.mtholyoke.edu/). This project is a collaborative effort of Wheaton, Mount Holyoke, and Dickinson Colleges and is sponsored by the collaborative planning grant from the Institute of Museum and Library Services. Expenses for participants, such as travel, lodging, and meals are covered by the IMLS grant. - Rosalyn Metz Systems Administrator for Curricular Support Library Information Services Wheaton College phone: 508-286-3733
[CODE4LIB] Call for Participation: ACRL NEC Annual Conference
Call for Participation Has your library employed search engine optimization or other techniques or strategies to increase findability of your library content on popular search engines, such as Google? Are you looking for an opportunity to present at ACRL NEC? The ITIG special interest group is seeking speakers for the upcoming ACRL NEC annual conference in May, 2010 @ Holy Cross in Worcester, MA. Specifically, we are looking for presenters who have utilized and engaged in search engine optimization techniques that have increased their ranking of library content within search engines. Interested persons can contact either Talia Resendes (talia.resen...@jwu.edu) or Laura Kohl (lk...@bryant.edu)
Re: [CODE4LIB] Library Website Redesign Info and Project Plans
Susan and Beth, I'd love to see a project plan and/or write up if you have one. I'm really curious how people handle the gathering information piece and balance staff v. user perspectives. And I'm sure as I begin actually writing a plan I'll be curious about many of the other things people did. Rosalyn On Thu, Sep 17, 2009 at 7:32 AM, susan teague rector setea...@vcu.edu wrote: I second David's book choice - I've done 3 redesigns at my library (we're on the 3rd right now) and i have referred to the process in this book countless times. I typically follow an iterative approach to writing and executing my project plans; however, it's difficult to get away from traditional waterfall methodology and our plans sometimes fall back into that mold - I can send you a copy of a proj. plan if you're interested Cheers, Susan VCU Libraries http://www.library.vcu.edu | http://www.library.vcu.edu/redesign Walker, David wrote: My wife really likes Web Redesign: Workflow that Works, by Kelly Goto Emily Cotler. The second edition is called Web Redesign 2.0. http://www.web-redesign.com/ http://www.worldcat.org/oclc/57641137 --Dave == David Walker Library Web Services Manager California State University http://xerxes.calstate.edu From: Code for Libraries [code4...@listserv.nd.edu] On Behalf Of Jason Stirnaman [jstirna...@kumc.edu] Sent: Wednesday, September 16, 2009 11:36 AM To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] Library Website Redesign Info and Project Plans I just came across this yesterday: http://johncrenshaw.net/blog/web-development-project-process-workflow/ Very high-level and usual systems design approach, but with some good web-specific tips thrown in. Sean Hannan shan...@jhu.edu 09/16/09 10:20 AM We're currently in the middle of a library website redesign as well. For the most part, we have framed our project using Jesse James Garrett's The Elements of User Experience (https://wiki.library.jhu.edu/download/attachments/30737/elements.pdf ). It has been immensely helpful in plotting out our work from the User Experience touchy-feely end to the Information Architecture to the visual design and implementation. -Sean --- Sean Hannan Web Developer Sheridan Libraries Johns Hopkins University On Sep 16, 2009, at 10:52 AM, Rosalyn Metz wrote: Hi All, I'm about to embark on a library website redesign. I've started thinking about creating a project plan, but I honestly don't know where to start. I saw this website redesign presentation Lorcan Dempsey tweeted about: http://www.ucd.ie/library/guides/powerpoint/rpan_ppt2/index.swf And started thinking, I wonder if anyone else has similar slides or project plans or advice. I of course asked the Google but I didn't really find any project plans. (If you're curious what I did find, take a look here: http://delicious.com/rosy1280/library+website-redesign) I do of course realize that every library is different, but I'm hoping that any information you all might be able to provide could help get the juices flowing. Thanks for your help in advance. Rosalyn Rosalyn
[CODE4LIB] Library Website Redesign Info and Project Plans
Hi All, I'm about to embark on a library website redesign. I've started thinking about creating a project plan, but I honestly don't know where to start. I saw this website redesign presentation Lorcan Dempsey tweeted about: http://www.ucd.ie/library/guides/powerpoint/rpan_ppt2/index.swf And started thinking, I wonder if anyone else has similar slides or project plans or advice. I of course asked the Google but I didn't really find any project plans. (If you're curious what I did find, take a look here: http://delicious.com/rosy1280/library+website-redesign) I do of course realize that every library is different, but I'm hoping that any information you all might be able to provide could help get the juices flowing. Thanks for your help in advance. Rosalyn Rosalyn
Re: [CODE4LIB] Implementing OpenURL for simple web resources
you could force a timestamp if people don't include a date. and I like the idea of going to the Internet Archive of a website, because then you're not having to get into the business of handling www.bbc.co.uk differently than cnn.com and someblog.org. i also like the idea of using a redirect. you could theoretically write a source parser (i'm assuming youre using SFX based on what you said about bX) that says if my rfr_id = mylocalid and the item is a website (however you choose to identify the website...which if you're writing your own source parser you could put website in the rft_genre even though its not technically a metadata format but you just want your source parser to forward the url on anyway, so the link resolver isn't actually going to do anything with it) bypass everything and just direct to the internet archive of the website. all of this is of course kind of hackish...but really isn't the whole thing hackish? there were a few source parsers that would be good models for writing something like this. but i have no idea if they still exist because i haven't looked at the back end of sfx in about a year. On Tue, Sep 15, 2009 at 5:12 AM, O.Stephens o.steph...@open.ac.uk wrote: I agree with this Rosalyn. The issue that Nate brought up was that the content at http://www.bbc.co.uk could change over time, and old content might be moved to another URI - http://archive.bbc.co.uk or something. So if course A references http://www.bbc.co.uk on 24/08/09, if the content that was on http://www.bbc.co.uk on 24/08/09 moves to http://archive.bbc.co.uk we can use the mechanism I propose to trap the links to http://www.bbc.co.uk and redirect to http://archive.bbc.co.uk. However, if at a later date course B references http://www.bbc.co.uk we have no way of knowing whether they mean the stuff that is currently on http://www.bbc.co.uk or the stuff that used to be on http://www.bbc.co.uk and is now on http://archive.bbc.co.uk - and we have a redirect that is being applied across the board. Thinking about it, references are required to include a date of access when citing websites, so this is probably the best piece of information to use to know where to resolve to (and we can put this in the DC metadata). Whether this will just get too confusing is a good question - I'll have at think about this. Owen PS using the date we could even consider resolving to the Internet Archive copy of a website if it was available I guess - this might be useful I guess... Owen Stephens TELSTAR Project Manager Library and Learning Resources Centre The Open University Walton Hall Milton Keynes, MK7 6AA T: +44 (0) 1908 858701 F: +44 (0) 1908 653571 E: o.steph...@open.ac.uk -Original Message- From: Code for Libraries [mailto:code4...@listserv.nd.edu] On Behalf Of Rosalyn Metz Sent: 14 September 2009 21:52 To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] Implementing OpenURL for simple web resources oops...just re-read original post s/professor/article also your link resolver should be creating a context object with each request. this context object is what makes the openurl unique. so if you want uniqueness for stats purposes i would image the link resolver is already doing that (and just another reason to use an rfr_id that you create). On Mon, Sep 14, 2009 at 4:45 PM, Rosalyn Metz rosalynm...@gmail.com wrote: Owen, rft_id isn't really meant to be a unique identifier (although it can be in situations like a pmid or doi). are you looking for it to be? if so why? if professor A is pointing to http://www.bbc.co.uk and professor B is pointing to http://www.bbc.co.uk why do they have to have unique OpenURLs. Rosalyn On Mon, Sep 14, 2009 at 4:41 PM, Eric Hellman e...@hellman.net wrote: Nate's point is what I was thinking about in this comment in my original reply: If you don't add DC metadata, which seems like a good idea, you'll definitely want to include something that will help you to persist your replacement record. For example, a label or description for the link. I should also point out a solution that could work for some people but not you- put rewrite rules in the gateways serving your network. A bit dangerous and kludgy, but we've seen kludgier things. On Sep 14, 2009, at 4:24 PM, O.Stephens wrote: Nate has a point here - what if we end up with a commonly used URI pointing at a variety of different things over time, and so is used to indicate different content each time. However the problem with a 'short URL' solution (tr.im, purl etc), or indeed any locally assigned identifier that acts as a key, is that as described in the blog post you need prior knowledge of the short URL/identifier to use it. The only 'identifier' our authors know for a website is it's URL - and it seems contrary for us to ask them to use something else. I'll need to think about
Re: [CODE4LIB] indexing pdf files
Eric, I have librarians that would kill for this. In fact I was talking to one about it the other day. She felt there must be a way to handle active reading and make it portable. This would be great in conjunction with RefWorks or Zotero or something along those lines. Rosalyn On Tue, Sep 15, 2009 at 9:31 AM, Eric Lease Morgan emor...@nd.edu wrote: I have been having fun recently indexing PDF files. For the pasts six months or so I have been keeping the articles I've read in a pile, and I was rather amazed at the size of the pile. It was about a foot tall. When I read these articles I actively read them -- meaning, I write, scribble, highlight, and annotate the text with my own special notation denoting names, keywords, definitions, citations, quotations, list items, examples, etc. This active reading process: 1) makes for better comprehension on my part, and 2) makes the articles easier to review and pick out the ideas I thought were salient. Being the librarian I am, I thought it might be cool (kewl) to make the articles into a collection. Thus, the beginnings of Highlights Annotations: A Value-Added Reading List. The techno-weenie process for creating and maintaining the content is something this community might find interesting: 1. Print article and read it actively. 2. Convert the printed article into a PDF file -- complete with embedded OCR -- with my handy-dandy ScanSnap scanner. [1] 3. Use MyLibrary to create metadata (author, title, date published, date read, note, keywords, facet/term combinations, local and remote URLs, etc.) describing the article. [2] 4. Save the PDF to my file system. 5. Use pdttotext to extract the OCRed text from the PDF and index it along with the MyLibrary metadata using Solr. [3, 4] 6. Provide a searchable/browsable user interface to the collection through a mod_perl module. [5, 6] Software is never done, and if it were then it would be called hardware. Accordingly, I know there are some things I need to do before I can truely deem the system version 1.0. At the same time my excitment is overflowing and I thought I'd share some geekdom with my fellow hackers. Fun with PDF files and open source software. [1] ScanSnap - http://tinyurl.com/oafgwe [2] MyLibrary screen dump - http://infomotions.com/tmp/mylibrary.png [3] pdftotext - http://www.foolabs.com/xpdf/ [4] Solr - http://lucene.apache.org/solr/ [5] module source code - http://infomotions.com/highlights/Highlights.pl [6] user interface - http://infomotions.com/highlights/highlights.cgi -- Eric Lease Morgan University of Notre Dame -- Eric Lease Morgan Head, Digital Access and Information Architecture Department Hesburgh Libraries, University of Notre Dame (574) 631-8604
Re: [CODE4LIB] Implementing OpenURL for simple web resources
ok no one shoot me for doing this: in section 9.1 Namespaces [Registry] of the OpenURL standard (z39.88) it actually provides an example of using a URL in the rfr_id field, and i wonder why you couldn't just do the same thing for the rft_id also there is a field called rft_val which currently has no use. this might be a good one for it. just my 2 cents. On Mon, Sep 14, 2009 at 12:57 PM, Jonathan Rochkind rochk...@jhu.eduwrote: Well, in the 'wild' I barely see any rft_id's at all, heh. Aside from the obvious non-http URIs in rft_id, I'm not sure if I've seen http URIs that don't resolve to full text. BUT -- you can do anything with an http URI that you can do with an info uri. There is no requirement or guarantee in any spec that an HTTP uri will resolve at all, let alone resolve to full text for the document cited in an OpenURL. The OpenURL spec says that rft_id is An Identifier Descriptor unambiguously specifies the Entity by means of a Uniform Resource Identifier (URI). It doesn't say that it needs to resolve to full text. In my own OpenURL link-generating software, I _frequently_ put identifiers which are NOT open access URLs to full text in rft_id. Because there's no other place to put them. And I frequently use http URIs even for things that don't resolve to full text, because the conventional wisdom is to always use http for URIs, whether or not they resolve at all, and certainly no requirement that they resolve to something in particular like full text. Examples that I use myself when generating OpenURL rft_ids, of http URIs that do not resolve to full text include ones identifying bib records in my own catalog: http://catalog.library.jhu.edu/bib/NUM [ Will resolve to my catalog record, but not to full text!] Or similarly, WorldCat http URIs. Or, an rft_id to unambigously identify something in terms of it's Google Books record: http://books.google.com/books?id=tl8MCAAJ Also, URIs to unambiguously specify a referent in terms of sudoc: http://purl.org/NET/sudoc/[sudoc] http://purl.org/NET/sudoc/%5Bsudoc%5D = will, as the purl is presently set up by rsinger, resolve to a GPO catalog record, but there's no guarantee of online public full text. I'm pretty sure what I'm doing is perfectly appropriate based on the definition of rft_id, but it's definitely incompatible with a receiving link resolver assuming that all rft_id http URIs will resolve to full text for the rft cited. I don't think it's appropriate to assume that just because a URI is http, that means it will resolve to full text -- it's merely an identifier that unambiguously specifies the referent, same as any other URI scheme. Isn't that what the sem web folks are always insisting in the arguments about how it's okay to use http URIs for any type of identifier at all -- that http is just an identifier (at least in a context where all that's called for is a URI to identify), you can't assume that it resolves to anything in particular? (Although it's nice when it resolves to RDF saying more about the thing identified, it's certainly not expected that it will resolve to full text). Eric, out of curiosity, will your own link resolver software automatically take rft_id's and display them to the user as links? Jonathan Eric Hellman wrote: Could you give us examples of http urls in rft_id that are like that? I've never seen such. On Sep 14, 2009, at 11:58 AM, Jonathan Rochkind wrote: In general, identifiers in URI form are put in rft_id that are NOT meant for providing to the user as a navigable URL. So the receiving software can't assume that whatever url is in rft_is represents an actual access point (available to the user) for the document. Eric Hellman President, Gluejar, Inc. 41 Watchung Plaza, #132 Montclair, NJ 07042 USA e...@hellman.net http://go-to-hellman.blogspot.com/
Re: [CODE4LIB] Implementing OpenURL for simple web resources
sorry eric, i was reading straight from the documentation and according to it it has no use. On Mon, Sep 14, 2009 at 1:55 PM, Eric Hellman e...@hellman.net wrote: It's not correct to say that rft_val has no use; when used, it should contain a URL-encoded package of xml or kev metadata. it would be correct to say it is very rarely used. On Sep 14, 2009, at 1:40 PM, Rosalyn Metz wrote: ok no one shoot me for doing this: in section 9.1 Namespaces [Registry] of the OpenURL standard (z39.88) it actually provides an example of using a URL in the rfr_id field, and i wonder why you couldn't just do the same thing for the rft_id also there is a field called rft_val which currently has no use. this might be a good one for it. just my 2 cents. Eric Hellman President, Gluejar, Inc. 41 Watchung Plaza, #132 Montclair, NJ 07042 USA e...@hellman.net http://go-to-hellman.blogspot.com/
Re: [CODE4LIB] Implementing OpenURL for simple web resources
i'd like to point out that perhaps the reason that SFX (and other link resolvers) don't use the rft_id in a particular way is because no one has pushed it to. for example, it is possible for you to have the word dinosaur link to an openurl and provide services for dinosaurs, but the question is: 1) who would provide a link in their articles on webpages to an openurl about dinosaurs. 2) do users really care. if i were in Owen's position i might create an openurl that looked like this: http://resolver.address/? url_ver=Z39.88-2004 url_ctx_fmt=info:ofi/fmt:kev:mtx:ctx rft_id=mylocalid rft_dat=urlhttp://www.bbc.co.uk/url where mylocalid is an id i created to give these special openurls and where url is the tag that i made up to help my resolver identify the data i'm sending privately. i could then write a program so my link resolver knows that the information contained in the private data field (rft_dat) and is identified by url should direct you to that url. you might also need to make up some other tags (like pdf if all your pdfs are in one spot). On Mon, Sep 14, 2009 at 2:23 PM, Jonathan Rochkind rochk...@jhu.edu wrote: I disagree. Putting URIs that unamiguously identify the referent, and in some cases provide additional 'hooks' by virtue of additional identifiers (local bibID, OCLCnum, LCCN, etc) is a VERY useful thing to do to me. Whether or not they resolve to an end-user appropriate web page or not. If you want to use rft_id to instead be an end-user appropriate access URL (which may or may not be a suitable unambiguous persistent identifier), I guess it depends on how many of the actually existing in-the-wild link resolvers will, in what contexts, treat an http URI as an end-user appropriate access URL. If a lot of the in-the-wild link resolvers will, that may be a practically useful thing to do. Thus me asking if the one you had knowledge of did or didn't. I'm 99% sure that SFX will not, in any context, treat an rft_id as an appropriate end-user access URL. Certainly providing an appropriate end-user access URL _is_ a useful thing to do. So is providing an unambiguous persistent identifier. Both are quite useful things to do, they're just different things, shame that OpenURL kinda implies that you can use the same data element for both. OpenURL's not alone there though, DC does the same thing. Jonathan Eric Hellman wrote: If you have a URL that can be used for a resource that you are describing in metadata, resolvers can do a better job providing services to users if it is put in the openurl. The only place to put it is rft_id. So let's not let one resolver's incapacity to prevent other resolvers from providing better services. If you want to make an OpenURL for a web page, its url is in almost all cases the best unambiguous identifier you could possibly think of. Putting dead http uri's in rft_id is not really a very useful thing to do. On Sep 14, 2009, at 1:45 PM, Jonathan Rochkind wrote: Eric Hellman wrote: http://catalog.library.jhu.edu/bib/NUM identifies a catalog record- I mean what else would you use to id the catalog record. unless you've implemented the http-range 303 redirect recommendation in your catalog (http://www.w3.org/TR/cooluris/), it shouldn't be construed as identifying the thing it describes, except as a private id, and you should use another field for that. Of course. But how is a link resolver supposed to know that, when all it has is rft_id=http://catalog.library.jhu.edu/bib/NUM ?? I suggest that this is a kind of ambiguity in OpenURL, that many of us are using rft_id to, in some contexts, simply provide an unambiguous identifier, and in other cases, provide an end-user access URL (which may not be a good unambiguous identifier at all!). With no way for the link resolver to tell which was intended. So I don't think it's a good idea to do this. I think the community should choose one, and based on the language of the OpenURL spec, rft_id is meant to be an unambiguous identifier, not an end-user access URL. So ideally another way would be provided to send something intended as an end-user access URL in an OpenURL. But OpenURL is pretty much a dead spec that is never going to be developed further in any practical way. So, really, I recommend avoiding OpenURL for some non-library standard web standards whenever you can. But sometimes you can't, and OpenURL really is the best tool for the job. I use it all the time. And it constantly frustrates me with it's lack of flexibility and clarity, leading to people using it in ambiguous ways. Jonathan Eric Hellman President, Gluejar, Inc. 41 Watchung Plaza, #132 Montclair, NJ 07042 USA e...@hellman.net http://go-to-hellman.blogspot.com/
Re: [CODE4LIB] Implementing OpenURL for simple web resources
whoopsthat should be rfr_id not rft_id. http://resolver.address/? url_ver=Z39.88-2004 url_ctx_fmt=info:ofi/fmt:kev:mtx:ctx rfr_id=mylocalid rft_dat=urlhttp://www.bbc.co.uk/url On Mon, Sep 14, 2009 at 2:39 PM, Rosalyn Metz rosalynm...@gmail.com wrote: i'd like to point out that perhaps the reason that SFX (and other link resolvers) don't use the rft_id in a particular way is because no one has pushed it to. for example, it is possible for you to have the word dinosaur link to an openurl and provide services for dinosaurs, but the question is: 1) who would provide a link in their articles on webpages to an openurl about dinosaurs. 2) do users really care. if i were in Owen's position i might create an openurl that looked like this: http://resolver.address/? url_ver=Z39.88-2004 url_ctx_fmt=info:ofi/fmt:kev:mtx:ctx rft_id=mylocalid rft_dat=urlhttp://www.bbc.co.uk/url where mylocalid is an id i created to give these special openurls and where url is the tag that i made up to help my resolver identify the data i'm sending privately. i could then write a program so my link resolver knows that the information contained in the private data field (rft_dat) and is identified by url should direct you to that url. you might also need to make up some other tags (like pdf if all your pdfs are in one spot). On Mon, Sep 14, 2009 at 2:23 PM, Jonathan Rochkind rochk...@jhu.edu wrote: I disagree. Putting URIs that unamiguously identify the referent, and in some cases provide additional 'hooks' by virtue of additional identifiers (local bibID, OCLCnum, LCCN, etc) is a VERY useful thing to do to me. Whether or not they resolve to an end-user appropriate web page or not. If you want to use rft_id to instead be an end-user appropriate access URL (which may or may not be a suitable unambiguous persistent identifier), I guess it depends on how many of the actually existing in-the-wild link resolvers will, in what contexts, treat an http URI as an end-user appropriate access URL. If a lot of the in-the-wild link resolvers will, that may be a practically useful thing to do. Thus me asking if the one you had knowledge of did or didn't. I'm 99% sure that SFX will not, in any context, treat an rft_id as an appropriate end-user access URL. Certainly providing an appropriate end-user access URL _is_ a useful thing to do. So is providing an unambiguous persistent identifier. Both are quite useful things to do, they're just different things, shame that OpenURL kinda implies that you can use the same data element for both. OpenURL's not alone there though, DC does the same thing. Jonathan Eric Hellman wrote: If you have a URL that can be used for a resource that you are describing in metadata, resolvers can do a better job providing services to users if it is put in the openurl. The only place to put it is rft_id. So let's not let one resolver's incapacity to prevent other resolvers from providing better services. If you want to make an OpenURL for a web page, its url is in almost all cases the best unambiguous identifier you could possibly think of. Putting dead http uri's in rft_id is not really a very useful thing to do. On Sep 14, 2009, at 1:45 PM, Jonathan Rochkind wrote: Eric Hellman wrote: http://catalog.library.jhu.edu/bib/NUM identifies a catalog record- I mean what else would you use to id the catalog record. unless you've implemented the http-range 303 redirect recommendation in your catalog (http://www.w3.org/TR/cooluris/), it shouldn't be construed as identifying the thing it describes, except as a private id, and you should use another field for that. Of course. But how is a link resolver supposed to know that, when all it has is rft_id=http://catalog.library.jhu.edu/bib/NUM ?? I suggest that this is a kind of ambiguity in OpenURL, that many of us are using rft_id to, in some contexts, simply provide an unambiguous identifier, and in other cases, provide an end-user access URL (which may not be a good unambiguous identifier at all!). With no way for the link resolver to tell which was intended. So I don't think it's a good idea to do this. I think the community should choose one, and based on the language of the OpenURL spec, rft_id is meant to be an unambiguous identifier, not an end-user access URL. So ideally another way would be provided to send something intended as an end-user access URL in an OpenURL. But OpenURL is pretty much a dead spec that is never going to be developed further in any practical way. So, really, I recommend avoiding OpenURL for some non-library standard web standards whenever you can. But sometimes you can't, and OpenURL really is the best tool for the job. I use it all the time. And it constantly frustrates me with it's lack of flexibility and clarity, leading to people using
Re: [CODE4LIB] Implementing OpenURL for simple web resources
Owen, rft_id isn't really meant to be a unique identifier (although it can be in situations like a pmid or doi). are you looking for it to be? if so why? if professor A is pointing to http://www.bbc.co.uk and professor B is pointing to http://www.bbc.co.uk why do they have to have unique OpenURLs. Rosalyn On Mon, Sep 14, 2009 at 4:41 PM, Eric Hellman e...@hellman.net wrote: Nate's point is what I was thinking about in this comment in my original reply: If you don't add DC metadata, which seems like a good idea, you'll definitely want to include something that will help you to persist your replacement record. For example, a label or description for the link. I should also point out a solution that could work for some people but not you- put rewrite rules in the gateways serving your network. A bit dangerous and kludgy, but we've seen kludgier things. On Sep 14, 2009, at 4:24 PM, O.Stephens wrote: Nate has a point here - what if we end up with a commonly used URI pointing at a variety of different things over time, and so is used to indicate different content each time. However the problem with a 'short URL' solution (tr.im, purl etc), or indeed any locally assigned identifier that acts as a key, is that as described in the blog post you need prior knowledge of the short URL/identifier to use it. The only 'identifier' our authors know for a website is it's URL - and it seems contrary for us to ask them to use something else. I'll need to think about Nate's point - is this common or an edge case? Is there any other approach we could take? Eric Hellman President, Gluejar, Inc. 41 Watchung Plaza, #132 Montclair, NJ 07042 USA e...@hellman.net http://go-to-hellman.blogspot.com/
Re: [CODE4LIB] Implementing OpenURL for simple web resources
oops...just re-read original post s/professor/article also your link resolver should be creating a context object with each request. this context object is what makes the openurl unique. so if you want uniqueness for stats purposes i would image the link resolver is already doing that (and just another reason to use an rfr_id that you create). On Mon, Sep 14, 2009 at 4:45 PM, Rosalyn Metz rosalynm...@gmail.com wrote: Owen, rft_id isn't really meant to be a unique identifier (although it can be in situations like a pmid or doi). are you looking for it to be? if so why? if professor A is pointing to http://www.bbc.co.uk and professor B is pointing to http://www.bbc.co.uk why do they have to have unique OpenURLs. Rosalyn On Mon, Sep 14, 2009 at 4:41 PM, Eric Hellman e...@hellman.net wrote: Nate's point is what I was thinking about in this comment in my original reply: If you don't add DC metadata, which seems like a good idea, you'll definitely want to include something that will help you to persist your replacement record. For example, a label or description for the link. I should also point out a solution that could work for some people but not you- put rewrite rules in the gateways serving your network. A bit dangerous and kludgy, but we've seen kludgier things. On Sep 14, 2009, at 4:24 PM, O.Stephens wrote: Nate has a point here - what if we end up with a commonly used URI pointing at a variety of different things over time, and so is used to indicate different content each time. However the problem with a 'short URL' solution (tr.im, purl etc), or indeed any locally assigned identifier that acts as a key, is that as described in the blog post you need prior knowledge of the short URL/identifier to use it. The only 'identifier' our authors know for a website is it's URL - and it seems contrary for us to ask them to use something else. I'll need to think about Nate's point - is this common or an edge case? Is there any other approach we could take? Eric Hellman President, Gluejar, Inc. 41 Watchung Plaza, #132 Montclair, NJ 07042 USA e...@hellman.net http://go-to-hellman.blogspot.com/
Re: [CODE4LIB] digital storage
Hi Edward, Might I suggest you look into cloud computing services if you're looking at different options. (I know you're all shocked I suggested it). If our budget weren't so abysmal (and going to get worse) we would be using it right now rather than the snap server we purchased with leftover funds. The benefits of using the cloud is of course the elasticity it offers you. The negative is that you have to pay to put your files into the cloud and then pay again to take them out (and since we've already been slashed 30% and are guaranteed another slash...that idea was shot down). Of course the major player out there is Amazon S3. The problem is that you can't use S3 via Amazon's Web Management Console. But there is a company called RightScale (http://www.rightscale.com/index.php) which has a web management console that allows you to upload files quickly and easily without having to write scripts and what not. Anyway, just my two cents. Rosalyn On Thu, Aug 27, 2009 at 8:10 AM, Edward Iglesias edwardigles...@gmail.comwrote: As I was trying to figure out what to do with half a terabyte of archival TIFFS it occurred to me that perhaps someone else had this problem. We are starting to produce massive amounts of digital objects (videos, archival TIFFS, audio interviews). Up until now we have been dealing with ways to display them to the public. Now we are starting to look at dark archives like OCLC's digital archive product. I would welcome any suggestions from those of you who have dealt with this on an archival level. It's one thing to stick the stuff up on a server, but then what? Our CIO suggested storage appliances like this one http://www.drobo.com/products/index.php but I am wary of the proprietary RAID system. Thanks in advance, ~ Edward Iglesias Systems Librarian Central Connecticut State University
Re: [CODE4LIB] digital storage
I have to agree with Ed. You should have a good policy in place for backing up your data. Just throwing it on a server isn't a policy. At the same time I would have to disagree with Ed. You should look at S3 as if it was your own server. What is the guarantee that you supply to your users with your own server. The snap server we use here (instead of S3) is the back up to a back up system already in place. On Thu, Aug 27, 2009 at 9:52 AM, Edward M. Corrado ecorr...@ecorrado.uswrote: Rosalyn's post made me think of one more thing if you are looking into outside entities (such as we are), what are the terms of service and what guarantee do they offer they won't lose your data? I believe that A3 does not offer any guarantee, so if you go with them, you probably want to have some other form of storage as well. Even if they offered a guarantee, what good is it once they loose your documents you were trying to preserve? Edward Corrado Rosalyn Metz wrote: Hi Edward, Might I suggest you look into cloud computing services if you're looking at different options. (I know you're all shocked I suggested it). If our budget weren't so abysmal (and going to get worse) we would be using it right now rather than the snap server we purchased with leftover funds. The benefits of using the cloud is of course the elasticity it offers you. The negative is that you have to pay to put your files into the cloud and then pay again to take them out (and since we've already been slashed 30% and are guaranteed another slash...that idea was shot down). Of course the major player out there is Amazon S3. The problem is that you can't use S3 via Amazon's Web Management Console. But there is a company called RightScale (http://www.rightscale.com/index.php) which has a web management console that allows you to upload files quickly and easily without having to write scripts and what not. Anyway, just my two cents. Rosalyn On Thu, Aug 27, 2009 at 8:10 AM, Edward Iglesias edwardigles...@gmail.comwrote: As I was trying to figure out what to do with half a terabyte of archival TIFFS it occurred to me that perhaps someone else had this problem. We are starting to produce massive amounts of digital objects (videos, archival TIFFS, audio interviews). Up until now we have been dealing with ways to display them to the public. Now we are starting to look at dark archives like OCLC's digital archive product. I would welcome any suggestions from those of you who have dealt with this on an archival level. It's one thing to stick the stuff up on a server, but then what? Our CIO suggested storage appliances like this one http://www.drobo.com/products/index.php but I am wary of the proprietary RAID system. Thanks in advance, ~ Edward Iglesias Systems Librarian Central Connecticut State University
Re: [CODE4LIB] Suggest a keynote speaker for Code4Lib 2010!
Hey James, Tim (or his bot...there is speculation) just tweeted this: http://twitter.com/timoreilly/statuses/2874552986 so maybe your dream is closer to reality than you think. Rosalyn On Mon, Jul 27, 2009 at 9:37 AM, King, James (NIH/OD/ORS) [E]james.k...@nih.gov wrote: Hi, I nominate two possibilities: * Jeff Patterson CEO of Safari Books Safari Books is the online host of the O'Reilly Book series. Jeff is a very down-to-earth guy but is also 'geeky' enough to work with this crowd. BIO: http://www.safaribooksonline.com/Corporate/Company/boardDirectors.php Jeff Patterson joined Safari Books Online from CMP Technology LLC, where he served in a number of senior executive positions over nine years. His most recent position was president of the Business Technology Group that included leading IT media brands such as InformationWeek, TechWeb and the Web 2.0 Conference. Prior to CMP, he co-founded Beacon Technology Partners LLC, a market research firm specializing in measuring IT audience behavior and attitudes, and held management positions in media, manufacturing and advertising companies including Cahners Publishing Company (now Reed Business Information); National Semiconductor; Foote, Cone Belding/San Francisco; Raychem Corporation; and Pinne, Garvin Hock/San Francisco. Jeff earned his B.A. from the University of California, Berkeley. *** Tim O'Reilly Founder and CEO of O'Reilly Media We can dream, can't we? In case you don't see the connection, he is the CEO of O'Reilly, home of the technology series of books named after him and author of radar.oreilly.com. I've tried to get him for a keynote for a conference and he accepted but the conference decided to go another route so I know he will consider keynotes for technology crowds. BIO: http://oreilly.com/oreilly/tim_bio.html Tim O'Reilly is the founder and CEO of O'Reilly Media, thought by many to be the best computer book publisher in the world. The company also publishes online through the O'Reilly Network and hosts conferences on technology topics. Tim is an activist for open source, open standards, and sensible intellectual property laws. Since 1978, Tim has led the company's pursuit of its core goal: to be a catalyst for technology change by capturing and transmitting the knowledge of alpha geeks and other innovators. His active engagement with technology communities drives both the company's product development and its marketing. Tim has built a culture where advocacy, meme-making, and evangelism are key tenets of the business philosophy. -James ___ James King, MLS Information Architect National Institutes of Health Library Building 10, Room 1L-07A 10 Center Drive, MSC 1150 Bethesda, MD 20892-1150 Phone: (301) 496-2187 Fax: (301) 402-0254 E-mail: kin...@mail.nih.gov http://nihlibrary.nih.gov/ ___ Amazing Research. Amazing Help. Please consider the environment before printing this e-mail -Original Message- From: Code for Libraries [mailto:code4...@listserv.nd.edu] On Behalf Of Andreas Orphanides Sent: Thursday, July 23, 2009 10:24 AM To: CODE4LIB@LISTSERV.ND.EDU Subject: [CODE4LIB] Suggest a keynote speaker for Code4Lib 2010! Hi folks, The time has come once again to commence discussion of possible keynote speakers for the upcoming Code4Lib 2010 conference in Asheville! If you've got any suggestions for a speaker who'd be engaging, knowledgeable, and foolhardy enough to accept this high honor, throw their names to the list for discussion. We here at Code4Lib 2010 World Headquarters, deep under the sea, will accept nominations until *September 16, 2009*. Shortly thereafter we will open the polls for online voting. All suggestions and comments are welcome! Discuss away! Andreas Orphanides Code4Lib 2010 Keynote Speakers Committee