Re: [CODE4LIB] The Berman Catalog
I actually created a minimal RubyOnRails view on the Berman data a few years ago. http://catalog.sanfordberman.org The src code for this is readily available if anyone is interested in helping Madeline work on it: http://inkdroid.org/bzr/berman/ The data has an interesting history, for anyone working with machine readable cataloging data. //Ed
Re: [CODE4LIB] code4lib 2009
On Mon, Jun 23, 2008 at 11:20 AM, Roy Tennant [EMAIL PROTECTED] wrote: JodiS suggested Abigail Sellen in the chatroom, which I'm posting here so we don't lose track of the suggestion. She does look quite interesting: http://research.microsoft.com/~asellen/. There is a video of her talking at http://www.archive.org/details/Abigailsellen_geekdinner. To this name I might add Sue Dumais's (http://research.microsoft.com/~sdumais/). She's also at MSR: I am interested in algorithms and interfaces for improved information retrieval, as well as general issues in and human-computer interaction. I joined Microsoft Research in July 1997. I work on a wide variety of information access and management issues, including: personal information management, web search, question answering, information retrieval, text categorization, collaborative filtering, interfaces for improved search and navigation, and user/task modeling. I've seen her speak once and she's good. Here are the vote-getting keynoters who were considered and not chosen last year: David Weinberger, Aaron Swartz, Mitchell Baker, Herbert Van De Sompel, Carl Lagoze, Alexis Rossi, MacKenzie Smith, Cathy Marshall, Jeni Tennison, Karen Calhoun, Karen Coombs, Sandy Payette, Kathy Sierra, Marek Tuszynski, Zaheda Bhorat. -Mike
Re: [CODE4LIB] The Berman Catalog
Are there any rights issues? If not, it would be good to make the raw files available, say by putting them up at the Internet Archive. kc md wrote: I have the raw data files of the former Hennepin County Library catalog and authority files. This is the innovative, unique catalog created by Sandy Berman. 1970s-2002. I would like to import the data into a MYSQL database. I assume this can be done with Perl, but don't know if an existing parser would work or if a custom program would be needed. I have no programming skills. There must be someone... here who knows and values Berman's work and is ready, willing and able to devote their knowledge and skills to making it accessible once again. Please contact me with questions on or off list. Thank You! Madeline Douglass [EMAIL PROTECTED] http://www.sanfordberman.org -- --- Karen Coyle / Digital Library Consultant [EMAIL PROTECTED] http://www.kcoyle.net ph.: 510-540-7596 skype: kcoylenet fx.: 510-848-3913 mo.: 510-435-8234
[CODE4LIB] Code4Lib Journal, Issue 3
The third issue of the Code4Lib Journal is now available at http://journal.code4lib.org/. Issue 3 continues the breadth and depth of articles that you found in the first 2 issues. Check out the table of contents included below: Code4Lib Journal * Editorial Introduction - Issue - http://journal.code4lib.org/articles/104 Ron Peterson * Alternative Solutions for Off-Campus Authentication - http://journal.code4lib.org/articles/73 Rebekah Kilzer, Elizabeth L. Black and James Muir The Ohio State University Libraries created scripts to overcome the local limitations of the proxy server and to offer resource connections at the point of need. All libraries struggle to provide seamless authentication for access to paid resources, such as research databases and electronic journals. In order to obtain access to this content, the libraries must sign contracts promising to limit access to these resources to their user community. The challenge then comes in balancing the patron’s need for easy access to these rich data sources from any computer and the vendors’ desire to protect their assets. * Distributed Version Control and Library Metadata - http://journal.code4lib.org/articles/86 Galen M. Charlton Distributed version control systems (DVCSs) are effective tools for managing source code and other artifacts produced by software projects with multiple contributors. This article describes DVCSs and compares them with traditional centralized version control systems, then describes extending the DVCS model to improve the exchange of library metadata. * The Planets Testbed: Science for Digital Preservation - http://journal.code4lib.org/articles/83 Brian Aitken (HATII), Petra Helwig (NANETH), Andrew Jackson (BL), Andrew Lindley (ARC), Eleonora Nicchiarelli (ONB), Seamus Ross (HATII) The preservation of digital objects requires specific software tools or services. These can be characterisation tools that abstract the essential characteristics of a digital object from a file, migration tools that convert digital objects to different formats, or emulation tools that render digital objects in their original context on a new infrastructure. Until recently digital preservation has been characterised by practices and processes that could best be described as more art and craft than science. The Planets Testbed provides a controlled environment where preservation tools can be tested and evaluated, and where experiment results can be empirically compared. This paper presents an overview of the Testbed application, an analysis of the experiment methodology and a description of the Testbed’s web service approach. * Bringing Sheet Music to Life: My Experiences with OMR - http://journal.code4lib.org/articles/84 Andrew Bullen This article describes the process of digitizing sheet music celebrating Pullman porters and rail travel from the 1870s-1920s. The process involves 1) digitizing sheet music, 2) running the digitized sheet music through an Optical Musical Recognition (OMR) software package, 3)cleaning up the resulting file, 4) converting it into an .mp3/MIDI file, and 5) tweaking it to use the voices/instruments of a music editing software program. The pros and cons of some popular OMR programs are discussed. * Building an Archival Collections Portal - http://journal.code4lib.org/articles/77 Terry Catapano, Joanna DiPasquale, and Stuart Marquis Columbia University Libraries has developed the Archival Collections Portal, a unified search system helping users discover archival resources in a streamlined way. We combined the power of Lucene and Solr to search XML, parse JSON objects, create EAD-compliant documents, and deliver results in an easy-to-use interface. By reusing MARC records and employing new search engine features and techniques, we are able to bring important and hard-to-find collections to researchers and archivists. The canonical home page of the Portal is http://www.columbia.edu/library/archival/. * Developing an Academic Image Collection with Flickr - http://journal.code4lib.org/articles/74 Jeremy McWilliams A group at Lewis Clark College in Portland are in the process of developing an educational collection of contemporary ceramics images using the photo sharing site Flickr as a back end. This article discusses the evolution of the project, Flickr machine tags, and the concept of Flickr as an application database layer. The article includes code samples for creating and querying machine tags using the Flickr API. * Making Patron Data Work Harder: User Search Terms as Access Points? - http://journal.code4lib.org/articles/78 Jason A. Clark Montana State University (MSU) Libraries are experimenting with re-using patron-generated data to create browseable access points for the Electronic Theses and Dissertations (ETD) collection. A beta QueryCatcher module logs recent search terms and the number of associated hits. These terms are used to create browseable
[CODE4LIB] Job Posting: Director for Knowledge Integration, Stanford, CA
Stanford's Lane Medical Library and Knowledge Management Center has an opening for a Director for Knowledge Integration. For a full description and information about applying, please see: http://jobs.stanford.edu Job ID: 30457 Reporting to the Associate Dean for Knowledge Management, Director of Lane Medical Library, the Director for Knowledge Integration is responsible for the strategic direction and daily operation of web presence and integrated services. This position defines, designs, and supports contextually-based knowledge integration applications that simplify the workflow of clinicians, researchers and biomedical trainees for the Lane Medical Library Knowledge Management Center. The Director for Knowledge Integration monitors emerging technologies to enable online access, dissemination, and manipulation of knowledge resources and services. The Director is a member of Lane Medical Library’s senior management team which has the responsibility to create the overall vision and guide the process of setting priorities, develop appropriate strategies in achieving objectives, enable successful outcomes through the best use of library assets including, staff, funds, collections, technology and facilities, and ensure world class knowledge-based services in support of the School’s research, clinical and educational mission. RESPONSIBILITIES *Knowledge Systems Integration* • Research, evaluate and implement emerging digital library and knowledge management technologies. • Oversee the implementation, integration and support of disparate library systems and technical components, including the integrated library system, link resolver and proxy server. • Consult with and provide technical project planning and implementation support to the Director for Resource Management and the Access Services Manager. • Identify and resolve technical problems in software development processes and deployed systems. • Responsible for the library knowledge center’s web presence and services. Manage the operational issues of web services and ancillary products and tools. Support all production activities as necessary. • Analyze automation of work processes to streamline user services and make effective use of library resources. *Knowledge Architecture and Planning* • Assess the changing information landscape of the knowledge management center; review the state of the art of information management technologies; promote and prototype systems to push current strengths into the contextually-based care, learning and research interfaces of tomorrow to meet strategic user workflow based on needs assessment. • Build effective partnerships with medical center departments through the Lane Liaison and Informationist programs to insure seamless and embedded point of use of web services. • Collaborate with library director and senior library managers for strategic vision, planning, and allocation of technical resources. • Collaborate with Systems Development and Data Center staff on the development, deployment and routine maintenance of knowledge management technologies. • Promote cross-functional, cross-organizational, effective and productive partnerships with knowledge management team members, including IRT Systems Development and Data Center staff, ITS and SULAIR partners, as well as interdisciplinary SOM or external development partners. • Chair and coordinate the interdisciplinary knowledge management team. • Synthesize ideas into clearly articulated project plans. *Project Management* • Establish and maintain processes to manage scope over the knowledge integration lifecycle. Monitor and manage resources and milestones to achieve implementation schedules and ensure goals are reached. • Manage the efforts of the web development group to ensure technical interoperability and alignment of design choices with strategic planning. Work closely with this group to inform design and development of the integrated and dynamic web application. • Plan and facilitate cross-functional meetings with and between knowledge management team members. Track, report and communicate action items. Identify cross-functional and project dependencies and scope changes that may impact timelines. • Inform associate dean and directors with clear and accurate information about effort, timeliness and quality of functional and technical aspects of projects. Participate in the overall strategic planning for the Lane Medical Library Knowledge Management Center • Participate in grant activities, contribute to scholarship in areas of expertise, and participate in the national digital library scene. • Supervise a team of 2.0 FTE professional staff including hiring, performance appraisals, goal setting, and professional development. QUALIFICATIONS • MS in Library Science or related field, emphasis on computer systems. • Minimum of five years related experience required. • Demonstrated ability to
Re: [CODE4LIB] The Berman Catalog
Thanks for the encouraging words about the Berman Catalog to fellow colleagues... I don't know if you know, but Although I did find the Ruby code here http://inkdroid.org/bzr/berman/ I did not find the Marc record reader or parser you created...was it MARC : : Forgiving Reader? to read the records, then import them into the MYSQL database. and I did not find the MYSQL database/databases? you designed. I assume these are still resident on your server.If I am remembering this right, I think that you had imported some but not all of the catalog records and there was not time to create the authority file database(s) ? All I have to work with right now is the raw data. So I need to get 100% of the raw data catalog and authority files into one or perhaps several MYSQL databases, before the work on the Ruby display can proceed. I've recently heard of a Ruby based OPAC Blacklight, but don't know if that would work for this project. So if I can find someone to help, he or she would have to do what I assume you already did 1. Assess the state of the raw data to find out if all the delimiters are where they should be and perhaps fix any data imperfections that would be an obstacle to transfer 2. Create a structure for the MYSQL database after analysing the structure of the data...field lengths/repeating fields, etc. 3. Either use an existing or create a custom Marc Reader or parser to read the records and import them into MYSQL. There would also have to be a way of checking if everything transferred okay without data loss...I used to call this data validation. Now...if you would send me a copy of MARC : : ForgivingReader and the MYSQL database you created either populated with the records you put into it or blank but with the structure you defined, that would eliminate the need to reinvent the wheel and start over. And although I hope this message will bring me someone who can help, so far the responses I've received are from people with their own projects. So if you do know anyone who can help, let me know. If it's someone who would need funding, I'll find out if funds could be raised. Times a wastin...Sandy and George Carlin have/had the same condition. Madeline Douglass [EMAIL PROTECTED]
[CODE4LIB] Rapi: An open-source OPAC package
Hi, Rapi is an open-source project of the WING group in the School of Computing, National University of Singapore licensed under the MIT license. Rapi provides an OPAC package that allows you to: 1. Build a Lucene index from your MARC files 2. Screen scrape live circulation data from your own iii OPAC 3. Wrap your OPAC with a customizable user interface The user interface packaged with Rapi has been tested with Firefox 2 and 3 as well as Internet Explorer 7. The user interface supports a variety of features including tabs, an overview+details view, and a suggestion bar among many others. Note that although the user interface supports query suggestions, the package currently does not provide any suggestion modules. With that said, if you do have query suggestion modules, they can be easily integrated with the package. As an example, our live demo at http://linc.comp.nus.edu.sg incorporates a spelling suggestion module. For more information, Rapi is available at http://linc.comp.nus.edu.sg/code/ We look forward to your feedback. Thanks, Jesse School of Computing National University of Singapore
Re: [CODE4LIB] use of OpenSearch response elements in libraries?
I believe WorldCat qualifies, although the API is not yet ready for general release (but soon): http://worldcat.org/devnet/index.php/SearchAPIDetails Roy On 6/23/08 6/23/08 8:55 PM, Godmar Back [EMAIL PROTECTED] wrote: Hi, are there any examples of functioning OpenSearch interfaces to library catalogs or library information systems? I'm specifically interested in those that not only advertise a text/html interface to their catalog, but who include OpenSearch response elements. One example I've found is Evergreen; though it's not clear to what extent this interface is used or implemented. For instance, their demo installation's OpenSearch description advertises an ATOM feed, but what's returned doesn't validate. (*) Are there other examples deployed (and does anybody know applications that consume OpenSearch feeds?) - Godmar (*) See, for instance: http://demo.gapines.org/opac/extras/opensearch/1.1/PINES/atom-full/keyword/?se archTerms=musicstartPage=startIndex=count=searchLang which is not a valid ATOM feed: http://validator.w3.org/feed/check.cgi?url=http%3A%2F%2Fdemo.gapines.org%2Fopa c%2Fextras%2Fopensearch%2F1.1%2FPINES%2Fatom-full%2Fkeyword%2F%3FsearchTerms%3 Dmusic%26startPage%3D%26startIndex%3D%26count%3D%26searchLang --
Re: [CODE4LIB] use of OpenSearch response elements in libraries?
Thanks --- let me do some query refinement then -- does anybody know of examples where record metadata (e.g., MARCXML or DC) is returned as an OpenSearch response? [ If I understand the proposed Worldcat API correctly, OpenSearch is used only for pre-formatted citations in HTML. ] - Godmar On Tue, Jun 24, 2008 at 12:54 AM, Roy Tennant [EMAIL PROTECTED] wrote: I believe WorldCat qualifies, although the API is not yet ready for general release (but soon): http://worldcat.org/devnet/index.php/SearchAPIDetails Roy On 6/23/08 6/23/08 € 8:55 PM, Godmar Back [EMAIL PROTECTED] wrote: Hi, are there any examples of functioning OpenSearch interfaces to library catalogs or library information systems? I'm specifically interested in those that not only advertise a text/html interface to their catalog, but who include OpenSearch response elements. One example I've found is Evergreen; though it's not clear to what extent this interface is used or implemented. For instance, their demo installation's OpenSearch description advertises an ATOM feed, but what's returned doesn't validate. (*) Are there other examples deployed (and does anybody know applications that consume OpenSearch feeds?) - Godmar (*) See, for instance: http://demo.gapines.org/opac/extras/opensearch/1.1/PINES/atom-full/keyword/?se archTerms=musicstartPage=startIndex=count=searchLang which is not a valid ATOM feed: http://validator.w3.org/feed/check.cgi?url=http%3A%2F%2Fdemo.gapines.org%2Fopa c%2Fextras%2Fopensearch%2F1.1%2FPINES%2Fatom-full%2Fkeyword%2F%3FsearchTerms%3 Dmusic%26startPage%3D%26startIndex%3D%26count%3D%26searchLang --