Re: [CODE4LIB] The Berman Catalog

2008-06-23 Thread Ed Summers
I actually created a minimal RubyOnRails view on the Berman data a few
years ago.

  http://catalog.sanfordberman.org

The src code for this is readily available if anyone is interested in
helping Madeline work on it:

  http://inkdroid.org/bzr/berman/

The data has an interesting history, for anyone working with machine
readable cataloging data.

//Ed


Re: [CODE4LIB] code4lib 2009

2008-06-23 Thread Michael J. Giarlo
On Mon, Jun 23, 2008 at 11:20 AM, Roy Tennant [EMAIL PROTECTED] wrote:
 JodiS suggested Abigail Sellen in the chatroom, which I'm posting here so we
 don't lose track of the suggestion. She does look quite interesting:
 http://research.microsoft.com/~asellen/. There is a video of her talking
 at http://www.archive.org/details/Abigailsellen_geekdinner.

To this name I might add Sue Dumais's
(http://research.microsoft.com/~sdumais/).  She's also at MSR: I am
interested in algorithms and interfaces for improved information
retrieval, as well as general issues in and human-computer
interaction. I joined Microsoft Research in July 1997. I work on a
wide variety of information access and management issues, including:
personal information management, web search, question answering,
information retrieval, text categorization, collaborative filtering,
interfaces for improved search and navigation, and user/task
modeling.  I've seen her speak once and she's good.

Here are the vote-getting keynoters who were considered and not chosen
last year: David Weinberger, Aaron Swartz,  Mitchell Baker, Herbert
Van De Sompel, Carl Lagoze, Alexis Rossi, MacKenzie Smith, Cathy
Marshall, Jeni Tennison, Karen Calhoun, Karen Coombs, Sandy Payette,
Kathy Sierra, Marek Tuszynski, Zaheda Bhorat.

-Mike


Re: [CODE4LIB] The Berman Catalog

2008-06-23 Thread Karen Coyle
Are there any rights issues? If not, it would be good to make the raw 
files available, say by putting them up at the Internet Archive.


kc

md wrote:
I have the raw data files of the former Hennepin County Library 
catalog and authority files.   


This is the innovative, unique catalog created
by Sandy Berman. 1970s-2002.

I would like to import the data into a MYSQL database.  I assume
this can be done with Perl, but don't know if an existing parser
would work or if a custom program would be needed.

I have no programming skills.   There must be someone...
here who knows and values Berman's work and is ready, 
willing and able to devote their knowledge

and skills to making it accessible once again.

Please contact me with questions on or off list.

Thank You!

Madeline Douglass
[EMAIL PROTECTED]

http://www.sanfordberman.org 



  



--
---
Karen Coyle / Digital Library Consultant
[EMAIL PROTECTED] http://www.kcoyle.net
ph.: 510-540-7596   skype: kcoylenet
fx.: 510-848-3913
mo.: 510-435-8234



[CODE4LIB] Code4Lib Journal, Issue 3

2008-06-23 Thread Ron Peterson
The third issue of the Code4Lib Journal is now available at 
http://journal.code4lib.org/.  Issue 3 continues the breadth and depth of 
articles that you found in the first 2 issues.  Check out the table of contents 
included below:

Code4Lib Journal
* Editorial Introduction - Issue - http://journal.code4lib.org/articles/104
Ron Peterson

* Alternative Solutions for Off-Campus Authentication - 
http://journal.code4lib.org/articles/73
Rebekah Kilzer, Elizabeth L. Black and James Muir
The Ohio State University Libraries created scripts to overcome the local 
limitations of the proxy server and to offer resource connections at the point 
of need. All libraries struggle to provide seamless authentication for access 
to paid resources, such as research databases and electronic journals. In order 
to obtain access to this content, the libraries must sign contracts promising 
to limit access to these resources to their user community. The challenge then 
comes in balancing the patron’s need for easy access to these rich data sources 
from any computer and the vendors’ desire to protect their assets.

* Distributed Version Control and Library Metadata - 
http://journal.code4lib.org/articles/86
Galen M. Charlton
Distributed version control systems (DVCSs) are effective tools for managing 
source code and other artifacts produced by software projects with multiple 
contributors. This article describes DVCSs and compares them with traditional 
centralized version control systems, then describes extending the DVCS model to 
improve the exchange of library metadata.

* The Planets Testbed: Science for Digital Preservation - 
http://journal.code4lib.org/articles/83
Brian Aitken (HATII), Petra Helwig (NANETH), Andrew Jackson (BL), Andrew 
Lindley (ARC), Eleonora Nicchiarelli (ONB), Seamus Ross (HATII)
The preservation of digital objects requires specific software tools or 
services. These can be characterisation tools that abstract the essential 
characteristics of a digital object from a file, migration tools that convert 
digital objects to different formats, or emulation tools that render digital 
objects in their original context on a new infrastructure. Until recently 
digital preservation has been characterised by practices and processes that 
could best be described as more art and craft than science. The Planets Testbed 
provides a controlled environment where preservation tools can be tested and 
evaluated, and where experiment results can be empirically compared. This paper 
presents an overview of the Testbed application, an analysis of the experiment 
methodology and a description of the Testbed’s web service approach.

* Bringing Sheet Music to Life: My Experiences with OMR - 
http://journal.code4lib.org/articles/84
Andrew Bullen
This article describes the process of digitizing sheet music celebrating 
Pullman porters and rail travel from the 1870s-1920s. The process involves 1) 
digitizing sheet music, 2) running the digitized sheet music through an Optical 
Musical Recognition (OMR) software package, 3)cleaning up the resulting file, 
4) converting it into an .mp3/MIDI file, and 5) tweaking it to use the 
voices/instruments of a music editing software program. The pros and cons of 
some popular OMR programs are discussed.

* Building an Archival Collections Portal - 
http://journal.code4lib.org/articles/77
Terry Catapano, Joanna DiPasquale, and Stuart Marquis
Columbia University Libraries has developed the Archival Collections Portal, a 
unified search system helping users discover archival resources in a 
streamlined way. We combined the power of Lucene and Solr to search XML, parse 
JSON objects, create EAD-compliant documents, and deliver results in an 
easy-to-use interface. By reusing MARC records and employing new search engine 
features and techniques, we are able to bring important and hard-to-find 
collections to researchers and archivists. The canonical home page of the 
Portal is http://www.columbia.edu/library/archival/.

* Developing an Academic Image Collection with Flickr - 
http://journal.code4lib.org/articles/74
Jeremy McWilliams
A group at Lewis  Clark College in Portland are in the process of developing 
an educational collection of contemporary ceramics images using the photo 
sharing site Flickr as a back end. This article discusses the evolution of the 
project, Flickr machine tags, and the concept of Flickr as an application 
database layer. The article includes code samples for creating and querying 
machine tags using the Flickr API.

* Making Patron Data Work Harder: User Search Terms as Access Points? - 
http://journal.code4lib.org/articles/78
Jason A. Clark
Montana State University (MSU) Libraries are experimenting with re-using 
patron-generated data to create browseable access points for the Electronic 
Theses and Dissertations (ETD) collection. A beta QueryCatcher module logs 
recent search terms and the number of associated hits. These terms are used to 
create browseable 

[CODE4LIB] Job Posting: Director for Knowledge Integration, Stanford, CA

2008-06-23 Thread Ryan Max Steinberg
Stanford's Lane Medical Library and Knowledge Management Center has an 
opening for a Director for Knowledge Integration.


For a full description and information about applying, please see:

 http://jobs.stanford.edu
 Job ID: 30457



Reporting to the Associate Dean for Knowledge Management, Director of 
Lane Medical Library, the Director for Knowledge Integration is 
responsible for the strategic direction and daily operation of web 
presence and integrated services. This position defines, designs, and 
supports contextually-based knowledge integration applications that 
simplify the workflow of clinicians, researchers and biomedical trainees 
for the Lane Medical Library  Knowledge Management Center. The Director 
for Knowledge Integration monitors emerging technologies to enable 
online access, dissemination, and manipulation of knowledge resources 
and services.


The Director is a member of Lane Medical Library’s senior management 
team which has the responsibility to create the overall vision and guide 
the process of setting priorities, develop appropriate strategies in 
achieving objectives, enable successful outcomes through the best use of 
library assets including, staff, funds, collections, technology and 
facilities, and ensure world class knowledge-based services in support 
of the School’s research, clinical and educational mission.




RESPONSIBILITIES

*Knowledge Systems Integration*
• Research, evaluate and implement emerging digital library and 
knowledge management technologies.
• Oversee the implementation, integration and support of disparate 
library systems and technical components, including the integrated 
library system, link resolver and proxy server.
• Consult with and provide technical project planning and implementation 
support to the Director for Resource Management and the Access Services 
Manager.
• Identify and resolve technical problems in software development 
processes and deployed systems.
• Responsible for the library  knowledge center’s web presence and 
services. Manage the operational issues of web services and ancillary 
products and tools. Support all production activities as necessary.
• Analyze automation of work processes to streamline user services and 
make effective use of library resources.



*Knowledge Architecture and Planning*
• Assess the changing information landscape of the knowledge management 
center; review the state of the art of information management 
technologies; promote and prototype systems to push current strengths 
into the contextually-based care, learning and research interfaces of 
tomorrow to meet strategic user workflow based on needs assessment.
• Build effective partnerships with medical center departments through 
the Lane Liaison and Informationist programs to insure seamless and 
embedded point of use of web services.
• Collaborate with library director and senior library managers for 
strategic vision, planning, and allocation of technical resources.
• Collaborate with Systems Development and Data Center staff on the 
development, deployment and routine maintenance of knowledge management 
technologies.
• Promote cross-functional, cross-organizational, effective and 
productive partnerships with knowledge management team members, 
including IRT Systems Development and Data Center staff, ITS and SULAIR 
partners, as well as interdisciplinary SOM or external development partners.

• Chair and coordinate the interdisciplinary knowledge management team.
• Synthesize ideas into clearly articulated project plans.

*Project Management*
• Establish and maintain processes to manage scope over the knowledge 
integration lifecycle. Monitor and manage resources and milestones to 
achieve implementation schedules and ensure goals are reached.
• Manage the efforts of the web development group to ensure technical 
interoperability and alignment of design choices with strategic 
planning. Work closely with this group to inform design and development 
of the integrated and dynamic web application.
• Plan and facilitate cross-functional meetings with and between 
knowledge management team members. Track, report and communicate action 
items. Identify cross-functional and project dependencies and scope 
changes that may impact timelines.


• Inform associate dean and directors with clear and accurate 
information about effort, timeliness and quality of functional and 
technical aspects of projects. Participate in the overall strategic 
planning for the Lane Medical Library  Knowledge Management Center
• Participate in grant activities, contribute to scholarship in areas of 
expertise, and participate in the national digital library scene.
• Supervise a team of 2.0 FTE professional staff including hiring, 
performance appraisals, goal setting, and professional development.




QUALIFICATIONS

• MS in Library Science or related field, emphasis on computer systems.
• Minimum of five years related experience required.
• Demonstrated ability to 

Re: [CODE4LIB] The Berman Catalog

2008-06-23 Thread md
Thanks for the encouraging words about the Berman Catalog to fellow
colleagues...

I don't know if you know, but

Although I did find the Ruby code here http://inkdroid.org/bzr/berman/

I did not find the Marc record reader or parser you created...was it
MARC : : Forgiving Reader? to read the records, then import them
into the MYSQL database.

and

I did not find the MYSQL database/databases? you designed.

I assume these are still resident on your server.If I am remembering
this
right, I think that you had imported some but not all of the catalog records
and there was not time to create the authority file database(s) ?

All I have to work with right now is the raw data.

So  I need to get 100% of the raw data catalog and authority files into one
or
perhaps several MYSQL databases, before the work on the Ruby display
can proceed.

I've recently heard of a Ruby based OPAC Blacklight, but don't know if
that would work for this project.

So if I can find someone to help, he or she would have to do what I assume
you already did

1. Assess the state of the raw data to find out if all the delimiters
are where they should be and perhaps fix any data imperfections
that would be an obstacle to transfer

2. Create a structure for the MYSQL database after analysing the
structure of the data...field lengths/repeating fields, etc.

3. Either use an existing or create a custom Marc Reader or parser
to read the records and import them into MYSQL.

There would also have to be a way of checking if everything transferred okay
without data loss...I used to call this data validation.

Now...if you would send me a copy of MARC : : ForgivingReader
and the MYSQL database you created either populated with
the records you put into it or blank but with the structure you
defined, that would eliminate the need to reinvent the wheel
and start over.

And although I hope this message will bring me someone
who can help, so far the responses I've received are from
people with their own projects.   So if you do know anyone
who can help, let me know.

If it's someone who would need funding, I'll find out
if funds could be raised.

Times a wastin...Sandy and George Carlin have/had
the same condition.

Madeline Douglass
[EMAIL PROTECTED]


[CODE4LIB] Rapi: An open-source OPAC package

2008-06-23 Thread Jesse Prabawa
Hi,

Rapi is an open-source project of the WING group in the School of Computing,
National University of Singapore licensed under the MIT license.

Rapi provides an OPAC package that allows you to:
  1. Build a Lucene index from your MARC files
  2. Screen scrape live circulation data from your own iii OPAC
  3. Wrap your OPAC with a customizable user interface

The user interface packaged with Rapi has been tested with Firefox 2
and 3 as well as Internet Explorer 7. The user interface supports a
variety of features including tabs, an overview+details view, and a
suggestion bar among many others. Note that although the user
interface supports query suggestions, the package currently does not
provide any suggestion modules. With that said, if you do have query
suggestion modules, they can be easily integrated with the package.
As an example, our live demo at http://linc.comp.nus.edu.sg
incorporates a spelling suggestion module.

For more information, Rapi is available at http://linc.comp.nus.edu.sg/code/

We look forward to your feedback. Thanks,


Jesse
School of Computing
National University of Singapore


Re: [CODE4LIB] use of OpenSearch response elements in libraries?

2008-06-23 Thread Roy Tennant
I believe WorldCat qualifies, although the API is not yet ready for general
release (but soon):

http://worldcat.org/devnet/index.php/SearchAPIDetails

Roy


On 6/23/08 6/23/08 € 8:55 PM, Godmar Back [EMAIL PROTECTED] wrote:

 Hi,
 
 are there any examples of functioning OpenSearch interfaces to library
 catalogs or library information systems?
 
 I'm specifically interested in those that not only advertise a text/html
 interface to their catalog, but who include OpenSearch response elements.
 One example I've found is Evergreen; though it's not clear to what extent
 this interface is used or implemented. For instance, their demo
 installation's OpenSearch description advertises an ATOM feed, but what's
 returned doesn't validate. (*)
 
 Are there other examples deployed (and does anybody know applications that
 consume OpenSearch feeds?)
 
  - Godmar
 
 (*) See, for instance:
 http://demo.gapines.org/opac/extras/opensearch/1.1/PINES/atom-full/keyword/?se
 archTerms=musicstartPage=startIndex=count=searchLang
 which is not a valid ATOM feed:
 http://validator.w3.org/feed/check.cgi?url=http%3A%2F%2Fdemo.gapines.org%2Fopa
 c%2Fextras%2Fopensearch%2F1.1%2FPINES%2Fatom-full%2Fkeyword%2F%3FsearchTerms%3
 Dmusic%26startPage%3D%26startIndex%3D%26count%3D%26searchLang
 

-- 


Re: [CODE4LIB] use of OpenSearch response elements in libraries?

2008-06-23 Thread Godmar Back
Thanks --- let me do some query refinement then -- does anybody know of
examples where record metadata (e.g., MARCXML or DC) is returned as an
OpenSearch response?  [ If I understand the proposed Worldcat API correctly,
OpenSearch is used only for pre-formatted citations in HTML. ]

 - Godmar

On Tue, Jun 24, 2008 at 12:54 AM, Roy Tennant [EMAIL PROTECTED] wrote:

 I believe WorldCat qualifies, although the API is not yet ready for general
 release (but soon):

 http://worldcat.org/devnet/index.php/SearchAPIDetails

 Roy


 On 6/23/08 6/23/08 € 8:55 PM, Godmar Back [EMAIL PROTECTED] wrote:

  Hi,
 
  are there any examples of functioning OpenSearch interfaces to library
  catalogs or library information systems?
 
  I'm specifically interested in those that not only advertise a text/html
  interface to their catalog, but who include OpenSearch response elements.
  One example I've found is Evergreen; though it's not clear to what extent
  this interface is used or implemented. For instance, their demo
  installation's OpenSearch description advertises an ATOM feed, but what's
  returned doesn't validate. (*)
 
  Are there other examples deployed (and does anybody know applications
 that
  consume OpenSearch feeds?)
 
   - Godmar
 
  (*) See, for instance:
 
 http://demo.gapines.org/opac/extras/opensearch/1.1/PINES/atom-full/keyword/?se
  archTerms=musicstartPage=startIndex=count=searchLang
  which is not a valid ATOM feed:
 
 http://validator.w3.org/feed/check.cgi?url=http%3A%2F%2Fdemo.gapines.org%2Fopa
 
 c%2Fextras%2Fopensearch%2F1.1%2FPINES%2Fatom-full%2Fkeyword%2F%3FsearchTerms%3
  Dmusic%26startPage%3D%26startIndex%3D%26count%3D%26searchLang
 

 --