[CODE4LIB] Job Posting: infrastructure developer at Stanford University Libraries

2008-05-14 Thread Lynn McRae

Software Engineer/Java Developer for Digital Library Systems and
Services
This is a 2 year fixed term position.

Full description and application process found at 
http://jobs.stanford.edu/find_a_job.html
Search for keyword 30271

Short Description:

The System Software Developer for Digital Library Systems will play an
integral role in defining, developing and delivering information
systems and infrastructure for the library of the future at Stanford
University. As part of the Digital Library Systems and Services (DLSS)
unit within Stanford University Libraries, the successful candidate
will help lead development efforts to support scholarship in the
digital age by delivering on the promises of the digital library.

The incumbent will be a part of the Digital Library Infrastructure
group, focused on realizing core components of Stanford's emerging
digital library architecture. The team will deliver cross-cutting
digital library services and infrastructure, such as metadata
generation, transformation and management; service oriented library
cyberinfrastructure; and digital asset  rights management. This group
works closely with peer teams focused on other aspects of digital
library functionality, including Web/UI application development, and
the creation of common and reusable technologies and components. The
successful candidate will be responsible for the design and initial
development of key technologies needed to support Stanford’s vision of
the digital library of the future.

Duties and Responsibilities:

   - Principle responsibility for implementing the Digital Object
Registry, a core XML- and RDF-based metadata management component of
our digital library, plus critical related services.
   - Lead role in defining and implementing an underlying strategy for a
simple, REST-based service oriented architecture to deliver
infrastructure services.
   - Provide analysis and software engineering support for implementing
and leveraging the open source Fedora framework as part of our overall
digital library access and management architecture.
   - Contribute analysis, expertise and development to other SULAIR
cyberinfrastructure and content middleware efforts, e.g., digitization
 workflow tools; metadata generation, transformation, editing  QA
tools; personalization and collaboration.
   - Work collaboratively with the the DLSS web development team,
Stanford Digital Repository team, metadata experts, digitization staff
and information architects to help define and develop an extensible,
robust and modular digital library architecture, infrastructure and
application environment.


Qualifications:

   - Proven success as a software engineer in a J2EE environment, and
experience in contributing to and/or defining the technical
architecture of complex systems.
   - Demonstrated ability to write solid, simple, elegant code both
independently and in a team-programming environment and within
schedule limitations.
   - Demonstrated success in working in an academic environment.
   - Demonstrated ability to work collaboratively on a project from
specification to launch; and to work with multiple levels of staff,
and colleagues at peer institutions and open source communities.
   - Knowledge of agile software development practices and test driven
design principles. Demonstrated understanding of best practices for
software development, and an ability to introduce and reinforce
application of those practices in a team environment.
   - Expertise in Java, familiarity with C and C+. Working knowledge of,
or ability to quickly learn, relevant scripting technologies such as
Perl, Ruby on Rails, Python, AJAX, etc.
   - Demonstrated expertise with XML and related tools and technologies
(e.g., XML schema, schema management and databases, XSLT, X-forms).
Familiarity with RDF and its relationship to Library metadata standards.
   - Knowledge of SQL and relational database concepts.
   - Masters degree or equivalent professional experience in Computer
Science, Information Science or related field.
   - Five or more years experience designing middleware, services or
infrastructure for software systems.

Desired:

   - Experience in digital library community.
   - Familiarity with library-related metadata and metadata standards,
particularly MARC, MODS, METS, TEI, EAD.
   - Software Quality Assurance (functional and performance) and testing
experience with various applications.

--
(Mr.) Lynn McRae
Infrastructure/Integration Architect
Stanford University Libraries


[CODE4LIB] FW: CDL releases eXtensible Text Framework (XTF) 2.1

2008-05-14 Thread Roy Tennant
-- Forwarded Message
From: Lisa Schiff [EMAIL PROTECTED]
Date: Tue, 13 May 2008 14:53:30 -0700
To: [EMAIL PROTECTED]
Subject: CDL releases eXtensible Text Framework (XTF) 2.1

**This announcement is being widely released; apologies in advance for
duplication.**
 
FOR IMMEDIATE RELEASE
 
Contact: Lisa Schiff
California Digital Library
University of California, Office of the President
415 20th St., 4th Floor
Oakland, CA 94612
(510) 587-6132
[EMAIL PROTECTED]
http://www.cdlib.org/inside/projects/xtf/
 
California Digital Library Announces New Release of the eXtensible Text
Framework (XTF)
 
Oakland, CA, May 12, 2008 - The California Digital Library (CDL) is pleased
to announce a new release of its search and display technology, the
eXtensible Text Framework (XTF) http://www.cdlib.org/inside/projects/xtf/
version 2.1 (http://www.cdlib.org/inside/projects/xtf/
).  XTF is an open source, highly flexible software application that
supports the search, browse and display of heterogeneous digital content.
XTF offers efficient and practical methods for creating customized end-user
interfaces for distinct digital content collections.
 
Highlights from the 2.1 release include:
* Extensive  interface improvements, including new search forms, built-in
faceted browsing,  and a new look and feel.
* Increased  support for document and information exchange formats.
 * XHTML  and OAI-PMH output
 * NLM  article format indexing and output
 * Microsoft  Word indexing
* Streamlined  XSLT stylesheets for simpler deployment and adaptation.
* Updated  documentation that has been moved to the XTF project wiki
http://xtf.wiki.sourceforge.net/ , allowing XTF  implementers to share
solutions with entire user community.
* Freeform  Boolean query language, offered as an experimental feature.
* Backward  compatibility with existing XTF implementations.
 A complete list of changes http://xtf.wiki.sourceforge.net/changeLog_2.1
 is available on the XTF Project page on SourceForge
 http://sourceforge.net/projects/xtf/ , where the distribution (including
 documentation) can also be downloaded.
  
 Since the first deployment of XTF in 2005, the development strategy has been
 to build and maintain an indexing and display technology that is not only
 customizable, but also draws upon tested components already in use by the
 digital library and search communities - in particular the Lucene text search
 engine, Java, XML, and XSLT.  By coordinating these pieces in a single
 platform that can be used to create multiple unique applications, CDL has
 succeeded in dramatically reducing the investment in infrastructure, staff
 training and development for new digital content projects.
  
 XTF offers a suite of customizable features that support diverse intellectual
 access to content.  Interfaces can be designed to support the distinct tools
 and presentations that are useful and meaningful to specific audiences.  In
 addition, XTF offers the following core features:
 * Easy to deploy: Drops directly in to a  Java application server such as
 Tomcat or Resin; has been tested on Solaris,  Mac, Linux, and Windows
 operating systems.
 * Easy to configure: Can create indexes on  any XML element or attribute;
 entire presentation layer is customizable via  XSLT.
 * Robust:  Optimized to perform well on large documents (e.g., a single text
 that exceeds 10MB of encoded text); scales to perform well on collections of
 millions of documents; provides full Unicode support.
 * Extensible:  
 * Works  well with a variety of authentication systems (e.g., IP address
 lists, LDAP,  Shibboleth).
 * Provides  an interface for external data lookups to support
 thesaurus-based term  expansion, recommender systems, etc.
 * Can  power other digital library services (e.g., XTF contains an OAI-PMH
 data  provider that allows others to harvest metadata, and an SRU interface
 that  exposes searches to federated search engines).
 * Can be deployed as separate, modular pieces of a  third-party system
 (e.g., the module that displays snippets of matching  text).
 * Powerful for  the end user:
 * Spell  checking of queries
 * Faceted  displays for browsing
 * Dynamically  updated browse lists
 * Session-based  bookbags
 These basic features can be tuned and modified.  For instance, the same
 bookbag feature that allows users to store links to entire books, can also
 store links to citable elements of an object, such as a note or other
 reference.
  
 A sampling of XTF-based applications both within and outside of the CDL
 include:
  
 * Mark Twain Project Online http://www.marktwainproject.org/
 (http://www.marktwainproject.org), developed by the Mark Twain Papers
 Project,  the CDL and the University of California  Press.
 * Calisphere http://www.calisphere.org/  (http://www.calisphere.org/),  a
 curated collection of primary sources keyed to the curriculum standards of
 California¹s  K-12 community, developed by the CDL.
 * The Encyclopedia of  Chicago 

Re: [CODE4LIB] place for code examples?

2008-05-14 Thread Andrew Darby
Out of curiosity, as an infrequent wikier, why the strong preference
for dokuwiki?

On Mon, May 12, 2008 at 10:30 AM, Jonathan Rochkind [EMAIL PROTECTED] wrote:
 I would still personally really like a dokuwiki hosted at code4lib.org.
 I am not happy with the mediawiki. If anyone is interested in setting up
 and admining a dokuwiki, I would support that.

 Jonathan

 Ryan Eby wrote:

 Just catching up now on Code4Lib emails. I threw this idea around
 awhile back in #code4lib and it seemed like most thought people used
 their own blog for snippets and code hosts for real projects and it
 wasn't worth doing. I was thinking wiki for code snippets with maybe
 svn for various small libraries / hacks. At the time I was looking at
 just using trac so the two would be together.

 If there's enough movement for just the snippets parts then I agree
 with others that dokuwiki or drupal category would probably be useful.

 Eby

 On Fri, Mar 28, 2008 at 5:07 PM, Keith Jenkins [EMAIL PROTECTED] wrote:

 Does there already exist some place to put some code examples to share
 with the code4lib community?  (I'm thinking of snippets somewhere on
 the order of 10-100 lines, like the definition of a php function.)

 Keith





 --
 Jonathan Rochkind
 Digital Services Software Engineer
 The Sheridan Libraries
 Johns Hopkins University
 410.516.8886
 rochkind (at) jhu.edu




--
Andrew Darby
Web Services Librarian
Ithaca College Library
http://www.ithaca.edu/library/
[EMAIL PROTECTED]


Re: [CODE4LIB] place for code examples?

2008-05-14 Thread Jonathan Rochkind

1. It is set up for including code samples in a way that MediaWiki is not.
2. It allows one wiki installation to include various sub-sections which
are kept seperarate, in seperate namespaces. We have several different
independent projects on our MediaWiki, but MediaWiki mixes them all
together.
3. In general, it was written for documentation and code content, rather
than for encyclopedia content.  Code4Lib needs tilt more toward the former.

Jonathan

Andrew Darby wrote:

Out of curiosity, as an infrequent wikier, why the strong preference
for dokuwiki?

On Mon, May 12, 2008 at 10:30 AM, Jonathan Rochkind [EMAIL PROTECTED] wrote:


I would still personally really like a dokuwiki hosted at code4lib.org.
I am not happy with the mediawiki. If anyone is interested in setting up
and admining a dokuwiki, I would support that.

Jonathan

Ryan Eby wrote:


Just catching up now on Code4Lib emails. I threw this idea around
awhile back in #code4lib and it seemed like most thought people used
their own blog for snippets and code hosts for real projects and it
wasn't worth doing. I was thinking wiki for code snippets with maybe
svn for various small libraries / hacks. At the time I was looking at
just using trac so the two would be together.

If there's enough movement for just the snippets parts then I agree
with others that dokuwiki or drupal category would probably be useful.

Eby

On Fri, Mar 28, 2008 at 5:07 PM, Keith Jenkins [EMAIL PROTECTED] wrote:



Does there already exist some place to put some code examples to share
with the code4lib community?  (I'm thinking of snippets somewhere on
the order of 10-100 lines, like the definition of a php function.)

Keith






--
Jonathan Rochkind
Digital Services Software Engineer
The Sheridan Libraries
Johns Hopkins University
410.516.8886
rochkind (at) jhu.edu






--
Andrew Darby
Web Services Librarian
Ithaca College Library
http://www.ithaca.edu/library/
[EMAIL PROTECTED]




--
Jonathan Rochkind
Digital Services Software Engineer
The Sheridan Libraries
Johns Hopkins University
410.516.8886
rochkind (at) jhu.edu


[CODE4LIB] Job Posting - DSpace developer

2008-05-14 Thread Randy Stern

As you may know, Harvard's faculty in the Faculty of Arts and Sciences has
recently voted to provide open access to scholarly articles created by
faculty. This job posting is in support of that goal. If interested, please
contact Randy Stern.

Digital Library Software Engineer
Harvard University Library
Office for Information Systems
Grade 57, One Year Term

Duties  Responsibilities:

Reporting to the Manager of Systems Development in the Office for
Information Systems, serves as the lead developer for software applications
and tools for the implementation of a new institutional repository within
the Harvard University Library. Responsibilities include the configuration,
customization, and on-going support of a DSpace instance at Harvard, as
well as the development, maintenance, and integration of institutional
repository software tools to create an extremely user friendly deposit and
repository management and reporting process. May also integrate
authentication processes and other DSpace modules with Harvard systems.

This position includes customizing the user interface of DSpace, utilizing
XML, XSL, and JSP technologies.

This position requires the ability to grasp a high level view of
requirements from discussions with stakeholders, recommend solutions, and
iteratively translate that into specifications, prototypes, and working
code with accompanying documentation.

Requirements: BA/BS in computer science with a minimum of 4 years
development experience in java. Ability to produce results and work
independently with general guidance in an environment in which requirements
evolve over time. Strong interpersonal, verbal and written communication
skills. Experience designing, developing, deploying, and managing both
stand alone and Internet applications utilizing Unix and World Wide Web
technologies. Experience with java stand alone and web applications, SQL,
JDBC, XML, XSL, HTML.

Desirable: Experience with IT and library systems in a higher education
environment; experience with Open Source software; familiarity with library
metadata standards such as Dublin Core, METS, MODS, and the OAI protocol;
knowledge of associated digital storage formats and conversion principles,
procedures, and operations; strong understanding of information
organization and retrieval technologies used to organize, store, and access
digital content; experience with programming best-practices, including
test-drive development and design patterns. Hands-on experience with
DSpace, Perl, CVS, Eclipse, Struts, Tomcat, Cocoon, Maven, Ant a plus.





Randy Stern
Manager of Systems Development
Harvard University Library Office for Information Systems
90 Mount Auburn Street
Cambridge, MA 02138
Tel. +1 (617) 495-3724
Email [EMAIL PROTECTED]


Re: [CODE4LIB] place for code examples?

2008-05-14 Thread Nate Vack
On Fri, Mar 28, 2008 at 4:07 PM, Keith Jenkins [EMAIL PROTECTED] wrote:

 Does there already exist some place to put some code examples to share
 with the code4lib community?  (I'm thinking of snippets somewhere on
 the order of 10-100 lines, like the definition of a php function.)

At the risk of sounding really stupid amidst the discussion of wikis
and repositories... why not pastie? Sure, it's not a 'forever'
solution, but will your code snippet really be interesting in 5 years?

Curious,
-Nate


Re: [CODE4LIB] Q.: deep-linking syntax for Encore?

2008-05-14 Thread Kyle Banerjee
 Is anybody using III's Encore system and happens to know if there is a
 deep-linking syntax, either documented or inferred, for it?

I'm not actually using this, but you can reference an individual
record using the syntax
http://encore.scottsdaleaz.gov/iii/encore/record/C|Rb1425309

where b1425309 is the record number. This will give you a nice looking
public display. If you need anything other than a formatted record
display based on a simple record number or keyword search, you are
much better off working in the old catalog (which all Encore libraries
still have) -- it has significantly more search and display options.

As far as I can tell, Encore allows keyword searching only, and there
is no way to pull a record in MARC view, to search by ISBN/ISSN or do
a number of other tasks. That's probably why clicking on the Advanced
search button in an Encore system or on the Find out more in the
individual record display takes you to the old system.

kyle


Re: [CODE4LIB] Life after Expect

2008-05-14 Thread Ken Irwin

Is their any hope for those of us who rely on our Expect-monkeys in III?
My most important Expect scripts use the create-list function, and I
hope that'll stay around for a while. But I'm sure they'll eventually go
away too.

Has III shown any interest in building in their own macros/automation
features to do the sorts of tasks for which we rely on Expect?

Ken

Kyle Banerjee wrote:

Last week, III announced that they are removing a number of
circulation functions from the telnet menus in a software update that
became generally available this month. From what I've been able to
surmise, functions that will be removed include placing holds and
checking things in or or out. Removing these menu options will break
scripts that have been in use for years at institutions in our
consortium, and lots more staff time will be required to perform
certain tasks after some systems are upgraded.

Apparently, III recently discovered that a bug involving holds was
caused by the character-based system, but it is also related to a
desire to port everything to Millennium. Based on the reasoning behind
the announcement, future updates are likely result in other mission
critical scripts breaking as other character-based functionality is
deprecated.

Just a reminder of the risks of relying on automation that depend on
interfaces that are losing vendor support.

kyle



--
Ken Irwin
Reference Librarian
Thomas Library, Wittenberg University