Re: [CODE4LIB] RFC 5005 ATOM extension and OAI

2007-10-22 Thread Jakob Voss

Ed Summers wrote:


Thanks for posting this Jakob. I was just reading RFC 5005 on the
train yesterday (literally) and the parallels between it and OAI-PMH
struck me as well. It's not quite clear to me how deleted records
would be handled with an atom archive feed. But I guess one could
assume if the identifier is no longer present it has been deleted it.
But that would require pulling the entire archive... I'm not really
sure how much deletes are really used in OAI-PMH repositories anyhow.


OAI-PMH 1.1 was not clear enough on deletions but in 2.0 the
specification contains an example. I think the missing support of
deletions in data providers has to do with the missing explicit support
in service providers and vice versa (henn-and-egg-problem).


Stuart Weibel has written [1] about the subject of blog archiving in
the past. And I remember hearing Jon Udell and Dan Chudnov talk about
it [2]. Who knows what technorati, bloglines and googlereader are
doing in this area. I guess the reality is that blogs are on the web
and as such will be archived by InternetArchive [3]. But perhaps that
doesn't really fit quite right? That's my feeling.


Thanks. BlogML was new to me - sounds interesting but looks very shaggy
and over-engineered - you do not even get the spec in HTML but have to
download an archive that contains tons of nasty .NET files and an XML
schema instead of a textual description with examples and discussion. I
copied the XML schema here: http://www.gbv.de/wikis/cls/BlogML. I think
extending ATOM is the better way.


I think your general point is correct. Libraries need to be
integrating themselves into the web these days rather than expecting
the web to integrate into them.


I doubt that archiving weblogs is that complicated [1]! You need a
harvester (partly implemented in many Feed-Reader), an archive (you
could start with just saving validated ATOM-Files), an index (Solr?) and
a reader (also already implemented in many Feed-Readers). I bet you
don't need more then a medium size project with one or two developers
and one or two years to create sustainable tools for basic weblog
archiving. Such a project could be done by any larger library or archive
that is able to get funding. It's not a lack of resources, it's a lack
of visions.


Oh, and would it be alright to add your blog to
http://planet.code4lib.org -- we need more of an international
presence on there IMHO.


The subfeed http://jakoblog.de/category/en/feed/atom/ contains all
English language postings which are probably of higher relevance.

Jakob

[1] Ok, real long-term preservatation *is* complicated but if you only
archive well-formed XML that conforms to a given schema (ATOM, HTML) you
should be in a good position for the next decades.

--
Jakob Voß [EMAIL PROTECTED], skype: nichtich
Verbundzentrale des GBV (VZG) / Common Library Network
Platz der Goettinger Sieben 1, 37073 Göttingen, Germany
+49 (0)551 39-10242, http://www.gbv.de


Re: [CODE4LIB] RFC 5005 ATOM extension and OAI

2007-10-22 Thread Ed Summers
On 10/22/07, Jakob Voss [EMAIL PROTECTED] wrote:
 I doubt that archiving weblogs is that complicated [1]! You need a
 harvester (partly implemented in many Feed-Reader), an archive (you
 could start with just saving validated ATOM-Files), an index (Solr?) and
 a reader (also already implemented in many Feed-Readers). I bet you
 don't need more then a medium size project with one or two developers
 and one or two years to create sustainable tools for basic weblog
 archiving. Such a project could be done by any larger library or archive
 that is able to get funding. It's not a lack of resources, it's a lack
 of visions.

I completely agree.  When developing software it's really important to
focus on the cleanest/clearest solution, rather than getting bogged
down in edge cases and the comments from nay sayers. I hope that my
response didn't come across that way.

A couple follow on questions for you:

In your vision for this software are you expecting that content
providers would have to implement RFC 5005 for your archiving system
to work?

Are you considering archiving media files associated with a blog entry
(images, sound, video, etc?).

//Ed


[CODE4LIB] Position Opening: Entry Web Developer, Touro College Libraries, New York, NY

2007-10-22 Thread Yitzchak Schaffer
TOURO COLLEGE LIBRARIES
Technical  Electronic Services
33 West 23rd Street
New York, NY  10010

Position Opening (available immediately)

*** Entry Level Web Developer (FT) ***

No library experience necessary.  Reporting to the Systems Librarian, the
Web Developer will work with the Systems Librarian and other staff to
design, implement, and maintain the electronic services managed by Technical
 Electronic Services (TES).  Our department provides expert cataloging and
web-based applications and content to Touro Libraries and their users
worldwide.  We are a quiet office of nine professionals and clerical staff.
Although we work independently on specialized tasks most of the time,
collaboration at key points in our projects is critical.  Given the
complexity and specialized nature of library services and their supporting
technology, the Web Developer will have a particular need to maintain close
consultation with colleagues within and beyond TES.


RESPONSIBILITIES


- Participate in a redesign of the Libraries' website, including development
and maintenance of public areas, as well as intranet services to optimize
workflow and staff communication within the Libraries.

- Develop and maintain an Electronic Resource Management System (complex
database-driven web application integrated with EZproxy servers
http://www.usefulutilities.com/ for access management and administration
of licensed e-content.

- Implement and maintain an OAIS-minded digital repository (e.g. Fedora
Commons http://www.fedora-commons.org/ to archive and provide access to
the Libraries' growing digital collections.

- Maintain the web-based front end (catalog) of the Libraries' Innovative
Interfaces Integrated Library System http://www.iii.com/.

- Ensure ongoing access to licensed e-content services.  Perform regular
updates to the Libraries' Serials Solutions e-content discovery system.


REQUIREMENTS FOR CONSIDERATION
==

- Strong knowledge of current (X)HTML and CSS standards and accepted
standards-based web design practices.  Knowledge of XML standards.

- Knowledge of scripting languages such as JavaScript, PHP, ASP, Perl.

- Proven ability to develop database-driven websites (i.e. WAMP/LAMP),
taking into account appropriate security considerations.

- Ability to acquire a working knowledge of new languages and environments
quickly and independently.


EDUCATION AND EXPERIENCE


- BA/BS degree in Computer Science or equivalent experience.

- Demonstrated design of database-driven websites which successfully meet
the needs of clients or users.  Clients or users is freely interpretable
but is understood to refer to parties other than the applicant.

=

Applications will be accepted until the position is filled.  For immediate
consideration, please submit your resume via e-mail, postal mail, or fax.
No calls, please.

Yitzchak Schaffer, MLS
Systems Librarian
Touro College Libraries
Technical  Electronic Services
33 West 23rd Street
New York, NY  10010
Fax (212) 627-3197
[EMAIL PROTECTED]

Touro College does not currently sponsor graduated foreign students to
obtain H status.
Touro College is an Equal Opportunity Employer.


[CODE4LIB] [Position Announcement] Tri-Colleges Web Developer

2007-10-22 Thread Adam Brin
The Tri-College Library Consortium of Bryn Mawr, Haverford, and Swarthmore
Colleges seeks a talented web developer to help expand its digital
initiatives. The ideal candidate has strong interpersonal skills,
proficiency with the LAMP stack (Linux, Apache, MySQL, Perl/PHP), and a keen
interest in working in a highly-collaborative academic environment. The
successful candidate will play a central role in designing, integrating, and
improving services in support of library users. The Consortium's digital
initiatives include the library catalog, image databases, institutional
repository software, blogs, wikis, and other applications.

TO APPLY: http://trilogy.brynmawr.edu/tricojob.html



Adam Brin
_
Tri-Colleges Systems Coordinator
Bryn Mawr | Haverford | Swarthmore
610.526.5294


[CODE4LIB] Job Opening: Digital Repository Developer: Northwestern University Library, Evanston, IL USA

2007-10-22 Thread Stu Baker

(This job posting is being sent to multiple lists; please excuse any
duplication.)

Senior Information Systems Architecture Engineer
(Digital Repository Developer)
12 month grant funded position

Interested applicants need to apply online via Northwestern's
eRecruit web site, referencing Job Opening ID #12352, at:
http://www.northwestern.edu/hr/jobs/

The Northwestern University Library is engaged in a number of data
repository and data management initiatives at a regional and
national.  The Repository Analyst/Engineer works in a highly
collaborative environment with library colleagues and external
partners to adapt the Library's Kirtas book scanning production from
a simple reformatting operation to a dynamic program that will enable
the library to make fully searchable online any object in a book
format.  In addition, the project addresses the need for the creation
of a Fedora-based book interface tool that could potentially be of
benefit to dozens of research libraries currently developing digital
repositories based on Fedora systems architecture.

NOTE: This is a twelve month, grant funded position from Fall 2007
through Fall 2008.

PRINCIPLE ACCOUNTABILITIES:
- Designing, programming, testing and deploying applications and
system interfaces using applicable programming languages in order to
automate ingestion tool digital objects and metadata into the
repository;

- Applying system analysis techniques and procedures to determine
which software and system functional and technical specifications
will be needed for the online book components;

- Evaluating potential commercial and open-source solutions for
integration with the repository;

- Consulting with other Fedora implementation sites about emerging
approaches to delivering complex digital objects;

- Defining and implementing programmatic interfaces with other
systems in support of the repository;

- Designing and testing user interfaces;

- Performing other related duties as required or assigned.

MINIMUM QUALIFICATIONS:
- A bachelor's degree in Computer Science, Information Science, or a
related field or the equivalent combination of education and
experience from which comparable knowledge and abilities can be
acquired;

- At least 4 years of significant development experience in an object
oriented development environment including experience with Java and
Web Services programming in a service-oriented architecture;

- Knowledge of Java, Java Applications, PHP, SQL, JavaScript, CSS,
XML, XPATH and XSLT;

- Demonstrated experience in at least two of the following areas:
Information exchange protocols such as OAI-PMH, SRU, and OpenURL,
Metadata standards, preferably METS, MODS, MARC, Dublin Core, PREMIS,
and MIX)Encoding formats such as TEI and EAD and Programmatic image
processing using TIFF or JPEG2000;

- Strong analytical and problem solving skills;

- Excellent verbal and written communication and interpersonal skills
and the ability to work successfully in a collaborative environment;

- Demonstrated ability to handle multiple priorities and deadlines;

- Creativity in problem solving to independently resolve technical
issues as well as extrapolate from one situation to another;

- Demonstrated experience with Unix/Linux server platforms, related
software, and basic system administration utilities.

PREFERRED QUALIFICATIONS:
- A master's degree in Computer Science, Information Science or the
equivalent combination of education and experience from which
comparable knowledge and abilities can be acquired;

- Experience with IT in a higher education environment;

- Experience with repository software (such as DSpace, Greenstone, or Fedora).

ENVIRONMENT:
Northwestern University, Evanston, Illinois, is a major private
university with an extensive library system. This includes the
Northwestern University Library, three branch libraries on the
Evanston Campus, and three libraries on the Chicago Campus.
Northwestern University Library is a member of the Association of
Research Libraries (ARL) and participates in programs of the
Committee on Institutional Cooperation (CIC). The holdings of the
libraries total over 4 million volumes. See more information about
the Library at: http://www.library.northwestern.edu.

TO APPLY:
Interested applicants need to apply online via Northwestern's
eRecruit web site, referencing Job Opening ID #12352, at:
http://www.northwestern.edu/hr/jobs/

Northwestern University is an Equal Opportunity/Affirmative Action
Employer.  Employment eligibility verification required upon hire.


--

Stu Baker
Head, Digital Library Systems Development @ Northwestern University
1970 Campus Drive | Room 1177 | Evanston, IL  | 60208-2300
847.467.1984
mailto:[EMAIL PROTECTED]