Re: [CODE4LIB] RFC 5005 ATOM extension and OAI
Ed Summers wrote: Thanks for posting this Jakob. I was just reading RFC 5005 on the train yesterday (literally) and the parallels between it and OAI-PMH struck me as well. It's not quite clear to me how deleted records would be handled with an atom archive feed. But I guess one could assume if the identifier is no longer present it has been deleted it. But that would require pulling the entire archive... I'm not really sure how much deletes are really used in OAI-PMH repositories anyhow. OAI-PMH 1.1 was not clear enough on deletions but in 2.0 the specification contains an example. I think the missing support of deletions in data providers has to do with the missing explicit support in service providers and vice versa (henn-and-egg-problem). Stuart Weibel has written [1] about the subject of blog archiving in the past. And I remember hearing Jon Udell and Dan Chudnov talk about it [2]. Who knows what technorati, bloglines and googlereader are doing in this area. I guess the reality is that blogs are on the web and as such will be archived by InternetArchive [3]. But perhaps that doesn't really fit quite right? That's my feeling. Thanks. BlogML was new to me - sounds interesting but looks very shaggy and over-engineered - you do not even get the spec in HTML but have to download an archive that contains tons of nasty .NET files and an XML schema instead of a textual description with examples and discussion. I copied the XML schema here: http://www.gbv.de/wikis/cls/BlogML. I think extending ATOM is the better way. I think your general point is correct. Libraries need to be integrating themselves into the web these days rather than expecting the web to integrate into them. I doubt that archiving weblogs is that complicated [1]! You need a harvester (partly implemented in many Feed-Reader), an archive (you could start with just saving validated ATOM-Files), an index (Solr?) and a reader (also already implemented in many Feed-Readers). I bet you don't need more then a medium size project with one or two developers and one or two years to create sustainable tools for basic weblog archiving. Such a project could be done by any larger library or archive that is able to get funding. It's not a lack of resources, it's a lack of visions. Oh, and would it be alright to add your blog to http://planet.code4lib.org -- we need more of an international presence on there IMHO. The subfeed http://jakoblog.de/category/en/feed/atom/ contains all English language postings which are probably of higher relevance. Jakob [1] Ok, real long-term preservatation *is* complicated but if you only archive well-formed XML that conforms to a given schema (ATOM, HTML) you should be in a good position for the next decades. -- Jakob Voß [EMAIL PROTECTED], skype: nichtich Verbundzentrale des GBV (VZG) / Common Library Network Platz der Goettinger Sieben 1, 37073 Göttingen, Germany +49 (0)551 39-10242, http://www.gbv.de
Re: [CODE4LIB] RFC 5005 ATOM extension and OAI
On 10/22/07, Jakob Voss [EMAIL PROTECTED] wrote: I doubt that archiving weblogs is that complicated [1]! You need a harvester (partly implemented in many Feed-Reader), an archive (you could start with just saving validated ATOM-Files), an index (Solr?) and a reader (also already implemented in many Feed-Readers). I bet you don't need more then a medium size project with one or two developers and one or two years to create sustainable tools for basic weblog archiving. Such a project could be done by any larger library or archive that is able to get funding. It's not a lack of resources, it's a lack of visions. I completely agree. When developing software it's really important to focus on the cleanest/clearest solution, rather than getting bogged down in edge cases and the comments from nay sayers. I hope that my response didn't come across that way. A couple follow on questions for you: In your vision for this software are you expecting that content providers would have to implement RFC 5005 for your archiving system to work? Are you considering archiving media files associated with a blog entry (images, sound, video, etc?). //Ed
[CODE4LIB] Position Opening: Entry Web Developer, Touro College Libraries, New York, NY
TOURO COLLEGE LIBRARIES Technical Electronic Services 33 West 23rd Street New York, NY 10010 Position Opening (available immediately) *** Entry Level Web Developer (FT) *** No library experience necessary. Reporting to the Systems Librarian, the Web Developer will work with the Systems Librarian and other staff to design, implement, and maintain the electronic services managed by Technical Electronic Services (TES). Our department provides expert cataloging and web-based applications and content to Touro Libraries and their users worldwide. We are a quiet office of nine professionals and clerical staff. Although we work independently on specialized tasks most of the time, collaboration at key points in our projects is critical. Given the complexity and specialized nature of library services and their supporting technology, the Web Developer will have a particular need to maintain close consultation with colleagues within and beyond TES. RESPONSIBILITIES - Participate in a redesign of the Libraries' website, including development and maintenance of public areas, as well as intranet services to optimize workflow and staff communication within the Libraries. - Develop and maintain an Electronic Resource Management System (complex database-driven web application integrated with EZproxy servers http://www.usefulutilities.com/ for access management and administration of licensed e-content. - Implement and maintain an OAIS-minded digital repository (e.g. Fedora Commons http://www.fedora-commons.org/ to archive and provide access to the Libraries' growing digital collections. - Maintain the web-based front end (catalog) of the Libraries' Innovative Interfaces Integrated Library System http://www.iii.com/. - Ensure ongoing access to licensed e-content services. Perform regular updates to the Libraries' Serials Solutions e-content discovery system. REQUIREMENTS FOR CONSIDERATION == - Strong knowledge of current (X)HTML and CSS standards and accepted standards-based web design practices. Knowledge of XML standards. - Knowledge of scripting languages such as JavaScript, PHP, ASP, Perl. - Proven ability to develop database-driven websites (i.e. WAMP/LAMP), taking into account appropriate security considerations. - Ability to acquire a working knowledge of new languages and environments quickly and independently. EDUCATION AND EXPERIENCE - BA/BS degree in Computer Science or equivalent experience. - Demonstrated design of database-driven websites which successfully meet the needs of clients or users. Clients or users is freely interpretable but is understood to refer to parties other than the applicant. = Applications will be accepted until the position is filled. For immediate consideration, please submit your resume via e-mail, postal mail, or fax. No calls, please. Yitzchak Schaffer, MLS Systems Librarian Touro College Libraries Technical Electronic Services 33 West 23rd Street New York, NY 10010 Fax (212) 627-3197 [EMAIL PROTECTED] Touro College does not currently sponsor graduated foreign students to obtain H status. Touro College is an Equal Opportunity Employer.
[CODE4LIB] [Position Announcement] Tri-Colleges Web Developer
The Tri-College Library Consortium of Bryn Mawr, Haverford, and Swarthmore Colleges seeks a talented web developer to help expand its digital initiatives. The ideal candidate has strong interpersonal skills, proficiency with the LAMP stack (Linux, Apache, MySQL, Perl/PHP), and a keen interest in working in a highly-collaborative academic environment. The successful candidate will play a central role in designing, integrating, and improving services in support of library users. The Consortium's digital initiatives include the library catalog, image databases, institutional repository software, blogs, wikis, and other applications. TO APPLY: http://trilogy.brynmawr.edu/tricojob.html Adam Brin _ Tri-Colleges Systems Coordinator Bryn Mawr | Haverford | Swarthmore 610.526.5294
[CODE4LIB] Job Opening: Digital Repository Developer: Northwestern University Library, Evanston, IL USA
(This job posting is being sent to multiple lists; please excuse any duplication.) Senior Information Systems Architecture Engineer (Digital Repository Developer) 12 month grant funded position Interested applicants need to apply online via Northwestern's eRecruit web site, referencing Job Opening ID #12352, at: http://www.northwestern.edu/hr/jobs/ The Northwestern University Library is engaged in a number of data repository and data management initiatives at a regional and national. The Repository Analyst/Engineer works in a highly collaborative environment with library colleagues and external partners to adapt the Library's Kirtas book scanning production from a simple reformatting operation to a dynamic program that will enable the library to make fully searchable online any object in a book format. In addition, the project addresses the need for the creation of a Fedora-based book interface tool that could potentially be of benefit to dozens of research libraries currently developing digital repositories based on Fedora systems architecture. NOTE: This is a twelve month, grant funded position from Fall 2007 through Fall 2008. PRINCIPLE ACCOUNTABILITIES: - Designing, programming, testing and deploying applications and system interfaces using applicable programming languages in order to automate ingestion tool digital objects and metadata into the repository; - Applying system analysis techniques and procedures to determine which software and system functional and technical specifications will be needed for the online book components; - Evaluating potential commercial and open-source solutions for integration with the repository; - Consulting with other Fedora implementation sites about emerging approaches to delivering complex digital objects; - Defining and implementing programmatic interfaces with other systems in support of the repository; - Designing and testing user interfaces; - Performing other related duties as required or assigned. MINIMUM QUALIFICATIONS: - A bachelor's degree in Computer Science, Information Science, or a related field or the equivalent combination of education and experience from which comparable knowledge and abilities can be acquired; - At least 4 years of significant development experience in an object oriented development environment including experience with Java and Web Services programming in a service-oriented architecture; - Knowledge of Java, Java Applications, PHP, SQL, JavaScript, CSS, XML, XPATH and XSLT; - Demonstrated experience in at least two of the following areas: Information exchange protocols such as OAI-PMH, SRU, and OpenURL, Metadata standards, preferably METS, MODS, MARC, Dublin Core, PREMIS, and MIX)Encoding formats such as TEI and EAD and Programmatic image processing using TIFF or JPEG2000; - Strong analytical and problem solving skills; - Excellent verbal and written communication and interpersonal skills and the ability to work successfully in a collaborative environment; - Demonstrated ability to handle multiple priorities and deadlines; - Creativity in problem solving to independently resolve technical issues as well as extrapolate from one situation to another; - Demonstrated experience with Unix/Linux server platforms, related software, and basic system administration utilities. PREFERRED QUALIFICATIONS: - A master's degree in Computer Science, Information Science or the equivalent combination of education and experience from which comparable knowledge and abilities can be acquired; - Experience with IT in a higher education environment; - Experience with repository software (such as DSpace, Greenstone, or Fedora). ENVIRONMENT: Northwestern University, Evanston, Illinois, is a major private university with an extensive library system. This includes the Northwestern University Library, three branch libraries on the Evanston Campus, and three libraries on the Chicago Campus. Northwestern University Library is a member of the Association of Research Libraries (ARL) and participates in programs of the Committee on Institutional Cooperation (CIC). The holdings of the libraries total over 4 million volumes. See more information about the Library at: http://www.library.northwestern.edu. TO APPLY: Interested applicants need to apply online via Northwestern's eRecruit web site, referencing Job Opening ID #12352, at: http://www.northwestern.edu/hr/jobs/ Northwestern University is an Equal Opportunity/Affirmative Action Employer. Employment eligibility verification required upon hire. -- Stu Baker Head, Digital Library Systems Development @ Northwestern University 1970 Campus Drive | Room 1177 | Evanston, IL | 60208-2300 847.467.1984 mailto:[EMAIL PROTECTED]