Re: [CODE4LIB] Best way to process large XML files
On 09.06.2012 00:00, Kyle Banerjee wrote: Since you mentioned SimpleXML, Kyle, I assume you're using PHP? Actually I'm using perl. For reasons not related to XML parsing, it is the preferred (but not mandatory) language. Based on a few tests and manual inspection, it looks like the ticket for me is going have a two stage process where the first stage converts the file to valid XML and the second cuts through it with SAX. Originally, I was trying to avoid SAX, but the process has been prettier than expected so far. The XML has not been prettier than expected -- it contains a number of issues including outright invalid XML, invalid characters, and hand coded HTML within some elements (i.e. string data not encoded as such). Gotta love library data. But screwed up stuff is employment security. If things actually worked, I'd be redundant... kyle Since you're using perl I think you mean XML::Simple which is a DOM-parser. You also mentioned LibXML and are considering SAX-parsing so I assume you've only used DOM-parsing then? How about using an XML reader, kind of like SAX but a whole lot cleaner and easier - something like: use XML::LibXML::Reader; my $reader = XML::LibXML::Reader-new(location = $filename_or_uri); while ( $reader-read ) { next unless $reader-name eq 'record' $reader-nodeType eq XML_READER_TYPE_ELEMENT; my $dom = XML::LibXML-load_xml( string = $reader-readOuterXml ); ...do something with the record element's dom-tree... } Documentation [https://metacpan.org/module/XML::LibXML::Reader] HTH -- Teemu Nuutinen, Digital Services, Helsinki University Library
[CODE4LIB] A bibliography in XMP inside PDF
Hi all, I have been wondering whether it is possible to put a bibliography (list of references in a journal article, for example) in an XMP packet in a PDF document. Having read about efforts to get the references from a document by parsing text, I thought it might make sense (and be possible) to put a machine-readable version in the document. I haven't found anyone who had done this (yet), so I started wondering why. Is it too obvious to write about? Or is it impossible? Eventually I'd like to have it automatically created at the same time BibTeX creates the 'normal' bibliography for my pdfLaTeX document. And if the PDF/A standard allows it, have it in such a way that it complies to the long-term archival PDF standard. It appears, however, that PDF/A-1 allows the XMP description of the document itself to have only one author [1], which makes me doubt the use of PDF/A in general. If I manage to get it working, could it be a topic of interest for the Code4Lib Journal? Regards, Ben [1] www.pdfa.org/wp-content/uploads/2011/08/tn0003_metadata_in_pdfa-1_2008-03-182.pdf
[CODE4LIB] OR2012 Reminder - Early Registration Extened to June 18
Dear Colleagues, We are counting down to the Seventh International Conference on Open Repositories (OR2012) on 9-13 July, 2012 and are delighted to announce that we are able to extend the Early Booking rate until Monday 18 June. The Early Booking rate is £295 with a full-price rate of £350 thereafter. There is also a day rate of £115. As a reminder delegates can register for OR2012 via the conference electronic payment system (http://or2012.ed.ac.uk/registration/). The draft programme is available at: https://www.conftool.net/or2012/sessions.php. This is provisional and, as such, timings may change. Further details including Pecha Kucha and Repository Fringe sessions, closing keynote, session chairs etc will also be incorporated in the coming days. Please check the website for updates. Delegates can also register for the workshops on Monday and Tuesday at www.or2012.eventbrite.com. Please note that the workshops are free but you will need the EDN number from your receipt from the conference electronic payment system to register for the workshops. Accommodation is booked separately - http://or2012.ed.ac.uk/delegates/accommodation/. Additional information about the conference is provided on the OR2012 website - http://or2012.ed.ac.uk/. We look forward to welcoming you to Edinburgh for OR2012. Best, William Nixon and Stuart Macdonald On behalf of OR2012 Programme Committee === Dr John B Howard, University Librarian and Adjunct Professor, UCD School of Computer Science and Informatics UCD James Joyce Library University College Dublin Belfield Dublin 4 Ireland An Dr John B Howard, Leabharlannaí Ollscoile, agus Ollamh Adjunct, Scoil na Ríomheolaíochta agus na Faisnéisíochta UCD Leabharlann James Joyce UCD, An Coláiste Ollscoile, Baile Átha Cliath, Belfield, Baile Átha Cliath 4, Éire t: +353 1 716 7067 f:+353 1 283 7667 john.b.how...@ucd.ie http://www.ucd.ie/research/people/library/drjohnhoward/
[CODE4LIB] Fwd: [Air-L] Berkman Jobs: Project Manager, Lead Platform Manager, Senior Systems Architect, and Junior Web Developer
-- Forwarded message -- From: Rebecca Tabasky rtaba...@cyber.law.harvard.edu Date: Thu, Jun 7, 2012 at 8:55 PM Subject: [Air-L] Berkman Jobs: Project Manager, Lead Platform Manager, Senior Systems Architect, and Junior Web Developer To: ai...@listserv.aoir.org Hi all, The Berkman Center for Internet Society at Harvard University has three new job listings for which we are now accepting applications. These technically-inclined leadership positions will provide the right candidates the chance to join Berkman's dynamic community in our collaborative exploration of some of the Internet's most interesting, challenging, and compelling issues, and the freedom and responsibility to help direct our cutting-edge initiatives. We are excited to launch our search for the following positions: ** Project Manager https://sjobs.brassring.com/1033/asp/tg/cim_jobdetail.asp?jobId=880847PartnerId=25240SiteId=5341type=mailJobReqLang=1recordstart=1JobSiteId=5341JobSiteInfo=880847_5341gqid=0 * Senior Systems Architect https://sjobs.brassring.com/1033/asp/tg/cim_jobdetail.asp?jobId=881344PartnerId=25240SiteId=5341type=mailJobReqLang=1recordstart=1JobSiteId=5341JobSiteInfo=881344_5341gqid=0 * Lead Platform Manager https://sjobs.brassring.com/1033/asp/tg/cim_jobdetail.asp?jobId=881338PartnerId=25240SiteId=5341type=mailJobReqLang=1recordstart=1JobSiteId=5341JobSiteInfo=881338_5341gqid=0* In addition, we are still accepting applications for a: ** Junior Web Developer https://sjobs.brassring.com/1033/asp/tg/cim_jobdetail.asp?jobId=865430PartnerId=25240SiteId=5341type=mailJobReqLang=1recordstart=1JobSiteId=5341JobSiteInfo=865430_5341gqid=0* Berkman's committed and tight-knit community http://cyber.law.harvard.edu/people encourages serious fun and creativity, supports deep inquiry, values novel approaches to solving problems, strives for transparency, continually builds upon best-practices and lessons learned, and is supportive of individual and collective goals. We look forward to growing our community with people who will thrive in and contribute to our unique, fulfilling environment. To learn more about the Berkman Center, please visit the Berkman site http://cyber.law.harvard.edu/. Short descriptions for these positions can be found below; full descriptions can be found in the Harvard Human Resources system. Applications must be submitted through the Harvard site. As ever, we are thankful for your help in sharing these opportunities with your networks - please forward this note and/or share through other networks (we've tweeted these opportunities https://twitter.com/berkmancenter/status/210780763141050368). If you are, or know, a strong candidate for one of the below positions, we most welcome you to reach out! All best, Becca --- *Project Manager https://sjobs.brassring.com/1033/asp/tg/cim_jobdetail.asp?jobId=880847PartnerId=25240SiteId=5341type=mailJobReqLang=1recordstart=1JobSiteId=5341JobSiteInfo=880847_5341gqid=0:* The Center is looking to hire a Project Manager to drive forward a diverse portfolio of Berkman's research projects; support critical institutional initiatives; and provide high level, strategic management to project teams, focusing on systems/efficacy of the efforts. The project manager will have a broad skill set, including deft management abilities, project team coordination and experience with direct supervision and mentoring; extensive research and writing experience; adaptive workflow management; event planning and implementation; and outstanding writing and verbal skills. In addition, the project manager will have outstanding interpersonal skills; a proven capacity to work independently and as a team member; sound judgment; and exceptional ethical standards. For this role, we have a strong preference for technological interest and experience, ideally managing projects that rely on technical tools or platforms and experience with guiding technical development processes. For a full description of the responsibilities, qualifications, and application instructions, please visit the Harvard Human Resources Project Manager listing https://sjobs.brassring.com/1033/asp/tg/cim_jobdetail.asp?jobId=880847PartnerId=25240SiteId=5341type=mailJobReqLang=1recordstart=1JobSiteId=5341JobSiteInfo=880847_5341gqid=0. https://sjobs.brassring.com/1033/asp/tg/cim_jobdetail.asp?jobId=881344PartnerId=25240SiteId=5341type=mailJobReqLang=1recordstart=1JobSiteId=5341JobSiteInfo=881344_5341gqid=0*Senior Systems Architect https://sjobs.brassring.com/1033/asp/tg/cim_jobdetail.asp?jobId=881344PartnerId=25240SiteId=5341type=mailJobReqLang=1recordstart=1JobSiteId=5341JobSiteInfo=881344_5341gqid=0* and *Lead Platform Manager https://sjobs.brassring.com/1033/asp/tg/cim_jobdetail.asp?jobId=881338PartnerId=25240SiteId=5341type=mailJobReqLang=1recordstart=1JobSiteId=5341JobSiteInfo=881338_5341gqid=0:* Berkman seeks leadership in the planning, design, and implementation of a novel approach that aims to increase the
Re: [CODE4LIB] code4lib Ottawa Meetup - June 13 @ 5PM, Exchange Pub
Just quick reminder that the code4lib Ottawa meetup is happening this Wednesday night at The Exchange Pub. More info is available on the wiki page: http://wiki.code4lib.org/index.php/North#Code4lib_North_Meetups_in_Ottawa If you haven't already sent me a message to RSVP, please do so soon so that I have a better idea of how many seats to reserve. Thanks! Warren Layton On Sun, Jun 3, 2012 at 2:06 PM, Warren A. Layton warren.lay...@gmail.com wrote: Hi all, After a month off, it's time for another code4lib Ottawa meetup! Date: Wednesday June 13th, 5pm Location: The Exchange Pub, 50 Rideau Street (entrance inside the Rideau Centre) The reservation is under Warren / code4lib and the reserved room is downstairs. We'll start with a short recap of the code4lib North unconference in Windsor followed by a few lightning talks (about 5 minutes each). Anyone who wants to demo what they're currently working on or talk about something interesting related to libraries and technology is encouraged to take the floor. An HDTV with an HDMI input is available if you want to show something on screen. We already have some volunteers: * Developers from the Ottawa Public Library will give us a quick preview of the API to their BiblioCommons catalogue. The API will be publicly available this fall. * William Wueppelmann will talk about what's going on at Canadiana.org and how they host and manage their huge digital collection and their efforts to achieve certification as a Trusted Digital Repository (TDR). * Mary Beth Baker will talk about the local tech scene in Ottawa and the potential for collaboration. Afterward, we'll have our usual social gathering. If possible, please send an RSVP to warren.lay...@gmail.com if you wish to attend and/or present a lightning talk. See you at the meetup! Cheers, Warren Layton
Re: [CODE4LIB] Best way to process large XML files
When I need to deal with huge XML files, I use Perl's XML::Parser in stream mode. It's blazing fast, but I have to admit, the code isn't very pretty. There's also XML::LibXML::SAXhttp://search.cpan.org/dist/XML-LibXML/lib/XML/LibXML/SAX.pod, but I can't seem to find any substantive documentation on how this works. (If anyone has any sample code that uses this, I'd love to see it. Please e-mail me off-list as I don't want to de-rail this thread.) Teemu's suggestion about XML::LibXML::Reader is definitely worth considering. I've never clocked it against XML::Parser, but it seems like it *should* be fast. And as Teemu demonstrated, it allows you to write nice compact code. Ron On Fri, Jun 8, 2012 at 2:36 PM, Kyle Banerjee baner...@orbiscascade.orgwrote: I'm working on a script that needs to be able to crosswalk at least a couple hundred XML files regularly, some of which are quite large. I've thought of a number of ways to go about this, but I wanted to bounce this off the list since I'm sure people here deal with this problem all the time. My goal is to make something that's easy to read/maintain without pegging the CPU and consuming too much memory. The performance and load I'm seeing from running the files through LibXML and SimpleXML on the large files is completely unacceptable. SAX is not out of the question, but I'm trying to avoid it if possible to keep the code more compact and easier to read. I'm tempted to streamedit out all line breaks since they occur in unpredictable places and put new ones at the end of each record into a temp file. Then I can read the temp file one line at a time and process using SimpleXML. That way, there's no need to load giant files into memory, create huge arrays, etc and the code would be easy enough for a 6th grader to follow. My proposed method doesn't sound very efficient to me, but it should consume predictable resources which don't increase with file size. How do you guys deal with large XML files? Thanks, kyle rantWhy the heck does the XML spec require a root element, particularly since large files usually consist of a large number of records/documents? This makes it absolutely impossible to process a file of any size without resorting to SAX or string parsing -- which takes away many of the advantages you'd normally have with an XML structure. /rant -- -- Kyle Banerjee Digital Services Program Manager Orbis Cascade Alliance baner...@uoregon.edubaner...@orbiscascade.org / 503.999.9787
[CODE4LIB] Job: Web Designer at British Museum
**Main purpose of Job**: To provide high quality digital design and to support the Museum's website and other digital projects. **Key Responsibilities:** * To develop new page, template and element designs, n line with New Visual Language and Online Style Guide as required * To provide design advice and guidance to the Web team * To integrate new content or information within the Museum's agreed information architecture * To work with the content commissioning team to develop high quality web design supporting the content needs, whether commissioned internally or from external suppliers * To source and prepare graphics, images, photographs, videos and other media to support content development * To maintain, develop and extend knowledge of current and emerging practice and technologies in web and new media, particularly in the museum and cultural sectors. * To ensure accessibility of the Museum web-site **What are the main work pressures and challenges?** * Working on a number of projects at any one time * Ensuring that all design and development work fits within the agreed information * architecture and design templates * Ensuring that accessibility needs and design needs are balanced effectively **What are the performance targets or standards?** * Producing accurate, effective design to agreed accessibility standards in a timely and cost effectivemanner * Effectiveness of working relationships with the web team, Museum stakeholders/clients andexternal suppliers **What changes are likely to occur in the next two years to this job?** * This is a 6 month fixed term post. **Who are the main contacts of the jobholder and what are the relationships to these contacts [eg managing, influencing, advising]?** * Working in equal partnership with other members of the web team. * Building good relationships and effective liaison with other Museum designers, * to ensure consistency of approach. * Influencing and advising the Head of Web, Web team developers. To apply please visit:[http://bit.ly/LNros4](http://bit.ly/LNros4) Brought to you by code4lib jobs: http://jobs.code4lib.org/job/998/
[CODE4LIB] Job: Agile Project Manager at AudioVisual Preservation Solutions
**Position Overview** AudioVisual Preservation Solutions (AVPS) seeks an experienced (mid-level) Agile Project Manager to provide essential support and facilitation to an open source software development project for the public media archival community. The position will begin on July 1, 2012 and continue through October 2013, with the possibility of extension. The project manager will both play a critical leadership role in the Agile development process as well act as primary liaison for clients and stakeholders. This position is full time, based at our office in New York City. No reimbursement for relocation costs will be provided. **Responsibilities** * Oversee the entire project, including overall project planning, project coordination and software development * Oversee Agile development of the application * Develop and document comprehensive project plans, timelines, milestones and deliverables * Manage the complete software development lifecycle * Lead the development and management of project requirements, system features, and user stories * Carefully track and coordinate project progress, ensuring the timely completion of deliverables * Continually prioritize and organize project goals in a way that is clearly accessible to all stakeholders * Manage and track project progress through web-based collaboration tools * Organize and facilitate regular project meetings, including iteration and release planning, daily stand-up meetings, demos, and reviews * Be the primary point of contact for all stakeholders, including clients, developers, stakeholders and AVPS team members. Answer questions, and field inquiries to appropriate team members as needed * Develop documentation and guidelines for software * Help train users of the application * Supervise hand off of application to product owners upon completion of contract * Travel to meetings as needed (10%) **Skills Experience** * At least three years in a project management role * Demonstrated experience with Agile software development coordination, using frameworks such as Scrum or Feature Driven Development (FDD) * Demonstrated leadership skills, with the ability to manage distributed, remote teams * Excellent verbal, written, presentation, and interpersonal communication skills * Extremely organized, responsive, and detail oriented * Experience managing projects with project tracking, issue tracking, and collaboration software such as JIRA and Confluence * Excellent MS Office skills on Mac and PC platforms, Google Docs, diagramming skills using a variety software such as OmniGraffle * Certified Scrum Master and/or PMP Certification a plus * Knowledge of library and information science, video and audio production, and/or public media a plus **About AVPS** AudioVisual Preservation Solutions (AVPS) is a full service audiovisual preservation and information management consulting firm serving the educational, broadcasting, government, non-profit, and corporate sectors. With a strong focus on professional standards and best practices, open communication, efficient workflows, and the innovative use and development of technological resources, AVPS brings a broad knowledge base and extensive experience to efficiently and effectively meeting the challenges faced in the preservation and access of digital content. **To Apply** Please submit resume and cover letter (including salary requirements if applicable) in PDF format to care...@avpreserve.com by June 22, 2012. Brought to you by code4lib jobs: http://jobs.code4lib.org/job/1000/
[CODE4LIB] Job: Sound Archives Librarian at Stanford University
The Stanford University Libraries and Academic Information Resources (SULAIR) invite applications and nominations for the Sound Archives Librarian. This is an Academic Staff continuing appointment. Applicants should apply via: http://jobs.stanford.edu. Please supply a cover letter, a complete statement of qualifications, a full resume of including education and relevant experience, and the names and addresses of three references with knowledge of one's professional qualifications for this position. The person in this position has a wide variety of responsibilities for providing access to materials, assisting the public in use of the collection, transferring archival recordings to digital form for patron use, preserving the Archive's physical collections, and supervising staff and student assistants. The librarian will actively manage the collection of historical recordings acquired by the Archive. The position provides an opportunity for exploring and implementing creative solutions to providing access to large numbers of recordings and an equally creative approach to public service in one of the world's most important sound archives at a leading research institution. **Specific Responsibilities:** * Using existing and emerging metadata schemas (MODS, Dublin Core, MARC, etc.), together with traditional cataloging standards (RDA, LCSH), techniques and EAD archival finding aids, supports creation, organization, and maintenance of various forms of metadata to facilitate discovery and access to historical audio recordings, in their original and digitized formats as appropriate. * Collaborates with Music and ARS staff to evaluate, select and apply metadata schemas and techniques, standards, data elements and thesauri to provide description and access to audio recordings. Actively pursues innovative discovery techniques as they emerge and adapts them to local environment. * Responsible for associated authority work and quality control as defined for particular schemas and according to local procedures. Searches authority files, creates and updates name, name-uniform title and series authority records. * Answers reference questions from patrons regarding the ARS collections and related public service duties. * Digitizes individual audio recordings for preservation, personal research, or class use. Maintains audio room in Archive of Recorded Sound. In collaboration with technicians and expert partners, keeps software up to date, and performs routine maintenance repair. Manages scheduling of use of audio room. Supervises operations manager in audio room tasks. * Collaborates with Media Preservation lab to create discovery/content metadata for digitized audio collections and to manage digitization workflow. * Maintains the ARS website by revising and writing new text using Drupal. * Supervises the Operations Manager and student assistants in the Archive of Recorded Sound. * Oversees inventory control of ARS collections performed by the operations manager and student assistants including barcoding, shelving, and paging materials to and from remote storage. * Assists the Head Librarian in the review and assessment of gift collections as necessary. * Assists the Head Librarian in conducting tours, class and public presentations, and other outreach activities. * Assists in the preparation and writing of grant proposals. **Qualificiations:** * MLS/MLIS degree from an ALA-accredited graduate school is required or work in progress toward the degree. * B.A. in music or equivalent or work in progress toward the degree is required with demonstrated subject knowledge of music history and theory. A graduate degree in music is highly desirable. * Knowledge of reference materials and research tools in music and audio recordings; knowledge of audio recording history; and knowledge of discography is required. * Knowledge of audio archives principles and practices is required. At least 2 years of experience working in a sound archive is required. * Theoretical and practical knowledge of current cataloging/metadata procedures and principles as defined in RDA, MARC21, Dublin Core, MODS, METS, DACS, and EAD. Ability to provide mapping between different metadata standards in a dynamic environment. Knowledge of XML and XSLT. Ability to rapidly learn and implement new cataloging procedures, principles, schemas and relevant thesauri or ontologies, as required. * Experience with Sirsi Symphony, Archivists' Toolkit, and Drupal is desired. The ability to learn these systems rapidly is required. * Theoretical and practical knowledge of digitization principles and practices or the ability to rapidly learn audio digitization, preferably with specific experience in media digitization and with audio equipment. Proficiency with WaveLab or other audio digitization applications desirable. Basic knowledge of audio preservation best practices. * High level of
[CODE4LIB] job: Programmer Analyst (working on digitized materials in a Fedora repository)
For details and to apply visit: http://www.yale.edu/hronline/careers/application/external/index.html And search for the STARS Requisition number: 17151BR Programmer Analyst II Digital Library and Programming Services Yale University Library New Haven, CT Grade: 24 www.yale.edu/jobs Duration: 1 year term starting September 1, 2012, ending August 31, 2013 Schedule: Full-time (37.5 hours per week); Standard Work Week (M-F, 8:30-5:00) Yale University offers exciting opportunities for achievement and growth in New Haven, Connecticut. Conveniently located between Boston and New York, New Haven is the creative capital of Connecticut with cultural resources that include two major art museums, a critically-acclaimed repertory theater, state-of-the-art concert hall, and world-renowned schools of Architecture, Art, Drama, and Music. The University and the Library The Yale University Library, as one of the world's leading research libraries, collects, organizes, preserves, and provides access to and services for a rich and unique record of human thought and creativity. It fosters intellectual growth and supports the teaching and research missions of Yale University and scholarly communities worldwide. A distinctive strength is its rich spectrum of resources, including around 12.8 million volumes and information in all media, ranging from ancient papyri to early printed books to electronic databases. The Library is engaging in numerous projects to expand access to its physical and digital collections. Housed in eighteen buildings including the Sterling Memorial Library, the Beinecke Rare Book and Manuscript Library, and the Bass Library, it employs a dynamic and diverse staff of approximately five hundred who offer innovative and flexible services to library readers. For additional information on the Yale University Library, please vi! sit the Library's web site at www.library.yale.edu. Departmental Focus: Reporting to the Manager of Digital Library and Programming Services, the Programmer Analyst II plans, develops, tests, and documents computer programs, applying knowledge of programming techniques and computer systems. Evaluates user requests for new or modified programs to determine feasibility, cost and time required, compatibility with current systems, and computer capabilities. Departmental Responsibilities 1. Create and maintain digital repositories; participate in Digital Asset Management and Digital Preservation activities. 2. Work within an open source community; develop software based on community standards with strong emphasis on collaboration, code sharing and Application Lifecycle Management. 3. Develops programs or systems of small to moderate size and complexity. Modifies existing systems at all levels of difficulty. 4. Provides informational input into decisions concerning the development and delivery of applications, programs and systems. Defines and analyzes requirements to meet the expectations of stakeholders and intended end users needs, scheduled timeline, and budgetary targets. 5. Analyzes, defines and designs new systems and applications. Writes code in support of business solutions. 6. Responsible for the creation, definition, communication, and management of project plans which includes architectural design, technology selection and methodologies to apply. 7. Troubleshoots problems and provide ongoing maintenance and support for applications and systems. 8. Prepares documentation, user manuals and develops formal proposals for new systems and modifications to existing systems. Mentors technical staff and provide training for end users. 9. Contributes in the development of policies or modifications to exiting policies. 10. Applies and keeps current with existing and emerging technologies and methodologies. Provides ongoing input to the establishment of programming standards, procedures, and methodologies. 11. Ensures established data standards are supported for metadata management, business logic, digital preservation, data modeling and metadata workflow management; 12. Other duties as required or assigned. 13. May be required to assist with disaster recovery operations. Required Education and Experience Bachelor's degree. Three years of experience in analysis and programming of small to medium scale systems or an equivalent combination of education and experience. Required Skills Abilities 1. Strong knowledge Java. Experience with three or more of the following: Java, C++, PHP, Ruby and/or C#. Strong knowledge of SQL using MySQL, Oracle and/or Microsoft SQL Server; experience with database architecture and design; experience managing large data sets using SQL and/or SOLR. 2. Source code repository management using system(s) such as Subversion or Git. Demonstrated use of technologies: AJAX, CSS, HTML, JSON, RDF and HTML5. 3.
[CODE4LIB] Job: Manager of Data Management Services at Johns Hopkins University
The Manager of Data Management Services manages a team that provides data management and planning services, including both data management consulting and technical staff. This position also provides consultative data management planning support to JHU Principle Investigators. **Primary Duties and Responsibilities Manages team providing data management and planning services (50%):** * Leads continued development and expansion of the JHU Data Management Services and champions use within the JHU Community. * Supervises team, including setting goals, performance management, and development. * Works with team to assess services, continuously improve customer support, and benchmark services within the library community. * Maintain relationships with primary JHU stakeholders supporting services and oversee all administrative needs. * Participate in defining requirements for software development. * Tracks and plans for storage expansion and participates in storage option evaluation and implementation. * Works with Head of the Entrepreneurial Library program on budgeting, financial projections, service center fee development, contractual and legal matters, recruitment, and development of services for external provision. * Liaise as needed with the Associate Dean of Research Data Management and Chief IT Architect of the Digital Research and Curation Center. **Provides data management planning services (50%):** * Manage inquiries from JHU PIs for data management planning support. * Provide consultative support to PIs including evaluation of data planning needs, assessing short and long term options and benefits, catering planning to specific granting agency requirements, and editing data management plans. * Track specific scientific domain areas building knowledge and expertise in data types, formats, and needs within domain. * Identify data standards, metadata standards, best practices for data management, etc. to continuously build expertise and improve provision of service. * Maintain knowledge on a broad range of data repositories including their submission, Intellectual Property, and use arrangements, and provide guidance on repository selection for deposit. * Proactively collaborate and coordinate with team to plan for data management. * Collaborate with others in the library to effectively communicate services to faculty, researchers, and departments. * Responsible for manage short and long-term communication and relationship with PIs. **Additional information:** The Sheridan Libraries encompass the Milton S. Eisenhower Library and its collections at the John Work Garrett Library, the George Peabody Library, the Albert D. Hutzler Reading Room, and the DC Centers. Its primary constituency is the students and faculty in the schools of Arts Sciences, Engineering, Carey Business School and the School of Education. A key partner in the academic enterprise, the library is a leader in the innovative application of information technology and has implemented notable diversity and organizational development programs. The Sheridan Libraries are strongly committed to diversity. A strategic goal of the Libraries is to 'work toward achieving diversity when recruiting new and promoting existing staff.' The Libraries prize initiative, creativity, professionalism, and teamwork. For information on the Sheridan Libraries, visit www.library.jhu.edu. **Qualifications:** * Masters of Science, Engineering or Library Science. * A minimum of three (3) years combined of library, information technology, informatics, and/or scientific research experience. * A minimum of two years (2) years of management or supervisory experience. * Experience leading and managing service, product or project teams preferred. * Experience working with scientific data management and/or curation preferred. * Experience with one or more components of the research data life cycle: creation, processing, analyzing, preserving, providing access to, and re-using. * Experience with National Science Foundation proposals a plus. * Superb people and diplomacy skills, strong team-orientation, and professional attitude. * Clear and consistent communicator skilled in networking and outreach efforts * Must be self motivated, pro-active, willing to take on new challenges and solve problems with minimal supervision. * Good listener and skilled negotiator with a high degree of customer orientation. * Strong writing skills. * Strong project planning, management, and execution skills. * Demonstrated ability to work with and easily adapt to new technology. * Experience managing budgets preferred. * Experience managing relationships and/or partnerships with other institutions or vendors a plus. More information available at:[https://hrnt.jhu.edu/jhujobs /job_view.cfm?view_req_id=52688view=sch](https://hrnt.jhu.edu/jhujobs/job_vie
[CODE4LIB] Job: Programmer Analyst II at Yale University
**Programmer Analyst II** Digital Library and Programming Services Yale University Library New Haven, CT Grade: 24 www.yale.edu/jobs Duration: 1 year term starting September 1, 2012, ending August 31, 2013 Schedule: Full-time (37.5 hours per week); Standard Work Week (M-F, 8:30-5:00) Yale University offers exciting opportunities for achievement and growth in New Haven, Connecticut. Conveniently located between Boston and New York, New Haven is the creative capital of Connecticut with cultural resources that include two major art museums, a critically-acclaimed repertory theater, state-of-the-art concert hall, and world-renowned schools of Architecture, Art, Drama, and Music. The University and the Library The Yale University Library, as one of the world's leading research libraries, collects, organizes, preserves, and provides access to and services for a rich and unique record of human thought and creativity. It fosters intellectual growth and supports the teaching and research missions of Yale University and scholarly communities worldwide. A distinctive strength is its rich spectrum of resources, including around 12.8 million volumes and information in all media, ranging from ancient papyri to early printed books to electronic databases. The Library is engaging in numerous projects to expand access to its physical and digital collections. Housed in eighteen buildings including the Sterling Memorial Library, the Beinecke Rare Book and Manuscript Library, and the Bass Library, it employs a dynamic and diverse staff of approximately five hundred who offer innovative and flexible services to library readers. For additional information on the Yale University Library, please visit the Library's web site at www.library.yale.edu. Departmental Focus: Reporting to the Manager of Digital Library and Programming Services, the Programmer Analyst II plans, develops, tests, and documents computer programs, applying knowledge of programming techniques and computer systems. Evaluates user requests for new or modified programs to determine feasibility, cost and time required, compatibility with current systems, and computer capabilities. Departmental Responsibilities 1. Create and maintain digital repositories; participate in Digital Asset Management and Digital Preservation activities. 2. Work within an open source community; develop software based on community standards with strong emphasis on collaboration, code sharing and Application Lifecycle Management. 3. Develops programs or systems of small to moderate size and complexity. Modifies existing systems at all levels of difficulty. 4. Provides informational input into decisions concerning the development and delivery of applications, programs and systems. Defines and analyzes requirements to meet the expectations of stakeholders and intended end users needs, scheduled timeline, and budgetary targets. 5. Analyzes, defines and designs new systems and applications. Writes code in support of business solutions. 6. Responsible for the creation, definition, communication, and management of project plans which includes architectural design, technology selection and methodologies to apply. 7. Troubleshoots problems and provide ongoing maintenance and support for applications and systems. 8. Prepares documentation, user manuals and develops formal proposals for new systems and modifications to existing systems. Mentors technical staff and provide training for end users. 9. Contributes in the development of policies or modifications to exiting policies. 10. Applies and keeps current with existing and emerging technologies and methodologies. Provides ongoing input to the establishment of programming standards, procedures, and methodologies. 11. Ensures established data standards are supported for metadata management, business logic, digital preservation, data modeling and metadata workflow management; 12. Other duties as required or assigned. 13. May be required to assist with disaster recovery operations. Required Education and Experience Bachelor's degree. Three years of experience in analysis and programming of small to medium scale systems or an equivalent combination of education and experience. Required Skills Abilities 1. Strong knowledge Java. Experience with three or more of the following: Java, C++, PHP, Ruby and/or C#. Strong knowledge of SQL using MySQL, Oracle and/or Microsoft SQL Server; experience with database architecture and design; experience managing large data sets using SQL and/or SOLR. 2. Source code repository management using system(s) such as Subversion or Git. Demonstrated use of technologies: AJAX, CSS, HTML, JSON, RDF and HTML5. 3. Excellent customer service skills; demonstrated ability to work with customers and vendors to design and document complex procedures. 4. Ability to work on multiple complex projects concurrently, both independently and as a member of a team in a complex and
[CODE4LIB] Reminder - call for proposals, New England code4lib!
Reminder: We're planning a New England regional code4lib conference and we need your proposals! Dates: Friday, October 26 and Saturday, October 27 Location: Yale University, New Haven, CT Proposal deadline: July 15, 2012. This will be a great opportunity to meet your peers at local institutions and generate conversation on code4lib related topics in which you are interested! Please add your proposals now (please, by July 15, 2012) for (a) Prepared talks (20 minutes) (b) Lightning talks (5 minutes) (c) Posters To submit a proposal, fill out the form code4lib New England - Call for Proposals at: https://docs.google.com/spreadsheet/viewform?formkey=dEQ5SEF4aXljTU5jZFN0UDRsSnJPb2c6MQ If you are interested in making multiple proposals, e.g. for both a prepared talk and a poster, please submit separate proposal forms. About the venue: http://wiki.code4lib.org/index.php/Information_about_meeting_rooms_and_available_equipment Go forth and propose topics! - the code4lib NE planning team (Mike Friscia, Matthew Beacom, Cindy Greenspun, Michelle Hudson, Jay Luker, Joe Montibello; Ernie Marinko, Mark Matienzo, Randy Stern, Kalee Sprague, Tito Sierra)
[CODE4LIB] Job: Agile Project Manager at AudioVisual Preservation Solutions
**Job Description:** AudioVisual Preservation Solutions (AVPS) seeks an experienced (mid-level) Agile Project Manager to provide essential support and facilitation to an open source software development project for the public media archival community. The position will begin on July 1, 2012 and continue through October 2013, with the possibility of extension. The project manager will both play a critical leadership role in the Agile development process as well act as primary liaison for clients and stakeholders. This position is full time, based at our office in New York City. No reimbursement for relocation costs will be provided. **Responsibilities** * Oversee the entire project, including overall project planning, project coordination and software development * Oversee Agile development of the application * Develop and document comprehensive project plans, timelines, milestones and deliverables * Manage the complete software development lifecycle * Lead the development and management of project requirements, system features, and user stories * Carefully track and coordinate project progress, ensuring the timely completion of deliverables * Continually prioritize and organize project goals in a way that is clearly accessible to all stakeholders * Manage and track project progress through web-based collaboration tools * Organize and facilitate regular project meetings, including iteration and release planning, daily stand-up meetings, demos, and reviews * Be the primary point of contact for all stakeholders, including clients, developers, stakeholders and AVPS team members. Answer questions, and field inquiries to appropriate team members as needed * Develop documentation and guidelines for software * Help train users of the application * Supervise hand off of application to product owners upon completion of contract * Travel to meetings as needed (10%) **Desired Skills and Experience** * At least three years in a project management role * Demonstrated experience with Agile software development coordination, using frameworks such as Scrum or Feature Driven Development (FDD) * Demonstrated leadership skills, with the ability to manage distributed, remote teams * Excellent verbal, written, presentation, and interpersonal communication skills * Extremely organized, responsive, and detail oriented * Experience managing projects with project tracking, issue tracking, and collaboration software such as JIRA and Confluence * Excellent MS Office skills on Mac and PC platforms, Google Docs, diagramming skills using a variety software such as OmniGraffle * Certified Scrum Master and/or PMP Certification a plus * Knowledge of library and information science, video and audio production, and/or public media a plus AudioVisual Preservation Solutions (AVPS) is a full service audiovisual preservation and information management consulting firm serving the educational, broadcasting, government, non-profit, and corporate sectors. With a strong focus on professional standards and best practices, open communication, efficient workflows, and the innovative use and development of technological resources, AVPS brings a broad knowledge base and extensive experience to efficiently and effectively meeting the challenges faced in the preservation and access of digital content. To Apply please submit resume and cover letter (including salary requirements if applicable) in PDF format to care...@avpreserve.com by June 22, 2012. Brought to you by code4lib jobs: http://jobs.code4lib.org/job/1003/
Re: [CODE4LIB] Job: Agile Project Manager at AudioVisual Preservation Solutions
Oops, sorry about that Mark. I should have looked more carefully before adding this after seeing it in your TweetStream. I'll remove the duplicate. Also happened today with the Yale posting. I guess I need to come up with some smarts to detect duplicates. //Ed On Mon, Jun 11, 2012 at 5:43 PM, j...@code4lib.org wrote: **Job Description:** AudioVisual Preservation Solutions (AVPS) seeks an experienced (mid-level) Agile Project Manager to provide essential support and facilitation to an open source software development project for the public media archival community. The position will begin on July 1, 2012 and continue through October 2013, with the possibility of extension. The project manager will both play a critical leadership role in the Agile development process as well act as primary liaison for clients and stakeholders. This position is full time, based at our office in New York City. No reimbursement for relocation costs will be provided. **Responsibilities** * Oversee the entire project, including overall project planning, project coordination and software development * Oversee Agile development of the application * Develop and document comprehensive project plans, timelines, milestones and deliverables * Manage the complete software development lifecycle * Lead the development and management of project requirements, system features, and user stories * Carefully track and coordinate project progress, ensuring the timely completion of deliverables * Continually prioritize and organize project goals in a way that is clearly accessible to all stakeholders * Manage and track project progress through web-based collaboration tools * Organize and facilitate regular project meetings, including iteration and release planning, daily stand-up meetings, demos, and reviews * Be the primary point of contact for all stakeholders, including clients, developers, stakeholders and AVPS team members. Answer questions, and field inquiries to appropriate team members as needed * Develop documentation and guidelines for software * Help train users of the application * Supervise hand off of application to product owners upon completion of contract * Travel to meetings as needed (10%) **Desired Skills and Experience** * At least three years in a project management role * Demonstrated experience with Agile software development coordination, using frameworks such as Scrum or Feature Driven Development (FDD) * Demonstrated leadership skills, with the ability to manage distributed, remote teams * Excellent verbal, written, presentation, and interpersonal communication skills * Extremely organized, responsive, and detail oriented * Experience managing projects with project tracking, issue tracking, and collaboration software such as JIRA and Confluence * Excellent MS Office skills on Mac and PC platforms, Google Docs, diagramming skills using a variety software such as OmniGraffle * Certified Scrum Master and/or PMP Certification a plus * Knowledge of library and information science, video and audio production, and/or public media a plus AudioVisual Preservation Solutions (AVPS) is a full service audiovisual preservation and information management consulting firm serving the educational, broadcasting, government, non-profit, and corporate sectors. With a strong focus on professional standards and best practices, open communication, efficient workflows, and the innovative use and development of technological resources, AVPS brings a broad knowledge base and extensive experience to efficiently and effectively meeting the challenges faced in the preservation and access of digital content. To Apply please submit resume and cover letter (including salary requirements if applicable) in PDF format to care...@avpreserve.com by June 22, 2012. Brought to you by code4lib jobs: http://jobs.code4lib.org/job/1003/
[CODE4LIB] Job: Scholarly Communications Liason at Rice University
The Scholarly Communications Liaison serves as the primary point of contact for issues related to scholarly communications and open access, including copyright issues related to scholarly publications. The Scholarly Communications Liaison is a member of the Center for Digital Scholarship team and works closely with Rice faculty and researchers to support open access. S/he facilitates the deposit of scholarly publications into the Rice institutional repository by receiving, digitizing and depositing published articles from faculty into the institutional repository. The Scholarly Communications Liaison interprets, communicates and applies the Rice Open Access policy to scholarly publications, resolving discrepancies between publisher copyrights and permissions and Rice's policies and educates Rice faculty and staff about the open access, author rights and copyrights. S/he communicates with publishers and funders to ensure that any copyright concerns around the scholarly publications are addressed and that compliance with any funder mandates related to open access are adequately met. The Scholarly Communications Liaison maintains and updates the web site with pertinent information related to Rice's activities and services that support digital scholarship and supervises student workers. The Scholarly Communications Liaison is responsible for outreach by authoring white papers, developing and distributing publicity materials, developing and offering workshops and presentations related to scholarly communications. **Responsibilities** * Serves as the primary point of contact for addressing issues related to copyright for articles that are made available as open access resources in Rice's institutional repository. * Receive published articles from faculty and deposit them into the institutional repository. * Interpret, communicate and apply the Rice Open Access policy to scholarly publications, resolving discrepancies between publisher copyrights and permissions and Rice's policies. * Recommend, develop, implement and manage internal processes and procedures to ensure practices related to the publication of scholarly communications comply with Rice's policies and publisher agreements with faculty. * Educate Rice faculty about the open access policy and copyright. * Liaise with the publisher to communicate Rice's open access policy, clarify permissions related to research publications and resolve issues with publisher agreements that faculty sign when publishing articles. * Obtain legal counsel from the Office of General Counsel as needed. * Work with other staff in the Center for Digital Scholarship and other library departments as needed to fulfill this duty. * Develop and offer training to Rice faculty and staff on issues related to scholarly communications and copyright. This will include education about Rice's open access policy, answering questions about copyright as it relates to Rice's open access policy, instructions for entering research publications into the institutional repository, workshops on scholarly communications, updates of the library website, information about funding agency policies related to scholarly communications, and description of services offered by the library to the Rice community to support their scholarly communications needs. * Maintain and update the web site with pertinent information related to Rice's activities and services that support digital scholarship. * Reach out to the university community by authoring white papers, developing and distributing publicity materials, and offering workshops and presentations related to scholarly communications. * Digitize print materials such as Rice theses and dissertations, perform quality control on digitized materials, update metadata for digital content in the institutional repository. * Supervise student workers who are supporting open access to faculty publications at Rice. **Requirements:** Education Required: * Bachelor's Degree * Experience may not be substituted for the education requirement. Experience Required: * 3 years * Work experience in a university setting * A PhD degree may be substituted for up to 2 years of the experience requirement. Other Skills Required: * Understanding of faculty needs and perspective * Understanding of the academic environment * Ability to update web pages, edit wikis and create spreadsheets * Strong written and verbal communication skills and a demonstrated ability to work collaboratively with others **Education Preferred:** * Degree in information science or communications **Experience Preferred** * Five years of work experience in a university setting; experience in a similar position For more information see[https://jobs.rice.edu/applicants/j sp/shared/position/JobDetails_css.jsp?postingId=158574](https://jobs.rice.edu/ applicants/jsp/shared/position/JobDetails_css.jsp?postingId=158574)
Re: [CODE4LIB] A bibliography in XMP inside PDF
You can include XMP packets in pdftex files using the \pdfcatalog primitive ; there are a couple of macro packages that add a little bit of a wrapper: xmpincl basically just wraps and inserts the contents of an external RDF+XML file as the value of /Metadata . http://www.ctan.org/tex-archive/macros/latex/contrib/xmpincl/ There is no standard ontology for citation data; you might look at BIBO http://purl.org/ontology/bibo/ or SPAR - http://purl.org/spar/ Simon On Mon, Jun 11, 2012 at 5:07 AM, Ben Companjen b...@companjen.name wrote: Hi all, I have been wondering whether it is possible to put a bibliography (list of references in a journal article, for example) in an XMP packet in a PDF document. Having read about efforts to get the references from a document by parsing text, I thought it might make sense (and be possible) to put a machine-readable version in the document. I haven't found anyone who had done this (yet), so I started wondering why. Is it too obvious to write about? Or is it impossible? Eventually I'd like to have it automatically created at the same time BibTeX creates the 'normal' bibliography for my pdfLaTeX document. And if the PDF/A standard allows it, have it in such a way that it complies to the long-term archival PDF standard. It appears, however, that PDF/A-1 allows the XMP description of the document itself to have only one author [1], which makes me doubt the use of PDF/A in general. If I manage to get it working, could it be a topic of interest for the Code4Lib Journal? Regards, Ben [1] www.pdfa.org/wp-content/**uploads/2011/08/tn0003_** metadata_in_pdfa-1_2008-03-**182.pdfhttp://www.pdfa.org/wp-content/uploads/2011/08/tn0003_metadata_in_pdfa-1_2008-03-182.pdf
Re: [CODE4LIB] Studying the email list (Charcuterie Spectrum)
I hear Roy Tennant talked Chuck Norris' fists into not punching him in the face. That's how smart Roy Tennant is. On Tue, Jun 5, 2012 at 1:53 PM, Frumkin, Jeremy frumk...@u.library.arizona.edu wrote: Is Roy Tennant smarter than Chuck Norris is tough? -- jaf Sent from my iPad On Jun 5, 2012, at 1:51 PM, Roy Tennant roytenn...@gmail.com wrote: Roy Tennant is too smart to have an official position on this. Best to work it out yourselves. :-) Roy On Jun 5, 2012, at 1:06 PM, Ethan Gruber ewg4x...@gmail.com wrote: The begs the question, what is the official Roy Tennant position on baloney vs. bologna? May I suggest a viaf-like resource for food, in which I may prefer the baloney label while allowing my data to be cross-searchable with bologna records? Is there an RDF ontology for this??? On Tue, Jun 5, 2012 at 4:02 PM, Kevin S. Clarke kscla...@gmail.com wrote: On Tue, Jun 5, 2012 at 3:55 PM, BWS Johnson abesottedphoe...@yahoo.com wrote: Bacon == Seal of Approval Bologna == Seal of Disapproval Salami == Seal of No Approval Needed This has some serious flaws. I'm concerned about the relationships between the desirability of the bespoke seals as they relate to the appeal of the meats themselves. While yea, bacon is nearly universal in its appeal, that one seems on the mark. Alas, bologna as the seal of disapproval might fall a bit short. While one might jump to proffer spam in its place, Hawai'ians quite like spam, leaving us all in a bit of a quandry. Olive loaf, perhaps? And while salame is a most excellent meat, perhaps fois gras more aptly conveys the aboutness of not giving a damn about one's approval or lack thereof. What say you cataloguing mafia? Surely we must honour the aboutness of meat and approval lest we needs OCLC to intervene more often than is strictly necessary in our mortal affairs. I'm vegan now, but having eaten it as a child, may I suggest chicken livers for the Seal of Disapproval? Blech! And, as a vegan, I'd stretch bounds of the Seal of No Approval Needed to tempeh. That seems appropriate. Fwiw... Kevin