Re: [CODE4LIB] Code4Lib NE
Hiya, Thanks for organizing this and putting this together! Eben and I from the Boston Public Library will both try to attend. :) Please note the following is a general note that isn't directed at Code4Lib NE specifically and rather an attempt to get some off-list communication started for events in 2016. With the Northeast Hydra regional meeting occurring on May 7th (at Brown) and the Northeast Fedora User Group meeting on May 11th and 12th (at Yale), it appears we have all accidentally packed all of our gatherings into the same month. This doesn't even include the Massachusetts focused Digital Commonwealth conference on the previous month (April 2nd). I'm unsure if having all of our regional get-togethers within the same timeframe is a concern with anyone else but I'd guess they all would be more effective if we spread them out a tad. Could be useful if those involved in planning these regional events all ended up on some low-traffic listsrv designed just to help keep each other informed of dates we are considering holding our tech gatherings? Would at least help us on the Hydra/Fedora side to better space things out. (Could still end up with the same compact regional schedule due to when people can hold things but still better than all planning dates separately). I can be reached at sander...@bpl.org and can start a group email that includes those of us that have been planning the Hydra/Fedora events to work out how this might be done? Thanks! Sincerely,Steven AndersonWeb Services - Digital Library Repository developer617-859-2393sander...@bpl.org Date: Wed, 11 Mar 2015 10:42:08 -0400 From: matt.j.bernha...@gmail.com Subject: Re: [CODE4LIB] Code4Lib NE To: CODE4LIB@LISTSERV.ND.EDU To add to Matt's comment - that date and venue are now confirmed: Friday, May 29, 2015 MIT Campus Other details are coming - everything will be posted to the code4lib wiki, an the URL announced on this list and elsewhere. The planning group is coordinating our activities in a Google Group, which you can join here: https://groups.google.com/forum/#!forum/code4lib-ne Thanks, Matt Bernhardt On Wed, Mar 11, 2015 at 10:29 AM, Matthew Sherman matt.r.sher...@gmail.com wrote: We just were discussing it this morning. We will have more details on the wiki soon. On Wed, Mar 11, 2015 at 10:28 AM, Whitni Watkins whitni.watk...@gmail.com wrote: Wiki Update: A tentative date of Friday May 29, 2015 on the MIT campus in Cambridge, MA has been discussed. More details will be provided in February. Is there any further confirming information on Code4Lib NE this May? Thanks! Whitni Watkins
[CODE4LIB] OR2015 NEWS: Registration Opens; Speakers from Mozilla and Google Announced
We are pleased to announce that registration is now open for the 10th International Conference on Open Repositories, to be held on June 8-11, 2015 in Indianapolis, Indiana, United States of America. Full registration details and a link to the registration form may be found at: http://www.or2015.net/registration OR2015 is co-hosted by Indiana University Bloomington Libraries, University of Illinois at Urbana-Champaign Library, and Virginia Tech Libraries. *OR2015 Registration and Fees:* An early registration fee of $450 USD will be available until May 8. After May 8, the registration fee will increase to $500 USD. This registration fee covers participation in general conference sessions, workshops, and interest group sessions, as well as the conference dinner on Wednesday, June 10 and poster reception on Tuesday, June 9. For a draft outline of the conference schedule, please see: http://www.or2015.net/program/schedule-at-a-glance Participants may register online at: http://www.or2015.net/registration. If you have any questions about registering for OR2015, please contact the Conference Registrar at iuco...@indiana.edu. Any other questions about the conference may be directed to the conference organizing committee by using the form at: http://www.or2015.net/contact-us *Hotel Reservations:* The OR2015 conference will take place at the Hyatt Regency Indianapolis hotel, conveniently located in the heart of downtown Indianapolis. Special room rates at the Hyatt starting at $159 USD per night have been negotiated for conference attendees and will be available for booking through May 16. More information on hotel reservations and travel is available at: http://www.or2015.net/conference-hotel-and-travel *Keynote and Featured Speakers:* Reflecting the significant milestone of the 10th Open Repositories conference and this year's theme of Looking Back, Moving Forward: Open Repositories at the Crossroads, we are pleased to announce the conference's two plenary speakers: Kaitlin Thaney will be giving the opening keynote talk on the morning of Tuesday, June 9. Kaitlin is director of the Mozilla Science Lab, an open science initiative of the Mozilla Foundation focused on innovation, best practice and skills training for research. Prior to Mozilla, she served as the Manager of External Partnerships at Digital Science, a technology company that works to make research more efficient through better use of technology. Kaitlin also advises the UK government on infrastructure for data intensive science and business, serves as a Director for DataKind UK, and is the founding co-chair for the Strata Conference series in London on big data. Prior to Mozilla and Digitial Science, Kaitlin managed the science program at Creative Commons, worked with MIT and Microsoft, and wrote for the Boston Globe. You can learn more about the Science Lab at http://mozillascience.org and follow Kaitlin online at @kaythaney. Anurag Acharya will be the featured speaker at the plenary session on the morning of Wednesday, June 10, presenting on Indexing repositories: pitfalls and best practices. Anurag is a Distinguished Engineer at Google and creator of Google Scholar, and he previously led the indexing group at Google. He has a Bachelors in Computer Science from the Indian Institute of Technology, Kharagpur and a PhD in Computer Science from Carnegie Mellon. Prior to joining Google, he was a post-doctoral researcher at the University of Maryland, College Park and an assistant professor at the University of California, Santa Barbara. We look forward to seeing you at OR2015! Jon Dunn, Julie Speer, and Sarah Shreeves OR2015 Conference Organizing Committee Holly Mercer, William Nixon, and Imma Subirats OR2015 Program Co-Chairs
Re: [CODE4LIB] Code4Lib NE
To add to Matt's comment - that date and venue are now confirmed: Friday, May 29, 2015 MIT Campus Other details are coming - everything will be posted to the code4lib wiki, an the URL announced on this list and elsewhere. The planning group is coordinating our activities in a Google Group, which you can join here: https://groups.google.com/forum/#!forum/code4lib-ne Thanks, Matt Bernhardt On Wed, Mar 11, 2015 at 10:29 AM, Matthew Sherman matt.r.sher...@gmail.com wrote: We just were discussing it this morning. We will have more details on the wiki soon. On Wed, Mar 11, 2015 at 10:28 AM, Whitni Watkins whitni.watk...@gmail.com wrote: Wiki Update: A tentative date of Friday May 29, 2015 on the MIT campus in Cambridge, MA has been discussed. More details will be provided in February. Is there any further confirming information on Code4Lib NE this May? Thanks! Whitni Watkins
[CODE4LIB] Job: Digital Studio Technology Specialist at New York University
Digital Studio Technology Specialist New York University New York City The Digital Studio is NYU's gateway to digital services supporting scholarship and teaching. The Digital Studio Technology Specialist will manage the Digital Studio facility, provide client service, technical knowledge, technical development, and communications expertise in support of state-of-the-art computer technology used by faculty and researchers for teaching and research. Advise and consult with faculty and researchers in the design, development, and implementation of information technology services for teaching and research. Develop and deliver training programs including conducting seminars, training sessions, and workshops. Help support scholarly computing, in particular digital humanities, in collaboration with Digital Scholarship Services and Data Services. **Key Responsibilities:** The Digital Studio's mission is to provide a friendly, non-intimidating, and human-centered service connecting the NYU community to the resources they need for research and teaching. The Digital Studio Technology Specialist will: • Provide client service in the Digital Studio, working directly with clients (in-person, email, phone, etc.). • Develop a technology environment conducive to effective client work by evaluating user needs, and planning and implementing systems to meet those needs: research and make recommendations for the purchase of hardware and software; think creatively about the configuration of workstations, work spaces, workflows, etc.; troubleshoot soft/hardware; contact and negotiate with vendors regarding hardware and software purchases; supervise installation of hardware, software, and peripherals • Supervise Digital Studio support staff including hiring, training, and evaluating performance. Identify and prioritize staff assignments to ensure deadlines are met and review work for accuracy. • Work closely with faculty, scholars, and other Libraries and ITS staff to: evaluate current technology used by faculty and scholars; ascertain the academic technology needs of the university; and develop and support standardized, core technology and person-to-person services for teaching and research. • Advise and educate faculty and scholars in the availability of information technology resources for teaching and research across Libraries, Information Technology Services, and NYU. • Develop and deliver training programs including conducting seminars, training sessions, and workshops. Identify, analyze, and evaluate training needs. Arrange for instructors, facilities and materials, as required. Consult with faculty, researchers, and ITS staff on training interests and opportunities. • Research and test future information technologies for teaching and research **Qualifications Required:** • Bachelor's degree and 3 years relevant experience or an equivalent combination. • Must include: good interpersonal and communication skills; client service experience and orientation; experience supporting information technology and training; interest in and a strong desire to learn about new information technologies; willingness to undertake responsibility. • Experience in one of the following areas: Digital audio/video encoding/streaming, imaging, website design, instructional design. Note: the Digital Studio is a growing service and open hours will be expanding. The successful applicant will be willing to work possible evening and weekend hours. **Preferred:** Master's degree and experience in an academic environment, preferably in Libraries and/or IT. Application and resume must include a cover letter that reflects how your experience fits the position description. New York University Libraries: Libraries at New York University serve the schools 40,000 students and faculty and contain more than 5 million volumes. The Libraries supports NYUs vision to become the first true Global Network University by collaborating and providing services to our global academic centers and portal campuses in Abu Dhabi and Shanghai. New York University Libraries is a member of the Association of Research Libraries, the OCLC Research Library Partnership, and the HathiTrust. The Libraries participates in a variety of consortia and collaborates closely with Columbia University Libraries and the New York Public Library through the Manhattan Research Library Consortium. For the NYU Libraries Mission and Strategic Plan go to http://library.nyu.edu/about/Strategic_Plan.pdf To Apply: Interested candidates are requested to submit a letter of application that specifically states how background and experiences are relevant to the position responsibilities and qualifications; current resume; and the names, addresses, and telephone numbers of three references. Please apply through NYU's application management system. Please click on the following link or copy and paste it onto your browser.
[CODE4LIB] Job: Digital Infrastructure Librarian at University of Idaho
Digital Infrastructure Librarian University of Idaho Moscow This is a new position at the University of Idaho Library. Come work with a great team of librarians in one of the most beautiful parts of the country. We are working on many exciting digital projects and we hope to find someone who would be excited to work on a broad array of projects. **- About the Position** The University of Idaho Library invites applications from innovative and service-oriented individuals for the position of Digital Infrastructure Librarian. The person in this position will work closely with the Head of Technical Services, members of the Data and Digital Services unit, the staff of the Technical Services Department, and the Northwest Knowledge Network (the university's data management service center). An ideal candidate will have strong analytical skills, an agile and flexible approach to working with technology, and the ability to adapt to an evolving environment. The University of Idaho library currently has established [digital collections](http://www.lib.uidaho.edu/digital/index.html), [extensive geospatial data resources](http://insideidaho.org/), and is implementing an [institution-wide instance of VIVO](http://vivo.nkn.uidaho.edu/). As a member of the Orbis Cascade Alliance, the library collaborates extensively with thirty-six other academic libraries in the Pacific Northwest. As a member of the University of Idaho faculty, the successful candidate is expected to participate in planning and governance, to be able to work comfortably in a shared decision-making environment, and to be active professionally in research, outreach and professional service. An ideal candidate will be intellectually curious with a desire for continuous learning, enthusiastically contribute to the University's research, teaching and outreach programs, explore emerging technologies with regard to potential library applications, and be committed to the ideals of faculty service. **Job Duties:** Teaching and Learning: The successful applicant will support the University of Idaho's mission by making information resources available, findable, and usable by: providing support for digital and data initiatives in the Library, primarily for metadata and associated technologies; developing, implementing, and refining metadata, ontologies, and controlled vocabularies to enhance information organization, discovery, and use; efficiently managing large sets of metadata records; overseeing interoperability of library systems (e.g., harvesting records for display in the library's discovery interface); and serving as a resource to library employees for questions concerning database maintenance, metadata standards, interoperability of systems, and workflow. There are also Scholarly, Outreach, and Service portions of the job duties. For the full listing, go here:[https://uidaho.peopleadmin.c om/postings/8132](https://uidaho.peopleadmin.com/postings/8132) Preferred Qualifications * Graduate degree in library science from an ALA-accredited institution or equivalent * Coursework, training, or experience with Dublin Core or discipline-specific descriptive metadata schemas, e.g. FGDC-CSDGM or DDI. * Coursework, training, or experience with XML and XSLT. * Ability to work effectively, independently, and collaboratively in a collegial environment. * Ability to meet requirements for promotion and tenure. * Excellent oral, interpersonal, and written communication skills. Desired Qualifications * Non-MARC metadata coursework beyond the basic or introductory level. * Informatics, research data management, digital archives, or GIS background. * Demonstrated ability to use scripting language(s) to transform metadata. * Experience with HTML/CSS and other web technologies. * Experience with metadata on a professional or paraprofessional level (preferably in an academic library). * Experience implementing, maintaining, and configuring library applications and systems. * Familiarity with METS and MODS. * Experience working in an academic library, particularly with data management systems or institutional repositories. * Research interests that could lead to scholarly publication. Brought to you by code4lib jobs: http://jobs.code4lib.org/job/19906/ To post a new job please visit http://jobs.code4lib.org/
Re: [CODE4LIB] Data Lifecycle Tracking Documentation Tools
At my previous job we often had to ingest and support several versions of ingest files from multiple outside data providers. One approach we took was to use the supportable pattern. Basically we would create a singleton component per file/version, with two methods required. 1) Boolean supports(file) - singleton did whatever necessary to see if it could support that file 2) void process(file) - singleton did its magic transforming. We ended up transforming them into objects that we then stored in our database, your requirements might be different. What was really nice about this is was that it made it super easy to test (unit test), as well as have spring framework automatically create a list of these singletons during startup and simply loop over each singleton per file. When performance became an issue it was also easy to put this in a thread since the singletons didn't keep state. I know for xml it can conform to a specific schema. Do you have any formal schema for the files you are ingesting? If so, is there any way to enforce they follow their schema? As for needing a human to quality review the ingest, if after looping through all singletons and none applying you know it needs human review. The code could put that file in a list and present it via some web interface (restful or GUI based). As for the wiki, we found that a good front page that linked to the most used pages was the best. From what I recall mediawiki was ok at search but not the best. We had the understanding that if ANYONE found something wrong or duplicate, it was up to them to fix it (including new members of the team). But even given the worst case of it being disorganized, I would still take it over just a select few people knowing it. Communication is key, so if you are finding it becoming a mess perhaps that is a better indication that there is some communication gap. To combat this we made it mandatory to be in the dev chat room on our IM client. People would often post links to the wiki in the chat room, which allowed anyone to mention if it was a duplicate. Anyways, these are just my ramblings as a developer. Take it with a grain of salt, your mileage may vary. John -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of davesgonechina Sent: Tuesday, March 10, 2015 10:35 PM To: CODE4LIB@LISTSERV.ND.EDU Subject: Re: [CODE4LIB] Data Lifecycle Tracking Documentation Tools Hi John, Good question - we're taking in XLS, CSV, JSON, XML, and on a bad day PDF of varying file sizes, each requiring different transformation and audit strategies, on both regular and irregular schedules. New batches often feature schema changes requiring modification to ingest procedures, which we're trying to automate as much as possible but obviously require a human chaperone. Mediawiki is our default choice at the moment, but then I would still be looking for a good workflow management model for the structure of the wiki, especially since in my experience wikis are often a graveyard for the best intentions. Dave On Tue, Mar 10, 2015 at 8:10 PM, Scancella, John j...@loc.gov wrote: Dave, How are you getting the metadata streams? Are they actual stream objects, or files, or database dumps, etc? As for the tools, I have used a number of the ones you listed below. I personally prefer JIRA (and it is free for non-profit). If you are ok if editing in wiki syntax I would recommend mediaWiki (it is what powers Wikipedia). You could also take a look at continuous deployment technologies like Virtual Machines (virtualbox), linux containers (docker), and rapid deployment tools (ansible, salt). Of course if you are doing lots of code changes you will want to test all of this continually (Jenkins). John Scancella Library of Congress, OSI -Original Message- From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf Of davesgonechina Sent: Tuesday, March 10, 2015 6:05 AM To: CODE4LIB@LISTSERV.ND.EDU Subject: [CODE4LIB] Data Lifecycle Tracking Documentation Tools Hi all, One of my projects involves harvesting, cleaning and transforming steady streams of metadata from numerous publishers. It's an infinite loop but every cycle can be a little bit or significantly different. Many issue tracking tools are designed for a linear progression that ends in deployment, not a circular workflow, and I've not hit upon a tool or use strategy that really fits. The best illustration I've found so far of the type of workflow I'm talking about is the DCC Curation Lifecycle Model http://www.dcc.ac.uk/sites/default/files/documents/publications/DCCLif ecycle.pdf . Here are some things I've tried or thought about trying: - Git comments - Github Issues - MySQL comments - Bash script logs - JIRA - Trac - Trello - Wiki - Unfuddle - Redmine - Zendesk - Request Tracker -
Re: [CODE4LIB] Code4Lib NE
We just were discussing it this morning. We will have more details on the wiki soon. On Wed, Mar 11, 2015 at 10:28 AM, Whitni Watkins whitni.watk...@gmail.com wrote: Wiki Update: A tentative date of Friday May 29, 2015 on the MIT campus in Cambridge, MA has been discussed. More details will be provided in February. Is there any further confirming information on Code4Lib NE this May? Thanks! Whitni Watkins
[CODE4LIB] Code4Lib NE
Wiki Update: A tentative date of Friday May 29, 2015 on the MIT campus in Cambridge, MA has been discussed. More details will be provided in February. Is there any further confirming information on Code4Lib NE this May? Thanks! Whitni Watkins