[CODE4LIB] Fwd: new W3C CSV on the Web specs, now at Candidate Recommendation stage - please implement!
For those who have to work with output from spreadsheets. And I wonder if this doesn't interact with OpenRefine output as well. kc Forwarded Message Subject: new W3C CSV on the Web specs, now at Candidate Recommendation stage - please implement! Resent-Date: Tue, 28 Jul 2015 14:33:21 + Resent-From: public-voc...@w3.org Date: Tue, 28 Jul 2015 15:32:52 +0100 From: Dan Brickley dan...@google.com To: public-...@w3.org, semantic-...@w3.org Web semantic-...@w3.org, W3C Web Schemas Task Force public-voc...@w3.org Hi! Short version: Please see http://www.w3.org/blog/news/archives/4830 for the Candidate Recommendation specs from W3C's CSV on the Web group - https://www.w3.org/2013/csvw/wiki/Main_Page Long version: These are the 4 docs, Model for Tabular Data and Metadata on the Web—an abstract model for tabular data, and how to locate metadata that enables users to better understand what the data holds; this specification also contains non-normative guidance on how to parse CSV files http://www.w3.org/TR/2015/CR-tabular-data-model-20150716/ Metadata Vocabulary for Tabular Data—a JSON-based format for expressing metadata about tabular data to inform validation, conversion, display and data entry for tabular data http://www.w3.org/TR/2015/CR-tabular-metadata-20150716/ Generating JSON from Tabular Data on the Web—how to convert tabular data into JSON http://www.w3.org/TR/2015/CR-csv2json-20150716/ Generating RDF from Tabular Data on the Web—how to convert tabular data into RDF http://www.w3.org/TR/2015/CR-csv2rdf-20150716/ See the blog post for more links including an extensive set of test cases, our GitHub repo and the mailing list for feedback. Also note that the approach takes CSV as its central stereotypical use case but should apply to many other tabular data-sharing approaches too (e.g. most obviously tab separated). So if you prefer tab-separated files to comma-separated, do please take a look! The Model spec defines that common model, the metadata document defines terminology for talking about instances of that model, and the last two specs apply this approach to the problem of mapping tables into JSON and/or RDF. The group expects to satisfy the implementation goals (i.e., at least two, independent implementations for each of the test cases) by October 30, 2015. Please take a look, and pass this along to other groups who may be interested. cheers, Dan for the CSVW WG p.s. since I'm writing I'll indulge myself and share my personal favourite part, which is the ability (in the csv2rdf doc) to map from rows in a table via templates into RDF triples. This is a particularly interesting/important facility and worth some attention. Normally I wouldn't enthuse over (yet another) new RDF syntax but the ability to map tabular data into triples via out-of-band mappings is very powerful. BTW the group gave some serious consideration to applying R2RML here (see docs and github/wiki for details), however given the subtle differences between SQL and CSV environments we have taken a different approach. Anyway please take a look!
[CODE4LIB] Job: Web Developer at Skokie Public Library
Web Developer Skokie Public Library Skokie Put your web development skills to work for an award-winning library and a thriving, diverse community. As a key member of the Virtual Community Engagement department, you'll help build user-centered websites and services to support the library's mission and respond to the community's needs. Our ideal candidate will be familiar with best practices in front-end web development, modern tools like Gulp and Sass, responsive and user-centered design, version control, and frameworks like Django. You should also be comfortable with the command line, servers, databases, and deployment tools. We're looking for candidates who are familiar with the following: * Modern web development with HTML, CSS, and JavaScript * Responsive and user-centered web design * Version control with Git and Github * Building sites with frameworks like Django * Command line, servers, databases, and deployment * Working with third-party APIs and web services Our ideal candidate will thrive working in a collaborative environment with other team members and be willing to juggle multiple projects and learn on the job. You'll be a full participant in the design and development process from user research to deployment. You need a bachelor's degree, preferably in Computer Science or a related area. An understanding of and familiarity with libraries is a plus. Salary range:$53,274.00 - $79,911.00 Annually Apply online at:[http://agency.governmentjobs.com/skokielib rary/default.cfm](http://agency.governmentjobs.com/skokielibrary/default.cfm) Brought to you by code4lib jobs: http://jobs.code4lib.org/job/22107/ To post a new job please visit http://jobs.code4lib.org/
[CODE4LIB] Looking for Ideas on Line Breaks in OCR Text
Hi Code4Lib folks, I was wondering if anyone had some experience cleaning up OCR text. Particularly I am trying to figure out how I can deal with the random line breaks that come from OCR. I am trying to parse out a bibliography with regex. I think I've figured out which queries I need to run to break it up so I can make it into a tab delimited text file but I noticed that the text does the classic thing of OCR inserting line breaks where they physically are on the page. This will obviously be a bit of an issue since it would break the annotation into a bunch of lines rather than leaving it one block so I can manipulate it into a database. So I am wondering if anyone who has worked with OCR text before has a suggested way to clean up those line breaks without doing 300 + pages by hand? Any thoughts would be welcome. Matt Sherman
[CODE4LIB] Thomson Reuters and Impact Factors
Hi all, If I wanted to subscribe to up-to-date impact factor information from Thomson Reuters, which product would I need to purchase (JCR, InCites, ESI, etc.) and is there a general ballpark for price? Thanks! Dave
[CODE4LIB] Registration now open: VuFind Summit 2015
Hello, everyone - Registration is now open for the 2015 VuFind Summit held Monday October 12 and Tuesday October 13, 2015 at Villanova University (in Villanova, PA). Registration will be $45 for two days of events, with breakfast/lunch included. You can register here: http://vustuff.org/vustuff/vufind-summit-2015-registration As usual, the event will be a combination of structured talks, planning sessions and free-form hacking. Registration is open until October 1, 2015. We also have an open call for talk proposals which runs until September 18, 2015. Speakers get free registration! Additionally, thanks to a generous donation by Agilent Technologies, Inc., we may be able to assist some attendees with expenses. Please contact me directly if you would like to attend but need financial assistance. I hope to see some of you in October. More information: https://vufind.org/wiki/vufind_summit_2015 - Demian
[CODE4LIB] Koha Educator Job Opening - ByWater Solutions
Hello all, ByWater Solutions is looking for a new Koha Educator. Full job ad here: http://bywatersolutions.com/2015/08/03/koha-educator-job/ Thanks a bunch! Nicole C. Engard VP of Community Outreach ByWater Solutions
[CODE4LIB] Job: Technology Innovation and Strategy Senior Software Engineer at University of Texas at Austin
Technology Innovation and Strategy Senior Software Engineer University of Texas at Austin Austin **The real story: ** Developer for the Digital Preservation Network project. http://www.dpn.org/ Texas is one of the five founding nodes of the Digital Preservation Network, and we need a software engineer who can work both locally and with developers acorss the country. Duties will eventually also include work just for UT Austin, but the first 12 -18 months will likely be getting the DPN up and running across North America. **Purpose** Provide software development in support of digital curation activities at the University of Texas Libraries. **Essential Functions** Perform software development and analysis including testing, debugging and training. Perform systems design, analysis, and integration with Library infrastructure. Conduct project and status meetings with Library staff and the campus community. **Marginal/Incidental functions** Other related functions as assigned. **Required qualifications** Possesses and applies comprehensive knowledge of application programming analysis. Demonstrated ability to analyze, test, and debug complex code. Strong verbal and written communication skills. Strong interpersonal skills. Strong organizational skills with attention to detail. Skilled at training technical staff. Demonstrated ability to work effectively, both independently and collaboratively, in a team-oriented environment. Bachelor's degree or equivalent experience. Three years of experience in application programming, analysis and system design. Experience with object oriented and procedural programming paradigms. Experience with SQL and RDBMS systems. Professional experience with programming languages such as Java, PHP, Ruby, and Python. Professional experience with web technologies including XML, XSLT, and REST. Professional experience working on Unix systems. Equivalent combination of relevant education and experience may be substituted as appropriate. **Preferred Qualifications** More than five years of experience in application programming, analysis and system design. Experience with metadata standards related to digital libraries and institutional repositories. Experience with digital preservation standards and practices. Experience in a digital library environment. Experience with Agile software development methodologies. Experience with user interface design. Demonstrated potential for leadership in university, state, national, and international programs and initiatives. Evidence of excellent written and verbal communication and problem solving skills. Evidence of initiative, creative energy and resourcefulness. Proven ability to work collaboratively and efficiently in a fast-paced and evolving technological and team-oriented environment. Proven ability to be flexible in work assignments and environments. **Working conditions** May work around standard office conditions Repetitive use of a keyboard at a workstation Use of manual dexterity Lifting and moving Work performed on current multiple projects under pressure of deadlines or time limitations. Ability to work under stress, work as a team member and independently. Commitment to provide excellent customer service. Brought to you by code4lib jobs: http://jobs.code4lib.org/job/22086/ To post a new job please visit http://jobs.code4lib.org/
[CODE4LIB] Job: Digital Collections Manager at State Library of North Carolina
Digital Collections Manager State Library of North Carolina Raleigh Apply for this position at [http://agency.governmentjobs.com/northcarolina/def ault.cfm?action=viewJobjobID=1209118](http://agency.governmentjobs.com/northc arolina/default.cfm?action=viewJobjobID=1209118) The State Library of North Carolina's Government Heritage Library (GHL) is looking for an experienced digital librarian who is passionate about digital preservation, committed to ensuring that digital government publications remain available to the public into the future, has innovative ideas, and has a strong customer service ethic. While this is a non-supervisory position, it coordinates the GHL's digital preservation activities, and actively participates in digital collection development. The position is in the Content and Information Delivery Services branch of the GHL. Job Class Title: Library Professional Working Title: Library Digital Collections Manager Position Number: 60083837 Department: Dept of Cultural Resources Recruitment Range: $34,361 - $57,870 Salary Grade / Salary Grade Equivalent: 74 Competency Level: Advanced Appointment Type: Permanent Full-Time Pos. Location: Wake County, North Carolina Opening Date: Mon. 08/03/15 Closing Date/Time: Fri. 08/21/15 5:00 PM Eastern Time Description of Work: The NC Government Heritage Library (GHL) seeks an enthusiastic, innovative, and detail-oriented Digital Collections Manager (Librarian) to support the GHL's work to preserve and provide access to the valuable resources of North Carolina government. As part of the Content and Information Delivery Services branch, this position coordinates the GHL's access and preservation repositories, digital preservation education and outreach initiatives, and specialized digital projects. Develops and delivers in-person and online training. Develops policies, guidelines, best practices, and workflows regarding digital information management. Researches, tests, and makes recommendations regarding digital preservation and access tools as they concern born-digital and digitized publications. Serves as the liaison between vendors and the GHL and State Archives for select systems, including CONTENTdm and DuraCloud. Position works closely with staff throughout the GHL and State Archives on collaborative projects and initiatives, including the NC Digital Collections, Digital Preservation Education site, and State Government Website Archiving Access Program. Continuously works to improve access to digital materials, and advises on copyright and intellectual property rights, as appropriate. Supports the research and development of grants concerning digital information management and digital projects. Provides in-person and virtual customer assistance, which includes occasional Saturday work. Knowledge, Skills and Abilities / Competencies: Demonstrated thorough knowledge of and hands-on experience with current practices, policies, and standards in library science and digital information practices and curation concepts such as the OAIS framework, distributed storage, open-source tools for validation, normalization, and authentication. Thorough knowledge of and experience with digital library collection management, including collection identification and selection, file structures, metadata schemes, controlled vocabularies, and web interfaces. Knowledge and understanding of content and digital asset management systems (e.g. Drupal, CONTENTdm), digital preservation tools and systems, and web scripting languages (e.g. HTML, PHP, Javascript). Knowledge and understanding of intellectual rights and copyright laws. Strong written and verbal skills and demonstrated ability to communicate complex technical issues to a variety of audiences. Strong customer service skills. Experience developing and delivering in-person training and/or creating online tutorials. Demonstrated ability to cultivate strong working relationships, to act as a lead on partnerships and collaborations, and to work independently as well as in a collaborative environment. Prefer experience with responsive web design, linked data, usability testing, and reference and customer service. Minimum Education and Experience Requirements: Master's degree in library and information science from an appropriately accredited (ALA) institution or regionally accredited program in North Carolina. Experience as a professional librarian in the area of digital libraries, web services, or library systems and technology support preferred. Supplemental and Contact Information: **APPLICATION MUST BE COMPLETED ONLINE USING THE WEBSITE PROVIDED BELOW** [http://agency.governmentjobs.com/northcarolina/default.cfm?action=viewJobjob ID=1209118](http://agency.governmentjobs.com/northcarolina/default.cfm?action= viewJobjobID=1209118) To apply for this position, please click the APPLY link on the URL provided above. In order to receive credit for your work history and