[CODE4LIB] Fwd: new W3C CSV on the Web specs, now at Candidate Recommendation stage - please implement!

2015-08-03 Thread Karen Coyle
For those who have to work with output from spreadsheets. And I wonder 
if this doesn't interact with OpenRefine output as well.


kc

 Forwarded Message 
Subject: new W3C CSV on the Web specs, now at Candidate Recommendation 
stage -  please implement!

Resent-Date: Tue, 28 Jul 2015 14:33:21 +
Resent-From: public-voc...@w3.org
Date: Tue, 28 Jul 2015 15:32:52 +0100
From: Dan Brickley dan...@google.com
To: public-...@w3.org, semantic-...@w3.org Web semantic-...@w3.org, 
W3C Web Schemas Task Force public-voc...@w3.org


Hi! Short version: Please see
http://www.w3.org/blog/news/archives/4830 for the Candidate
Recommendation specs from W3C's CSV on the Web group -
https://www.w3.org/2013/csvw/wiki/Main_Page

Long version:

These are the 4 docs,

Model for Tabular Data and Metadata on the Web—an abstract model for
tabular data, and how to locate metadata that enables users to better
understand what the data holds; this specification also contains
non-normative guidance on how to parse CSV files
http://www.w3.org/TR/2015/CR-tabular-data-model-20150716/

Metadata Vocabulary for Tabular Data—a JSON-based format for
expressing metadata about tabular data to inform validation,
conversion, display and data entry for tabular data
http://www.w3.org/TR/2015/CR-tabular-metadata-20150716/

Generating JSON from Tabular Data on the Web—how to convert tabular
data into JSON
http://www.w3.org/TR/2015/CR-csv2json-20150716/

Generating RDF from Tabular Data on the Web—how to convert tabular
data into RDF
http://www.w3.org/TR/2015/CR-csv2rdf-20150716/

See the blog post for more links including an extensive set of test
cases, our GitHub repo and the mailing list for feedback. Also note
that the approach takes CSV as its central stereotypical use case but
should apply to many other tabular data-sharing approaches too (e.g.
most obviously tab separated). So if you prefer tab-separated files to
comma-separated, do please take a look! The Model spec defines that
common model, the metadata document defines terminology for talking
about instances of that model, and the last two specs apply this
approach to the problem of mapping tables into JSON and/or RDF.

The group expects to satisfy the implementation goals (i.e., at least
two, independent implementations for each of the test cases) by
October 30, 2015. Please take a look, and pass this along to other
groups who may be interested.

cheers,

Dan

for the CSVW WG


p.s. since I'm writing I'll indulge myself and share my personal
favourite part, which is the ability (in the csv2rdf doc) to map from
rows in a table via templates into RDF triples. This is a particularly
interesting/important facility and worth some attention. Normally I
wouldn't enthuse over (yet another) new RDF syntax but the ability to
map tabular data into triples via out-of-band mappings is very
powerful. BTW the group gave some serious consideration to applying
R2RML here (see docs and github/wiki for details), however given the
subtle differences between SQL and CSV environments we have taken a
different approach. Anyway please take a look!


[CODE4LIB] Job: Web Developer at Skokie Public Library

2015-08-03 Thread jobs
Web Developer
Skokie Public Library
Skokie

Put your web development skills to work for an award-winning library and a
thriving, diverse community. As a key member of the Virtual Community
Engagement department, you'll help build user-centered websites and services
to support the library's mission and respond to the community's needs. Our
ideal candidate will be familiar with best practices in front-end web
development, modern tools like Gulp and Sass, responsive and user-centered
design, version control, and frameworks like Django. You should also be
comfortable with the command line, servers, databases, and deployment tools.

  
We're looking for candidates who are familiar with the following:

  * Modern web development with HTML, CSS, and JavaScript
  * Responsive and user-centered web design
  * Version control with Git and Github
  * Building sites with frameworks like Django
  * Command line, servers, databases, and deployment
  * Working with third-party APIs and web services
  
Our ideal candidate will thrive working in a collaborative environment with
other team members and be willing to juggle multiple projects and learn on the
job. You'll be a full participant in the design and development process from
user research to deployment.

  
You need a bachelor's degree, preferably in Computer Science or a related
area. An understanding of and familiarity with libraries is a plus.

  
Salary range:$53,274.00 - $79,911.00 Annually

  
Apply online at:[http://agency.governmentjobs.com/skokielib
rary/default.cfm](http://agency.governmentjobs.com/skokielibrary/default.cfm)



Brought to you by code4lib jobs: http://jobs.code4lib.org/job/22107/
To post a new job please visit http://jobs.code4lib.org/


[CODE4LIB] Looking for Ideas on Line Breaks in OCR Text

2015-08-03 Thread Matt Sherman
Hi Code4Lib folks,

I was wondering if anyone had some experience cleaning up OCR text.
Particularly I am trying to figure out how I can deal with the random
line breaks that come from OCR.  I am trying to parse out a
bibliography with regex.  I think I've figured out which queries I
need to run to break it up so I can make it into a tab delimited text
file but I noticed that the text does the classic thing of OCR
inserting line breaks where they physically are on the page.  This
will obviously be a bit of an issue since it would break the
annotation into a bunch of lines rather than leaving it one block so I
can manipulate it into a database.  So I am wondering if anyone who
has worked with OCR text before has a suggested way to clean up those
line breaks without doing 300 + pages by hand?  Any thoughts would be
welcome.

Matt Sherman


[CODE4LIB] Thomson Reuters and Impact Factors

2015-08-03 Thread davesgonechina
Hi all,

If I wanted to subscribe to up-to-date impact factor information from
Thomson Reuters, which product would I need to purchase (JCR, InCites, ESI,
etc.) and is there a general ballpark for price?

Thanks!
Dave


[CODE4LIB] Registration now open: VuFind Summit 2015

2015-08-03 Thread Demian Katz
Hello, everyone -

Registration is now open for the 2015 VuFind Summit held Monday October 12 and 
Tuesday October 13, 2015 at Villanova University (in Villanova, PA). 
Registration will be $45 for two days of events, with breakfast/lunch included. 
You can register here:

http://vustuff.org/vustuff/vufind-summit-2015-registration

As usual, the event will be a combination of structured talks, planning 
sessions and free-form hacking.

Registration is open until October 1, 2015. We also have an open call for talk 
proposals which runs until September 18, 2015. Speakers get free registration!

Additionally, thanks to a generous donation by Agilent Technologies, Inc., we 
may be able to assist some attendees with expenses. Please contact me directly 
if you would like to attend but need financial assistance.

I hope to see some of you in October.

More information:

https://vufind.org/wiki/vufind_summit_2015

- Demian


[CODE4LIB] Koha Educator Job Opening - ByWater Solutions

2015-08-03 Thread Nicole Engard
Hello all,

ByWater Solutions is looking for a new Koha Educator.  Full job ad here:
http://bywatersolutions.com/2015/08/03/koha-educator-job/

Thanks a bunch!
Nicole C. Engard
VP of Community Outreach
ByWater Solutions


[CODE4LIB] Job: Technology Innovation and Strategy Senior Software Engineer at University of Texas at Austin

2015-08-03 Thread jobs
Technology Innovation and Strategy Senior Software Engineer
University of Texas at Austin
Austin

**The real story: **  
Developer for the Digital Preservation Network project.
http://www.dpn.org/

Texas is one of the five founding nodes of the Digital Preservation Network,
and we need a software engineer who can work both locally and with developers
acorss the country. Duties will eventually also include
work just for UT Austin, but the first 12 -18 months will likely be getting
the DPN up and running across North America.

  
  
**Purpose**  
Provide software development in support of digital curation activities at the
University of Texas Libraries.

  
**Essential Functions**  
Perform software development and analysis including testing, debugging and
training. Perform systems design, analysis, and integration with Library
infrastructure. Conduct project and status meetings with Library staff and the
campus community.

**Marginal/Incidental functions**  
Other related functions as assigned.

  
**Required qualifications**  
Possesses and applies comprehensive knowledge of application programming
analysis. Demonstrated ability to analyze, test, and debug complex code.
Strong verbal and written communication skills. Strong interpersonal skills.
Strong organizational skills with attention to detail. Skilled at training
technical staff. Demonstrated ability to work effectively, both independently
and collaboratively, in a team-oriented environment. Bachelor's degree or
equivalent experience. Three years of experience in application programming,
analysis and system design. Experience with object oriented and procedural
programming paradigms. Experience with SQL and RDBMS systems. Professional
experience with programming languages such as Java, PHP, Ruby, and Python.
Professional experience with web technologies including XML, XSLT, and REST.
Professional experience working on Unix systems. Equivalent combination of
relevant education and experience may be substituted as appropriate.

  
**Preferred Qualifications**  
More than five years of experience in application programming, analysis and
system design. Experience with metadata standards related to digital libraries
and institutional repositories. Experience with digital preservation standards
and practices. Experience in a digital library environment. Experience with
Agile software development methodologies. Experience with user interface
design. Demonstrated potential for leadership in university, state, national,
and international programs and initiatives. Evidence of excellent written and
verbal communication and problem solving skills. Evidence of initiative,
creative energy and resourcefulness. Proven ability to work collaboratively
and efficiently in a fast-paced and evolving technological and team-oriented
environment. Proven ability to be flexible in work assignments and
environments.

  
**Working conditions**  
May work around standard office conditions Repetitive use of a keyboard at a
workstation Use of manual dexterity Lifting and moving Work performed on
current multiple projects under pressure of deadlines or time limitations.
Ability to work under stress, work as a team member and independently.
Commitment to provide excellent customer service.



Brought to you by code4lib jobs: http://jobs.code4lib.org/job/22086/
To post a new job please visit http://jobs.code4lib.org/


[CODE4LIB] Job: Digital Collections Manager at State Library of North Carolina

2015-08-03 Thread jobs
Digital Collections Manager
State Library of North Carolina
Raleigh

Apply for this position at [http://agency.governmentjobs.com/northcarolina/def
ault.cfm?action=viewJobjobID=1209118](http://agency.governmentjobs.com/northc
arolina/default.cfm?action=viewJobjobID=1209118)

  
The State Library of North Carolina's Government  Heritage Library (GHL) is
looking for an experienced digital librarian who is passionate about digital
preservation, committed to ensuring that digital government publications
remain available to the public into the future, has innovative ideas, and has
a strong customer service ethic.

  
While this is a non-supervisory position, it coordinates the GHL's digital
preservation activities, and actively participates in digital collection
development. The position is in the Content and Information Delivery Services
branch of the GHL.

  
Job Class Title: Library Professional

Working Title:
Library Digital Collections Manager

Position Number: 60083837

Department: Dept of Cultural Resources

Recruitment Range:
$34,361 - $57,870

Salary Grade / Salary Grade Equivalent: 74

Competency Level: Advanced

Appointment Type:
Permanent Full-Time

Pos. Location: Wake
County, North Carolina

Opening Date: Mon. 08/03/15

Closing Date/Time: Fri. 08/21/15 5:00 PM Eastern Time

  
Description of Work:

The NC Government  Heritage Library (GHL) seeks an enthusiastic, innovative,
and detail-oriented Digital Collections Manager (Librarian) to support the
GHL's work to preserve and provide access to the valuable resources of North
Carolina government. As part of the Content and Information Delivery Services
branch, this position coordinates the GHL's access and preservation
repositories, digital preservation education and outreach initiatives, and
specialized digital projects. Develops and delivers in-person and online
training. Develops policies, guidelines, best practices, and workflows
regarding digital information management. Researches, tests, and makes
recommendations regarding digital preservation and access tools as they
concern born-digital and digitized publications. Serves as the liaison between
vendors and the GHL and State Archives for select systems, including CONTENTdm
and DuraCloud. Position works closely with staff throughout the GHL and State
Archives on collaborative projects and initiatives, including the NC Digital
Collections, Digital Preservation Education site, and State Government Website
Archiving  Access Program. Continuously works to improve access to digital
materials, and advises on copyright and intellectual property rights, as
appropriate. Supports the research and development of grants concerning
digital information management and digital projects. Provides in-person and
virtual customer assistance, which includes occasional Saturday work.

  
Knowledge, Skills and Abilities / Competencies:

Demonstrated thorough knowledge of and hands-on experience with current
practices, policies, and standards in library science and digital information
practices and curation concepts such as the OAIS framework, distributed
storage, open-source tools for validation, normalization, and authentication.
Thorough knowledge of and experience with digital library collection
management, including collection identification and selection, file
structures, metadata schemes, controlled vocabularies, and web interfaces.
Knowledge and understanding of content and digital asset management systems
(e.g. Drupal, CONTENTdm), digital preservation tools and systems, and web
scripting languages (e.g. HTML, PHP, Javascript). Knowledge and understanding
of intellectual rights and copyright laws. Strong written and verbal skills
and demonstrated ability to communicate complex technical issues to a variety
of audiences. Strong customer service skills. Experience developing and
delivering in-person training and/or creating online tutorials. Demonstrated
ability to cultivate strong working relationships, to act as a lead on
partnerships and collaborations, and to work independently as well as in a
collaborative environment. Prefer experience with responsive web design,
linked data, usability testing, and reference and customer service.

  
Minimum Education and Experience Requirements:

Master's degree in library and information science from an appropriately
accredited (ALA) institution or regionally accredited program in North
Carolina. Experience as a professional librarian in the area of digital
libraries, web services, or library systems and technology support preferred.

  
Supplemental and Contact Information:

**APPLICATION MUST BE COMPLETED ONLINE USING THE WEBSITE PROVIDED BELOW**  
[http://agency.governmentjobs.com/northcarolina/default.cfm?action=viewJobjob
ID=1209118](http://agency.governmentjobs.com/northcarolina/default.cfm?action=
viewJobjobID=1209118)

  
To apply for this position, please click the APPLY link on the URL provided
above. In order to receive credit for
your work history and