[CODE4LIB] OAI-PMH to JSON conversion

2014-06-20 Thread Mitar
Hi!

I made a node.js module which helps with OAI-PMH to JSON conversion:

https://github.com/peerlibrary/node-xml4js

It makes such nice JSON:

https://github.com/peerlibrary/node-xml4js/blob/master/tests/arxiv/GetRecord-arXivRaw.json


Mitar

-- 
http://mitar.tnode.com/
https://twitter.com/mitar_m


[CODE4LIB] ALA 2014 announcement --ALCTS CRS Holdings Information Committee

2014-06-20 Thread Violeta Ilik
***Please excuse the cross posting***


Please join the ALCTS CRS Holdings Information Committee at the ALA Annual 
Conference in Las Vegas on Saturday, June 28th from 3:00-4:00 p.m. in the Las 
Vegas Convention Center, Room S229. We have three presentations demonstrating 
three different perspectives on the manipulation and use of holdings data:


MarcEdit and OCLC Integration

Terry Reese, the developer of the MarcEdit software program, will discuss 
MarcEdit and current integration opportunities, focusing on the new OCLC 
Metadata API service. He will demonstrate how MarcEdit has been able to utilize 
that service to provide deep integration to support direct and local 
bibliographic editing of WorldCat data directly within MarcEdit, as well as 
support batch holdings management through the interface.  The presentation will 
discuss the current capabilities, some gaps, and how interested individuals 
could get started.


BIBFRAME, ISSN, and the Future of Serials

Regina Reynolds, Coordinator, U.S. ISSN Center, Library of Congress shares her 
thoughts about the ISSN and the future of serials description and access as we 
transition from MARC21 to BIBFRAME.  The future of the ISSN is intimately bound 
to the future of CRs. How are these resources changing, and what are some 
impacts of those changes?  The presentation will include a high-level view of 
changes in continuing resources, challenges faced by BIBFRAME in modeling 
serials, and an introduction of a model for serials, PRESSoo, developed by the 
ISSN International Centre and the Bibliothèque nationale de France. Some 
potential benefits of a linked data approach to serials and other continuing 
resources will also be presented.


The AUDITOR and holdings verification

Josh Pyle, CTO and President of Dublin Six, will discuss holdings verification 
through the use of The AUDITOR software package. Installed on an institution’s 
network, The Auditor passively monitors all network traffic and makes libraries 
aware of every form of content that is being used and presents this information 
in COUNTER-style reports. The Auditor’s reports include content from every 
publisher on every platform, and also reports on content from open access or 
free information sources. Sophisticated click-stream analysis infers library 
holdings in real-time. Unexpected access denials, including broken links, are 
detected immediately.

We look forward to seeing you there!


On behalf of the ALCTS CRS Holdings Committee,

Violeta Ilik | Assistant Professor
Semantic Technologies Librarian
Office of Scholarly Communication
Texas AM University Libraries
5000 TAMU | College Station, TX 77843

Tel. 979.862.4661 Email: vilik at library.tamu.edu


Re: [CODE4LIB] OAI-PMH to JSON conversion

2014-06-20 Thread Maarten Brinkerink
Dear Mitar,

Looks really interesting! Is your tool open source?

Best,

Maarten

Op 20 jun. 2014, om 08:10 heeft Mitar mmi...@gmail.com het volgende 
geschreven:

 Hi!
 
 I made a node.js module which helps with OAI-PMH to JSON conversion:
 
 https://github.com/peerlibrary/node-xml4js
 
 It makes such nice JSON:
 
 https://github.com/peerlibrary/node-xml4js/blob/master/tests/arxiv/GetRecord-arXivRaw.json
 
 
 Mitar
 
 -- 
 http://mitar.tnode.com/
 https://twitter.com/mitar_m


[CODE4LIB] DLF Forum Call for Proposals - due Sunday!

2014-06-20 Thread Louisa Kwasigroch
The DLF Forum (http://www.diglib.org/forums/2014forum/) is an annual meeting 
where the digital library community comes together to discover better methods 
of working through sharing and collaboration. It serves as a resource and 
catalyst among digital library developers, project managers, and all who are 
invested in digital library issues.

We are currently seeking proposals for the 2014 DLF Forum program. The Program 
Planning Committee requests proposals within the broad framework of digital 
collections, infrastructure, resources, and organizational priorities. You do 
not need to be part of a member organization in order to submit a proposal.

The Forum traditionally has no overarching theme so that we can craft a program 
that speaks to current issues of interest to our community. We depend on 
contributors to focus proposals on action-oriented topics targeted towards a 
practitioner audience, considering the aspects of design, management, 
assessment, and collaboration

Suggested topical areas for 2014 include:

  *   Digital resources, including research data and archival collections
  *   User services / UX
  *   Systems architecture, both hardware and code

This is not a prescriptive list; we encourage you to be creative, 
collaborative, and collegial.

Proposals are due this Sunday, June 22. For more information and to submit your 
proposal, please visit http://www.diglib.org/forums/2014forum/cfp/.

Please share widely. Apologies for cross-posting.


Louisa Kwasigroch

Senior Program Associate - Digital Library Federation

Council on Library and Information Resources

www.diglib.org | www.clir.orghttp://www.clir.org/



2014 DLF Forumhttp://www.diglib.org/forums/2014forum/, Atlanta, October 27-29

Call for Proposalshttp://www.diglib.org/forums/2014forum/cfp/ – Due June 22

Registrationhttp://www.diglib.org/forums/2014forum/registration/ – Early bird 
until June 30


Re: [CODE4LIB] Is ISNI / ISO 27729:2012 a name identifier or an entity identifier?

2014-06-20 Thread Richard Wallis
Janifer has been following the thread and has asked me to post the
following on her behalf:

~Richard.

Here is a consolidated response:



Below is an extract from a paper that is in publication that illustrates
the model that is used in ISNI.  Many of the contributors to VIAF are also
adopting this approach.   Marrying the model with actual data is discussed
below the extract.



Meryl Streep has only one public identity that she uses for her creative
works.  VIAF includes Streep, Meryl (Mary Louise) as the name form on her
birth certificate is Streep, Mary Louise.  Her married name Gummer, Mary
Louise, is mentioned in Wikipedia and could also be considered as a name
variant.  Changes in the preferred forenames and change of name due to
marriage are not considered as constituting a change of identity as there
is not necessarily a wish by the person to separate his or her creative
output corresponding with a name change.  Researchers, for example will
mention the ensemble of their publications in their curriculum vitae where
they have been published over a lifetime involving name changes.

Alternative name forms that do not usually indicate a change in identity
can take variant forms including names in different scripts or
transliterated with different schemes, changes due to change of sex,
marriage and deed poll.

Kingsley Amis in the figure above is an example of an author writing under
3 different identities, writing as himself and writing under two different
pseudonyms.  His works attributed to each of the three different identities
are clearly separated.  In this case, the association of the pseudonyms
with the real person is public but that is not always the case.  For
example, J.K. Rowling chose to create a pseudonym Robert Galbraith (35) so
that she could write in a different genre and have her works in the new
genre judged without any influence of the successful Harry Potter
series.When the relationship between a pseudonym and a real person is
or becomes public, the works tend to become associated with the real person
as well as the pseudonym; for example they may be republished or released
in collected works.  One example is John Wyndham, the main pen name of John
Wyndham Parkes Lucas Beynon Harris (10 July 1903 – 11 March 1969).  Under
an earlier pen name, John Beynon, he wrote Secret People but this is now
published under John Wyndham.   As a general rule, separate authority
records should be made for each identity used as a public identity.  This
allows the history of works to be accurately traced, enabling catalogs to
respond to questions such as “under which identity was the work conceived
and first published?”  The authority records related to each of the public
identities should be linked where the relationship is public.
Encyclopedias such as Wikipedia tend to have only one entry for a person,
grouping all public identities.  This entails the need for one-to-many
relationships in linking especially from library authority records to
encyclopedias.



ISNI has a suite of programs that detects pseudonyms coded as name variants
and changes them into related name and generates related identity records.
It is a while since it was run and will be re-run in the next few weeks.
This should change Currer Bell into a related name of Charlotte Brontë .

The Sefton / Waddell I will fix manually.  The End user comment function in
the ISNI public interface enables anyone to submit a query or request for
change on a record.  This generates an email to the Quality Team at the
British Library and Bibliothèque nationale de France and their staff
perform regular manual fixes.




On 20 June 2014 04:58, Stuart Yeates stuart.yea...@vuw.ac.nz wrote:

 In wikipedia, the principal representation for alternative names for
 entities are 'redirects'. The redirect from Catherine Sefton to Martin
 Waddell can be found at https://en.wikipedia.org/w/
 index.php?title=Catherine_Seftonredirect=no (and yes, being a wiki it's
 editable).

 That redirect is annotated that this is a  redirect From an alternative
 name (as opposed to a common spelling mistake or something else) and From
 a printworthy page title (which says to use this redirect when building
 (cross-) indexes etc.).

 To create a link from the Catherine Sefton to an authority control
 system (as distinct from the Martin Waddell link), the redirect can be
 editted include an Authority control template (see
 https://en.wikipedia.org/wiki/Template:Authority_control ), which is the
 same template used for full articles.

 cheers
 stuart




 On 06/19/2014 08:53 PM, Owen Stephens wrote:

 An aside but interesting to see how some of this identity stuff seems to
 be playing out in the wild now. Google for Catherine Sefton:

 https://www.google.co.uk/search?q=catherine+sefton

 The Knowledge Graph displays information about Martin Waddell. Catherine
 Sefton is a pseudonym of Martin Waddell. It is impossible to know, but the
 most likely source of this knowledge 

Re: [CODE4LIB] Is ISNI / ISO 27729:2012 a name identifier or an entity identifier?

2014-06-20 Thread Eric Lease Morgan
On Jun 20, 2014, at 10:31 AM, Richard Wallis richard.wal...@dataliberate.com 
wrote:

 ISNI has a suite of programs that detects pseudonyms coded as name variants
 and changes them into related name and generates related identity records.
 It is a while since it was run and will be re-run in the next few weeks.
 This should change Currer Bell into a related name of Charlotte Brontë .


Please humor me as I ask this question again. What is the difference between 
ISNI and other identifiers systems (like ORCID, etc.)? What distinguishes one 
from another? As a librarian, why should I care? Was as a faculty 
member/scholar, why should I care? Under what context is one identifier 
expected to be used instead of another? Maybe a picture/graph is in order:

  authority control simple pointer
 +-+--+
  VIAF   |X|  |
 +-+--+
  ORCID  | | X|
 +-+--+
   ISNI  | |  |
 +-+--+

—
Eric Lease Morgan


Re: [CODE4LIB] Is ISNI / ISO 27729:2012 a name identifier or an entity identifier?

2014-06-20 Thread Richard Wallis
Hi Eric,

What distinguishes one from another?

The communities behind them, the [often overlapping] communities they
are intended to serve, and the technical implementation.

As a librarian, why should I care?

I would, as a non-librarian, suggest that once you are happy with
the ‘authority’ of them, you shouldn’t have to care. Ideally, we are not
there yet, systems should be flexible and accommodating enough to link to
any appropriate authority.

I will probably get flamed for over generalisation here but - VIAF is
an aggregation of National Libraries Authority files.  - ISNI is a more
publisher focused but similar effort.  - OCID comes from and and tries to
serve individual academic institutions, their researchers and falsity
authors.


  authority control |simple identifier |Linked Data capability
 +-+--+--+
  VIAF   |X|X |  X   |
 +-+--+--+
  ORCID  | | X|  |
 +-+--+--+
   ISNI  |X| X|X |
 +-+--+--+

~Richard


On 20 June 2014 15:42, Eric Lease Morgan emor...@nd.edu wrote:

 On Jun 20, 2014, at 10:31 AM, Richard Wallis 
 richard.wal...@dataliberate.com wrote:

  ISNI has a suite of programs that detects pseudonyms coded as name
 variants
  and changes them into related name and generates related identity
 records.
  It is a while since it was run and will be re-run in the next few weeks.
  This should change Currer Bell into a related name of Charlotte Brontë .


 Please humor me as I ask this question again. What is the difference
 between ISNI and other identifiers systems (like ORCID, etc.)? What
 distinguishes one from another? As a librarian, why should I care? Was as a
 faculty member/scholar, why should I care? Under what context is one
 identifier expected to be used instead of another? Maybe a picture/graph is
 in order:

   authority control simple pointer
  +-+--+
   VIAF   |X|  |
  +-+--+
   ORCID  | | X|
  +-+--+
ISNI  | |  |
  +-+--+

 —
 Eric Lease Morgan




-- 
Richard Wallis
Founder, Data Liberate
http://dataliberate.com
Tel: +44 (0)7767 886 005

Linkedin: http://www.linkedin.com/in/richardwallis
Skype: richard.wallis1
Twitter: @rjw


Re: [CODE4LIB] Is ISNI / ISO 27729:2012 a name identifier or an entity identifier?

2014-06-20 Thread Renate Morgenstern

will be there just now.

Renate
On Fri, 20 Jun 2014 15:56:28 +0100
 Richard Wallis richard.wal...@dataliberate.com wrote:

  Hi Eric,

  What distinguishes one from another?

  The communities behind them, the [often overlapping] communities they
  are intended to serve, and the technical implementation.

  As a librarian, why should I care?

  I would, as a non-librarian, suggest that once you are happy with
  the ‘authority’ of them, you shouldn’t have to care. Ideally, we are not
  there yet, systems should be flexible and accommodating enough to link to
  any appropriate authority.

  I will probably get flamed for over generalisation here but - VIAF is
  an aggregation of National Libraries Authority files.  - ISNI is a more
  publisher focused but similar effort.  - OCID comes from and and tries to
  serve individual academic institutions, their researchers and falsity
  authors.


   authority control |simple identifier |Linked Data capability
  +-+--+--+
   VIAF   |X|X |  X   |
  +-+--+--+
   ORCID  | | X|  |
  +-+--+--+
ISNI  |X| X|X |
  +-+--+--+

  ~Richard


  On 20 June 2014 15:42, Eric Lease Morgan emor...@nd.edu wrote:


  On Jun 20, 2014, at 10:31 AM, Richard Wallis 
  richard.wal...@dataliberate.com wrote:

   ISNI has a suite of programs that detects pseudonyms coded as name
  variants
   and changes them into related name and generates related identity
  records.
   It is a while since it was run and will be re-run in the next few weeks.
   This should change Currer Bell into a related name of Charlotte Brontë .


  Please humor me as I ask this question again. What is the difference
  between ISNI and other identifiers systems (like ORCID, etc.)? What
  distinguishes one from another? As a librarian, why should I care? Was as a
  faculty member/scholar, why should I care? Under what context is one
  identifier expected to be used instead of another? Maybe a picture/graph is
  in order:

authority control simple pointer
   +-+--+
VIAF   |X|  |
   +-+--+
ORCID  | | X|
   +-+--+
 ISNI  | |  |
   +-+--+

  —
  Eric Lease Morgan





  --
  Richard Wallis
 Founder, Data Liberate
  http://dataliberate.com
  Tel: +44 (0)7767 886 005

  Linkedin: http://www.linkedin.com/in/richardwallis
  Skype: richard.wallis1
  Twitter: @rjw


Renate Morgenstern
Tel. 2072607


[CODE4LIB] Job: Library Technology Services Manager at Arlington Public Library System

2014-06-20 Thread jobs
Library Technology Services Manager
Arlington Public Library System
Arlington

The Arlington Public Library, a department of the City of Arlington, TX (pop.
375,438), now the seventh largest city in Texas, is seeking an innovative and
results-oriented Library Technology Services Manager. Located in the
Dallas/Fort Worth Metroplex, the Arlington Public Library has a Central
Library and six branch libraries and is in the process of designing an
exciting new Central Library facility, projected to begin service in
2017. The Library Technology Services Manager is
responsible for directing the day to day operation of library technology
systems, including the functioning of the Library's Polaris ILS, public
computing services (pay for print and computer reservations) and the software
and hardware systems within the department that support the provision of
library service, including purchase and maintenance of equipment and
contract/license maangement for hardware and software, as directed by the
City's Information Technology Department. Successful
candidates will have experience in investigating, evaluating and recommending
new technology applicable to the Library's technology infrastructure,
especially technologies that effectively facilitate self-service and with
short-term planning for technology projects. The ability to
work collaboratively with the City's Information Technology department in
managing technology infrastructure is important. The Arlington Public Library
offers a progressive and creative environment for self-directed librarians who
are interested in building and sustaining strong community partnerships and
planning and implementing unique programs of services that meet community
needs. We are seeking librarians who are adaptable to change, motivated by
innovation, passionate about the impact libraries have on individuals,
families and communities and who want more than a career spent sitting at a
service desk. All of our librarians, regardless of their
supervisory classification are project managers, service and program
designers, technology geeks, marketers, educators and mentors.



Brought to you by code4lib jobs: http://jobs.code4lib.org/job/15245/
To post a new job please visit http://jobs.code4lib.org/


Re: [CODE4LIB] Is ISNI / ISO 27729:2012 a name identifier or an entity identifier?

2014-06-20 Thread Karen Coyle
I concur with Richard's analysis[1]. Each identifier type serves a 
different community. In particular, ORCID identifiers will tend to 
identify faculty and researchers whose sole output is journal articles 
-- thus who would not normally appear in a library authority file. The 
ISNI is sometimes seen as an interloper from the publishing community, 
but most likely is integrated into the publisher workflow (e.g. writing 
checks to authors).


Like Richard, I don't see anything to worry about. You can use one, 
some, or all of the identifiers based on your needs. So a faculty 
digital repository may need to used ORCIDs because there are authors who 
are only identified by those. (Repositories are beginning to require 
ORCIDs for deposit.) The same repository can also use LCNA for some 
authors -- you are in no way limited to one identifier per person or 
system. If you are hoping to pull in author data in your library catalog 
from wiki/DB/pedia, then you might favor the VIAF identifier, since this 
is being linked to the pedia world.


This seems to me to be quite similar to other data and metadata choices 
that we make: define your use case, then choose the data that meets that 
need.


kc
[1] One possible difference is that I would consider ORCID a viable URI 
for linked data purposes, although at the moment ORCID does not export 
its data in RDF. All of the identifiers listed below are HTTP URIs.


On 6/20/14, 7:56 AM, Richard Wallis wrote:

Hi Eric,

What distinguishes one from another?

The communities behind them, the [often overlapping] communities they
are intended to serve, and the technical implementation.

As a librarian, why should I care?

I would, as a non-librarian, suggest that once you are happy with
the ‘authority’ of them, you shouldn’t have to care. Ideally, we are not
there yet, systems should be flexible and accommodating enough to link to
any appropriate authority.

I will probably get flamed for over generalisation here but - VIAF is
an aggregation of National Libraries Authority files.  - ISNI is a more
publisher focused but similar effort.  - OCID comes from and and tries to
serve individual academic institutions, their researchers and falsity
authors.


   authority control |simple identifier |Linked Data capability
  +-+--+--+
   VIAF   |X|X |  X   |
  +-+--+--+
   ORCID  | | X|  |
  +-+--+--+
ISNI  |X| X|X |
  +-+--+--+

~Richard


On 20 June 2014 15:42, Eric Lease Morgan emor...@nd.edu wrote:


On Jun 20, 2014, at 10:31 AM, Richard Wallis 
richard.wal...@dataliberate.com wrote:


ISNI has a suite of programs that detects pseudonyms coded as name

variants

and changes them into related name and generates related identity

records.

It is a while since it was run and will be re-run in the next few weeks.
This should change Currer Bell into a related name of Charlotte Brontë .


Please humor me as I ask this question again. What is the difference
between ISNI and other identifiers systems (like ORCID, etc.)? What
distinguishes one from another? As a librarian, why should I care? Was as a
faculty member/scholar, why should I care? Under what context is one
identifier expected to be used instead of another? Maybe a picture/graph is
in order:

   authority control simple pointer
  +-+--+
   VIAF   |X|  |
  +-+--+
   ORCID  | | X|
  +-+--+
ISNI  | |  |
  +-+--+

—
Eric Lease Morgan






--
Karen Coyle
kco...@kcoyle.net http://kcoyle.net
m: 1-510-435-8234
skype: kcoylenet


Re: [CODE4LIB] OAI-PMH to JSON conversion

2014-06-20 Thread Mitar
Hi!

Yes. Check the LICENSE file in the repository. BSD licensed.


Mitar

On Fri, Jun 20, 2014 at 6:09 AM, Maarten Brinkerink
mbrinker...@beeldengeluid.nl wrote:
 Dear Mitar,

 Looks really interesting! Is your tool open source?

 Best,

 Maarten

 Op 20 jun. 2014, om 08:10 heeft Mitar mmi...@gmail.com het volgende 
 geschreven:

 Hi!

 I made a node.js module which helps with OAI-PMH to JSON conversion:

 https://github.com/peerlibrary/node-xml4js

 It makes such nice JSON:

 https://github.com/peerlibrary/node-xml4js/blob/master/tests/arxiv/GetRecord-arXivRaw.json


 Mitar

 --
 http://mitar.tnode.com/
 https://twitter.com/mitar_m



-- 
http://mitar.tnode.com/
https://twitter.com/mitar_m


[CODE4LIB] Job: Library Systems and Data Strategist at University of Wisconsin-Madison

2014-06-20 Thread jobs
 Library Systems and Data Strategist
University of Wisconsin-Madison
Madison

Hello,

  
I want to point out to you a new job posting for a Data Strategist at the
University of Wisconsin - Madison. The posting is at:

  
http://www.ohr.wisc.edu/WebListing/Unclassified/PVLSummary.aspx?pvl_num=80033

  
Note that we are moving to Alma from Voyager for our library catalog. We also
have a Blacklight-based library catalog interface at:

  
http://search.library.wisc.edu/

  
We are also a Primo site.

  
The Data Strategist will work closely with library and technical staff,
integrated library system vendors, application developers, researchers and
campus partners to further the discovery and delivery of Library and other
data. The Data Strategist will leverage deep knowledge of Library and other
data to develop strategies for organizing and integrating many disparate data
types. The Data Strategist will develop and maintain a thorough understanding
of the functional, operational, and service needs of the Library as well as a
provide a creative, dynamic perspective on provisioning data and services
through other systems.

  
First review of applications will take place on 7/17/2014 and will continue
until the position is filled. Please email resume, cover letter and contact
information for three references to perick...@library.wisc.edu.



Brought to you by code4lib jobs: http://jobs.code4lib.org/job/15251/
To post a new job please visit http://jobs.code4lib.org/


Re: [CODE4LIB] Is ISNI / ISO 27729:2012 a name identifier or an entity identifier?

2014-06-20 Thread Eric Lease Morgan
On Jun 20, 2014, at 10:56 AM, Richard Wallis richard.wal...@dataliberate.com 
wrote:

  authority control|simple identifier |Linked Data capability
 +-+--+--+
  VIAF   |X|X |  X   |
 +-+--+--+
  ORCID  | |X |  |
 +-+--+--+
   ISNI  |X|X |  X   |
 +-+--+--+


Increasingly I like linked data, and consequently, here is clarification and a 
question. ORCID does support RDF, but only barely. It can output FOAF-like 
data, but not bibliographic. Moreover, it is experimental, at best:

  curl -L -H 'accept: application/rdf+xml' http://orcid.org/-0002-9952-7800

In what ways does ISNI support linked data? 

—
Eric Morgan


[CODE4LIB] software for a glossary

2014-06-20 Thread Eric Lease Morgan
Do you know of a Web-based tool or piece of desktop software that would let a 
professor post a text in a frame, then highlight words or phrases and link them 
to a glossary? A quick-and-dirty web page (possibly attached) and link below 
illustrates the idea:

  http://dh.crc.nd.edu/tmp/glossary.html

—
Eric Morgan


Re: [CODE4LIB] Is ISNI / ISO 27729:2012 a name identifier or an entity identifier?

2014-06-20 Thread Richard Wallis
In what ways does ISNI support linked data?

See: http://www.isni.org/how-isni-works#HowItWorks_LinkedData

~Richard


On 20 June 2014 18:57, Eric Lease Morgan emor...@nd.edu wrote:

 On Jun 20, 2014, at 10:56 AM, Richard Wallis 
 richard.wal...@dataliberate.com wrote:

   authority control|simple identifier |Linked Data capability
  +-+--+--+
   VIAF   |X|X |  X   |
  +-+--+--+
   ORCID  | |X |  |
  +-+--+--+
ISNI  |X|X |  X   |
  +-+--+--+


 Increasingly I like linked data, and consequently, here is clarification
 and a question. ORCID does support RDF, but only barely. It can output
 FOAF-like data, but not bibliographic. Moreover, it is experimental, at
 best:

   curl -L -H 'accept: application/rdf+xml'
 http://orcid.org/-0002-9952-7800

 In what ways does ISNI support linked data?

 —
 Eric Morgan




-- 
Richard Wallis
Founder, Data Liberate
http://dataliberate.com
Tel: +44 (0)7767 886 005

Linkedin: http://www.linkedin.com/in/richardwallis
Skype: richard.wallis1
Twitter: @rjw


Re: [CODE4LIB] software for a glossary

2014-06-20 Thread Chris Markman
I saw a project at the Hacking Journalism hackathon @ MIT recently that
would get the job done:
http://hackingjournalism.challengepost.com/submissions/24271-inline

They were thinking more along the lines of a crowdsource solution for
science news but I'm sure it could adapted to fit your needs!

—Chris


On Fri, Jun 20, 2014 at 2:28 PM, Eric Lease Morgan emor...@nd.edu wrote:

 Do you know of a Web-based tool or piece of desktop software that would
 let a professor post a text in a frame, then highlight words or phrases and
 link them to a glossary? A quick-and-dirty web page (possibly attached) and
 link below illustrates the idea:

   http://dh.crc.nd.edu/tmp/glossary.html

 —
 Eric Morgan



Re: [CODE4LIB] software for a glossary

2014-06-20 Thread Tom Keays
The Web Ahead podcast had an episode that covered the current state of
web annotation. Something there might work.
http://5by5.tv/webahead/60

Crossing the thread over to linked author data, this item made me laugh.
http://w3cmemes.tumblr.com/post/76273506486/dave-started-reviewing-open-annotations-today

On Fri, Jun 20, 2014 at 2:28 PM, Eric Lease Morgan emor...@nd.edu wrote:
 Do you know of a Web-based tool or piece of desktop software that would let a 
 professor post a text in a frame, then highlight words or phrases and link 
 them to a glossary? A quick-and-dirty web page (possibly attached) and link 
 below illustrates the idea:

   http://dh.crc.nd.edu/tmp/glossary.html

 —
 Eric Morgan


Re: [CODE4LIB] Is ISNI / ISO 27729:2012 a name identifier or an entity identifier?

2014-06-20 Thread Karen Coyle

On 6/20/14, 11:38 AM, Richard Wallis wrote:

In what ways does ISNI support linked data?

See: http://www.isni.org/how-isni-works#HowItWorks_LinkedData


 accessible by a persistent URI in the form 
isni-url.oclc.nl/isni/000134596520 (for example)  and soon also in 
the form isni.org/isni/000134596520. 


Odd. I assume that whoever wrote that on their page just forgot the 
http://; part of those strings. Right?


kc




~Richard


On 20 June 2014 18:57, Eric Lease Morgan emor...@nd.edu wrote:


On Jun 20, 2014, at 10:56 AM, Richard Wallis 
richard.wal...@dataliberate.com wrote:


  authority control|simple identifier |Linked Data capability
 +-+--+--+
  VIAF   |X|X |  X   |
 +-+--+--+
  ORCID  | |X |  |
 +-+--+--+
   ISNI  |X|X |  X   |
 +-+--+--+


Increasingly I like linked data, and consequently, here is clarification
and a question. ORCID does support RDF, but only barely. It can output
FOAF-like data, but not bibliographic. Moreover, it is experimental, at
best:

   curl -L -H 'accept: application/rdf+xml'
http://orcid.org/-0002-9952-7800

In what ways does ISNI support linked data?

---
Eric Morgan






--
Karen Coyle
kco...@kcoyle.net http://kcoyle.net
m: 1-510-435-8234
skype: kcoylenet


Re: [CODE4LIB] Is ISNI / ISO 27729:2012 a name identifier or an entity identifier?

2014-06-20 Thread Joe Hourcle
On Jun 20, 2014, at 4:30 PM, Karen Coyle wrote:

 On 6/20/14, 11:38 AM, Richard Wallis wrote:
 In what ways does ISNI support linked data?
 
 See: http://www.isni.org/how-isni-works#HowItWorks_LinkedData
 
  accessible by a persistent URI in the form 
 isni-url.oclc.nl/isni/000134596520 (for example)  and soon also in the 
 form isni.org/isni/000134596520. 
 
 Odd. I assume that whoever wrote that on their page just forgot the http://; 
 part of those strings. Right?

People think I'm being pedantic when I bitch about the protocol
missing for printed materials (flyers, business cards, etc) ...
but in this case, it's a definite violation of RFC 3986:

  1.1.1.  Generic Syntax
 Each URI begins with a scheme name, as defined in Section 3.1, that
 refers to a specification for assigning identifiers within that
 scheme.  As such, the URI syntax is a federated and extensible naming
 system wherein each scheme's specification may further restrict the
 syntax and semantics of identifiers using that scheme.

Now, it's possible that this whole we don't need to bother with
http://; thing has spilled into the CMS building community, and
they're actively stripping it out.  From their page, I think they're
using Drupal, but the horrible block of HTML that this was in is
blatantly MS Word's 'save as HTML' foulness:

  h2span lang=EN-USa name=HowItWorks_LinkedData/aLinked 
Data/span/h2
  p class=MsoNormalspan lang=EN-USLinked data is part of the ISNI-IA’s 
strategy to make ISNIs freely available and widely diffused.nbsp; Each 
assigned ISNI is accessible by a persistent URI in the form 
isni-url.oclc.nl/isni/000134596520 (for example) nbsp;and soon also in the 
form isni.org/isni/000134596520.nbsp;/span/p
  p class=MsoNormalspan lang=EN-USComing soon:nbsp; ISNI core metadata 
in RDF triples.nbsp; The RDF triples will be embedded in the public web pages 
and the format will be available via the persistent URI and the SRU search 
API./span/p
  p class=MsoNormalspan lang=EN-USnbsp;/span/p

-Joe



 
 On 20 June 2014 18:57, Eric Lease Morgan emor...@nd.edu wrote:
 
 On Jun 20, 2014, at 10:56 AM, Richard Wallis 
 richard.wal...@dataliberate.com wrote:
 
  authority control|simple identifier |Linked Data capability
 +-+--+--+
  VIAF   |X|X |  X   |
 +-+--+--+
  ORCID  | |X |  |
 +-+--+--+
   ISNI  |X|X |  X   |
 +-+--+--+
 
 Increasingly I like linked data, and consequently, here is clarification
 and a question. ORCID does support RDF, but only barely. It can output
 FOAF-like data, but not bibliographic. Moreover, it is experimental, at
 best:
 
   curl -L -H 'accept: application/rdf+xml'
 http://orcid.org/-0002-9952-7800
 
 In what ways does ISNI support linked data?
 
 ---
 Eric Morgan
 
 
 
 
 -- 
 Karen Coyle
 kco...@kcoyle.net http://kcoyle.net
 m: 1-510-435-8234
 skype: kcoylenet


Re: [CODE4LIB] Excel to XML (for a Drupal Feeds import)

2014-06-20 Thread Ryan Engel
Thanks for everyone's ideas.  I think Shaun's solution is exactly what 
I'm trying to do.




Shaun Ellis mailto:sha...@princeton.edu
June 16, 2014 at 10:36 PM
Ryan, it sounds like you simply want to pull two relational tables 
into drupal using referenced entities in a one (question) to many 
(answers) relationship?


This can be accomplished, albeit unintuitively (it is Drupal 
afterall), by using the Feeds CSV parser with Feeds Tamper, and I 
would skip what seems like an unnecessary conversion to XML step.


First, you import your questions and set a GUID target (typically your 
db id).  Then import your answers -- each should have a reference to 
the id mapped using the Entity Reference by Feeds GUID.  I have done 
it in reverse order too, in which case you'd import all your answers 
first. In this case, your second (questions) import needs to contain a 
single column of delimited answer GUIDs into a single column.  Use the 
Feeds Tamper explode plugin on that field during mapping as explained 
(rather vaguely) here:


http://drupal.stackexchange.com/questions/32234/how-to-use-feeds-module-to-import-multi-value-fields 



If that's not what you're trying to do, can you clarify?

-Shaun



Joshua Welker mailto:wel...@ucmo.edu
June 16, 2014 at 2:35 PM
Sorry, the last line got messed up by outlook.

#now save the whole thing as an xml file

with open('myfile.xml', 'wb') as file
ElementTree(rootNode).write(file)


Josh Welker


-Original Message-
From: Joshua Welker [mailto:wel...@ucmo.edu]
Sent: Monday, June 16, 2014 2:32 PM
To: Code for Libraries
Subject: RE: [CODE4LIB] Excel to XML (for a Drupal Feeds import)

This should be quite doable in most programming languages with
out-of-the-box tools and no tricky parsing code. The gist is to save in
Excel as a delimited text file (tab is a good choice), then have your 
script
ingest the document and turn it into an array, and then turn the array 
into

XML. In Python, it could be something like the code below (not tested but
the principles should be sound):

import 'csv'
from elementtree.ElementTree import Element, SubElement

#create a list
mylist = []

#open your delimited file with a csv reader with open('myfile.txt', 
'rb') as

textfile:
reader = csv.reader( textfile, delimiter='\t', quotechat='') #this
assumes your file is tab-delimited (\t)

#loop through rows in your file and save each row as a key/value pair
(dictionary)
for row in textfile:
fields = {
'field1': row[0]
'field2': row[1]
'field3': row[2]
'field4': row[3]
}

#append this row to our master list
mylist.append( fields )


#create an xml root node
rootNode = Element(XmlRoot)

#loop through our list of rows from the text file and create xml 
nodes for

row in mylist:
rowNode = Element(record)

#loop through all the fields on this row and turn them into xml nodes
for fieldName, fieldValue in row:
fieldNode = Element(fieldName)
fieldNode.text = fieldValue

#append each field node to the parent row node
rowNode.append(fieldNode)

#append each row node to the document root node
rootNode.append(rowNode)

#now save the whole thing as an xml file with open('myfile.xml', 'wb') as
file
ElementTree(rootNode).write(file)



Josh Welker

-Original Message-
From: Code for Libraries [mailto:CODE4LIB@LISTSERV.ND.EDU] On Behalf 
Of Kyle

Banerjee
Sent: Monday, June 16, 2014 1:04 PM
To: CODE4LIB@LISTSERV.ND.EDU
Subject: Re: [CODE4LIB] Excel to XML (for a Drupal Feeds import)

I'd just do this the old fashioned way. Awk is great for problems like 
this.

For example, if your file is tab delimited, the following should work

awk '{FS=\t}{if ($2 != ) question = $2;}{print $1,question,$3}''
yourfile

In the example above, I just print the fields but you could easily encase
them in tags.

kyle


Joshua Welker mailto:wel...@ucmo.edu
June 16, 2014 at 2:32 PM
This should be quite doable in most programming languages with
out-of-the-box tools and no tricky parsing code. The gist is to save in
Excel as a delimited text file (tab is a good choice), then have your 
script
ingest the document and turn it into an array, and then turn the array 
into

XML. In Python, it could be something like the code below (not tested but
the principles should be sound):

import 'csv'
from elementtree.ElementTree import Element, SubElement

#create a list
mylist = []

#open your delimited file with a csv reader
with open('myfile.txt', 'rb') as textfile:
reader = csv.reader( textfile, delimiter='\t', quotechat='') #this
assumes your file is tab-delimited (\t)

#loop through rows in your file and save each row as a key/value pair
(dictionary)
for row in textfile:
fields = {
'field1': row[0]
'field2': row[1]
'field3': row[2]
'field4': row[3]
}

#append this row to our master list
mylist.append( fields )


#create an xml root node
rootNode = Element(XmlRoot)

#loop through our list of rows from the text file and create xml nodes
for row in mylist:
rowNode = Element(record)

#loop through all the fields on this row and turn 

Re: [CODE4LIB] Is ISNI / ISO 27729:2012 a name identifier or an entity identifier?

2014-06-20 Thread Karen Coyle

On 6/20/14, 1:49 PM, Joe Hourcle wrote:



Now, it's possible that this whole we don't need to bother with
http://; thing has spilled into the CMS building community, and
they're actively stripping it out.


I actually had the editors of an ALA publication remove http://; 
whenever it preceded www because they were convinced that you only 
needed http://; when there was no www at the front of the domain 
name. I had to fight to get the http://; put back in.  (I believe the 
excuse was that it took up space.)



 From their page, I think they're
using Drupal, but the horrible block of HTML that this was in is
blatantly MS Word's 'save as HTML' foulness:

   h2span lang=EN-USa name=HowItWorks_LinkedData/aLinked 
Data/span/h2
   p class=MsoNormalspan lang=EN-USLinked data is part of the ISNI-IA’s strategy to make ISNIs 
freely available and widely diffused.nbsp; Each assigned ISNI is accessible by a persistent URI in the form 
isni-url.oclc.nl/isni/000134596520 (for example) nbsp;and soon also in the form 
isni.org/isni/000134596520.nbsp;/span/p
   p class=MsoNormalspan lang=EN-USComing soon:nbsp; ISNI core metadata in RDF 
triples.nbsp; The RDF triples will be embedded in the public web pages and the format will be available via the persistent URI 
and the SRU search API./span/p
   p class=MsoNormalspan lang=EN-USnbsp;/span/p


Which is so foul that Dreamweaver has a Clean up Word HTML command in 
its menu. But you still end up with pretty bad HTML.


kc



-Joe




On 20 June 2014 18:57, Eric Lease Morgan emor...@nd.edu wrote:


On Jun 20, 2014, at 10:56 AM, Richard Wallis 
richard.wal...@dataliberate.com wrote:


  authority control|simple identifier |Linked Data capability
 +-+--+--+
  VIAF   |X|X |  X   |
 +-+--+--+
  ORCID  | |X |  |
 +-+--+--+
   ISNI  |X|X |  X   |
 +-+--+--+

Increasingly I like linked data, and consequently, here is clarification
and a question. ORCID does support RDF, but only barely. It can output
FOAF-like data, but not bibliographic. Moreover, it is experimental, at
best:

   curl -L -H 'accept: application/rdf+xml'
http://orcid.org/-0002-9952-7800

In what ways does ISNI support linked data?

---
Eric Morgan




--
Karen Coyle
kco...@kcoyle.net http://kcoyle.net
m: 1-510-435-8234
skype: kcoylenet


--
Karen Coyle
kco...@kcoyle.net http://kcoyle.net
m: 1-510-435-8234
skype: kcoylenet


Re: [CODE4LIB] software for a glossary

2014-06-20 Thread Karen Coyle

On 6/20/14, 1:16 PM, Tom Keays wrote:


Crossing the thread over to linked author data, this item made me laugh.
http://w3cmemes.tumblr.com/post/76273506486/dave-started-reviewing-open-annotations-today


The only thing worse would be:
OMG IT'S FULL OF OWL

kc



On Fri, Jun 20, 2014 at 2:28 PM, Eric Lease Morgan emor...@nd.edu wrote:

Do you know of a Web-based tool or piece of desktop software that would let a 
professor post a text in a frame, then highlight words or phrases and link them 
to a glossary? A quick-and-dirty web page (possibly attached) and link below 
illustrates the idea:

   http://dh.crc.nd.edu/tmp/glossary.html

—
Eric Morgan


--
Karen Coyle
kco...@kcoyle.net http://kcoyle.net
m: 1-510-435-8234
skype: kcoylenet