I’m not for bubble gum and duct tape. But I also realize that when I’ve got a
hammer everything begins to look like a nail.
After having made those two ambiguous statements I would ask myself, “What is
the problem I am trying to solve?” If you want to make your data available in a
linked data
http://wiki.code4lib.org/index.php/Midwest
I attended Code4Lib Midwest last week, and from my perspective it was a roaring
success. Kudos to Kyle Felker, Erin Fisher, Eric Kunnen, Kristin Meyer, Matthew
Reidsma, and anybody else I may have missed for putting on such a
well-organized and
I’m surprised and impressed with the way Code4Lib Midwest is shaping up —
http://wiki.code4lib.org/index.php/Midwest Cool! —ELM
From my server’s MOTD, and I thought it was fun:
Here I sit, broken-hearted,
All logged in, but work unstarted.
First net.this and net.that,
And a hot buttered bun for net.fat.
The boss comes by, and I play the game,
Then I turn back to net.flame.
Is there a cure (I need your views),
For
Does anybody here have any experience with the Elsevier API Program? [1]
Apparently, through Elsevier’s TDM (text and data mining) API a person can get
the full text of Elsevier content, after being granted an access key. As per
their instructions, I used the following curl command to try to
On Jul 14, 2014, at 5:29 PM, Ben Companjen ben.compan...@dans.knaw.nl wrote:
This will not answer your question, but LIBER has a response to the TDM API
policies:
http://libereurope.eu/news/over-40-signatories-ask-elsevier-to-withdraw-tdm-policy/
Peter Murray-Rust posted quite a few
I have been a card-carrying librarian since 1987, but I have been writing
computer programs since 1976. I became a librarian because I heard about the
“information explosion”, and I thought it would be a growth industry. —ELM
On Jul 1, 2014, at 9:12 AM, Katie konrad.ka...@gmail.com wrote:
Has anyone here experience in the world of natural language programming
(while applying information retrieval techniques)?
I'm currently trying to develop a tool that will:
1. take a pdf and extract the text (paying no
for making this
possible! And feel free to give me a shout if you have any questions.
Agenda
Publishing, Sharing, and Opening
* 8:30-9:30 Sylvia Southwick and Cory Lampert, UNLV, Librarians'
adventure into LODLAM
* 9:30-10:20 Eric Lease Morgan, Notre Dame, Publishing LOD
| |
+-+--+
ORCID | | X|
+-+--+
ISNI | | |
+-+--+
—
Eric Lease Morgan
On Jun 20, 2014, at 10:56 AM, Richard Wallis richard.wal...@dataliberate.com
wrote:
authority control|simple identifier |Linked Data capability
+-+--+--+
VIAF |X|X | X |
Do you know of a Web-based tool or piece of desktop software that would let a
professor post a text in a frame, then highlight words or phrases and link them
to a glossary? A quick-and-dirty web page (possibly attached) and link below
illustrates the idea:
On Jun 16, 2014, at 6:44 AM, Johan Oomen joo...@beeldengeluid.nl wrote:
2.805.000 Euros Funding for Future Memory Standards!
The PREFORMA call for tender has been published on June 12th, 2014. Proposal
submission deadline: August 12th, 2014. Budget: 2.805.000 EUR. —
Is ORCID an acronym, and if it is then what does it stand for? —ELM
On Jun 10, 2014, at 4:07 PM, Masamitsu, Pam masamit...@llnl.gov wrote:
http://orcid.org/faq-page#n110 - ORCID is an acronym, short for Open
Researcher and Contributor ID.
Resolved. Thanks! —ELM
ORDID and ResearcherID and Scopus, oh my!
It is just me, or are there an increasing number of unique identifiers popping
up in Library Land? A person can now be identified with any one of a number of
URIs such as:
* ORCID - http://orcid.org/-0002-9952-7800
* ResearcherID -
* ORCID - http://orcid.org/-0002-9952-7800
* ResearcherID - http://www.researcherid.com/rid/F-2062-2014
* Scopus - http://www.scopus.com/authid/detail.url?authorId=25944695600
* VIAF - http://viaf.org/viaf/26290254
* LC - http://id.loc.gov/authorities/names/n94036700
* ISNI -
On Jun 4, 2014, at 3:32 PM, Jodi Schneider jschnei...@pobox.com wrote:
However, I believe that ISNI is bridging between these various sources --
certainly including LC and VIAF [1], and also ORCID [2].
[1] http://www.isni.org/content/data-contributors
[2] From
C4L is not a democracy but an anarchy.
Sometimes. We vote on conference locations. We vote on keynote talks. We vote
for presentations. Everybody had multiple opportunities to voice their opinion.
I think this vote should count too. —ELM
of the time.
* Apparently “administratativia” is a word of my own design because a search of
it in Google returns only postings I’ve written. No wonder my spell checker
doesn’t like it.
[1] description - http://dh.crc.nd.edu/sandbox/mailing-lists/code4lib/
—
Eric Lease Morgan
$full_ads++
--
http://orcid.org/-0002-9952-7800
On May 16, 2014, at 3:46 PM, Andreas Orphanides akorp...@ncsu.edu wrote:
Library Electronic Resources Specialist
Raritan Valley Community College
Branchburg Township, New Jersey
ColdFusion, EZproxy, JavaScript, Personal computer hardware
http://jobs.code4lib.org/job/13115
I’d rather see the
Here is the tiniest bit of mailing list administratativa: the list now contains
close two 2,800 subscribers, and based on my antidotal observations, it
subscription size increase by approximately five new subscriptions per week. I
think we have a strong, vibrant, and growing community. —ELM
On May 8, 2014, at 12:36 PM, Jodi Schneider jschnei...@pobox.com wrote:
Eric -- are you still the list owner? j...@code4lib.org already uses Job: as
a prefix -- so I would suggest adding Job as a topic, setting
Default-Topics= Job,OTHER (unless all-caps is requisite?) If this works,
On Apr 17, 2014, at 9:13 PM, Miles Fidelman mfidel...@meetinghouse.net
wrote:
Give up and let chaos reign supreme?
Yep! That's what I would do. -- ELM
Do usability studies to demonstrate to others how things can be improved. --ELM
On Apr 7, 2014, at 11:25 PM, Richard Wallis richard.wal...@dataliberate.com
wrote:
You are correct that in the xmlns namespace definitions in the xml
http://viaf.org/viaf/231063554/rdf.xml references a namespace that is no
longer used in the output. There is a single commented out reference
Can somebody here point me to an RDF file describing the VIAF ontology? Does an
authoritative list of VIAF classes and properties exist, and if so, where can I
get a copy? —Eric Lease Morgan
On Apr 7, 2014, at 10:52 PM, Richard Wallis richard.wal...@dataliberate.com
wrote:
Is this what you are looking for: http://viaf.org/viaf/data/
Alas, no, not really. I’m looking for an RDF file listing the classes and
properties used by VIAF. VIAF can return RDF for specific entities, as in
know of others? Am I
missing something significant?
—
Eric Lease Morgan
A few years ago I did a bit of analysis against the feed —
http://infomotions.com/blog/2011/03/constant-chatter-at-code4lib/ —ELM
I’ve done a tiny bit of investigation, and as alluded to previously, there
seems to be a higher concentration of Code4Lib-like activity around Bologna and
Padua. And this sort of “event” is intended to be smaller and more informal
rather than larger and more structured. It requires a time, a
I wonder whether there are enough people and enough interest to organize a
Code4Lib Italy event. Hmmm... —Eric Lease Morgan
On Mar 6, 2014, at 1:37 PM, Ethan Gruber ewg4x...@gmail.com wrote:
Let me ask a more direct question. If participating in linked data is a
“good thing”, then how do you — anybody here — suggest archivists (or
librarians or museum curators) do that starting today? —Eric Morgan
I think that
, something like D2RQ
could be put on top of the ArchiveSpace database to expose the underlying
content as RDF. What do you think?
[1] D2RQ - http://d2rq.org
—
Eric Lease Morgan
University of Notre Dame
On Mar 6, 2014, at 9:47 AM, Mark A. Matienzo mark.matie...@gmail.com wrote:
ArchivesSpace has a REST backend API, and requests yield a response in
JSON. As one option, I'd investigate to publish linked data as JSON-LD.
Some degree of mapping would be necessary, but I imagine it would be
Let me ask a more direct question. If participating in linked data is a “good
thing”, then how do you — anybody here — suggest archivists (or librarians or
museum curators) do that starting today? —Eric Morgan
Does anybody here work in Rome (Italy). I see a “busman’s holiday” in my
future. —Eric Lease Morgan, University of Notre Dame
/archive/2004/200312/0012.html
—
Eric Lease Morgan
So, we're shutting it [Code4Lib] down?
We interrupt this program for an important statement:
Before things get out of hand and rumors start flying, there are
no plans about shutting down the mailing list. An individual
simply wanted to be unsubscribed, and that has been done.
Now back
Correct. It is all but a waste of time. --ELM
/projects/loc-recollect/
[2] ViewShare - http://viewshare.org/about/community/
—
Eric Lease Morgan
Finally, 1600 properties... good luck with that.
ROTFL!!! ―ELM
On Jan 20, 2014, at 11:12 AM, Robert Sanderson azarot...@gmail.com wrote:
Have you considered the LOCAH work in mapping EAD into Linked Data?
* http://archiveshub.ac.uk/locah/
* http://data.archiveshub.ac.uk/
Robert, yes, and these are and were very good examples. ‘Ahead of their time,
On Jan 19, 2014, at 10:58 AM, Karen Coyle li...@kcoyle.net wrote:
If you were to select a set of RDF ontologies intended to be used in the
linked data of archival descriptions, then what ontologies would you select?
There is an ontology for archival description developed by the Europeana
A couple of days ago I wrote:
If you were to select a set of RDF ontologies intended to be used in the
linked data of archival descriptions, then what ontologies would you
select?
And in response Ben Companjen ben.compan...@dans.knaw.nl wrote the following
post, which I think is absolutely
If you were to select a set of RDF ontologies intended to be used in the linked
data of archival descriptions, then what ontologies would you select?
For simplicity's sake, RDF ontologies are akin to the fields in MARC records or
the entities in EAD/XML files. Articulated more accurately, they
On Jan 17, 2014, at 2:54 PM, Emily Lynema emilylyn...@gmail.com wrote:
And with that, registration is now closed!… Waitlist:
https://docs.google.com/a/ncsu.edu/forms/d/1Vuo7g7xbNeGCQywwngkAt4OMdX0k3wsoiwCrxJtdx6k/viewform
Geesh! That was fast. I’m impressed. —Eric Morgan
This is a test message.
This is only a test of the Emergency Test Messaging Service.
This is only a message.
In case of a real emergency you would be instructed to
Contact your nearest Code4Lib subscriber.
This is only test message.
—
Eric “Test Message” Morgan
I have created an initial pile of RDF, mostly.
I am in the process of experimenting with linked data for archives. My goal is
to use existing (EAD and MARC) metadata to create RDF/XML, and then to expose
this RDF/XML using linked data principles. Once I get that far I hope to slurp
up the
I have successfully been able to being the systematic transformation process of
EAD and MARC to RDF/XML, and consequently been able to literally illustrate the
resulting triples. [1, 2] From the blog posting [3]:
The resulting images are huge, and the astute/diligent reader
will see a
suffers from a chicken-and-egg problem. By implementing my simple
recipe, I believe I will be making it easier for the community to lay an egg.
—
Eric Lease Morgan
On Dec 5, 2013, at 1:17 PM, Kevin Ford k...@3windmills.com wrote:
Frankly, I don't see how you can generate RDF that anybody would want to
use from XSLT: where would your URIs come from? What, exactly, are you
modeling?
-- Our experience getting to good, URI rich RDF has been basically a
On Dec 4, 2013, at 10:29 PM, Corey A Harper corey.har...@nyu.edu wrote:
Have you had a look at Ed Chamberlain's work on COMET:
https://github.com/edchamberlain/COMET
It's been a while since I've run this, but if I remember correctly, it was
fairly easy-to-use.
Thank you for the pointer. I
On Dec 4, 2013, at 10:29 PM, Corey A Harper corey.har...@nyu.edu wrote:
Also, though much older, I seem to remember the Simile MARC RDFizer being
a pretty straightforward one to run:
http://simile.mit.edu/wiki/MARC/MODS_RDFizer
MODS aficionados will point to some problems with some of it's
On Dec 5, 2013, at 6:54 AM, Eric Lease Morgan emor...@nd.edu wrote:
http://simile.mit.edu/wiki/MARC/MODS_RDFizer
...The distribution includes a possibly cool stylesheet — mods2rdf.xslt.
Ah ha! The MODS_RDFizer’s mods2rdf.xslt file functioned very well against one
of my MODS files
it to my attention.
[1] demo - http://demo.librecat.org
—
Eric Lease Morgan
When exposing sets of MARC records as linked data, do you think it is better to
expose them in batch (collection) files or as individual RDF serializations? To
bastardize the Bard — “To batch or not to batch? That is the question.”
Suppose I am a medium-sized academic research library. Suppose
RDFizer - http://simile.mit.edu/wiki/MARC/MODS_RDFizer
[5] ead2rdf.xsl - http://data.archiveshub.ac.uk/xslt/ead2rdf.xsl
—
Eric Lease Morgan
On Dec 5, 2013, at 11:17 AM, Mark A. Matienzo mark.matie...@gmail.com wrote:
I’m hoping to articulate and implement a simple and functional method for
exposing EAD and MARC metadata as linked data.
Isn't the point of this to expose archival description as linked data? What
about description
On Dec 5, 2013, at 11:33 AM, Mark A. Matienzo mark.matie...@gmail.com wrote:
At the very least, these applications (ArchivesSpace,
Archivists’ Toolkit, etc.) can regularly and systematically export their
data as EAD, and the EAD can be made available as linked data.
Wouldn't it make more
/tmp/pamphlets.rdf
[5] modsrdf.xsl -
http://www.loc.gov/standards/mods/modsrdf/xsl-files/modsrdf.xsl
[6] BIBFRAME Tools - http://bibframe.org/tools/transform/start
—
Eric Lease Morgan
in the process of writing a guidebook on the topic of linked data and
archives. In the guidebook I will elaborate on this recipe and provide
instructions for its implementation. [1]
[1] guidebook - http://sites.tufts.edu/liam/
--
Eric Lease Morgan
University of Notre Dame
On Nov 19, 2013, at 8:48 AM, Robert Forkel xrotw...@googlemail.com wrote:
while I also think this is not rocket surgery, I'd like to point out that
trial (and potentially error) as suggested by your go back to step #1
instructions is not a good solution to coming up with URIs. I think once
On Nov 19, 2013, at 9:41 AM, Karen Coyle li...@kcoyle.net wrote:
Eric, I think this skips a step - which is the design step in which you
create a domain model that uses linked data as its basis. RDF is not a
serialization; it actually may require you to re-think the basic
structure of your
On Nov 19, 2013, at 9:54 AM, Aaron Rubinstein arubi...@library.umass.edu
wrote:
I think you’ve hit the nail on the head here, Karen. I would just
add, or maybe reassure, that this does not necessarily require
rethinking your existing metadata but how to translate that
On Nov 19, 2013, at 11:09 AM, Karen Coyle li...@kcoyle.net wrote:
Eric, if you want to leap into the linked data world in the fastest,
easiest way possible, then I suggest looking at microdata markup, e.g.
schema.org. [1] …
[1] http://schema.org
I don’t advocate this as the fastest,
On Nov 18, 2013, at 3:04 PM, Ingraham Dwyer, Andy adw...@library.ohio.gov
wrote:
https://www.google.com/search?q=site:nd.edutbm=isch
Very nice!! —Eric Morgan
create a Web-based
interface to query many different z39.50 targets and provide on-the-fly text
mining analysis against the results.
In short, I learned a great many things.
—
Eric Lease Morgan
University of Notre Dame
#!/usr/bin/perl
# nytimes-search.pl - rudimentary z39.50 client to query
How do I authenticate to a z39.50 server using yaz-client?
There is a ProQuest database/index I want to search via their z39.50 interface.
I have a valid username/password combination that I can sucessfully use to
configure and search with my EndNote z39.50 client. How do I supply a similar
, then I could do a lot more. Jena seems like a good option. So
does Openlink Virtuoso.
What experience do y'all have with these tools, and do you know how to import
RDF/XML into them?
--
Eric Lease Morgan
On Nov 7, 2013, at 12:01 PM, Scherbak, Loren scherb...@si.edu wrote:
Put another way, instead of trying to force people to do the best and
most perfect bibliographic search, allow them to do broad searches and
then provide supplementary tools enabling the reader to examine the
results. It is
Yes, I'm going to get sucked into this vi vs emacs argument for nostalgia's
sake...
ROTFL, because that is exactly what I was thinking. “Vi is better. No, emacs.
You are both wrong; it is all about BBedit!” Each tool whether they be editors,
email clients, or RDF serializations all have their
What are some of the more popular and useful bibliographic databases/indexes
with well-structured output?
If it were easy (trivial) for our readers to gets sets of well-structured data
out of our bibliographic databases, then it would be relatively easy for us to
write software enabling
I am of two minds when it comes to Linked Data and the Semantic Web.
Libraries and many other professions have been encoding things for a long time,
but encoding the description of a book (MARC) or marking up texts (TEI), is not
the same as encoding knowledge — a goal of the Semantic Web. The
Cool input. Thank you. I believe I have tweaked my assertions:
1. The Declaration of Independence was written by Thomas Jefferson
rdf:RDF
xmlns:rdf=http://www.w3.org/1999/02/22-rdf-syntax-ns#;
xmlns:dc=http://purl.org/dc/elements/1.1/;
rdf:Description
On Nov 3, 2013, at 6:07 PM, Robert Sanderson azarot...@gmail.com wrote:
Currently your assertion is that the creator /of a web page/ is Jefferson,
which is clearly false.
The page (...) is a transcription of the Declaration of Independence.
The Declaration of Independence is written by
On Nov 3, 2013, at 6:07 PM, Robert Sanderson azarot...@gmail.com wrote:
And it's not very hard given the right mindset -- its just a fully expanded
relational database, where the identifiers are URIs. Yes, it's not 1st
year computer science, but it is 2nd or 3rd year rather than post
How can I write an RDF serialization enabling me to express the fact that the
United States Declaration Of Independence was written by Thomas Jefferson and
Thomas Jefferson was a male? (And thus asserting that the Declaration of
Independence was written by a male.)
Suppose I have the
On Oct 15, 2013, at 10:44 AM, Eric Lease Morgan emor...@nd.edu wrote:
For a limited period of time I am making publicly available a Web-based
program called PDF2TXT --http://bit.ly/1bJRyh8
On Oct 14, 2013, at 7:56 AM, Nicolas Franck nicolas.fra...@ugent.be wrote:
Could this also be done
On Oct 30, 2013, at 11:24 AM, Andrew Darby darby.li...@gmail.com wrote:
Hello, all. This is perhaps a bit off-topic, but I was wondering how many
of you have a dedicated usability person as part of your development team…
I do not think we have a usability person, per se. The position is sort
On Oct 16, 2013, at 10:56 AM, Robert Haschart rh...@virginia.edu wrote:
The abstract extraction routine I have been working on does use
tesseract internally for doing OCR when it encounters a document that
doesn't have usable full-text. I agree that tesseract is not that easy
to install,
On Oct 13, 2013, at 6:21 PM, David Friggens frigg...@waikato.ac.nz wrote:
For a limited period of time I am making publicly available a Web-based
program called PDF2TXT --http://bit.ly/1bJRyh8
PDF2TXT extracts the text from an OCRed PDF document
The file I tried was digital native
On Oct 14, 2013, at 1:48 AM, Penelope Campbell
penelope.campb...@facs.nsw.gov.au wrote:
For a limited period of time I am making publicly available a Web-based
program called PDF2TXT -- http://bit.ly/1bJRyh8
As a small special library (solo librarian) in an Australian State
Government
On Oct 14, 2013, at 7:56 AM, Nicolas Franck nicolas.fra...@ugent.be wrote:
Could this also be done by Apache Tika? Or do I miss a crucial point?
http://tika.apache.org/1.4/gettingstarted.html
Nicolas, this looks VERY promising! It seemingly can extract the OCR from a PDF
document as well
On Oct 14, 2013, at 4:49 PM, Robert Haschart rh...@virginia.edu wrote:
For a limited period of time I am making publicly available a Web-based
program called PDF2TXT --http://bit.ly/1bJRyh8
Although based on some subsequent messages where you mention tesseract
maybe I misunderstood and
On Oct 12, 2013, at 8:31 AM, Nicolas Franck nicolas.fra...@ugent.be wrote:
For a limited period of time I am making publicly available a Web-based
program called PDF2TXT -- http://bit.ly/1bJRyh8
If you're looking for the pdf in question, search for IIPv105.pdf in your
log files.
It is a
On Oct 11, 2013, at 6:39 PM, Mark Pernotto mark.perno...@gmail.com wrote:
Putting my devil's advocate hat on, it doesn't parse foreign documents well
(I got it to break!). I also got inconsistent results feeding it PDF files
with tables embedded (but haven't been able to figure out what it is
On Oct 11, 2013, at 6:39 PM, Mark Pernotto mark.perno...@gmail.com wrote:
Just from a curiosity standpoint, what encoding is being utilized? I know
nothing about Perl. It seemed to have no problem parsing a dash (-) if it
was up against another character (2007-2012), but barfs when it's by
with it because it does not process things the
size of the Bible.
--
[cid:116F6092-2AB6-4E95-8199-25639542726A]
Eric Lease Morgan
Digital Initiatives Librarian
University of Notre Dame
Room 131, Hesburgh Libraries
Notre Dame, IN 46556
o: 574-631-8604
e: emor...@nd.edumailto:emor...@nd.edu
[cid
On Oct 11, 2013, at 11:57 AM, Peter Murray peter.mur...@lyrasis.org wrote:
For a limited period of time I am making publicly available a Web-based
program called PDF2TXT --http://bit.ly/1bJRyh8
Very neat. I couldn't get the 'network diagram' link to work (from
On Oct 11, 2013, at 12:57 PM, Sean Hannan shan...@jhu.edu wrote:
For a limited period of time I am making publicly available a Web-based
program called PDF2TXT -- http://bit.ly/1bJRyh8
Very cool. But, why only for a limited period of time?
Sean, thank you for your support. I'm making it
On Oct 11, 2013, at 1:49 PM, Matthew Sherman matt.r.sher...@gmail.com wrote:
For a limited period of time I am making publicly available a Web-based
program called PDF2TXT -- http://bit.ly/1bJRyh8
Very slick, good work. I can see where this tool can be very helpful. It
does have some
://github.com/ericleasemorgan/htrc
[2] Center's Data API - http://www.hathitrust.org/htrc/api-guide
--
[cid:116F6092-2AB6-4E95-8199-25639542726A]
Eric Lease Morgan
Digital Initiatives Librarian
University of Notre Dame
Room 130, Hesburgh Libraries
Notre Dame, IN 46556
o: 574-631-8604
e: emor
/liam-guidebook/
--
Eric Lease Morgan
On Sep 4, 2013, at 9:42 AM, Eric Lease Morgan emor...@nd.edu wrote:
I get the basic concepts of linked data. But what I don't understand is
why the idea has been around so long, yet there seems to be a dearth of
useful applications that live up to the hype. So, what I want to learn
about
On Sep 3, 2013, at 10:49 AM, Coombs,Karen coom...@oclc.org wrote:
Option 2. - Use HTTPful code library (http://phphttpclient.com/). This is a
well developed and supported code base which is designed specifically to
support REST interactions. It is easy to install via Composer or Phar, or
/181/190/
--
Eric Lease Morgan, Digital Initiatives Librarian
Hesburgh Libraries
University of Notre Dame
574/631-8604
From a different list I learned of the following WorldCat metadata API
webinar, and I thought some of us over here maybe interested too. From the
description:
The WorldCat Metadata API supports a variety of cataloging
functionality for libraries to catalog their collections in
WorldCat.
On Aug 15, 2013, at 11:11 AM, diego ferreyra temat...@r020.com.ar wrote:
Hi, we looking for a web tool to manage Authority Data with this general
desirable features:
- Web tool
- Open source
- Capabilities to expose web services or document API
- Manage data about persons and
101 - 200 of 566 matches
Mail list logo