[The following position announcement is being forwarded upon request. —ELM]
> We are currently hiring for the Applications Developer III position at the
> Howard-Tilton Memorial Library at Tulane University located in New Orleans,
> Louisiana.
>
> Please see the job details here:
As of this message, I’m putting the Code4Lib mailing list “on hold” while the
list’s configurations and archives get moved from one place to another. ‘More
soon, and this process will take at least a day. Please be patient. —Eric Lease
Morgan
> Alas, the Code4Lib mailing list software will most likely need to be migrated
> before the end of summer…
On Monday Wayne Graham (CLIR/DLF) and I are hoping to migrate the Code4Lib
mailing list to a different domain. We don’t think any archives, subscriptions,
nor preferences will get lost
meeting, we will be holding at least one
more webinar in 2016.
http://bit.ly/29FuUuY
Actually, the mass-editing of cataloging (MARC) data is something that is
particularly interesting to me these days. Hand-crafted metadata records are
nice, but increasingly unscalable.
—
Eric Lease Morgan
choose is
all but arbitrary. I’ll use some 9xx field, just to make things easy. I can
always (and easily) change it later.
[1] "Catholic Portal” - http://www.catholicresearch.net
—
Eric Lease Morgan
I’m looking for date fields.
Or more specifically, I have been given a pile o’ MARC records, and I will be
extracting for analysis the values of dates from MARC 260$c. From the resulting
set of values — which will include all sorts of string values ([1900], c1900,
190?, 19—, 1900, etc.) — I
On Jun 7, 2016, at 10:11 AM, Eric Lease Morgan <emor...@nd.edu> wrote:
>>> Alas, the Code4Lib mailing list software will most likely need to be
>>> migrated before the end of summer, and I’m proposing a number possible
>>> options for the lists continued existe
On Jun 14, 2016, at 8:01 PM, Coral Sheldon-Hess <co...@sheldon-hess.org> wrote:
> Now, there kind of is. By my count, we have 4 volunteers. Chad, Tom, Galen,
> and me. Anyone else?
Coral, please sign me up. I’d like to learn more. —Eric Lease Morgan
On Jun 9, 2016, at 7:55 PM, Coral Sheldon-Hess wrote:
> One note about what we're discussing: when we talk about just doing the
> regional events (and I mean beyond 2017, which will be a special case if a
> host city can't step in), we need to realize that we have a lot
her end of the first meeting. NASIG? DLF? ACRL?
Call it a symbiotic relationship.
—
Eric Lease Morgan
left out. That is what happens when formalization take place.
The regional conferences are good things. I call them franchises. The annual
meeting does not have to be a big deal, and the smaller it is, the less
financial risk there will be. Somebody will always come forward. It will just
happen.
—
Eric Lease Morgan
> I'm also interested in investigating how to formalize Code4Lib as an
> entity, for all of the reasons listed earlier in the thread…
-1 because I don’t think the benefits will outweigh the emotional and
bureaucratic expense. We already have enough rules.
—
ELM
] arscience - http://infomotions.com/blog/2008/07/arscience/
—
Eric Lease Morgan
On May 12, 2016, at 8:30 AM, Eric Lease Morgan <emor...@nd.edu> wrote:
>> Alas, the Code4Lib mailing list software will most likely need to be
>> migrated before the end of summer, and I’m proposing a number possible
>> options for the lists continued existence...
On Mar 24, 2016, at 10:29 AM, Eric Lease Morgan <emor...@nd.edu> wrote:
> Alas, the Code4Lib mailing list software will most likely need to be migrated
> before the end of summer, and I’m proposing a number possible options for the
> lists continued existence...
Our
minds would like to know.
Fun with authorities!? And, “What’s in a name anyway?"
[1] ISNI - http://isni.org
[2] some documentation - http://isni.org/how-isni-works
—
Eric Lease Morgan
Lost In Rome
he Python
Natural Language Toolkit (NLTK) module. [2] The noted chapter gives a pretty
good overview of the subject.
[1] NER - http://nlp.stanford.edu/software/CRF-NER.shtml
[2] NLTK chapter - http://www.nltk.org/book/ch07.html
* ‘Story of my life.
—
Eric Lease Morgan
On Apr 7, 2016, at 4:24 PM, Gregory Markus wrote:
>> from one of the New York Times stories on the Panama Papers: "The
>> ICIJ made a number of powerful research tools available to the
>> consortium that the group had developed for previous leak
>> investigations. Those
, “Whenever you have a hammer, everything
begins to look like a nail."
[1] an essay elaborating on the idea of use & understand -
http://infomotions.com/blog/2011/09/dpla/
—
Eric Lease Morgan
Artist- And Librarian-At-Large
On Apr 5, 2016, at 11:12 PM, Karen Coyle wrote:
> Eric, there were studies done a few decades ago using factual questions.
> Here's a critical round-up of some of the studies:
> http://www.jstor.org/stable/25828215 Basically, 40-60% correct, but possibly
> the questions
I sincerely wonder to what extent librarians give the reader
(patrons) the right -- correct -- answer to a (reference) question.
Such is a hypothesis that can be tested and measured. Please show me
non-antidotal evidence one way or the other. --ELM
On Mar 25, 2016, at 1:24 PM, Bethany Nowviskie wrote:
> Dear all — I’ve been getting this as a digest, so apologies that I’m only
> seeing the thread on the future of the mailing list now!
>
> CLIR/DLF is running the same version of ye olde LISTSERV as Notre Dame, to
>
Regarding the mailing list, here is what I propose to do:
1. Upgrade my virtual server to include more RAM and disk space.
2. Install and configure Mailman.
3. Ask people to subscribe to a bogus list so I/we can practice.
4. Evaluate.
5. If evaluation is successful, then migrate
y’all think? If we go with Option #2, then where might we host the
list, who might do the work, and what software might we use?
—
Eric Lease Morgan
Artist- And Librarian-At-Large
professional ethics. Too many librarians thought the implementation of
the idea challenged intellectual privacy. Alas.
—
Eric Lease Morgan
Artist- And Librarian—At-Large
(574) 485-6870
I’m curious. What is the difference between the WorldCat Discovery and WorldCat
Metadata APIs?
Given an OCLC number, I want to programmatically search WorldCat and get in
return a full bibliographic record compete with authoritative subject headings
and names. Which API should I be using?
—
l become
insignificant. The consequence will be a more holistic set of library
collections and services.
[1] I have elaborated on these ideas in a blog posting - http://bit.ly/1LDpXkc
—
Eric Lease Morgan
Code4Crotaia was alluded to in a blog posting. [1] code4crotaia++ Inquiring
mind would like to know more. Please tell us about Code4Crotaia, and don’t
hesitate to update http://wiki.code4lib.org with details?
[1] http://blog.okfn.org/2016/03/21/codeacross-opendataday-zagreb-2016/
—Eric Morgan
On Mar 2, 2016, at 9:48 AM, LeVan,Ralph wrote:
> …I've written so much bloat that didn't get used because a librarian was sure
> the system would fail without it….
I’m ROTFL because just a few minutes ago, while composing an informal essay on
the history of bibliographic
On Mar 2, 2016, at 9:30 AM, Tom Hutchinson <thutc...@swarthmore.edu> wrote:
> ...To be honest I feel like I still don’t even really know what libraries /
> librarians are yet.
Tom, when you find out, please tell the rest of us. ;-) —Eric Lease Morgan
On Feb 8, 2016, at 11:25 AM, Katherine Deibel wrote:
> From a disability accessibility perspective, magnification is not purely
> about text readability but making sure that all features of a
> website---images, interactive widgets, text, etc.---are of use to the user.
> Merely
Given an OCLC member code, such as BXM for Boston College, is it possible to
use some sort of OCLC API to search WorldCat (or some other database) and
return information about Boston College? —Eric Lease Morgan
> I have around 1400 xml files that I am trying to copy into one xml file so
> that I can then pull out three elements from each and put into a single csv
> file.
What are three elements you want to pull out of each XML file, and
what do you want the CSV file to look like?
Your XML files are
On Jan 14, 2016, at 10:32 AM, Ethan Gruber wrote:
>>> Part of this grant stipulates that open access books be made available
>>> in EPUB 3.0.1, so I got to work on a pipeline for dynamically serializing
>>> TEI into EPUB...
>>>
le thing
was the creation of the TEI files in the first place. After that, everything
was relatively easy.
[1] Alex Lite blog posting - http://bit.ly/eazpJY
[2] Alex Lite - http://infomotions.com/sandbox/alex-lite/
—
Eric Lease Morgan
Artist- And Librarian-At-Large
(A man in a trenc
On Jan 5, 2016, at 1:17 PM, Nate Hill wrote:
> metro.org/fellowship
>
> Our goal is to empower a small cohort of fellows to help solve
> cross-institutional problems and to spur innovation within our membership
> of libraries and archives in NYC and Westchester County
enable httpd_can_network_connect.
>
> —
> Michael Berkowski
> University of Minnesota Libraries
Michael, resolved, and thank you for the prompt and thorough reply.
Yes, SELinux was doing its job, and it was configured to disallow network
connections from httpd. After issuing the following command (which allows httpd
to make network connections) both my rsync- and wget-based CGI scripts worked
without modification:
setsebool http_can_network_connect on
Maybe I’ll add the -P option later. Yippie! Thank you.
—
Eric Lease Morgan
On Dec 26, 2015, at 8:14 PM, Childs, Riley wrote:
>> How do I modify the permissions of a file under the supervision of SELunix
>> so the file can be executed as a CGI script?
>>
>> I have two CGI scripts designed to do targeted crawls against remote
>> hosts. One script uses
missions? Maybe I need to use something like semanage (which doesn’t
exist on my system) to change the user apache’s permissions?
This is a level of the operating system of which I am unfamiliar.
—
Eric Lease Morgan
Obviously, the sorts of things outlined in the presentation above are real, and
they are really scary. Us developers need to take note: getting input from the
‘Net can be a really bad thing. —Eric Lease Morgan
On Nov 24, 2015, at 8:20 PM, Eric Lease Morgan <emor...@nd.edu> wrote:
>>> Do Dublin Core files exist, and if so, then can somebody show me one? Put
>>> another way, can you point me to a DTD or schema denoting Dublin Core XML?
>>> The closest I can co
[Forwarded upon request. —E “Lost In Venice” M ]
> From: "Fultz, Tamara"
> Subject: Question about posting
> Date: October 15, 2015 at 12:43:08 AM GMT+2
> To: "code4lib-requ...@listserv.nd.edu"
>
> Implementing BIBFRAME
> The UC
A Code4Lib Chicago meeting has been scheduled for Monday, November 23 from 8:30
to 5 o’clock at the University of Illinois-Chicago. [1] Sign up early. Sign up
often.
[1] meeting - http://wiki.code4lib.org/Code4Lib_Chicago
—
Eric Lease Morgan, Librarian-At-Large
On Sep 2, 2015, at 12:04 PM, Cary Gordon wrote:
> http://cod4lib.com
ROTFL!!! —Eric Morgan
On Aug 31, 2015, at 9:23 PM, Cary Gordon wrote:
> Perhaps this belongs on the Cod4lib list.
^^^
Yesterday, I didn’t quite understand the allusion to the East Coast, but now I
see that I lost an e in Code4Lib. Cod4Lib. That's pretty funny.
On Sep 1, 2015, at 9:42 AM, Eric Hellman wrote:
> As someone who feels that Code4Lib should welcome people who don't
> particularly identify as "coders", I would welcome a return to the previous
> title attribute.
1++ because I believe it is more about libraries than it is
On Aug 28, 2015, at 11:56 AM, Allan Berry wrote:
> The UIC Library would be happy to host the Code4Lib event, in November or
> early December.
The folks at University of Illinois-Chicago would like to sponsor a one-day
Cod4Lib event, and in order to determine the best date,
On Aug 10, 2015, at 11:38 AM, David Lacy david.l...@villanova.edu wrote:
The 2016 conference will be held from March 7 through March 10 in the Old
City District of Philadelphia. This location puts conference attendees
within easy walking distance of many of Philadelphia’s historical
As some of you in around Chicago may or may not know, there is a Code4Lib
Chicago group called chicode4lib. See the Google Group:
https://groups.google.com/forum/#!forum/chicode4lib
I’m simply trying to drum up business for the community.
—
Eric Lease Morgan
On Jul 22, 2015, at 6:49 PM, Peter Mangiafico pmangiaf...@stanford.edu wrote:
I am conducting a survey of software used for image analysis and metadata
enhancement. Examples include facial recognition, object identification,
similarity matching, and so on. The goal is to understand if it
objectives and deliverables.
For more detail, see: http://www.ecu.edu/cs-lib/about/job942036.cfm
—
Eric Lease Morgan
to the purpose and functionality of HathiTrust Research Center Workset
Browser.
[1] EBO-TCP Workset Browser -
https://github.com/ericleasemorgan/EEBO-TCP-Workset-Browser
—
Eric Lease Morgan, Librarian
University of Notre Dame
On Jun 18, 2015, at 12:02 PM, Matt Sherman matt.r.sher...@gmail.com wrote:
I am working with colleague on a side project which involves some scanned
bibliographies and making them more web searchable/sortable/browse-able.
While I am quite familiar with the metadata and organization aspects we
/2015/05/htrc-workset-browser/
—
Eric Lease Morgan
University of Notre Dame
On Jun 8, 2015, at 7:32 AM, Owen Stephens o...@ostephens.com wrote:
I’ve just seen another interesting take based (mainly) on data in the
TCP-EEBO release:
https://scalablereading.northwestern.edu/2015/06/07/shakespeare-his-contemporaries-shc-released/
It includes mention of
/channing/about.html
—
Eric Lease Morgan, Librarian
University of Notre Dame
to learn of other
people’s experience so I do not not re-invent the wheel (too many times). ‘Got
ideas?
—
Eric Lease Morgan
University Of Notre Dame
On Jun 5, 2015, at 8:20 AM, Ethan Gruber ewg4x...@gmail.com wrote:
Does anybody here have experience reading the SGML/XML files representing
the content of EEBO?
Are these in TEI? Back when I worked for the University of Virginia
Library, I did a lot of clean up work and migration of
On Jun 5, 2015, at 8:10 AM, Eric Lease Morgan emor...@nd.edu wrote:
Does anybody here have experience reading the SGML/XML files representing the
content of EEBO?
I ultimately found the EEBO files in the form of TEI, and then I was able to
transform one of them into VERY functional HTML5
for the digital humanities and text mining.
[1] HathiTrust Research Center - http://hathitrust.org/htrc
[2] blog posting describing the Browser -
http://blogs.nd.edu/emorgan/2015/05/htrc-workset-browser/
—
Eric Lease Morgan
University of Notre Dame
I believe I have created a repository of my HTRC Workset Browser code (shell
and Python scripts) on GitHub. [1] From the Quick Start section of the README:
1. Download the software putting the bin and etc directories in the same
directory.
2. Change to the directory where the bin and etc
On Jun 1, 2015, at 10:58 AM, davesgonechina davesgonech...@gmail.com wrote:
They just informed me I need a .edu address. Having trouble understanding
the use of the term public domain here.
Gung fhpx, naq fbhaqf ernyyl fbeg bs fghcvq!! --RYZ
On Jun 1, 2015, at 4:33 AM, davesgonechina davesgonech...@gmail.com wrote:
If your *institutional* email address is not on their whitelist (not sure
if it is limited to subscribing ones, they don't say) you cannot register
using the signup form, instead you can only request an account by
and text mining.
[1] HathiTrust Research Center - http://hathitrust.org/htrc
[2] blog posting describing the Browser - http://ntrda.me/1FUGP2g
—
Eric Lease Morgan
University of Notre Dame
On May 27, 2015, at 6:33 PM, Karen Coyle li...@kcoyle.net wrote:
In my copious spare time I have hacked together a thing I’m calling the
HathiTrust Research Center Workset Browser, a (fledgling) tool for doing
“distant reading” against corpora from the HathiTrust. [0, 1] ...
'Want to give
On May 26, 2015, at 11:30 AM, Eric Lease Morgan emor...@nd.edu wrote:
In my copious spare time I have hacked together a thing I’m calling the
HathiTrust Research Center Workset Browser, a (fledgling) tool for doing
“distant reading” against corpora from the HathiTrust. [0]
[0
/thoreau-catalog
[9] source code - http://ntrda.me/1Q8pPoI
[10] HathiTrust Research Center - https://sharc.hathitrust.org
—
Eric Lease Morgan, Librarian
University of Notre Dame
On May 18, 2015, at 9:23 PM, Galen Charlton g...@esilibrary.com wrote:
I have two scripts, attached. They do EXACTLY the same thing
in almost EXACTLY the same manner, but the Python script is
almost 25 times slower than the Perl script:
I'm no Python expert, but I think that the difference
NLTK
library, but now I’m not so sure. It is just me, or is Python really s l o o o
w ? Is there anything I can do to improve/optimize my Python code?
—
Eric Lease Morgan
#!/usr/bin/env python2
# json2catalog.py - create a catalog from a set of HathiTrust json files
# Eric Lease Morgan emor
If a peson could denote the characteristics of both the main (female) character
as well as the protagonist, then bits of natural language processing (text
mining) might be able to address this problem. —Eric “When You Have A Hammer,
Everything Begins To Look Like a Nail” Morgan
Code4Lib is now 3,082 subscribers strong. Yeah! Almost time to do some
analysis. —ELM
On Feb 26, 2015, at 9:48 AM, Owen Stephens o...@ostephens.com wrote:
I highly recommend Chapter 6 of the Linked Data book which details different
design approaches for Linked Data applications - sections 6.3
(http://linkeddatabook.com/editions/1.0/#htoc84) summarises the approaches as:
On Feb 25, 2015, at 3:12 PM, Sarah Weissman seweiss...@gmail.com wrote:
I am kind of new to this linked data thing, but it seems like the real
power of it is not full-text search, but linking through the use of shared
vocabularies. So if you have data about Jane Austen in your database and
On Feb 25, 2015, at 2:48 PM, Esmé Cowles escow...@ticklefish.org wrote:
In the non-techie library world, linked data is being talked about (perhaps
only in listserv traffic) as if the data (bibliographic data, for instance)
will reside on remote sites (as a SPARQL endpoint??? We don't know
On Feb 16, 2015, at 4:54 PM, Levy, Michael ml...@ushmm.org wrote:
I think you can accomplish what you want by using ICUFoldingFilterFactory
https://wiki.apache.org/solr/AnalyzersTokenizersTokenFilters#solr.ICUFoldingFilterFactory
which should simply perform ICU (cf
I know the documents I’m indexing are written in Spanish, and adding the
following filters to my field definition, I believe I have resolved my problem:
filter class=solr.LowerCaseFilterFactory/
filter class=solr.SnowballPorterFilterFactory language=Spanish /
In other words, my searchable
[The following announcement is being passed on by request. —ELM]
Assistant Manager – Websites and Online Engagement Digital Services
Vancouver Public Library (VPL) is seeking a dynamic, strategic, and creative
Assistant Manager to join the Digital Services department. Reporting to the
On Feb 10, 2015, at 11:46 AM, Erik Hatcher erikhatc...@mac.com wrote:
bin/post -c collection_name /path/to/file.doc
The almost trivial command to index a Word document in Solr, above, is most
certainly appealing, but I’m wondering about the underlying index’s schema.
Tika makes every effort
Can somebody point me to a good tutorial on how to index Word documents using
Solr?
I have a few hundred Microsoft Word documents I want to search. Through the use
of the Tika library it seems as if I ought to be able to index my Word
documents directly into Solr, but none of the tutorials I
It is a joy to manage this mailing list, and I say that with all sincerity.
—Eric Morgan
The replies received have all been very helpful. Thank you! —Eric M.
Does anybody here know how to extract circulation statistics from an library
catalog? Specifically, given a date range, are you able to create a list of the
most frequently borrowed books ordered by the number of times they’ve been
circulated?
I have a colleague who wants to digitize sets of
I’m curious, how large is LITA (Library and Information Technology
Association)? [0] How many members does it have?
Apparently it has around 3000 members this year. I found this on the ALA
membership statistics page:
http://www.ala.org/membership/membershipstats_files/divisionstats#lita
On Jan 5, 2015, at 11:25 AM, Sylvain Machefert smachef...@u-bordeaux3.fr
wrote:
Interesting and thank you. Code4Lib only needs fifty more subscribers to
equal LITA’s size. I think this just goes to show, with the advent of the
Internet, centralized authorities are not as necessary/useful as
On Jan 5, 2015, at 1:35 PM, Karen Coyle li...@kcoyle.net wrote:
1) Everyone should read at least the first chapters of the Allemang book,
Semantic Web for the Working Ontologist:
I’m curious, how large is LITA (Library and Information Technology
Association)? [0] How many members does it have?
[0] LITA - http://www.ala.org/lita/
—
ELM
It is so cool that we have “franchises”. —Eric Morgan
I don’t know about y’all, but it seems to me that things like linked data and
open access are larger trends in Europe than here in the United States. Is
there are larger commitment to sharing in Europe when compared to the United
States? If so, is this a factor based on the nonexistence of a
On Dec 17, 2014, at 9:52 AM, Nicola Carboni nic.carb...@gmail.com wrote:
I am collecting some resources (beginner level) in order to start using
Virtuoso (OpenSource Edition) for a project I am working with. I would like
to use it both for hosting triples and for its sponger (CSV to RDF). I
On Dec 17, 2014, at 10:10 AM, Mixter,Jeff mixt...@oclc.org wrote:
If you want to test out a bare-bones triple store, I would suggest 4Store
(http://4store.org/). It has pre-compiled installs for Unix and Unix-like
systems (although not Windows). It supports SPARQL 1.1 and is relatively easy
On Dec 9, 2014, at 8:25 AM, Kyle Banerjee kyle.baner...@gmail.com wrote:
I've just started a project that involves harvesting large numbers of
scanned PDF's and extracting information from the text from the OCR output.
The process I've started with -- use imagemagick to convert to tiff and
On Dec 8, 2014, at 12:57 PM, Dana Jemison dana.jemi...@ucop.edu wrote:
Looks like the recommended hotel is already filled up. Are there any other
options close by?
Mine is a unsolicited comment/endorsement for AirBnB as an additional source of
accommodations, if it does not hurt the
On Oct 23, 2014, at 11:45 AM, Joe Hourcle onei...@grace.nascom.nasa.gov
wrote:
I found this blog post talking about CrossRef's support:
http://www.crossref.org/CrossTech/2011/04/content_negotiation_for_crossr.html
But I know DataCite supports it to some extent too.
Does anyone know
Learning Unix is not necessarily the problem to solve. Instead it is means to
an end.
To my mind, there are number of skills and technologies a person needs to know
in order to provide (digital) library service. Some of those
skills/technologies include: indexing, content management
On Jul 14, 2014, at 5:00 PM, Eric Lease Morgan emor...@nd.edu wrote:
Does anybody here have any experience with the Elsevier API Program? [1]
[1] Elsevier API - http://www.developers.elsevier.com/cms/
I have had tiny success with the Elsevier API Program.
I first created an API key
+ --ELM
This is at test, and hopefully the only test of the Mr. Serials Process against
my Code4Lib mailing list archive. Delete me. —Eric
The following message about the Crowdsourcing Consortium for Libraries and
Archives project is being forwarded upon request. —ELM
From: Christina manzo.christ...@gmail.com
Subject: Please Distribute [Crowdsourcing Consortium for Libraries and
Archives project]
Date: August 27, 2014 at
...But there are few programmer projects that would require zero maintenance
once finished…
This is a bit out of context, but a Buddhist monk once said, “Software is never
done. If it were, then it would be called hardware.” —Eric Morgan
Albiet a bit late, I very recently learned that the DOAJ is asking journals
like ours (Code4Lib Journal) to resubmit our application to be in the
directory. [1] From a Nature article:
Now, following criticism of its quality-control checks, the
website [DOAJ] is asking all of the journals in
1 - 100 of 566 matches
Mail list logo