-- Forwarded message --
From: Mark A. Matienzo mark.matie...@gmail.com
Date: Wed, Oct 13, 2010 at 10:53 PM
Subject: [pymarc] Examples added to pymarc wiki
To: pym...@googlegroups.com
I just added a bunch of links to some Gists (i.e. code snippets hosted
on Github) that use pymarc
My apologies if you've seen this already...
//Ed
--
W3C Library Linked Data Incubator Group - http://www.w3.org/2005/Incubator/lld/
Call for Use Cases: Library Linked Data
Are you currently using linked data technology [1] for library-related data, or
considering doing it in the near future?
In case anyone is tracking the status of the Open Publication
Distribution System (OPDS), v1.0 of the specification was just
released yesterday [1]. OPDS is essentially a pattern for sharing
ebook URLs, and the metadata associated with them, using Atom. Hadrien
Gardeur has a nice list of the
On Mon, Aug 9, 2010 at 7:11 PM, Brian Tingle
brian.tingle.cdlib@gmail.com wrote:
Ed Summers was asking me during a break today about support for
embedded linked data in the HTML view of the EAC record. I have to
admit I'm a bit of a linked data skeptic, but I'd be interested to
explore
On Thu, Jul 8, 2010 at 9:48 AM, Jonathan Rochkind rochk...@jhu.edu wrote:
So dc:relation does sound like the right vocabulary element for generic
related web page page, thanks. Is the value of dc:relation _neccesarily_
a URI/URL? I hope so, because otherwise I'm not sure dc:relation is
d'oh, s/dcterms:related/dcterms:relation/ (thanks ksclarke). I also
meant to point out that rdfs:seeAlso is another option.
//Ed
On Thu, Jul 8, 2010 at 11:44 AM, Ed Summers e...@pobox.com wrote:
On Thu, Jul 8, 2010 at 9:48 AM, Jonathan Rochkind rochk...@jhu.edu wrote:
So dc:relation does sound
On Wed, Jul 7, 2010 at 7:00 PM, Doran, Michael D do...@uta.edu wrote:
Of course, subfield $3 values are not any kind of controlled vocabulary, so
it's hard to do much with them programmatically.
A few years ago I analyzed the subfield 3 values in the Library of
Congress data up at the Internet
On Tue, Jun 1, 2010 at 2:21 PM, LeVan,Ralph le...@oclc.org wrote:
I've been sensing a flaw in HTTP for some time now. It seems like you
ought to be able to do everything through a URL that you can using a
complete interface to HTTP. Specifically, I'd love to be able to
specify values for
On Tue, Jun 1, 2010 at 6:35 PM, Erik Hetzner erik.hetz...@ucop.edu wrote:
There is a time for a URI that can use content-negotiation (the Accept
header, etc.) to get, e.g., PDF, HTML, or plain text. As an example:
http://example.org/RFC1
And there is a time when we want to explicitly refer
Of possible interest to someone looking to help out with some
opensource hacking on a high profile project...
//Ed
-- Forwarded message --
From: Thomas Baker tba...@tbaker.de
Date: Thu, May 20, 2010 at 2:03 PM
Subject: DCMI's Vocabulary Management Tool published as an Open Source
Hi Karen,
On Thu, May 13, 2010 at 1:19 PM, Karen Coyle li...@kcoyle.net wrote:
but I would have expected to see a title data element listed somewhere. Is
dc assumed? Or is the bibliographic description scheme an open question?
The nice thing about Atom is that it allows you to layer in
As Kevin said, I think you can use the Atom feed to page backwards
through time. Basically this amounts to programatically following the
link rel=next links in the feed, applying creates, updates and
deletes as you go until you make it to Feb. 15, 2010.
Currently this would involve walking from:
On Tue, May 4, 2010 at 8:24 AM, Mike Taylor m...@indexdata.com wrote:
Oh, what is the solution when using it in RDF?
I've been using the Bibliographic Ontology myself:
http://bibliontology.com/
Lots of stuff in there for journals, etc ... and reuse of other
vocabularies like event, foaf,
On Fri, Apr 30, 2010 at 9:09 AM, Ross Singer rossfsin...@gmail.com wrote:
I actually think this lack of any specified response format is a large
factor in the stagnation of OpenURL as a technology. Since a resolver
is under no obligation to do anything but present a web page it's
difficult
On Tue, Apr 27, 2010 at 7:02 AM, Jakob Voss jakob.v...@gbv.de wrote:
If you want to put bibliographic metadata
into twitter annotations (good idea) you first need to clarify the basic
purpose of embedding this information. I see two of them:
I. Identification: To identify other tweets and
On Wed, Apr 28, 2010 at 4:17 AM, Jakob Voss jakob.v...@gbv.de wrote:
P.S: An example of a CSL record from the JavaScript client:
{
title: True Crime Radio and Listener Disenchantment with Network
Broadcasting, 1935-1946,
author: [ {
family: Razlogova,
given: Elena
} ],
On Wed, Apr 21, 2010 at 6:08 AM, Tim Spalding t...@librarything.com wrote:
I'm inclined to start adding it to the I'm talking about and I'm
adding links on LibraryThing. I imagine it could be easily added to
many library applications too—anywhere there is or could be a share
this on Twitter
whoops, forgot my footnote :-)
[1]
http://groups.google.com/group/twitter-api-announce/browse_thread/thread/fa5da2608865453
On Sun, Apr 11, 2010 at 12:00 AM, Tim Spalding t...@librarything.com wrote:
While the new draft is written in a much friendlier tone, and even has
some improvements, it also takes away concrete rights that libraries
had in the earlier drafts, including the right to consider fully
theirs
Hey Charles,
How do these jobs end up on your blog again? It seems like you have a
lot of good content. Given the number of jobs that are posted on here
and on your site I wonder if it might be feasible to create a little
alerting/subscription service for code4lib folks who want a technical
job
I just found out about a code4lib regional full day meeting in the DC
area at the National Agriculture Library in Beltsville, Maryland. I
figured it couldn't hurt to announce it on here in case you have any
interest in attending. Agenda is below.
asideIt would be nice if the organizers of
On Wed, Apr 7, 2010 at 7:30 PM, William Denton w...@pobox.com wrote:
A quick note to anyone who's interested in Code4Lib North (6-7 May in
Kingston, Ontario) and isn't on the mailing list for it:
http://wiki.code4lib.org/index.php/North
Details about the mailing list are there.
On Wed, Apr 7, 2010 at 10:43 PM, William Denton w...@pobox.com wrote:
So far there are just three people with ideas for talks (me, Walter Lewis,
Art Rhyno). Have the other local chapters found it works well to have more
time for informal stuff, or lightning talks, or Ask Anything like I see
I realize it's of limited utility compared to yet another web2.0 API,
but I think it would be good to see Works represented somehow in the
RDF Linked Data views...assuming they're not already.
//Ed
On Tue, Mar 30, 2010 at 1:22 PM, Karen Coyle li...@kcoyle.net wrote:
Open Library now has Works
On Tue, Mar 30, 2010 at 1:52 PM, Karen Coyle li...@kcoyle.net wrote:
Ed, thanks. I'll need you to be a bit more -v on this one: are you asking
for a an RDF option on the API, or that Works as a whole be represented as
linked data? The Open Library doesn't present itself as linked data, as you
On Mon, Mar 29, 2010 at 3:45 PM, Lars Aronsson l...@aronsson.se wrote:
Wikipedia has 720 links to www.rockhall.com,
which is a dream for any website,
http://en.wikipedia.org/wiki/Special:LinkSearch/www.rockhall.com
I had no idea that Wikipedia provided a live search of external links.
Up until
On Mon, Mar 29, 2010 at 4:34 PM, Jonathan Rochkind rochk...@jhu.edu wrote:
Hmm, just because a given page is linked to from a wikipedia page, can one
assume that the target of the link is about the same thing as the original
page? I'm not sure how often this assumption would be violated?
Nice work Bill! I particularly like your use of the link element to
enable auto-discovery of these resources:
link rel=canonical href=/Record/005550418
link rel=alternate type=application/marc href=/Record/005550418.mrc
link rel=alternate type=application/marc+xml
On Thu, Dec 10, 2009 at 11:11 AM, LeVan,Ralph le...@oclc.org wrote:
The file contains tab delimited records. The first column is the ID
number of the FAST record that the term comes from. The second column
is the MARC Authorities field that the term came from. The third column
is the term
On Tue, Dec 8, 2009 at 10:16 AM, Karen Coyle li...@kcoyle.net wrote:
Couple of things: first, what we have at id.loc.gov is NOT LCSH, but a copy
of the LC subject authority file. The entries in this file form the basis
for subject headings, most of which add facets to the authority entry when
On Tue, Dec 8, 2009 at 11:08 AM, Ethan Gruber ewg4x...@gmail.com wrote:
my interest is how I (or other people
developing their own autosuggest systems based on the subject headings) can
pull updates of the master authority XML file into my index of terms.
Take a look at the Atom feed for
In the interests of all-that-is-agile, and the HolidaySeason™ I took a
quick (imperfect) stab at providing some basic suggest functionality
at id.loc.gov which you can find documented in the OpenSearch
Description:
http://id.loc.gov/authorities/opensearch/
I used the OpenSearch Suggestions
On Mon, Dec 7, 2009 at 11:48 AM, Ethan Gruber ewg4x...@gmail.com wrote:
Ben, another problem with digestibility of the search results is that it's
not XHTML, and therefore not well-formed XML, making it impossible to
process with XPath.
What page did you find that wasn't valid XHTML? The JSON
On Mon, Dec 7, 2009 at 3:18 PM, LeVan,Ralph le...@oclc.org wrote:
That looks more like an old-style index browse than what I'd think of as a
suggestion. You've returned nothing until the user has typed enough
characters to restrict the number of index terms to be returned to a rational
On Mon, Dec 7, 2009 at 3:43 PM, LeVan,Ralph le...@oclc.org wrote:
For VIAF, rankings are calculated based on the number of institutions
that have controlled that name and the amount of attention the
institutions have given to that name (e.g. size of their respective name
authority records).
Just a quick note to let you know pymarc v2.41 is available:
http://pypi.python.org/pypi/pymarc/
The change in this release is functionality from Mark Matienzo for
supporting reading/writing MARC records with non-numeric tags.
Thanks Mark!
//Ed
Hi Brendan:
Ahh the lovely MARC-8 :-)
It's a fair bit of effort I think. One approach could be to porting
the MARC8-Unicode functionality from pymarc [1,2]. It's only one-way,
but that's normally what most sane people want to do anyhow.
Another approach would be to look into wrapping yaz-iconv
My apologies for the extremely short notice. There is a Vocabulary
Camp taking place at the Library of Congress all day tomorrow and part
of Saturday (Oct 30-31). All are welcome to attend. More information
is available at:
http://vocamp.org/wiki/VoCampDCOctober2009VoCampDCOctober2009
You should also feel free to discuss AquaBrowser on here too ... the
code4lib discussion isn't limited to opensource software.
//Ed
On Wed, Oct 21, 2009 at 4:32 PM, Kathryn Frederick
kfred...@skidmore.edu wrote:
Please excuse cross-posting.
I've set up an AquaBrowser Google Group to share
Nice work Ross! Users of rubymarc might like to see the performance
enhancements that motivated you to do the nokogiri integration:
http://paste.lisp.org/display/87529
!!!
//Ed
On Wed, Sep 23, 2009 at 10:51 PM, Ross Singer rossfsin...@gmail.com wrote:
Hi everybody,
Apologies for the
Hi Erik, all
On Tue, Sep 15, 2009 at 1:12 PM, Erik Hetzner erik.hetz...@ucop.edu wrote:
I might be misunderstanding you, but, I think that you are leaving out
the implicit dimension of time here - when was the URL referenced?
What can we use to represent the tuple URL, date, and how do we
On Thu, Aug 27, 2009 at 4:37 PM, Keith Jenkinsk...@cornell.edu wrote:
Maybe someone more familiar with PURL systems can tell me... is there
any way to harvest data from a PURL server, so that a backup/mirror
can be available?
This would be a great question for the purl-dev discussion list too:
On Thu, Aug 13, 2009 at 3:10 PM, Karen Coyleli...@kcoyle.net wrote:
Does it work for folks if this returns either a cover OR a blank? (1x1 jpg).
It may be awkward to test first for an actual cover. Also, if it's ok to not
test for a cover, does anyone have a preference over the blank or a 404
On Tue, Aug 11, 2009 at 10:40 AM, Karen Coyleli...@kcoyle.net wrote:
Ed, I have NO IDEA how you got to rdf/xml from the OL author link -- do
tell, and I'll take a look! There is no RDF/XML export template for authors,
but one could be created. The URI/URL is simply the address of the author
On Tue, Aug 11, 2009 at 11:12 AM, Coombs, Karen
Akacoo...@central.uh.edu wrote:
I hate to say it but as someone who is working with the WorldCat Search API
at this point I sure could create code that used Open Library data faster if
you used one of the same metadata formats that the WorldCat
Hi Chris:
Assuming your files are available in a directory like:
http://ia360943.us.archive.org/1/items/talis_openlibrary_contribution/
why not use the manifest file:
http://ia360943.us.archive.org/1/items/talis_openlibrary_contribution/talis_openlibrary_contribution_files.xml
It will
On Thu, Jul 23, 2009 at 4:13 PM, Lovins, Danieldaniel.lov...@yale.edu wrote:
I nominate Jonathan Zittrain [1], co-founder of Harvard's Berkman Center for
Internet and Society [2] (where David Weinberger is a fellow [3]) and author
of The Future of the Internet and How to Stop It [4].
+1
Hi Wayne:
My advice would be to find a real, do-able, and hopefully exciting
problem where you work (or elsewhere) that can be solved with a bit of
automation. Once you've found something to work on, fish around for
the right tools(s) to solve the problem. You can use your peers in
here or
Here's another reference to add to some of the great ones so far
(hadn't seen that Norvig one before Chris!). Whatever you think about
Perl, it's hard to argue with Larry Wall's 3 great virtues of a
computer programmer:
http://www.c2.com/cgi/wiki?LazinessImpatienceHubris
//Ed
On Wed, Jul 22, 2009 at 1:18 PM, Wayne Lamwing...@gmail.com wrote:
So i start browsing on the internet about what OAI, Solr and all related
thing without a direction, reading books on what Semantic web is and
subscribe some mailing list of interesting projects. But for a newbie like
me, it was
Nice work Eric! You are right to focus on making the data resolvable
first. Ross' advice to work on exposing and interlinking more of your
own data next, instead of trying to link out elsewhere, is good I
think.
I noticed from your compact (and elegant) little Perl script that you
are expecting
On Wed, Apr 1, 2009 at 2:12 PM, Reese, Terry
terry.re...@oregonstate.edu wrote:
FYI for the larger group. Since many members in the PNW simply cannot
travel to the larger C4L meeting due to budgetary restraints (this year,
and very likely the next), etc -- we will be starting up a PNW local
On Wed, Apr 1, 2009 at 6:14 AM, Mike Taylor m...@indexdata.com wrote:
As usual, an ounce of example is worth a ton of exposition, so:
Suppose I always keep a PDF of my latest paper at
http://www.miketaylor.org.uk/latest.pdf
for the benefit of people who want to keep an eye on my
On Wed, Apr 1, 2009 at 12:28 PM, Ray Denenberg, Library of Congress
r...@loc.gov wrote:
Even more to the point: there is no sound definition of dereference. To
dereference a resource means to retrieve a representation of it. There has
never been any agreement within the w3c of what constitutes
The TGN is still behind a pay-firewall right? Not that that means it
isn't legit conversation on here (because it is) -- but just curious
what the current state is.
//Ed
One other suggestion here. If the issue is that amplee, django-atompub
or flatatompub lack the ability to layer in non-atom markup you might
want to look at adding this functionality to your favorite among them,
and seeing if the changes could be rolled back into the project.
I realize this is
Hi Godmar:
Perhaps this will seem backwards, but rather than trying to adapt (or
create) a generic AtomPub implementation I've been using vanilla
django, and just rolling my own atom feed and service documents as
standard templates. The advantage here is that my code is custom
suited to my task
Wow, this sounds too good to be true. Perhaps this is premature, but
do you think there might be interest in hosting a code4lib2010 in the
Netherlands? (he asks selfishly).
I see you started a wiki page [1]. If at any point you want
nl.code4lib.org (or something) to point somewhere just say the
On Thu, Jan 22, 2009 at 1:11 PM, Walker, David dwal...@calstate.edu wrote:
Some of us can barely afford to get to the east coast of the United States,
let alone Europe. Not that you have to cater to us poor state university
folk, or anything. ;-)
There are university folk in the Netherlands
, Jonathan Rochkind rochk...@jhu.edu wrote:
Who's making rules?
It's bad to discuss things without your peers before voting on them? I
think it's a fine idea to discuss things before just blindly voting on them
without discussion. Leads to better informed votes, does it not?
Ed Summers wrote
On Fri, Dec 5, 2008 at 9:08 PM, Michael Ang [EMAIL PROTECTED] wrote:
From what I've been reading it sounds like abbr with title is more of a
problem than span with title. So maybe span with title isn't too much
of a problem in practice? I suspect that whether the span is empty doesn't
make a
On Fri, Dec 5, 2008 at 10:50 AM, Godmar Back [EMAIL PROTECTED] wrote: would
it.
BTW, I don't see why screen readers would stumble over this when the
child of the span is empty. Do they try to read empty text? And if
a COinS is processed, we fix up the title so tooltips show nicely.
Yeah, if
On Tue, Dec 2, 2008 at 3:11 PM, Godmar Back [EMAIL PROTECTED] wrote:
COinS are still needed, in particular in situations in which multiple
resources are displayed on a page (like, for instance, in the search
results pages of most online systems or on pages such as
http://citeulike.org, or in a
One thing to keep in mind when looking at unAPI, COinS and
Microformats in general is:
http://www.bbc.co.uk/blogs/radiolabs/2008/06/removing_microformats_from_bbc.shtml
//Ed
Hi Karen:
I definitely think adding COinS to OpenLibrary pages could make sense.
I'm curious what everyone's use case is. Is it mainly browser plugins
that can inject links to a relevant OpenURL router so that you can
find books in your local context? If so I think use of COinS in
OpenLibrary
On Mon, Dec 1, 2008 at 11:05 PM, Karen Coyle [EMAIL PROTECTED] wrote:
I asked about COinS because it's something I have vague knowledge of. (And I
assume it isn't too difficult to implement.) However, if there are other
services that would make a bigger difference, I invite you (all) to speak
Would be nice to add these ideas to the code4lib wiki somewhere...
On Wed, Nov 26, 2008 at 11:36 AM, Jay Luker [EMAIL PROTECTED] wrote:
I know its a bit early to start thinking of these, but in the spirit of
William Denton's talk proposal, What We Talk About When We Talk About FRBR,
I thought
Yeah, but really a boring list of ideas would work well for now I think.
On Wed, Nov 26, 2008 at 1:32 PM, Ranti Junus [EMAIL PROTECTED] wrote:
I wonder if we could make it something like this:
http://electroniclibrarian.org/erlwiki/Thought_cloud
ranti.
On Wed, Nov 26, 2008 at 12:29 PM, Ed
I hope it's not too late to get this proposal for a Linked Data
code4lib 2009 pre-conference in:
http://wiki.code4lib.org/index.php/LinkedData
//Ed
On Tue, Nov 11, 2008 at 4:13 PM, Jay Datema [EMAIL PROTECTED] wrote:
I suggest using http://wiki.code4lib.org.
Also, it would be good to set dates and locations for these meetings
collaboratively on this listserv, I think.
Sorry to be late getting back on this -- so is it ok to retire
Until Kevin Clarke happened to mention it in #code4lib I didn't know
that the list of proposals so far is available at:
http://library.brown.edu/code4libcon09/proposals/list/
So if you are looking for inspiration, or to see the pool of talks so
far check it out. Maybe this was mentioned
It's may be worth pointing out that drupal4lib is scheduled to happen
the day after code4lib ends, and Darien, CT is quote close
geographically to Providence, RI ... so you could theoretically jump
from code4lib to drupal4lib.
Nice planning!
//Ed
On Mon, Nov 17, 2008 at 5:13 PM, Amanda
Joanna:
Could I suggest an item for the agenda: whether the nyc.code4lib.org
DNS record is necessary. Jay Datema asked me to set this up maybe a
month ago--which I did. But I don't believe the group itself is using
the server that this hostname resolves to. If you want to have
nyc.code4lib.org
If you are interested in Hadoop's distributed filesystem HDFS [1] you
might also be interested in Tahoe [2].
The downside to things like Hadoop and Tahoe as compared with S3 are
that you have to manage the machines and services yourself, rather
than paying someone else to do it in the cloud. But
On Wed, Oct 29, 2008 at 10:30 AM, Jonathan Rochkind [EMAIL PROTECTED] wrote:
Can anyone reccommend any good sources on how to do 'release management' in
a small distributed open source project. Or in a small in-house not open
source project, for that matter. The key thing is not something
On Mon, Sep 22, 2008 at 4:39 PM, Jonathan Rochkind [EMAIL PROTECTED] wrote:
To me, a committee of volunteers that anyone interested can be on _is_ a
community decision.
Well it all depends on how the committee is selected doesn't it? If
it's people who care enough to volunteer, and are
On Mon, Sep 29, 2008 at 6:01 PM, Jonathan Rochkind [EMAIL PROTECTED] wrote:
I thought I remembered something about Casey Bisson doing exactly that with
a grant/award he received? I forget what happened to it. A snapshot would
just be a snapshot of course, it wouldn't include records created or
How about we allow anyone to submit ideas, and use some of the $$ like
Roy suggested to get a professional one from someone--and then we vote
on all of them? I nominate Roy for coordinating the pro-design, and
the vote :-)
//Ed
Here are a few ideas., since you asked (I think?)
Stefano Mazzocchi: http://www.betaversion.org/~stefano/
Creator or Cocoon, one of the people behind the Simile project, now at
MetaWeb ... I think Stefano's experience with public speaking, open
source software, data processing and the web would
You are hereby invited to RepoCamp in Washington DC on July 25th.
http://barcamp.pbwiki.com/RepoCamp
RepoCamp is a one-day free and open event where folks who are
interested in managing and creating digital repository software and
their contents can gather and share ideas, innovations, trials
On Tue, Jun 24, 2008 at 4:31 PM, Hahn, Harvey [EMAIL PROTECTED] wrote:
Same experience here. I responded directly to Karen:
See http://catalog.sanfordberman.org/ -- it refers to licensing from
the Univ of Illinois Archives. When the data first became available a
number of years ago, I
I actually created a minimal RubyOnRails view on the Berman data a few
years ago.
http://catalog.sanfordberman.org
The src code for this is readily available if anyone is interested in
helping Madeline work on it:
http://inkdroid.org/bzr/berman/
The data has an interesting history, for
a conversation with Ed Summers quite a while ago and he encouraged me to go
ahead and request it be added but I never followed through.
Yikes, can someone add Steve's blog to the planet :-)
http://www.familymanlibrarian.com/
//Ed
-2.21 ./readloc.py
- Godmar
On Tue, Apr 29, 2008 at 8:20 AM, Ed Summers [EMAIL PROTECTED] wrote:
You could download a snapshot of the full LC back file at the Internet
Archive (kindly donated by Scriblio).
http://www.archive.org/details/marc_records_scriblio_net
You could download a snapshot of the full LC back file at the Internet
Archive (kindly donated by Scriblio).
http://www.archive.org/details/marc_records_scriblio_net
Then run a script using your favorite MARC parsing library (mine
currently is pymarc):
from pymarc import MARCReader
for
On Mon, Mar 31, 2008 at 1:12 PM, Dr R. Sanderson
[EMAIL PROTECTED] wrote:
my poor mailbox ;)
I now open up discussion of real smtp clients, and the pros/cons of
mail filtering ... Lets just get this all out of our system :-)
Yours,
//Ed
PS shouldn't the discussion list set the From header to
On Mon, Mar 31, 2008 at 1:54 PM, David Fiander [EMAIL PROTECTED] wrote:
Vi is just as programmable as emacs. It's possible to write a vi macro
that runs a turing machine.
Yeah, and XSLT is Turing complete but that doesn't mean it's actually
a good thing to use :-)
//Ed
PS does this thing go
I think Allen just didn't know about oss4lib:
http://tech.groups.yahoo.com/group/linuxinlibraries/message/858
Now he does, so maybe he goes forward with his plans to create a new
community--or maybe he doesn't. Either way I don't see that either is
a problem. On the one hand it's nice to work
asideSpeaking of the *.code4lib.org domain .. if there's a code4lib
service you'd like to offer on your own server and want it at the
code4lib domain let me know and I'll add an ARECORD for you to
DNS./aside
Just a quick heads up about the Simple Knowledge Organization System
(SKOS) Reference was just released as a W3C Public Working Draft [1].
I apologize if you've already seen this announcement elsewhere.
SKOS is aimed at making a range of knowledge organization systems
(classification schemes,
On Jan 23, 2008 6:34 PM, Doran, Michael D [EMAIL PROTECTED] wrote:
I'm wondering if we'll soon reach the point where the perl4lib list is not
worth subscribing to. I'm not looking for any definitive answer here... I
guess I'm just a little sad to see an old favorite lose its relevance. :-/
fwiw, my proposed solution was to use MARC::File::XML from the
marc-xml cpan module [1]
use Net::OAI::Harvester;
use MARC::File::SAX;
my $url = 'http://memory.loc.gov/cgi-bin/oai2_0';
my $harvester = Net::OAI::Harvester-new(baseURL = $url);
my $response =
Hey Eric:
N::O::H uses XML::SAX for XML parsing, which provides a standard
interface to multiple back end XML parsers, and also provides a
facility known as XML Filters [1].
Net::OAI::Record::OAI_DC is an example of a SAX filter which receives
SAX events for each metadata record in a response
Another option if you are in Perl land would be to take a look at Tim
Brody's HTTP::OAI library [1] which returns XML::DOM::Document objects
for record metadata, which you can walk around in and use to evaluate
xpaths:
--
use HTTP::OAI;
my $harvester = HTTP::OAI::Harvester-new(
To OASIS members, Public Announce Lists:
The OASIS Search Web Services TC has recently approved for public review the
following Discussion Document:
Search Web Services v1.0 Discussion Document
This document: Search Web Services Version 1.0 - Discussion Document - 2
November 2007, was prepared
On 10/22/07, Jakob Voss [EMAIL PROTECTED] wrote:
I doubt that archiving weblogs is that complicated [1]! You need a
harvester (partly implemented in many Feed-Reader), an archive (you
could start with just saving validated ATOM-Files), an index (Solr?) and
a reader (also already implemented in
Thanks for posting this Jakob. I was just reading RFC 5005 on the
train yesterday (literally) and the parallels between it and OAI-PMH
struck me as well. It's not quite clear to me how deleted records
would be handled with an atom archive feed. But I guess one could
assume if the identifier is no
On 9/25/07, Andrew Nagy [EMAIL PROTECTED] wrote:
This topic came up a few weeks ago on code4lib too, where were you Ed!? :)
Sorry Andrew, I was probably busy executing @blockparty in #code4lib :-)
I will echo something that Roy mentioned in the thread from a few weeks
back, would the LOC
It's funny this subject just came up on one of the open-library
discussion lists this week [1]. A whiles ago now Rob Sanderson, Brian
Rhea (University of Liverpool) and I pulled down the LC Classification
Outline pdf files, converted them to text, wrote a python munger to
convert the text into
There is a nice list of wiki farms at:
http://en.wikipedia.org/wiki/Comparison_of_Wiki_farms
//Ed
101 - 200 of 215 matches
Mail list logo