That example gil scott heron circle of stone returns a JHU record as
the first hit, but the item is his thesis and the url is found in the
Wikipedia article on Gil Scott-Heron, so it has a lot of pagerank
independently of the catalogue. I'm just curious: can you control for
that kind of
The U of Alberta Libraries have extended the search for a Digital
Preservation Officer/Librarian. This is a new position, open to those
with and without the MLIS, and to international applicants. The core of
it is policy and planning for our digital preservation activities, with
the first target
Apologies for cross-posting -
The University of Alberta Libraries seek a Digital Initiatives
Applications Librarian to work as a member of the Digital Initiatives
working group. The Libraries' digital initiatives are expanding and
diversifying quickly, presenting a need for constant innovation and
You could start with the exact title search as expressed in the Bath
Profile: http://www.collectionscanada.gc.ca/bath/tp-bath2.9-e.htm#a .
But you may well have to tinker to discover the combination that your
server will accept and interpret the way you want it to.
All the best,
Peter
And beyond Tesseract is Ocropus (http://code.google.com/p/ocropus/),
which uses Tesseract (and eventually other ocr engines) to generate
positional OCR in an HTML format. I wonder if you could process that
HTML slightly to put the TIFF in the background, then use an HTML to PDF
tool to generate
The announcement talks about a template to inform ... to guide ... to
influence I think such a project would do a lot for the
credibility of any open-source ILS that could gain its blessing.
Academic administrators currently tend to perceive these efforts as way
too bottom-up, and oriented to
Or just ask the speakers to download, sign and send in a new form?
Peter
-Original Message-
From: Code for Libraries [mailto:[EMAIL PROTECTED] On Behalf Of Dan Scott
Sent: Thursday, March 08, 2007 7:19 AM
To: CODE4LIB@listserv.nd.edu
Subject: Re: [CODE4LIB] Posting the conference video?
I've taken the liberty of adding a stalkers section at the end, for
those of us who will be there in spirit but not in the flesh.
Peter
-Original Message-
From: Code for Libraries [mailto:[EMAIL PROTECTED] On Behalf Of
Roy Tennant
Sent: Thursday, February 22, 2007 8:46 PM
To:
To: CODE4LIB@listserv.nd.edu
Subject: Re: [CODE4LIB] 2007 Conference Attendee List
I'm with you, Peter, in not being there. But I renamed the section to
lurkers, because stalkers just sounded too creepy...
-Keith
On 2/23/07, Binkley, Peter [EMAIL PROTECTED] wrote:
I've taken the liberty of adding
I was thinking of this as similar to the code4libcon process: put up
lots of proposals, set some kind of target size, and open up voting,
maybe with some tampering by the editor(s) to make sure the final
product has some balance. I had thought about an All-Star Game rule as
well: every blog in
That would be great. I've got a MODS-to-Solr xsl to share as well. Where
would be a good place to post these, along with relevant Solr schemas?
Peter
-Original Message-
From: Code for Libraries [mailto:[EMAIL PROTECTED] On Behalf Of
Andrew Nagy
Sent: Tuesday, February 13, 2007 9:18 AM
In hunting for data to help model subject faceting for MARC records,
I've just been looking at Bill Moen's Zinterop report
(http://www.unt.edu/zinterop/ZInterop2/Documents/ZInterop2FinalReport_we
m4Dec2005.pdf). It occurs to me that with all our various projects
working on indexing MARC records in
I get this error when I try to visit the tshirt page:
Fatal error: Call to undefined function format_name() in
/var/www/code4lib.org/htdocs/themes/sunflower/sunflower.theme on line
183
Peter
-Original Message-
From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED]
On Behalf Of Ross Singer
Those are impressive salaries! What's the cost of living like in
Washington?
Peter
-Original Message-
From: Code for Libraries [mailto:[EMAIL PROTECTED] On Behalf Of
Edward Summers
Sent: Monday, January 22, 2007 11:46 AM
To: CODE4LIB@listserv.nd.edu
Subject: [CODE4LIB] Information
This is why I think we should figure out smart ways to manage facets
independently of Lucene index fields. Solr populates a facet by setting
up a bitset for every value found in a given index field, and it uses
those bitsets to filter query result sets by deriving an intersection
set. We can
data from Voyager into XML?)
Peter,
What is a bitset? Is it a Java class or a more primitive data
structure?
Alan
On 19-Jan-07, at 2:32 PM, Binkley, Peter wrote:
This is why I think we should figure out smart ways to manage facets
independently of Lucene index fields. Solr populates a facet
I really think we need to look at ways to manage dynamic record-level data like
circ status separately from the bibliographic metadata. To display that info,
we can do a real-time lookup; but to use it in the faceted search interface we
need a smarter solution. If we can figure out how to
Wasn't there also some discussion at one point about holding it in
Venice?
Peter
-Original Message-
From: Code for Libraries [mailto:[EMAIL PROTECTED] On Behalf Of
Edward Summers
Sent: Thursday, January 04, 2007 7:40 AM
To: CODE4LIB@listserv.nd.edu
Subject: [CODE4LIB] code4lib london
As we move towards experimenting with a Solr-based opac I'm hoping to
persuade everyone involved that MODS is sufficient to drive the search
interface. Let MARC abide in the ILS, and become a mere spirit of malice
that gnaws itself in the shadows, but cannot again grow or take shape.
Peter
Re the eXist 1.1 development line: I'm tinkering with that now - tried
populating two different collections at the same time over webdav
connections from two different machines, and ended up with a corrupt db
(content from one source ended up in documents supposedly written by the
other). Darn.
I don't see any aggravation of this problem because of the hybrid
approach Bess is proposing. You've got enough flexibility in the way you
set up your Lucene index, and Lucene search results give you access to
the term weights for each hit, so you can tell which fields actually
matched.
There
Subject: Re: [CODE4LIB] code4lib lucene pre-conference
Binkley, Peter wrote:
There would probably be a lot of optimizations you could do within Solr
to help with this kind of thing. Art and I talked a little about this
at the ILS symposium: why not nestle the XML db inside Solr alongside
Lucene
And ideally make the editing functions accessible via APIs as well, so
that e.g. if I edit an item in Zotero in such a way that captures
FRBR-relevant information, Zotero can shoot off an update.
Peter
-Original Message-
From: Code for Libraries [mailto:[EMAIL PROTECTED] On Behalf Of
Tim
This was more or less what I was thinking of in my hackfest suggestion
to embed Lucene in a Firefox extension; but I hadn't thought of using it
to access pre-distributed Lucene indexes. That might be very handy.
(Though a Firefox-only approach probably isn't what Eric has in mind).
Would it be
And if you built the search interface in Cocoon (using its Lucene
transformer), you'd have Jetty built in, making the search application
bootable on any os with a jvm.
Peter
-Original Message-
From: Code for Libraries [mailto:[EMAIL PROTECTED] On Behalf Of
Edward Summers
Sent: Friday,
If you have stylesheets for the individual flavours, it would be easy to
include them into a single master stylesheet and run the appropriate one
based on a version attribute etc. And you could run the input through
Tidy first to guarantee well-formed xml.
Peter
-Original Message-
From:
One question is certainly, Who will this journal serve?
The more I think about it, the more I think the main justification for a
code4lib journal is to get our stuff noticed more. There are too many
enthusiastic Library 2.0 bloggers who spend their time talking about
non-library Web 2.0
I agree with Ed Corrado that the purpose of the peer-review process is
to improve the articles, not to give thumbs-up or thumbs-down. How about
making the review process consist of submitting an article into a wiki
(with proper discussion page etc.) and letting it simmer there for a
while before
Eric and Mike wrote:
Maybe I should draw search results from MyLibrary and not
swish-e to
display characters correctly? If I draw content from many global
sources, then how do I know what character set to use for display?
This is definitely the best thing to do. Search the
29 matches
Mail list logo