Re: [CODE4LIB] Getting data from Voyager into XML?

2007-01-19 Thread Erik Hatcher
On Jan 17, 2007, at 3:26 PM, Andrew Nagy wrote: One thing I am hoping that can come out of the preconference is a standard XSLT doc. I sat down with my metadata librarian to develop our XSLT doc -- determining what fields are to be searchable what fields should be left out to help speed up

Re: [CODE4LIB] Getting data from Voyager into XML?

2007-01-19 Thread Birkin James Diana
On Wed, 17 Jan 2007, Nathan Vack wrote: Sure isn't elegant, but as our Real Systems Guys don't want us to look at the production Oracle instance (performance worries), we've had pretty good luck screen-scraping holdings and status data, once we get a Bib ID. Ugly, but functional, and

Re: [CODE4LIB] Limiting by availability (was Re: [CODE4LIB] Getting data from Voyager into XML?)

2007-01-19 Thread Emily Lynema
At NCSU, we don't have any survey or focus group data about user interest in limiting by availability. But we do have an availability limit on our catalog search results page. It's a link at the top of the page that says 'limit results to currently available items'.

Re: [CODE4LIB] Getting data from Voyager into XML?

2007-01-19 Thread Erik Hatcher
Tod, Great information. I apologize for being a late comer to the game and bringing up FAQs. What about date normalization? One thing that must be considered when doing faceted browsing is that it works best with some pre-processed data, such as years rather than full dates. The question

Re: [CODE4LIB] Getting data from Voyager into XML?

2007-01-19 Thread Doran, Michael D
Since we can't SQL-query our own ILS data directly... I don't know why we tolerate such limitations in our contractual agreements. For Voyager ILS libraries the can't SQL-query is strictly a local limitation imposed by local IT fiat. Many, if not most of us, SQL-query to our hearts

[CODE4LIB] a few code4lib conference updates

2007-01-19 Thread LaJeunesse, Brad
Hi Folks, I have a few updates on the code4lib conference. First, we're closing in fast on our registration cap of 140. As of this morning, I believe there are 3 open slots left. After we reach 140 registrants, the conference center will maintain a waiting list in case someone cancels. Second,

[CODE4LIB] ISBD punctuation was [CODE4LIB] Getting data from Voyager into XML?

2007-01-19 Thread Walter Lewis
Erik Hatcher wrote: [snip] I am, however, skeptical of a purely MARC - XSLT - Solr solution. The MARC data I've seen requires some basic cleanup (removing dots at the end of subjects, normalizing dates, etc) in order to be useful as facets. While XSLT is powerful, this type of data manipulation

Re: [CODE4LIB] Getting data from Voyager into XML?

2007-01-19 Thread Kevin S. Clarke
On 1/19/07, Jonathan Gorman [EMAIL PROTECTED] wrote: There's also been work done to provide libraries and the like in XSLT. EXSLT comes to mind right away. That's what I was wondering... and wondering whether CDL's date normalization library would be useful. Kevin

Re: [CODE4LIB] ISBD punctuation was [CODE4LIB] Getting data from Voyager into XML?

2007-01-19 Thread David J. Fiander
Walter Lewis wrote: Perhaps what Erik's put his finger on here is as good an excuse as any to raise the Death To ISBD Punctuation banner one more time. Some 60s/70s field termination punctuation rules are at the heart of most of the crud you're trying to scrape off these records. If ever

Re: [CODE4LIB] Getting data from Voyager into XML?

2007-01-19 Thread Roy Tennant
Finally getting caught up on this discussion, see http://www.cdlib.org/inside/diglib/datenorm/ if you want to mess around with our date normalization utility (it's in Java). Roy On 1/19/07 7:52 AM, Kevin S. Clarke [EMAIL PROTECTED] wrote: On 1/19/07, Jonathan Gorman [EMAIL PROTECTED] wrote:

Re: [CODE4LIB] Getting data from Voyager into XML?

2007-01-19 Thread Walker, David
One thing I am hoping that can come out of the preconference is a standard XSLT doc. Is there XSLT to do this sort of thing with dates available? I am, however, skeptical of a purely MARC - XSLT - Solr solution. Most XSLT processors do, of course, allow you to write your own extension

Re: [CODE4LIB] Limiting by availability (was Re: [CODE4LIB] Getting data from Voyager into XML?)

2007-01-19 Thread Binkley, Peter
This is why I think we should figure out smart ways to manage facets independently of Lucene index fields. Solr populates a facet by setting up a bitset for every value found in a given index field, and it uses those bitsets to filter query result sets by deriving an intersection set. We can

Re: [CODE4LIB] Limiting by availability (was Re: [CODE4LIB] Getting data from Voyager into XML?)

2007-01-19 Thread Binkley, Peter
It's a Java object. Solr actually replaces it with a different implementation called OpenBitSet: http://lucene.apache.org/solr/api/org/apache/solr/util/OpenBitSet.html , for performance. Solr's use of them in managing facets is discussed in the wiki:

Re: [CODE4LIB] a few code4lib conference updates

2007-01-19 Thread Nathan Vack
On Jan 19, 2007, at 9:51 AM, LaJeunesse, Brad wrote: I must strongly encourage everyone attending to bring fully-charged laptops and spare batteries (if you have them). The auditorium has 60 power outlets available, which gives us roughly a 2:1 ratio of outlets to people. Spare batteries

Re: [CODE4LIB] a few code4lib conference updates

2007-01-19 Thread Andrew Nagy
Nathan Vack wrote: On Jan 19, 2007, at 9:51 AM, LaJeunesse, Brad wrote: I must strongly encourage everyone attending to bring fully-charged laptops and spare batteries (if you have them). The auditorium has 60 power outlets available, which gives us roughly a 2:1 ratio of outlets to people.

[CODE4LIB] Solr indexing -- why XSL is the wrong choice

2007-01-19 Thread Casey Durfee
I think there are many good reasons why XSLT is absolutely the wrong tool for the job of indexing MARC records for Solr. 1) Performance/Speed: In my experience even just transforming from MARCXML to MODS takes a second or two (using the LoC stylesheet), due to the stylesheet's complexity

Re: [CODE4LIB] Getting data from Voyager into XML?

2007-01-19 Thread Tod Olson
On Jan 19, 2007, at 6:53 AM, Erik Hatcher wrote: Great information. I apologize for being a late comer to the game and bringing up FAQs. Well, I'm not certain Q is A-ed very F. What about date normalization? One thing that must be considered when doing faceted browsing is that it works

Re: [CODE4LIB] Solr indexing -- why XSL is the wrong choice

2007-01-19 Thread Casey Durfee
I'm perfectly willing to be persuaded to the 'light' side as well and I'm looking forward to learning more about your project as well, which is much more mature than mine at this point ... I'm just interested in something that works and is easily tweakable. I don't hold a lot of hope that a

[CODE4LIB] net::delicious

2007-01-19 Thread Eric Lease Morgan
Is it possible use the Net::Delicious Perl module (or even just the Delicious API) to query del.icio.us return a list of recommended tags for that URL. The Web-based interface to del.icio.us recommends sets of tags upon entering a URL, and I would like to capture these tags programatically for