Re: [CODE4LIB] code4lib lucene pre-conference

2006-11-27 Thread Art Rhyno
>I'd agree... the nice thing about the WebDAV interface is that is >works with all these backends! Whether SVN, a native XML database, or >a file system... all have WebDAV interfaces. I've been very >interested over the years watching Art (and Ross) experimenting with >WebDAV and catalog records.

Re: [CODE4LIB] code4lib lucene pre-conference

2006-11-27 Thread Ross Singer
On 11/27/06, Kevin S. Clarke <[EMAIL PROTECTED]> wrote: WebDAV and catalog records. Have there been any code4lib proposals along these lines? I'd vote for that. And if I get around to writing the voting module, I'd make sure you could vote for that one twice... Seriously, please don't get h

Re: [CODE4LIB] code4lib lucene pre-conference

2006-11-27 Thread Kevin S. Clarke
On 11/27/06, Erik Hatcher <[EMAIL PROTECTED]> wrote: My ideal solution is WebDAV interface with commit hooks into Solr. With Subversion behind the scenes of the WebDAV interface, or an XML database, or file system or whatever. :) I'd agree... the nice thing about the WebDAV interface is that i

Re: [CODE4LIB] code4lib lucene pre-conference

2006-11-27 Thread Kevin S. Clarke
On 11/27/06, Andrew Nagy <[EMAIL PROTECTED]> wrote: My only concern about lucene is the lack of a standard query language. I went down the native XML database path because of XQuery and XSL, does something like lucene and solr offer a strong query language? Is it a standard? What if someone de

Re: [CODE4LIB] code4lib lucene pre-conference

2006-11-27 Thread Kevin S. Clarke
On 11/27/06, Bess Sadler <[EMAIL PROTECTED]> wrote: At UVa we have been burned several times by poor search performance of XML native databases. In light of this, we're starting to look at the database and the index as separate but cooperative pieces of the application. That way you can use solr

Re: [CODE4LIB] code4lib lucene pre-conference

2006-11-27 Thread Erik Hatcher
On Nov 27, 2006, at 5:49 PM, Andrew Nagy wrote: My only concern about lucene is the lack of a standard query language. I went down the native XML database path because of XQuery and XSL, does something like lucene and solr offer a strong query language? Is it a standard? What if someone develop

Re: [CODE4LIB] code4lib lucene pre-conference

2006-11-27 Thread Erik Hatcher
On Nov 27, 2006, at 5:04 PM, Jonathan Rochkind wrote: Bess Sadler wrote: application. That way you can use solr / lucene for search, faceted browse, etc, and your XML database only for known item retrieval, which it is generally able to do without performance issues. I'm hopping up and down wai

Re: [CODE4LIB] code4lib lucene pre-conference

2006-11-27 Thread Erik Hatcher
On Nov 27, 2006, at 6:12 PM, Binkley, Peter wrote: Fair point, and that's how my current solr-based project works. I'm thinking I would like the other advantages of an XML db: the ability to run xqueries, batch updates, etc., alongside the Lucene searching. And I want them integrated under the h

Re: [CODE4LIB] code4lib lucene pre-conference

2006-11-27 Thread Erik Hatcher
On Nov 27, 2006, at 5:46 PM, Binkley, Peter wrote: You've got enough flexibility in the way you set up your Lucene index, and Lucene search results give you access to the term weights for each hit, It does? so you can tell which fields actually matched. You can? I'm curious how you're doin

Re: [CODE4LIB] code4lib lucene pre-conference

2006-11-27 Thread Erik Hatcher
On Nov 27, 2006, at 5:19 PM, Andrew Nagy wrote: BTW, have you played with Hadoop? I guess it's something like the open-source attempt to google's search algorithm. I would be curious about implementing hadoop across a few servers to store the marcxml records. Hadoop is not a search engine. I

Re: [CODE4LIB] code4lib lucene pre-conference

2006-11-27 Thread Casey Durfee
Lucene has a pretty well-specified search syntax which is unlikely to change all that much, even though it's not a standard. It's not perfect, but I think it's pretty good. Overview here: http://lucene.apache.org/java/docs/queryparsersyntax.html I believe Solr adds a bit to the standard

Re: [CODE4LIB] code4lib lucene pre-conference

2006-11-27 Thread Binkley, Peter
Fair point, and that's how my current solr-based project works. I'm thinking I would like the other advantages of an XML db: the ability to run xqueries, batch updates, etc., alongside the Lucene searching. And I want them integrated under the hood in Solr so that people smarter than me will mainta

Re: [CODE4LIB] code4lib lucene pre-conference

2006-11-27 Thread Andrew Nagy
Binkley, Peter wrote: There would probably be a lot of optimizations you could do within Solr to help with this kind of thing. Art and I talked a little about this at the ILS symposium: why not nestle the XML db inside Solr alongside Lucene? Solr could then manage the indexing of the contents of

Re: [CODE4LIB] code4lib lucene pre-conference

2006-11-27 Thread Andrew Nagy
Casey Durfee wrote: I am writing a Solr-powered OPAC right now and have not had any performance problems (either indexing or querying) using Solr for both data storage and search. You can indicate in Solr whether you want particular data fields to be stored, indexed or both. So I stick the

Re: [CODE4LIB] code4lib lucene pre-conference

2006-11-27 Thread Binkley, Peter
I don't see any aggravation of this problem because of the hybrid approach Bess is proposing. You've got enough flexibility in the way you set up your Lucene index, and Lucene search results give you access to the term weights for each hit, so you can tell which fields actually matched. There woul

Re: [CODE4LIB] code4lib lucene pre-conference

2006-11-27 Thread Andrew Nagy
Bess Sadler wrote: Hi, Andrew. Since this will be an all-day event, the session would be starting first thing in the morning on Feb 27. I'm thinking 9am, but I haven't confirmed that with anyone else. I'm just flying by the seat of my pants here. I wouldn't be able to make this then due to tim

Re: [CODE4LIB] code4lib lucene pre-conference

2006-11-27 Thread Casey Durfee
I am writing a Solr-powered OPAC right now and have not had any performance problems (either indexing or querying) using Solr for both data storage and search. You can indicate in Solr whether you want particular data fields to be stored, indexed or both. So I stick the entire MARC record in

Re: [CODE4LIB] code4lib lucene pre-conference

2006-11-27 Thread Jonathan Rochkind
Bess Sadler wrote: application. That way you can use solr / lucene for search, faceted browse, etc, and your XML database only for known item retrieval, which it is generally able to do without performance issues. I'm hopping up and down waiting for someone to take this approach with an ILS, so p

Re: [CODE4LIB] code4lib lucene pre-conference

2006-11-27 Thread Bess Sadler
On Nov 27, 2006, at 3:54 PM, Andrew Nagy wrote: Could you please describe the time for when this might occur as I have already booked my flight into Atlanta for late in the afternoon so I would need to change that if you plan on having the session earlier in the day. Hi, Andrew. Since this wil

Re: [CODE4LIB] code4lib lucene pre-conference

2006-11-27 Thread Andrew Nagy
Bess Sadler wrote: Enough people are interested in ILS related topics that it might be worth forming groups around specific ILS products. If you are one of these people, email the list if you're interested in setting up such a thing. Bess, this sounds like a great conversation. You can count

Re: [CODE4LIB] code4lib lucene pre-conference

2006-11-27 Thread Edward Summers
On Nov 27, 2006, at 2:37 PM, Bess Sadler wrote: Comments? Suggestions? Sounds great! If you get a chance add a story to http://code4lib.org when things get solidified. //Ed

Re: [CODE4LIB] code4lib pre-conference: lucene and solr workshop

2006-11-27 Thread Doran, Michael D
I responded to Bess regarding question #1. (A "yes, I am interested.") > 2. What kind of format would folks like to see for this event? My personal preference is not to travel with a laptop, plus I'm old-school, so I would favor more of a lecture/presentation format. > 3. Some folks on #code4li

[CODE4LIB] code4lib lucene pre-conference

2006-11-27 Thread Bess Sadler
Since the request I sent out last week, I've received quite a lot of email expressing interest in a lucene pre-conference, but it hasn't been an overwhelming amount. Based on this, I think it's safe to reserve the smaller, wi-fi enabled room that we've been discussing, and to plan for a max of 40

[CODE4LIB] LibXML tutorial?

2006-11-27 Thread Thomas Dowling
Can anyone point me to a gentle, example-laden tutorial for using Perl's LibXML module? The few examples I can find online all point back to a single O'Reilly example that does precisely nothing when I try it. -- Thomas Dowling [EMAIL PROTECTED]