Re: Lucene in the Humanities

2005-02-19 Thread Paul Elschot
Erik, On Saturday 19 February 2005 01:33, Erik Hatcher wrote: On Feb 18, 2005, at 6:37 PM, Paul Elschot wrote: On Friday 18 February 2005 21:55, Erik Hatcher wrote: On Feb 18, 2005, at 3:47 PM, Paul Elschot wrote: Erik, Just curious: it would seem easier to use multiple fields

Re: Scalability of Lucene indexes

2005-02-19 Thread Andy
Hi Bryan, How big is your index? Also what is the advantage of binding a user to a server? Thanks. Andy --- Bryan McCormick [EMAIL PROTECTED] wrote: Hi chris, I'm responsible for the webshots.com search index and we've had very good results with lucene. It currently indexes over 100

Re: Lucene in the Humanities

2005-02-19 Thread Erik Hatcher
On Feb 19, 2005, at 3:52 AM, Paul Elschot wrote: By lowercasing the querytext and searching in title_lc ? Well sure, but how about this query: title:Something AND anotherField:someOtherValue QueryParser, as-is, won't be able to do field-name swapping. I could certainly apply that

Re: Lucene vs. in-DB-full-text-searching

2005-02-19 Thread Steven J. Owens
On Fri, Feb 18, 2005 at 04:45:50PM -0500, Mike Rose wrote: I can comment on this since I'm in the middle of excising Oracle text searching and replacing it with Lucene in one of my projects. Intereseting, particularly as it's from somebody who's already tried an existing in-db fulltext

Re: Lucene in the Humanities

2005-02-19 Thread Paul Elschot
On Saturday 19 February 2005 11:02, Erik Hatcher wrote: On Feb 19, 2005, at 3:52 AM, Paul Elschot wrote: By lowercasing the querytext and searching in title_lc ? Well sure, but how about this query: title:Something AND anotherField:someOtherValue QueryParser, as-is, won't be

MultiFieldQueryParser 1.8 isn't parsing phrases

2005-02-19 Thread Ben
Hi When I try to search for phrases using the MultiFieldQueryParser v1.8 from CVS, it gives me NullPointerException. Using the following keyword works: title:IBM backs linux However, it gives me the exception if I use the following keyword: IBM backs linux Any idea why? I am using this

Re: Search Performance

2005-02-19 Thread sergiu gordea
Michael Celona wrote: My index is changing in real time constantly... in this case I guess this will not work for me any suggestions... using a singleton pattern for the your index searcher makes sense anyway ... I don'T think that you change the index after each search. the computing

Re: MultiFieldQueryParser 1.8 isn't parsing phrases

2005-02-19 Thread Daniel Naber
On Saturday 19 February 2005 15:26, Ben wrote: When I try to search for phrases using the MultiFieldQueryParser v1.8 from CVS, it gives me NullPointerException. This has just been fixed in SVN (I assume you mean SVN, CVS still exists but is read only and probably not updated anymore).

Re: MultiFieldQueryParser 1.8 isn't parsing phrases

2005-02-19 Thread Ben
Thanks On Sat, 19 Feb 2005 16:09:49 +0100, Daniel Naber [EMAIL PROTECTED] wrote: On Saturday 19 February 2005 15:26, Ben wrote: When I try to search for phrases using the MultiFieldQueryParser v1.8 from CVS, it gives me NullPointerException. This has just been fixed in SVN (I assume

Re: Scalability of Lucene indexes

2005-02-19 Thread Praveen Peddi
We are doing the same exacting thing. We didn't test with so many documents. The most we tested till now 3 million documents with 3GB file size. I would be interested in seeing how you maintained replicated indices that r in sync. The way we did was, run the indexer on each server independently.

Mail Archive Broken?

2005-02-19 Thread Owen Densmore
I just beamed into the archive: http://mail-archives.apache.org/eyebrowse/SummarizeList?listId=30 ..and it only has through Feb 1! What's up? Owen - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands,

Re: Scalability of Lucene indexes

2005-02-19 Thread Bryan McCormick
Our index is currently about 40Gb. The advantage of binding a user is that once a search is performed then caching within lucene and in the application is very effective if subsequent searches go back to the same box. Our initial searches are usually in the sub 100milliS range while subsequent

JavaLobby Lucene presentation

2005-02-19 Thread Erik Hatcher
I recorded a Meet Lucene presentation at JavaLobby. It is a multimedia Flash video that shows slides with my voice recorded over them which spans just over 20 minutes (you can jump to specific slides).Check it out here: http://www.javalobby.org/members-only/eps/meet-lucene/index.html?