Re: NIOFSDirectory

2008-12-05 Thread Glen Newton
Understood. Thanks! :-) -glen 2008/12/4 John Wang <[EMAIL PROTECTED]>: > NIOFSDirectory.getDirectory simple calls the static method on the parent > class: FSDirectory.getDirectory. > Which returns an instance of FSDirectory. > > IMO: NIOFSDirectory solves concurrent read problems, generally you d

Re: Sorting documents without a query

2008-12-05 Thread John Wang
The obvious way is to use use MatchAllDocsQuery with Sort parameters on the searcher, e.g. searcher.search(new MatchAllDocsQuery(),sort); If you only care about 1 sort spec (e.g. no secondary sort to break ties) it may be faster just traversing the term table since that is already sorted. -John

Re: NIOFSDirectory

2008-12-05 Thread Wouter Heijke
This is 2.9 code. For 2.4 you're stuck with the system property. Wouter > Thanks! > -John > > On Thu, Dec 4, 2008 at 2:16 PM, Yonik Seeley <[EMAIL PROTECTED]> wrote: > >> Details in the bug: >> https://issues.apache.org/jira/browse/LUCENE-1451 >> >> Use this constructor to create an instance of N

Save big arrays in lucene document

2008-12-05 Thread Zender00
Hello, I have to store 2 arrays (1 int-Array, 1 String-Array (approx. length per element: 30 chars)) with a count of 100-10.000 elements in a lucene document. At this time, I implode each array (split with an ';') and save the value as a string. This works perfectly for small array-sizes (e.g. l

Re Lucene analyzers

2008-12-05 Thread Diego Cassinera
In the analyzer the function public final TokenStream tokenStream(String fieldName, Reader reader) Usually does a bunch of new filters, from what I seen in most of these filters none of the use class member variables. Has anybody tried making them static to avoid the creation of new objects.

Deleting documents from index

2008-12-05 Thread Gunnar Wurl
Hi, I am trying to delete documents from an index. When using the IndexReader#deleteDocuments(Term) method this works fine. But when using the IndexWriter#deleteDocuments(Term) it is not working, although I am using the same term. When using the IndexWriter I tried to commit and optimize but

Re: Design guidance - search strategy

2008-12-05 Thread Erick Erickson
Glad it's working, but it's not my book, that's Erik Hatcher not Erick Erickson. Erik: Do I get a commission? On Thu, Dec 4, 2008 at 9:48 PM, Ian Vink <[EMAIL PROTECTED]> wrote: > It works. > For those using Lucene.NET here is an example of a Filter that takes a list > of IDs for books: > >

Re: Save big arrays in lucene document

2008-12-05 Thread Karsten F.
Hi Zender, please take a look to http://www.nabble.com/Lucene-vs.-Database-td19755932.html#a19757274 you shouldn't use a lucene fields to store such huge data. At least not a lucene field in your main search index. You can use lucene as repository, but I would advice you to use a extra index for

Re: Deleting documents from index

2008-12-05 Thread Erick Erickson
How do you know it's failing? If you're searching could it be that you need to re-open the underlying indexreader when you delete using IndexWriter? Best Erick On Fri, Dec 5, 2008 at 8:42 AM, Gunnar Wurl <[EMAIL PROTECTED]> wrote: > Hi, > > I am trying to delete documents from an index. When usi

Re: Deleting documents from index

2008-12-05 Thread Gunnar Wurl
Thanks for your answer Erick.. I just found out that my problem occured because of an user(me) error. Used two different workspaces and indexes.. sorry. Erick Erickson schrieb: How do you know it's failing? If you're searching could it be that you need to re-open the underlying indexreader wh

Re: Deleting documents from index

2008-12-05 Thread Erick Erickson
been there, done that . On Fri, Dec 5, 2008 at 9:17 AM, Gunnar Wurl <[EMAIL PROTECTED]> wrote: > Thanks for your answer Erick.. > > I just found out that my problem occured because of an user(me) error. Used > two different workspaces and indexes.. sorry. > > Erick Erickson schrieb: > > How do y

RE: SnowballAnalyzer and AlphaNumeric

2008-12-05 Thread Steven A Rowe
Hi Sam, On 12/04/2008 at 8:21 PM, samd wrote: > Where can I get the Lucene source for the Snowball implementation. > I need to be able to search for words that are alphanumeric > and this does not work with the current snowballanalyzer. Lucene-java's source is available through its revision contr

Re: Design guidance - search strategy

2008-12-05 Thread Otis Gospodnetic
Yeah, I think we'll have to start paying the commission fee! ;) Otis -- Sematext -- http://sematext.com/ -- Lucene - Solr - Nutch - Original Message > From: Erick Erickson <[EMAIL PROTECTED]> > To: java-user@lucene.apache.org > Sent: Friday, December 5, 2008 8:37:20 AM > Subject: Re:

Re: Slow queries with lots of hits

2008-12-05 Thread Tim Sturge
I think we're going to cheat with this one. Two options: 1) Add a term to documents with a high enough setBoost(). Add this term to queries that are insufficiently restrictive. This way only high scoring documents will be considered. 2) sort the index data slightly before build time by boost (so

TopDocs - Get all docs?

2008-12-05 Thread Ian Vink
Is there an easy way to get all the documents in the index? Kinda like this: TopDocs everything = ???.GetAllDocuments();

Re: TopDocs - Get all docs?

2008-12-05 Thread Grant Ingersoll
See the MatchAllDocsQuery in the search package. -Grant On Dec 5, 2008, at 5:57 PM, Ian Vink wrote: Is there an easy way to get all the documents in the index? Kinda like this: TopDocs everything = ???.GetAllDocuments(); -- Grant Ingersoll Lucene Helpful Hints: http

Lucene evaluation (XML)

2008-12-05 Thread Gustavo Corral
Hello all, I want to do some kind of evaluation of my Lucene application, but the problem is that I'm indexing XML documents with a specific DTD and I have just a small collection. Is there a way to run a quality benchmark using the benchmark package in Lucene with my own collection an queries? I