Re: Strange sort error

2005-04-14 Thread Yonik Seeley
I haven't tried it, but I think the fix should be easy... never throw that exception. Either check for null before the loop, or in the loop. Original code for native int sorting: TermEnum termEnum = reader.terms (new Term (field, )); try { if (termEnum.term() == null)

IOException: No such path or directory

2005-04-14 Thread Luis Medina
Hi Everyone, The company I work for uses Lucene search 2 of their sites. Each site's configuration is (almost) an mirror image of the other. The only difference here is the content. We use a servlet to start up a Lucene mantainance utility that keeps the indexes up to date. This servlet is set to

RE: Update performance/indexwriter.delete()?

2005-04-14 Thread Peter Veentjer - Anchor Men
-Oorspronkelijk bericht- Van: Roy Klein [mailto:[EMAIL PROTECTED] Verzonden: donderdag 14 april 2005 15:40 Aan: java-user@lucene.apache.org Onderwerp: Update performance/indexwriter.delete()? I've got an application that will be doing constant updates to an index. I've looked into

getting the number of occurrences within a document

2005-04-14 Thread Pablo Gomes Ludermir
Hello all, I would like to get the following information from the index: 1. Given a term, how many times the term occurs in each document. Something like a triple: Term, Doc1, Freq , Term, Doc2, Freq, Term2, Docx, Freq, ... Is possible to do that? Regards, Pablo -- Pablo Gomes Ludermir

RE: getting the number of occurrences within a document

2005-04-14 Thread Pasha Bizhan
Hi, From: Pablo Gomes Ludermir [mailto:[EMAIL PROTECTED] I would like to get the following information from the index: 1. Given a term, how many times the term occurs in each document. Something like a triple: Term, Doc1, Freq , Term, Doc2, Freq, Term2, Docx, Freq, ... Is possible to

Re: getting the number of occurrences within a document

2005-04-14 Thread Paul Libbrecht
Le 14 avr. 05, à 17:15, Pablo Gomes Ludermir a écrit : I would like to get the following information from the index: 1. Given a term, how many times the term occurs in each document. Something like a triple: Term, Doc1, Freq , Term, Doc2, Freq, Term2, Docx, Freq, ... Is possible to do that? Luke

Reverting QueryParser ?

2005-04-14 Thread Paul Libbrecht
Hi, I am currently evaluating the need for an elaborate query data-structure (to be exchanged over XML-RPC) as opposed to working with plain strings. One thing that would heavily vote for strings would be to have query objects returned by Query-parser reconvertible to a string (and

Re: getting the number of occurrences within a document

2005-04-14 Thread Andy Roberts
On Thursday 14 Apr 2005 15:15, Pablo Gomes Ludermir wrote: Hello all, I would like to get the following information from the index: 1. Given a term, how many times the term occurs in each document. Something like a triple: Term, Doc1, Freq , Term, Doc2, Freq, Term2, Docx, Freq, ... Is

Re: Hungarian notation analyzer and phrase queries

2005-04-14 Thread Doug Cutting
Paul Smith wrote: So it sounds like there isn't a perfect solution, but I think the best tradeoff for me is to put them all in the same position unless anyone has more input on the subject? If they're all at the same position you can still use slop to match the phrase. So if 'power', 'query'

Re: Reverting QueryParser ?

2005-04-14 Thread Erik Hatcher
On Apr 14, 2005, at 11:32 AM, Paul Libbrecht wrote: Hi, I am currently evaluating the need for an elaborate query data-structure (to be exchanged over XML-RPC) as opposed to working with plain strings. One thing that would heavily vote for strings would be to have query objects

Re: Update performance/indexwriter.delete()?

2005-04-14 Thread Yonik Seeley
An IndexReader is required to, given a term, find the document number to mark deleted. Yeah, most the time it makes sense to do deletions off the IndexReader. There are times, however, when it would be nice for deletes to be able to be concurrent with adds. Q: can docids change after an add()

Re: Reverting QueryParser ?

2005-04-14 Thread Doug Cutting
Paul Libbrecht wrote: I am currently evaluating the need for an elaborate query data-structure (to be exchanged over XML-RPC) as opposed to working with plain strings. I'd opt for both. For example: search boolean-query required query-parser analyzer=...java based

RE: Update performance/indexwriter.delete()?

2005-04-14 Thread Roy Klein
Hi, I guess I didn't ask my question very well. I do understand that you can only do a delete via a reader based on the current sources, what I don't understand is why the delete function couldn't be incorporated into a writer, so that updates could be all done within the context of a writer?

Re: Update performance/indexwriter.delete()?

2005-04-14 Thread Doug Cutting
Roy Klein wrote: I think this is a better way of asking my original questions: Why was this designed this way? In order to optimize updates. Can it be changed to optimize updates? Updates are fastest when additions and deletions are separately batched. That is the design. Doug

Re: IOException: No such path or directory

2005-04-14 Thread Daniel Naber
On Thursday 14 April 2005 16:44, Luis Medina wrote: primarily reporting lock issues (except no lock files were found in the directory). With that directory, do you mean the index directory? The lock files are not there, but in /tmp (by default). It's only okay to remove the lock file

Boosting not working?

2005-04-14 Thread Martin May
I have a bunch of documents in my index, some of which have values for a certain field while others don't. I'd like the ones that do have a value to always show up before the ones who don't when sorting by relevance. I tried to accomplish this by check whether there are values for the field, and

Re: Boosting not working?

2005-04-14 Thread Martin May
I've got the book (which is great, btw). I used Luke to get explanations of the results, but I don't see any boosts in the explanations. Martin On Thu, 2005-04-14 at 13:24 -0700, Otis Gospodnetic wrote: I'd look a the output of Explain to see how ranking score is calculated Look at this:

Re: Boosting not working?

2005-04-14 Thread Erik Hatcher
On Apr 14, 2005, at 4:32 PM, Martin May wrote: I've got the book (which is great, btw). I used Luke to get explanations of the results, but I don't see any boosts in the explanations. The index-time boosts are folded into the field normalization factor, so you won't see boost by itself. That

Re: Strange sort error

2005-04-14 Thread Daniel Naber
On Thursday 14 April 2005 16:28, Yonik Seeley wrote: I haven't tried it, but I think the fix should be easy... never throw that exception. As Lucene does not have the concept of a warning I think it should throw exceptions when someone tries to do something that doesn't make sense (even if