Re: An arguable bug in Lucene 1.9.1

2007-02-06 Thread Paul Elschot
Gentlemen, Have a look here: https://issues.apache.org:443/jira/browse/LUCENE-413 This was fixed in 2.0. Regards, Paul Elschot On Tuesday 06 February 2007 01:38, [EMAIL PROTECTED] wrote: > I am seeing this issue as well with the exact same stack trace using > spanQueries. Does anyone know if

upgrading from Lucene 1.4.3 to Lucene 2.0

2007-02-06 Thread [EMAIL PROTECTED]
Hi all ! Which performance improvements can I expect when upgrading from Lucene 1.4.3 to Lucene 2.0 ? Thanks in advance ! Regards, Claus - To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PRO

Retrieving exact matches

2007-02-06 Thread Mile Rosu
Hello, I have been looking in the documentation but haven't found a solution to this: is there a way to retrieve only the record "picasso" when the query is picasso and not the records: "picasso","picasso pablo" ie a 100% match of the query ? Thank you, Mile Rosu ---

Swedish, Norwegian or Danish Analyzers?

2007-02-06 Thread rubdabadub
Hi: Didn't find any of the nordic (Swedish, Danis etc..) language analyzers under svn/contrib..Trying to do a quick check here.. I am interested in only analyzers .. Thanks for any pointer. - To unsubscribe, e-mail: [EMAIL PROT

Re: Swedish, Norwegian or Danish Analyzers?

2007-02-06 Thread rubdabadub
Sorry I meant to say publicly available ones :-) On 2/6/07, rubdabadub <[EMAIL PROTECTED]> wrote: Hi: Didn't find any of the nordic (Swedish, Danis etc..) language analyzers under svn/contrib..Trying to do a quick check here.. I am interested in only analyzers .. Thanks for any pointer. ---

Re: Swedish, Norwegian or Danish Analyzers?

2007-02-06 Thread karl wettin
6 feb 2007 kl. 14.19 skrev rubdabadub: Didn't find any of the nordic (Swedish, Danis etc..) language analyzers under svn/contrib..Trying to do a quick check here.. I am interested in only analyzers .. Is this what you are looking for? https://svn.apache.org/repos/asf/lucene/java/trunk/contri

Re: Retrieving exact matches

2007-02-06 Thread Erick Erickson
One way to do this is to store the fields UN_TOKENIZED (and be careful to index and search with compatible Analyzers!!) Also, watch your capitalization. But this is a bit tricky, since if you index UN_TOKENIZED, you would never get a match for your second document above unless you searched on "pic

Re: Swedish, Norwegian or Danish Analyzers?

2007-02-06 Thread rubdabadub
On 2/6/07, karl wettin <[EMAIL PROTECTED]> wrote: 6 feb 2007 kl. 14.19 skrev rubdabadub: > Didn't find any of the nordic (Swedish, Danis etc..) language > analyzers under svn/contrib..Trying to do a quick check here.. I am > interested in only analyzers .. Is this what you are looking for? ht

query problem: grouping "with same field"

2007-02-06 Thread poeta simbolista
Hi all, First of all thanks for this forum, I have read stuff that helped me a lot. Second, I would like to query a Lucene index in this way: I want to do operations on groups of entries. These entries I want them to be grouped by a field, (called websiteid), so the queries will only go to those

Re: query problem: grouping "with same field"

2007-02-06 Thread Erick Erickson
Isn't this just a field on your document and an AND in the query with the websiteID in question? Or are you saying you want to group results by multiple website IDs in a single query? As in webid1 resulta resultb resultc webid2 resulte resultf resultg etc. If the latter, search the archiv

deviations

2007-02-06 Thread poeta simbolista
Hi, I would like to query results, regarding results. For example, get those results whose field "size" is 2 standard deviations from the median. It looks like sth I should do myself, processing results first, getting the median, and then calculate that threshold and perform a new query. Is it h

Re: deviations

2007-02-06 Thread Grant Ingersoll
Have a look at the FieldSelector stuff, assuming you are using trunk or some recent nightly build. It _may_ help w/ what you are doing. Also, I would think you could calculate the median at indexing time (or just after), but I'm not sure I fully understand what you are trying to do, so ta

Newbie question...

2007-02-06 Thread Don . Reese
Hi, Sorry if this has been explained before, but could not find anything using the user archive search... I understand the concept of the fields. What is not clear is if an document must have all fields that are used in the index or not. Now let me clarify that some, lets say that there are

Re: Newbie question...

2007-02-06 Thread Doron Cohen
Hi Don, No, Lucene does not require all documents to have the same set of fields. It is different from a database table - it is all very flexible. So doc1 can have fields f1, f2, and doc2 can have fields f2 f3, and doc3 can have only field f4, and doc4 can have no fields at all (likely that doc4 w

Re: Newbie question...

2007-02-06 Thread Don . Reese
Thanks, that is as I suspected, but it was not clear. -- Don Reese BIPT PO Box 20159 5971 Cattleridge Blvd Ste 101 Sarasota, FL 34276 [EMAIL PROTECTED] 941-358-8004 x139 Doron Cohen <[EMAIL PROTECTED]> 02/06/2007 01:51 PM Please respond to java-user@lucene.apache.org To java-user@lucene.apac

Re: upgrading from Lucene 1.4.3 to Lucene 2.0

2007-02-06 Thread Daniel Naber
On Tuesday 06 February 2007 13:21, [EMAIL PROTECTED] wrote: > Which performance improvements can I expect when upgrading from Lucene > 1.4.3 to Lucene 2.0 ? This is difficult to say, but you can update to Lucene 1.9 probably without doing any changes to your code and then make a performance test

Using Nutch database with Lucene

2007-02-06 Thread mmoser
Ok, here is my problem. We have created a .NET implementation of Lucene using the .NET Library for Lucene. We were previously only bringing in content from the database and indexing it. We now want to keep that in place and use Nutch or some web crawler to go out and get content from the list of u