Re: Adding another factor to Lucene search

2009-03-03 Thread liat oren
Hi, Thanks for the information. Does MoreLikeThis can help in addressing the similiarity of words according to index of scores between words I have? Best, Liat 2009/3/2 Ian Lea ian@gmail.com Hi Document.setBoost(float boost) where boost is either your score as is, or a value based on

not updating caching

2009-03-03 Thread sandyg
Hi All, I had indexed some records and we found some searches are not found because of some reason. So again made changes and indexed it again, now we got hits for the query from cmd prompt ,but this search is not updated in caching in server. If i close and start the server it is works

Re: not updating caching

2009-03-03 Thread Otis Gospodnetic
I have a feeling you are using Solr or some other server and not straight Lucene. To turn off Solr caching, comment it out from solrconfig.xml (but you'll need to restart after that). If this is indeed about Solr, please use solr-user list instead of this one. Otis -- Sematext --

Re: Analyze other language using English Analyzer

2009-03-03 Thread Otis Gospodnetic
Ganesh, You should really try to detect the language of the document and then, based on the language identified, pick the appropriate analyzer. See Snowball analzyers in Lucene contrib See http://www.sematext.com/product-language-identifier.html Otis -- Sematext -- http://sematext.com/ --

Tomcat Threads are BLOCKED after some time

2009-03-03 Thread damu_verse
Hi All, We Have used Lucene as our Search Engine and all our applications are deployed onto tomcat and running with thread pool size of 200. Java Version - 1.6.0-rc Lucene Version - 2.3.2 Tomcat Version - 6.0.14 OS - Red Hat Enterprise Linux ES release 4 (Nahant Update 5) kernel -

Re: Tomcat Threads are BLOCKED after some time

2009-03-03 Thread damu_verse
We have observed the same issue for 3 days in row ... And it has become really urgent... If any body has an idea about this issue please help me. Thanks in Advance -damu damu_verse wrote: Hi All, We Have used Lucene as our Search Engine and all our applications are deployed

Re: Tomcat Threads are BLOCKED after some time

2009-03-03 Thread Yonik Seeley
Hmmm, if this is some sort of deadlock, we may need a thread dump of all of the threads. Does your app with Lucene 2.4 have the same problem? Is it possible to try a newer JVM version (like 1.6.0_12)? -Yonik http://www.lucidimagination.com On Tue, Mar 3, 2009 at 8:13 AM, damu_verse

RE: Tomcat Threads are BLOCKED after some time

2009-03-03 Thread Uwe Schindler
Does this also happen, if you not use ParallelMultiSearcher? Can you replace by a normal MultiSearcher or alternatively a simple IndexSearcher on a MultiReader? - Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.thetaphi.de eMail: u...@thetaphi.de -Original Message-

Re: How to index Named Entities

2009-03-03 Thread Grant Ingersoll
Have a look at the TeeTokenFilter and the SinkTokenizer. You could extend/implement those to have a lookup in your list, and then when you have a match, add the token to the Sink, which then allows you to index a separate field containing your named entities. The TeeTF and SinkTok are

Re: Tomcat Threads are BLOCKED after some time

2009-03-03 Thread damu_verse
Hi Thanx for the reply.. we have not tested this against the versions (both java-1.6.12 and lucene-2.4) mentioned and more over we can not move to those verions right away... So we need a solution for this particular version only.. thanx regards damu damu_verse

Re: Indexing synonyms for multiple words

2009-03-03 Thread Michael McCandless
Actually, the start position of each token is stored in the normal Lucene index (in the *.prx files), not using payloads. Payloads are entirely for per-token extensibility (ie, core Lucene doesn't use them by default): you'd have to create your own analyzer to attach payloads to tokens, and

Why do range queries work on fields only ?

2009-03-03 Thread Raymond Balmès
Hi all, The range query only works on fields (using a string compare)... is there any reason why it is not possible on the words of the document. The following query [stringa TO stringb] would just give the list of documents which contains words between those two strings. -RB-

RE: Why do range queries work on fields only ?

2009-03-03 Thread Steven A Rowe
Hi Raymond, On 3/3/2009 at 12:04 PM, Raymond Balmès wrote: The range query only works on fields (using a string compare)... is there any reason why it is not possible on the words of the document. The following query [stringa TO stringb] would just give the list of documents which contains

Re: Why do range queries work on fields only ?

2009-03-03 Thread Raymond Balmès
Just a simplified view of my problem : A document contains the terms index01 blabla index02 xxx yyy index03 ... index10. I have the terms indexed in the collection. I now want to look for documents which contains [page04 TO page08]... looks like I can't do it -RB- On Tue, Mar 3, 2009 at 6:33 PM,

Re: Why do range queries work on fields only ?

2009-03-03 Thread Raymond Balmès
sorry [index04 TO index 08] On Tue, Mar 3, 2009 at 7:18 PM, Raymond Balmès raymond.bal...@gmail.comwrote: Just a simplified view of my problem : A document contains the terms index01 blabla index02 xxx yyy index03 ... index10. I have the terms indexed in the collection. I now want to look

sloppyFreq question

2009-03-03 Thread Peter Keegan
The DefaultSimilarity class defines sloppyFreq as: public float sloppyFreq(int distance) { return 1.0f / (distance + 1); } For a 'SpanNearQuery', this reduces the effect of the term frequency on the score as the number of terms in the span increases. So, for a simple phrase query (using

Re: Why do range queries work on fields only ?

2009-03-03 Thread Erick Erickson
But it's still not clear what the semantics of your search are. Do you mean I want index04, index05, index06, index07 and index08 to appear in that order in the field? If so you can use one of the SpanQuery family, specifying a large slop and respecting order. But this could get clumsy with large

RE: Why do range queries work on fields only ?

2009-03-03 Thread Steven A Rowe
Hi Raymond, On 3/3/2009 at 1:19 PM, Raymond Balmès wrote: On Tue, Mar 3, 2009 at 7:18 PM, Raymond Balmès raymond.bal...@gmail.comwrote: Just a simplified view of my problem : A document contains the terms index01 blabla index02 xxx yyy index03 ... index10. I have the terms indexed in

Optimum way to find all document without particular field

2009-03-03 Thread Allahbaksh Mohammedali Asadullah
Hi, What is the optimum way in which I can find all the document which has particular field in it. Example I want to find all the document in which the field text is not null. Regards, Allahbaksh Allahbaksh Mohammedali Asadullah, Software Engineering Technology Labs, Infosys Technolgies

Re: Optimum way to find all document without particular field

2009-03-03 Thread zhaoxueqing
all the document? how about to create a new index which hold the field all which you want to find.? like database every document like a table you can not search a field from all table directly. but you can build a view use union. or a new table hold the field you need only. -

Re: Optimum way to find all document without particular field

2009-03-03 Thread Ganesh
Allahbaksh, I don't think, Lucene could filter out Null and Not null values. In case if the field value is Null, index the field with value NULL. To find all the documents which the field is not null, Search the field, which does not contain the value NULL. Regards Ganesh - Original