Re: range and content query

2004-09-20 Thread Morus Walter
Chris Fraschetti writes: can someone assist me in building or deny the possibility of combing a range query and a standard query? say for instance i have two fields i'm searching on... one being the a field with an epoch date associated with the entry, and the content so how can I make

Re: indexes won't close on windows

2004-09-20 Thread sergiu gordea
Hi Fred, I think that we can help you if you provide us your code, and the context in which it is used. we need to see how you open and close the searcher and the reader, and what operations are you doing on index. All the best, Sergiu Fred Toth wrote: Hi, I have built a nice lucene

Re: range and content query

2004-09-20 Thread Chris Fraschetti
I've more or less figured out the query string required to get a range of docs.. say date[0 TO 10]assuming my dates are from 1 to 10 (for the sake of this example) ... my query has results that I don't understand. if i do from 0 TO 10, then I only get results matching 0,1,10 ... if i do 0 TO

Re: range and content query

2004-09-20 Thread Morus Walter
Chris Fraschetti writes: I've more or less figured out the query string required to get a range of docs.. say date[0 TO 10]assuming my dates are from 1 to 10 (for the sake of this example) ... my query has results that I don't understand. if i do from 0 TO 10, then I only get results

Re: range and content query

2004-09-20 Thread Chris Fraschetti
very correct you are. changing the format of the numbers when i index then and when i do the range fixed my problem.. thanks much. On Mon, 20 Sep 2004 09:08:50 +0200, Morus Walter [EMAIL PROTECTED] wrote: Chris Fraschetti writes: I've more or less figured out the query string required to get

Re: indexes won't close on windows

2004-09-20 Thread Fred Toth
Hi Sergiu, My searches take place in tomcat, in a struts action, in a single method Abbreviated code: IndexReader reader = null; IndexSearcher searcher = null; reader = IndexReader.open(indexName); searcher = new IndexSearcher(reader); // code to do a

Re: indexes won't close on windows

2004-09-20 Thread Otis Gospodnetic
Fred, I won't get into the details here, but you shouldn't (have to) open a new IndexReader/Searcher on each request (I'll assume the code below is from your Actions'e xecute method). You should cache and re-use IndexReaders (and IndexSearchers). There may be a FAQ entry regarding that, I'm not

Re: indexes won't close on windows

2004-09-20 Thread sergiu gordea
Hi Fred, That's right, there are many references to this kind of problems in the lucene-user list. This suggestions were already made, but I'll list them once again: 1. One way to use the IndexSearcher is to use yopur code, but I don't encourage users to do that IndexReader reader =

Re: NGramSpeller contribution -- Re: combining open office spellchecker with Lucene

2004-09-20 Thread Morus Walter
David Spencer writes: could you put the current version of your code on that website as a java Weblog entry updated: http://searchmorph.com/weblog/index.php?id=23 thanks Great suggestion and thanks for that idiom - I should know such things by now. To clarify the issue, it's just

Re: indexes won't close on windows

2004-09-20 Thread Fred Toth
Hi Otis, I understand about reusing readers and searchers, but I was working on the do the simplest thing that can possibly work theory for starters, in part because I wanted to be sure that I could recreate the index safely as needed. I should emphasize that I developed for weeks on linux without

Re: indexes won't close on windows

2004-09-20 Thread Fred Toth
Hi Sergiu, Thanks for your suggestions. I will try using just the IndexSearcher(String...) and see if that makes a difference in the problem. I can confirm that I am doing a proper close() and that I'm checking for exceptions. Again, the problem is not with the search function, but with the

Re[2]: indexes won't close on windows

2004-09-20 Thread Maxim Patramanskij
Hello Fred, When you recreate an index from the scratch (with the last IndexWriter constructor's argument true), all IndexReaders must be closed, cause IndexWriter tries to delete all files entire directory, where you index being created. If you have any opened IndexReader within this time, then

Re: indexes won't close on windows

2004-09-20 Thread sergiu gordea
Fred Toth wrote: Hi Sergiu, Thanks for your suggestions. I will try using just the IndexSearcher(String...) and see if that makes a difference in the problem. I can confirm that I am doing a proper close() and that I'm checking for exceptions. Again, the problem is not with the search function,

RE: indexes won't close on windows

2004-09-20 Thread JirĂ­ Kuhn
Hi, I guess you have answered yourself. I can imagine that Tomcat was serving your servlet with constructed index searcher while your command line application wanted to recreate the index. Are you protected against this situation? Jiri. -Original Message- From: Fred Toth

Re: Running OutOfMemory while optimizing and searching

2004-09-20 Thread John Z
Doug Thank you for confirming this. ZJ Doug Cutting [EMAIL PROTECTED] wrote: John Z wrote: We have indexes of around 1 million docs and around 25 searchable fields. We noticed that without any searches performed on the indexes, on startup, the memory taken up by the searcher is roughly 7

Too many boolean clauses

2004-09-20 Thread Shawn Konopinsky
Hello There, Due to the fact that the [# TO #] range search works lexographically, I am forced to build a rather large boolean query to get range data from my index. I have an ID field that contains about 500,000 unique ids. If I want to query all records with ids [1-2000], I build a boolean

Re: Too many boolean clauses

2004-09-20 Thread Paul Elschot
On Monday 20 September 2004 18:27, Shawn Konopinsky wrote: Hello There, Due to the fact that the [# TO #] range search works lexographically, I am forced to build a rather large boolean query to get range data from my index. I have an ID field that contains about 500,000 unique ids. If I

Similarity scores: tf(), lengthNorm(), sumOfSquaredWeights().

2004-09-20 Thread Paul Elschot
After last week's discussion on idf() of the similarity score computation I looked into the score computation a bit deeper. In the DefaultSimilarity tf() is the sqrt() and lengthNorm() is the inverse of sqrt(). That means that the factor (docTf * docNorm) actually implements the square root of

RE: Too many boolean clauses

2004-09-20 Thread Shawn Konopinsky
Hey Paul, Thanks for the quick reply. Excuse my ignorance, but what do I do with the generated BitSet? Also - we are using a pooling feature which contains a pool of IndexSearchers that are used and tossed back each time we need to search. I'd hate to have to work around this and open up an

RE: indexes won't close on windows - solved

2004-09-20 Thread Fred Toth
All, Many thanks for your help and comments. I found a bug in my code where, in obscure circumstances, the indexes were being left open. Now fixed, thanks to everyone's help. Fred At 10:30 AM 9/20/2004, you wrote: Hi, I guess you have answered yourself. I can imagine that Tomcat was

Re: Too many boolean clauses

2004-09-20 Thread Paul Elschot
On Monday 20 September 2004 20:54, Shawn Konopinsky wrote: Hey Paul, Thanks for the quick reply. Excuse my ignorance, but what do I do with the generated BitSet? You can return it in in the bits() method of the object implementing your org.apache.lucene.search.Filter

Re: Too many boolean clauses

2004-09-20 Thread Paul Elschot
On Monday 20 September 2004 20:54, Shawn Konopinsky wrote: Hey Paul, ... Also - we are using a pooling feature which contains a pool of IndexSearchers that are used and tossed back each time we need to search. I'd hate to have to work around this and open up an IndexReader for this

Highlighting PDF file after the search

2004-09-20 Thread Balasubramanian . Vijay
Hello, I can successfully index and search the PDF documents, however i am not able to highlight the searched text in my original PDF file (ie: like dtSearch highlights on original file) I took a look at the highlighter in sandbox, compiled it and have it ready. I am wondering if this

Re: Highlighting PDF file after the search

2004-09-20 Thread David Spencer
[EMAIL PROTECTED] wrote: Hello, I can successfully index and search the PDF documents, however i am not able to highlight the searched text in my original PDF file (ie: like dtSearch highlights on original file) I took a look at the highlighter in sandbox, compiled it and have it ready. I am

Re: Highlighting PDF file after the search

2004-09-20 Thread Balasubramanian . Vijay
Thanks David. I'll give that a shot and let you know. Vijay Balasubramanian DPRA Inc., 214 665 7503 David Spencer

RE: Highlighting PDF file after the search

2004-09-20 Thread Bruce Ritchie
From: [EMAIL PROTECTED] I can successfully index and search the PDF documents, however i am not able to highlight the searched text in my original PDF file (ie: like dtSearch highlights on original file) I took a look at the highlighter in sandbox, compiled it and have it ready. I am

Problems with Lucene + BDB (Berkeley DB) integration

2004-09-20 Thread Christian Rodriguez
Hi everyone, I am trying to use the Lucene + BDB integration from the sandbox (http://cvs.apache.org/viewcvs.cgi/jakarta-lucene-sandbox/contributions/db/). I installed C Berkeley DB 4.2.52 and I have the Lucene jar file. I have an example program that indexes 4 small text files in a directory

Re: Problems with Lucene + BDB (Berkeley DB) integration

2004-09-20 Thread Andy Goodell
I used BDB + lucene successfully using the lucene 1.3 distribution, but it broke in my application with the 1.4 distribution. The 1.4 dist uses a different file system by default, the cluster file system, so maybe that is the source of the issues. good luck, andy g On Mon, 20 Sep 2004 19:36:51

Re: Use of SortComparator.getComparable() ?

2004-09-20 Thread Tea Yu
Dear all, I'm recently implementing a sort logic that leverages an external index, however, I'm confused by the newComparator() and getComparable() in SortComparator. It seems natural to me that IndexSearcher - FieldSortedHitQueue - factory.newComparator(). However, what's the use of

WildCardQuery

2004-09-20 Thread Raju, Robinson (Cognizant)
Is there a limitation in Lucene when it comes to wildcard search ? Is it a problem if we use less than 3 characters along with a wildcard(*). Gives me error if I try using 45* , *34 , *3 ..etc . Too Many Clauses Error Doesn't happen if '?' is used instead of '*'. The intriguing thing is , that it