Re: new version of NewMultiFieldQueryParser

2004-10-28 Thread sergiu gordea
Bill Janssen wrote: I'm not sure this solution is very robust Thanks, but I'm pretty sure it *is* robust. Can you please offer a specific critique? Always happy to learn and improve :-). Try to see the behavior if you want to have a single term query juat something like: robust

Re: Locks and Readers and Writers

2004-10-28 Thread Christoph Kiehl
[EMAIL PROTECTED] wrote: I'm getting: java.io.IOException: Lock obtain timed out I have a writer service that opens the index to delete and add docs. I have a reader service that opens the index for searching only. AFAIK you should never open an IndexWriter and an IndexReader at the same time.

RE: Indexing process causes Tomcat to stop working

2004-10-28 Thread iouli . golovatyi
before scewing tomcat too much... 1.make it sure both indexing and reading processes use the same locking directory (i.e. set it explicitly, take a look in wiky how to) 2. try to execute queries from command line and see what happends 3. in case your queries use sorting, there is a memory leak

Re: Locks and Readers and Writers

2004-10-28 Thread Morus Walter
Christoph Kiehl writes: AFAIK you should never open an IndexWriter and an IndexReader at the same time. You should use only one of them at a time but you may open as many IndexSearchers as you like for searching. You cannot open an IndexSearcher without opening an IndexReader (explicitly

Search.jhtml ?

2004-10-28 Thread Willy De Waele
Hi, I'm new using lucene. I downloaded lucene 1.4.2 and added the 2 jar files to the classpath. Executing the demos as a bat file (Windows) is working fine, but using lucene as a web 'application' is not working ... Since I'm using netbeans, I startup the

RE: Indexing process causes Tomcat to stop working

2004-10-28 Thread James Tyrrell
From: [EMAIL PROTECTED] Hello! before scewing tomcat too much... A little late but probably good advice thankfully it hasn't gone wrong 1.make it sure both indexing and reading processes use the same locking directory (i.e. set it explicitly, take a look in wiky how to) working on this not so

RE: Indexing process causes Tomcat to stop working

2004-10-28 Thread Armbrust, Daniel C.
You want version 1.4.2, not version 1.4. The website makes it hard to find 1.4.2, because the mirrors have not been updated yet. Get 1.4.2 here: http://cvs.apache.org/dist/jakarta/lucene/v1.4.2/ My queries do use sorting! So I have placed the 1.4 final jar onto my classpath and have started

RE: Searchable Solutions Please

2004-10-28 Thread gwithers
A quick pointer.. What you want to look at is using a stemming implementation. Look, for example, at the FAQ and docs related to the PorterStemFilter and writing A customer analyzer (http://lucene.sourceforge.net/cgi-bin/faq/faqmanager.cgi?file=chapter.index ingtoc=faq#q17). There is a lot of

Searching for a phrase that contains quote character

2004-10-28 Thread Will Allen
I am having this same problem, but cannot find any help! I have a keyword field that sometimes includes double quotes, but I am unable to search for that field because the escape for a quote doesnt work! I have tried a number of things: myfield:lucene is \cool\ AND myfield:lucene is

Re: Searching for a phrase that contains quote character

2004-10-28 Thread Justin Swanhart
Have you tried making a term query by hand and testing to see if it works? Term t = new Term(field, this is a \test\); PhraseQuery pq = new PhraseQuery(t); ... On Thu, 28 Oct 2004 12:02:48 -0400, Will Allen [EMAIL PROTECTED] wrote: I am having this same problem, but cannot find any help!

Re: Negative boosting?

2004-10-28 Thread Jason Haruska
Hi Terry, I know this is an old message on the list but it does not look like anyone responded to your request. I had to do negative boosting for my search functionality as well so I'd like to share the modification to QueryParser.jj to make it work. Find your Boost TOKEN and change it to: Boost

Re: Searching for a phrase that contains quote character

2004-10-28 Thread Erik Hatcher
On Oct 28, 2004, at 1:03 PM, Justin Swanhart wrote: Have you tried making a term query by hand and testing to see if it works? Term t = new Term(field, this is a \test\); PhraseQuery pq = new PhraseQuery(t); That's not accurate API, but add you used pq.add(t), it still would presume that text

RE: Searching for a phrase that contains quote character

2004-10-28 Thread Will Allen
I am using a NullAnalyzer for this field. -Original Message- From: Erik Hatcher [mailto:[EMAIL PROTECTED] Sent: Thursday, October 28, 2004 2:00 PM To: Lucene Users List Subject: Re: Searching for a phrase that contains quote character On Oct 28, 2004, at 1:03 PM, Justin Swanhart

Re: Searching for a phrase that contains quote character

2004-10-28 Thread Daniel Naber
On Thursday 28 October 2004 19:03, Justin Swanhart wrote: Have you tried making a term query by hand and testing to see if it works? Term t = new Term(field, this is a \test\); PhraseQuery pq = new PhraseQuery(t); That's not a proper PharseQuery, it searches for *one* term this is a test

Re: Search.jhtml ?

2004-10-28 Thread Daniel Naber
On Thursday 28 October 2004 15:01, Willy De Waele wrote: Executing the demos as a bat file (Windows) is working fine, but using lucene as a web 'application' is not working ... I think that Search.jhtml is totally outdated, please try src/jsp instead. Regards Daniel --

Searching against index in memory

2004-10-28 Thread Ravi
If I have a document set of 10,000 docs and my merge factor is 1000, for every 1000 documents, Lucene creates a new segment. By the time Lucene indexes 4500 documents, index will have 4000 documents on the disk and index for 500 documents is stored in memory. How can I search against this index at

Lots Of Interest in Lucene Desktop

2004-10-28 Thread Kevin A. Burton
I've made a few passive mentions of my Lucene http://jakarta.apache.org/lucene Desktop prototype here on PeerFear in the last few days and I'm amazed how much feedback I've had. People really want to start work on an Open Source desktop search based on Lucene.

Re: Negative boosting?

2004-10-28 Thread Jason Haruska
You'll have to run tests but it shouldn't. All it does is change the NUMBER token to accept an optional - in front of a number. So, existing queries with no negative numbers should not be impacted. On Thu, 28 Oct 2004 13:50:47 -0400, Terry Steichen [EMAIL PROTECTED] wrote: Jason,

Re: Searching for a phrase that contains quote character

2004-10-28 Thread Justin Swanhart
absolutely correct. sorry about that. shouldn't code before coffee :) On Thu, 28 Oct 2004 20:16:16 +0200, Daniel Naber [EMAIL PROTECTED] wrote: On Thursday 28 October 2004 19:03, Justin Swanhart wrote: Have you tried making a term query by hand and testing to see if it works? Term

Re: Searching for a phrase that contains quote character

2004-10-28 Thread Erik Hatcher
On Oct 28, 2004, at 2:02 PM, Will Allen wrote: I am using a NullAnalyzer for this field. Which means that each field is added exactly as-is as a single term? Then trying the PhraseQuery directly is a good first step - if you can get that to work then you can move on to making QueryParser work

RE: Searching for a phrase that contains quote character

2004-10-28 Thread Will Allen
The nullanalyzer overrides the isTokenChar method to simply return true in the tokenizer class (http://issues.apache.org/eyebrowse/[EMAIL PROTECTED]msgId=1703655). The situation is that it seems lucene does not expect you to escape characters that exist inside of a quoted string. So my search

Searching for a path

2004-10-28 Thread Bill Tschumy
I have a need to search an index for documents that were taken ffrom particulars files in the filesystem. Each document in the index has a field named url that is created using: doc.add(Field.Text(url, urlStr)); I understand this is both stored and indexed. My search works if I do

Re: Searching for a path

2004-10-28 Thread Daniel Naber
On Friday 29 October 2004 00:22, Bill Tschumy wrote: I get zero hits. Why are these not equivalent? I think it has something to do with the fact that the url needs to be quoted so I search for an exact match. When you manually build the query there's no need to have quotes around it. Can you

Re: Searching for a path

2004-10-28 Thread Bill Tschumy
I have tried that and it doesn't work either. I have also tried using a PhraseQuery rather than TermQuery. On Oct 28, 2004, at 5:29 PM, Daniel Naber wrote: On Friday 29 October 2004 00:22, Bill Tschumy wrote: I get zero hits.  Why are these not equivalent?  I think it has something to do with

Faster highlighting with TermPositionVectors

2004-10-28 Thread markharw00d
Thanks to the recent changes (see CVS) in TermFreqVector support we can now make use of term offset information held in the Lucene index rather than incurring the cost of re-analyzing text to highlight it. I have created a class ( see http://www.inperspective.com/lucene/TokenSources.java )

Re: new version of NewMultiFieldQueryParser

2004-10-28 Thread Bill Janssen
Try to see the behavior if you want to have a single term query juat something like: robust .. and print out the query string ... Sure, that works fine. For instance, if you have the three default fields title, authors, and contents, the one-word search robust expands to title:foobar

Ability to apply document age with the score?

2004-10-28 Thread Kevin A. Burton
Lets say I have an index with two documents. They both have the same score but one was added 6 months ago and the other was added 2 minutes ago. I want the score adjusted based on the age so that older documents have a lower score. I don't want to sort by document age (date) because if one

Re: Faster highlighting with TermPositionVectors

2004-10-28 Thread Fred Toth
Hi, We are very interested in highlighting, but haven't gotten around to reviewing the state of the highlighting mechanisms. Could someone possibly give me the big picture on highlighting? What code is available? How does it work? What are the current issues? Many thanks, Fred At 07:16 PM

LUCENE INDEX STATISTICS

2004-10-28 Thread Karthik N S
Hi Guys Apologies. Can some body provide approximate Statics about the following factor for Developement and Deployment of Lucene [ it may be usefull for Pro's Developers ] a) Creation Indexing 1) X [ Say 100 Million ] of number of documents Y [ Kilobytes ] with

RE: Faster highlighting with TermPositionVectors

2004-10-28 Thread Bruce Ritchie
Mark, Thanks to the recent changes (see CVS) in TermFreqVector support we can now make use of term offset information held in the Lucene index rather than incurring the cost of re-analyzing text to highlight it. I have created a class ( see