query parser

2006-03-08 Thread Raghavendra Prabhu
I want to use query parser to parse my query string But the default field should be a group of fields with different fields where it is searched on Can any one let me know For example if my query is new books new should be searched in different fields ( content and title) books should be sear

Re: query parser

2006-03-08 Thread Rainer Dollinger
Take a look at the class MultiFieldQueryParser, I think it does exactly what you want. GR, Rainer Raghavendra Prabhu wrote: > I want to use query parser to parse my query string > > But the default field should be a group of fields with different fields > where it is searched on > > Can any on

RE: Get only count

2006-03-08 Thread anton
Signifies this that method collect can be called for document with score <= 0 ? -Original Message- From: Yonik Seeley [mailto:[EMAIL PROTECTED] Sent: Tuesday, March 07, 2006 6:35 PM To: java-user@lucene.apache.org Subject: Re: Get only count Importance: High On 3/7/06, [EMAIL PROTECTED]

Re: query parser

2006-03-08 Thread Raghavendra Prabhu
Hi Rainer Thanks. I have one more doubt. How do i set different boosts for each field using query parser Can i set different boosts for each field? Rgds Prabhu On 3/8/06, Rainer Dollinger <[EMAIL PROTECTED]> wrote: > > Take a look at the class MultiFieldQueryParser, I think it does exactly > w

Re: Scoring with FunctionQueries?

2006-03-08 Thread Sebastian Marius Kirsch
On Tue, Mar 07, 2006 at 06:19:53PM -0800, Chris Hostetter wrote: > once you've tried the suggestions above, can you make send out a > selfcontained JUnit test showing the problems? Thanks, Chris, glad you agree that it doesn't work as you expect it to work. I will try your suggestions and send in

Re: Lucene 1.9.1 and timeToString() apparent incompatibility with 1.4.3

2006-03-08 Thread Chris Hostetter
: Thanks Chris for making it clear, I had read the comment but I had not : understood that it implied incompatibility. But will the code be preserved : in Lucene 2.0, in light of the comment contained in the Lucene 1.9.1 : announcement ? I don't really know, it's currently being discussed in LUCE

Re: Lucene 1.9.1 and timeToString() apparent incompatibility with 1.4.3

2006-03-08 Thread George Washington
thanks Chris, I think I'll opt for re-creating the index now, using the new 1.9.1 code. Sooner or later, it seems to me, the deprecated code will be removed anyway. Better facing the pain now than later, makes it possible for me to take advantage of the new date resolution features. Even though

MuliField Query Parser

2006-03-08 Thread Raghavendra Prabhu
Hi I need different boosts for fields which we define in multifield query parser How can this be accomplished?? Rgds Prabhu

RangeQuery and RangeFilter

2006-03-08 Thread Anton Potehin
What faster RangeQuery or RangeFilter ?

Re: MuliField Query Parser

2006-03-08 Thread Rainer Dollinger
You could try to inherit from MultiFieldQueryParser: public class BoostableMultiFieldQueryParser extends MultiFieldQueryParser { // TODO: add constructors of super class public static Query parse(String query, String[] fields, BooleanClause.Occur[] flags,Analyzer analyzer, f

Re: RangeQuery and RangeFilter

2006-03-08 Thread mark harwood
See http://wiki.apache.org/jakarta-lucene/FilteringOptions --- Anton Potehin <[EMAIL PROTECTED]> wrote: > What faster RangeQuery or RangeFilter ? > > ___ Win a BlackBerry device from O2 with Yahoo!. Enter now. http:

1.4.3 and 64bit support? out of memory??

2006-03-08 Thread zzzzz shalev
hi all, i've been trying to load a 6GB index on linux (16GB RAM) but am having no success. i wrote a program that allocates memory and it was able to allocate as much RAM as i requested (stopped at 12GB) however i am recieving the following stack trace: JVMDUMP013I

Lucene Scoring

2006-03-08 Thread msftblows
Anyone have a doc or something that would allow me to explain this to execs? A "Lucene Scoring for Dummies" idea...explaining math algo to a exec or someone with no knowledge is not that easy :) Thanks! -Joe

Re: Get only count

2006-03-08 Thread Paul Elschot
On Wednesday 08 March 2006 09:25, [EMAIL PROTECTED] wrote: > Signifies this that method collect can be called for document with score <= > 0 ? The collect() method is called after next() on the top level Scorer has returned true. In between score() is called on that Scorer to provide the score v

Re: 1.4.3 and 64bit support? out of memory??

2006-03-08 Thread Dan Armbrust
z shalev wrote: hi all, i've been trying to load a 6GB index on linux (16GB RAM) but am having no success. i wrote a program that allocates memory and it was able to allocate as much RAM as i requested (stopped at 12GB) Was your program that got up to 12GB of memory written

RE: Lucene Scoring

2006-03-08 Thread Pasha Bizhan
Hi, > From: [EMAIL PROTECTED] [mailto:[EMAIL PROTECTED] > Anyone have a doc or something that would allow me to explain > this to execs? A "Lucene Scoring for Dummies" > idea...explaining math algo to a exec or someone with no > knowledge is not that easy :) http://lucene.apache.org/java/d

Re: 1.4.3 and 64bit support? out of memory??

2006-03-08 Thread zzzzz shalev
yes, 100% Dan Armbrust <[EMAIL PROTECTED]> wrote: z shalev wrote: > hi all, > > i've been trying to load a 6GB index on linux (16GB RAM) but am having no > success. > > i wrote a program that allocates memory and it was able to allocate as much > RAM as i requested (stopped at 12GB)

Does Lucene support on-disk search?

2006-03-08 Thread Xiaocheng Luan
Hi, I heard that Lucene loads the index into memory to do a search, which does not sound quite right to me. I will not be surprised if Lucene is smart enough to load the index into memory when it is feasible, but I'd be surprised if it ALWAYS loads index memory to do the search, which I

Re: Lucene Scoring

2006-03-08 Thread markharw00d
[EMAIL PROTECTED] wrote: Anyone have a doc or something that would allow me to explain this to execs? Roughly speaking: * Documents containing *all* the search terms are good * Matches on rare words are better than for common words * Long documents are not as good as short ones * Documents wh

Re: Does Lucene support on-disk search?

2006-03-08 Thread Grant Ingersoll
Lucene _can_ load the index into memory, but it doesn't have to, if you want further details see the Javadocs on RAMDirectory versus FSDirectory. I think you will find it has good performance on a few gigs of data. Results, of course, vary based on what you are asking it to do and what kind o

Re: 1.4.3 and 64bit support? out of memory??

2006-03-08 Thread Chris Hostetter
: i am recieving the following stack trace: : : JVMDUMP013I Processed Dump Event "uncaught", detail "java/lang/OutOfMemoryError". : Exception in thread "main" java.lang.OutOfMemoryError : at org.apache.lucene.index.TermInfosReader.readIndex(TermInfosReader.java:82) is it possible th

Re: Lucene Scoring

2006-03-08 Thread Chris Hostetter
: Roughly speaking: : : * Documents containing *all* the search terms are good : * Matches on rare words are better than for common words : * Long documents are not as good as short ones : * Documents which mention the search terms many times are good Be wary of the distinction between "term" and

Re: 1.4.3 and 64bit support? out of memory??

2006-03-08 Thread zzzzz shalev
hey chris, i will check and let you know just to make sure, basically i see the OS allocating memory (up to about 4GB) while loading the indexes to memory and then crashing on the TermInfosReader class. what i noticed was that the crash occured when lucene tried to create a Term arra

Re: 1.4.3 and 64bit support? out of memory??

2006-03-08 Thread Daniel Noll
z shalev wrote: hey chris, i will check and let you know just to make sure, basically i see the OS allocating memory (up to about 4GB) while loading the indexes to memory and then crashing on the TermInfosReader class. what i noticed was that the crash occured when lucene tried to create a

Lucene Ranking/scoring

2006-03-08 Thread Yang Sun
Hi, Just wondering how I can rank search result by a combination of fields. I know there is a multi-field sort, but it is just a sorting method. It is sorted by the first field and then the second field ... What I need is a weighted combination. For example, I want to assign a weight of 2 to title

Re: Lucene Ranking/scoring

2006-03-08 Thread Yonik Seeley
Hi Yang, Boosting works at query time as well as index time. If you are using the QueryParser, specify boosts like so: title:foo^2 abstract:foo^1.5 date:mydate^3 If you are building queries pragmatically, then use the Query.setBoost() method. That will boost relative to how a non-boosted query w

Atomic index/search for a phrase

2006-03-08 Thread Urvashi Gadi
Hi All, I am trying index and search a phrase (multiple words seperated by spaces). How should i index it so that it remains atomic. I have observed that if i index the phrase are keyword, lucene doesn't let me retrive the phrase in search. Please advice. Urvashi ---

RE: Lucene Ranking/scoring

2006-03-08 Thread Yang Sun
Hi Yonik, Thanks very much for your suggestion. The query boost works great for keyword matching. But in my case, I need to rank the results by date and title. For example, title:foo^2 abstract:foo^1.5 date:2004^3 will only boost the document with date=2004. What I need is boosting the "distance" f

RangeQuery, FilterdQuery and HitCollector

2006-03-08 Thread Youngho Cho
Hello, I would like to use a Filter for rangeQuery ( to avoid potential TooManyClauses exception ) and found out http://wiki.apache.org/jakarta-lucene/FilteringOptions wiki said that FilteredQuery is best one. But Interesting is that when I used the option with HitCollector , FilteredQuery te

RE: 1.4.3 and 64bit support? out of memory??

2006-03-08 Thread Andrew Schetinin
Hi, That's somewhat strange, if I remember correctly the index size was 6 Gb, wasn't it? I saw posts from people working with tens of Gb indexes. And we worked with index of 8 Gb in 32-bit JVM (on Windows 2000) with as little as 700 Mb of max memory allowed to JVM. Are there too many documents/t