At 05:17 PM 6/4/2001 -0400, Matthew Schmidt wrote:
>Lucene Developers and Users,
>     I have been assigned to look at the possibilty of using Lucene for 
> indexing a large amount of news articles and accessing the index in 
> through a search interface.  Several things have drawn us to Lucene, 
> namely its speed and its price, but there are a few things we need 
> answered before we can go ahead with any implementation.  Any help on the 
> following questions would be much appreciated.
>
>1) Does lucene support queries which include > or <.  Eg. dateField  < 
>some date or all id's > some id.

Yes, as long as dates are stored in com.lucene.document.DateField format, 
you can use com.lucene.search.DateFilter to specify a date range in your query.

>
>2) Can lucene handle large lists of queries. Comparable to a SQL query of 
>where text="hi" in (some list of ids)

I don't quite understand what you mean by a large list of queries. Do you 
mean a single query with a large number of conditions? or simultaneous 
processing of a large number of queries? Perhaps you can clarify more here?

>
>3) We need a way to order the index in a way so that the most recent 
>articles are listed first and therefore retrieved first.  Because there is 
>the possiblity of several hundred thousand returned articles, this 
>ordering needs to take place when new articles are added to the index, not 
>on retrieval.  Is this possible?

Quite possibly. Someone has mentioned before on this mailing list that 
lucene search documents in chronological order that they were stored. 
However, I have not investigated this myself so you might want to perform 
some test to verify the validity of this claim.

>
>4)  Finally, we need to short-circuit returns.  Namely, if the user wants 
>just the 50 most recent articles, we need to just go get the first 50 and 
>then stop, not wasting processor time looking in the next million.

I'm quite certain this is not built into lucene (Someone correct me if I am 
wrong). However you have the lucene source code. It shouldn't be too tough 
to add that piece of functionality. ;)

>
>If anyone can help me answer these, please let me know.
>
>     Matthew Schmidt
>     <mailto:[EMAIL PROTECTED]>[EMAIL PROTECTED]
>


_______________________________________________
Lucene-users mailing list
[EMAIL PROTECTED]
http://lists.sourceforge.net/lists/listinfo/lucene-users

Reply via email to