On Wednesday, December 10, 2003, at 04:07 PM, julien gerard wrote:
I'm attempting to optimize a fuzzy search on a big index with ~4.400.000 Documents ( lucene's meanning ) in 600.000 sub-categories (Simple Text.Keyword type a field ).

My purpose is to limit the amount of documents on wich the fuzzy search with levenhstein disance is performed ( an user cannot search on the 600.000 sub-categories but on 1 to 3 max )

the classics lucenes ways to do that are not adapted to my case :
- multiple indexes : having 600.000 indexes is a nightmare for maintenance.
- QueryFilter is not adapted because it's the fuzzy search which is in The QueryFilter and the number of different request is too important, so I cannot reuse the same.
- The BooleanQuery with 'AND' parameter is also not adapted because the two search are executed and after the results are merged.



QueryFilter would do the trick if you instead used the query you handed to it to be the one to single out a "sub-category". It would limit the documents searched to just the sub-category, and the fuzzy search would be done using IndexSearcher.search, only handing it the filter then as well.


Will this scheme work for you?

Erik


--------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]



Reply via email to