Hi,

I'm attempting to optimize a fuzzy search on a big index with ~4.400.000 Documents ( 
lucene's meanning ) in 600.000 sub-categories (Simple Text.Keyword type a field ).

My purpose is to limit the amount of documents on wich the fuzzy search with 
levenhstein disance is performed ( an user cannot search on the 600.000 sub-categories 
but on 1 to 3 max )

the classics lucenes ways to do that are not adapted to my case :
- multiple indexes : having 600.000 indexes is a nightmare for maintenance.
- QueryFilter is not adapted because it's the fuzzy search which is in The QueryFilter 
and the number of different request is too important, so I cannot reuse the same.
- The BooleanQuery with 'AND' parameter is also not adapted because the two search are 
executed and after the results are merged.

So ( Ahhhhh!!! ) my first question is :
is there any way to do fuzzy search on a subset of the index that I've not seen yet?

Is this solution does not exist? Which solution could I implemented to perform this 
kind of search? 
I could implemented a FuzzyFilter but I'll need to access to each document, wich is 
time consuming. 

I know that solution cost a lot of memory usage, which has already been discuted on 
this list, but in my case this way is the only I can see to decrease the execution 
time.

regards,
Julien.

PS. : Sorry for my poor english.



---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to