Hi, I'm attempting to optimize a fuzzy search on a big index with ~4.400.000 Documents ( lucene's meanning ) in 600.000 sub-categories (Simple Text.Keyword type a field ).
My purpose is to limit the amount of documents on wich the fuzzy search with levenhstein disance is performed ( an user cannot search on the 600.000 sub-categories but on 1 to 3 max ) the classics lucenes ways to do that are not adapted to my case : - multiple indexes : having 600.000 indexes is a nightmare for maintenance. - QueryFilter is not adapted because it's the fuzzy search which is in The QueryFilter and the number of different request is too important, so I cannot reuse the same. - The BooleanQuery with 'AND' parameter is also not adapted because the two search are executed and after the results are merged. So ( Ahhhhh!!! ) my first question is : is there any way to do fuzzy search on a subset of the index that I've not seen yet? Is this solution does not exist? Which solution could I implemented to perform this kind of search? I could implemented a FuzzyFilter but I'll need to access to each document, wich is time consuming. I know that solution cost a lot of memory usage, which has already been discuted on this list, but in my case this way is the only I can see to decrease the execution time. regards, Julien. PS. : Sorry for my poor english. --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
