I have a couple questions regarding performance of lucene. First off my environment:
Data 1-10M Documents 5 - 30 fields < 10B 1-3 Fields 1KB - 500KB I have three types of queries: Query 1 : 85% usage 1-2 phrase terms i.e. +id:"651" +id2:"241" sorting by an arbitrary field normally the date 5-20 security terms 5k-1M results can never return stale data Query 2: 13% 10 full wildcard terms i.e. *search* sorting is optional 0-200 results 20-200 security terms can return slightly stale data Query 3: 2% 1-20 mixed terms sorting is optional 0-200 results 20-200 security terms can return slightly stale data 1) Does re-opening an IndexSearcher flush all of the caches (filter and sort) ? 2) What is the overhead of opening an IndexSearcher? What does it depend on? 3) What is the recommended approach for updating and refreshing the index where there is 1 update for every 5 queries? 4) Is query 1 better off done using a database as I would have to re-open the IndexSeacher every couple of queries? 5) What would perform better Solr or Lucence? When is it better to use one or the other? 6) What else should I look out for? 7) Why is refreshing an IndexSearcher not supported? Any help is greatly appreciated Thanks Moshe -- View this message in context: http://www.nabble.com/Performance-Questions-tf4405513.html#a12568500 Sent from the Lucene - Java Users mailing list archive at Nabble.com. --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]