I can't claim its the best, but I'd say solar or katta.
-Joey On Jun 3, 2011 8:57 PM, "Mark Kerzner" <[email protected]> wrote: > Hi, > > I need to store, say, 10M-100M documents, with each document having say 100 > fields, like author, creation date, access date, etc., and then I want to > ask questions like > > give me all documents whose author is like abc**, and creation date any time > in 2010 and access date in 2010-2011, and so on, perhaps 10-20 conditions, > matching a list of some keywords. > > What's best, Lucene, Katta, HBase CF with secondary indices, or plain scan > and compare of every record? > > Thanks a bunch! > > Mark
