On Sun, Mar 22, 2009 at 11:35 PM, Lance Norskog <goks...@gmail.com> wrote: > LinkedIn open-sourced a pile of DocSet compression implementations as > "Lucene-Ext", or "kamikaze": > http://code.google.com/p/lucene-ext/wiki/Kamikaze > > Has anyone looked at using these in Solr?
The big question would be "use for what?" DocSets are often used for fast intersections when doing faceting.... most forms of compression would greatly impact performance. Although due to other Lucene changes/advances, I've considered moving from a HashDocSet to a sorted list of docids. These DocSets could implement skipTo() and be directly usable as filters because of that, but would be slower for random ID lookup and slower to get the intersection of a small and a large set. -Yonik http://www.lucidimagination.com