Now that we talk about it, it would totally make sense for the CollocDriver to optionally spit out a (serialized) BloomFilter at the end of its processing. You can even do it in parallel and then OR the separate pieces together...
-jake On May 27, 2010 12:09 PM, "Drew Farris" <[email protected]> wrote: On Thu, May 27, 2010 at 2:59 PM, Jake Mannix <[email protected]> wrote: > Ditto this. I though... Not that I know of. There are a couple implementations in hbase too, not sure how similar these are to the one in hadoop: http://svn.apache.org/repos/asf/hbase/trunk/src/main/java/org/apache/hadoop/hbase/util/ ByteBloomFilter and DynamicByteBloomFilter
