I am building a data cube on top of HBase. All access to the data is by 
map/reduce jobs. I want to build a scanner that matches based on the set 
intersection of bloom filters. First I run a map/reduce job on table A. For 
every row I match in table A, I add the row key to a bloom filter. I then to a 
map/reduce job on table B. 
-----------------
Sincerely,
David G. Boney
Chair, Austin ACM SIGKDD
[email protected]
http://www.meetup.com/Austin-ACM-SIGKDD/
http://tech.groups.yahoo.com/group/austinsigkdd/

Reply via email to