I am wondering if anyone has thought about using HBase on other file systems like "Gluster". I think Gluster may offer much faster performance without exorbitant cost. With Gluster, you would have to fetch the data from the "Storage Bricks" and process it in your own environment. This allows the servers that are used as storage nodes very cheap.
I think with Gluster you can fetch data from any of the nodes. I would imagine this would be a little slower than Server Attached Storage, I think have a 10 Gbps network, and enough network bandwidth may make this a non-issue. Any comments? No, I do not work for Gluster. I have just started researching this, so I have not fact checked it adequately. Kevin