Our organization people are more familiar with and use just Hadoop..HBase is not yet..I'm venturing into it..Because most are familiar with Hadoop, everyone thinks we would need lot more storage..
I understand the general use case for Hadoop is storing giant raw log files from webservers & other servers..which are huge & fills up quickly If all we are storing processed data(events) directly in HBase tables, that space should not be that much..I would think.. I've done some small benchmarking testing (300,000 records )/less 100 bytes per record..HBase does n't take up much disk space..(looking raw disk usage) As table gets larger it will..we have flexibility on how long to keep data around.. Thoughts? thanks venkatesh
