HBase is a distributed, consistent, sorted key value store. The "sorted" bit 
allows for range scans in addition to the point gets that all K/V stores 
support. Nothing more, nothing less.

It happens to store its data in HDFS by default, and we provide convenient 
input and output formats for map reduce.

      From: Michael Segel <[email protected]>
 To: [email protected] 
 Sent: Monday, June 1, 2015 5:32 PM
 Subject: Re: Hbase vs Cassandra
   
The point is that HBase is part of the Hadoop ecosystem. Not a stand alone 
database like Cassandra. 

This is one thing that gets lost when people want to compare NoSQL databases / 
data stores. 

As to Big Data without Hadoop? Well, there’s spark on mesos … :-P
And there are other Big Data systems out there but are not as well known. 
Lexus/Nexus had their proprietary system that they’ve been trying to sell … 




> On Jun 1, 2015, at 5:29 PM, Vladimir Rodionov <[email protected]> wrote:
> 
>>> The key issue is that unless you need or want to use Hadoop, you
> shouldn’t be using HBase. Its not a stand alone product or system.
> 
> Hello, what is use case of a big data application w/o Hadoop?
> 
> -Vlad
> 
> On Mon, Jun 1, 2015 at 2:26 PM, Michael Segel <[email protected]>
> wrote:
> 
>> Saying Ambari rules is like saying that you like to drink MD 20/20 and
>> calling it a fine wine.
>> 
>> Sorry to all the Hortonworks guys but Amabari has a long way to go…. very
>> immature.
>> 
>> What that has to do with Cassandra vs HBase? I haven’t a clue.
>> 
>> The key issue is that unless you need or want to use Hadoop, you shouldn’t
>> be using HBase. Its not a stand alone product or system.
>> 
>> 
>> 
>> 
>>> On May 30, 2015, at 7:40 AM, Serega Sheypak <[email protected]>
>> wrote:
>>> 
>>> 1. No killer features comparing to hbase
>>> 2.terrible!!! Ambari/cloudera manager rulezzz. Netflix has its own tool
>> for
>>> Cassandra but it doesn't support vnodes.
>>> 3. Rumors say it fast when it works;) the reason- it can silently drop
>> data
>>> you try to write.
>>> 4. Timeseries is a nightmare. The easiest approach is just replicate data
>>> to hdfs, partition it by hour/day and run spark/scalding/pig/hive/Impala
>>> 
>>> пятница, 29 мая 2015 г. пользователь Ajay написал:
>>> 
>>>> Hi,
>>>> 
>>>> I need some info on Hbase vs Cassandra as a data store (in general plus
>>>> specific to time series data).
>>>> 
>>>> The comparison in the following helps:
>>>> 1: features
>>>> 2: deployment and monitoring
>>>> 3: performance
>>>> 4: anything else
>>>> 
>>>> Thanks
>>>> Ajay
>>>> 
>> 
>> 


  

Reply via email to