HBase is a distributed, consistent, sorted key value store. The "sorted" bit
allows for range scans in addition to the point gets that all K/V stores
support. Nothing more, nothing less.
It happens to store its data in HDFS by default, and we provide convenient
input and output formats for map reduce.
From: Michael Segel <[email protected]>
To: [email protected]
Sent: Monday, June 1, 2015 5:32 PM
Subject: Re: Hbase vs Cassandra
The point is that HBase is part of the Hadoop ecosystem. Not a stand alone
database like Cassandra.
This is one thing that gets lost when people want to compare NoSQL databases /
data stores.
As to Big Data without Hadoop? Well, there’s spark on mesos … :-P
And there are other Big Data systems out there but are not as well known.
Lexus/Nexus had their proprietary system that they’ve been trying to sell …
> On Jun 1, 2015, at 5:29 PM, Vladimir Rodionov <[email protected]> wrote:
>
>>> The key issue is that unless you need or want to use Hadoop, you
> shouldn’t be using HBase. Its not a stand alone product or system.
>
> Hello, what is use case of a big data application w/o Hadoop?
>
> -Vlad
>
> On Mon, Jun 1, 2015 at 2:26 PM, Michael Segel <[email protected]>
> wrote:
>
>> Saying Ambari rules is like saying that you like to drink MD 20/20 and
>> calling it a fine wine.
>>
>> Sorry to all the Hortonworks guys but Amabari has a long way to go…. very
>> immature.
>>
>> What that has to do with Cassandra vs HBase? I haven’t a clue.
>>
>> The key issue is that unless you need or want to use Hadoop, you shouldn’t
>> be using HBase. Its not a stand alone product or system.
>>
>>
>>
>>
>>> On May 30, 2015, at 7:40 AM, Serega Sheypak <[email protected]>
>> wrote:
>>>
>>> 1. No killer features comparing to hbase
>>> 2.terrible!!! Ambari/cloudera manager rulezzz. Netflix has its own tool
>> for
>>> Cassandra but it doesn't support vnodes.
>>> 3. Rumors say it fast when it works;) the reason- it can silently drop
>> data
>>> you try to write.
>>> 4. Timeseries is a nightmare. The easiest approach is just replicate data
>>> to hdfs, partition it by hour/day and run spark/scalding/pig/hive/Impala
>>>
>>> пятница, 29 мая 2015 г. пользователь Ajay написал:
>>>
>>>> Hi,
>>>>
>>>> I need some info on Hbase vs Cassandra as a data store (in general plus
>>>> specific to time series data).
>>>>
>>>> The comparison in the following helps:
>>>> 1: features
>>>> 2: deployment and monitoring
>>>> 3: performance
>>>> 4: anything else
>>>>
>>>> Thanks
>>>> Ajay
>>>>
>>
>>