Our experience of Cassandra+Hadoop is good.
We have a 16 node Cassandra cluster storing 110m users plus a 5 node
Hadoop cluster. We can scan through all rows in about 2.5 hours.
Dave
On Thursday, 20 January 2011, David G. Boney
dbon...@semanticartifacts.com wrote:
I don't think the below
Hi All
I want to use Apache Cassandra to store information (like first name, last
name, gender, address) about 2 million people. Then need to perform
analytic and reporting on that data.
is need to store information about 2 million people in Mysql and then
transfer that information into
Cassandra is not a good solution for data mining type problems, since it
doesn't have ad-hoc queries. Cassandra is designed to maximize throughput,
which is not usually a problem for data mining.
On Thu, Jan 20, 2011 at 2:07 PM, Surender Singh suriait2...@gmail.comwrote:
Hi All
I want to use
David
Please tell me any solution for it.
Thanks and regards
Surender Singh
On Thu, Jan 20, 2011 at 6:05 PM, David Boxenhorn da...@lookin2.com wrote:
Cassandra is not a good solution for data mining type problems, since it
doesn't have ad-hoc queries. Cassandra is designed to maximize
I don't think the below statement accurately describes data mining or using
Cassandra for data mining. All the techniques I am familiar with for either
data mining or machine learning, which data mining is a subset, make one or
more sequential scans through the data to abstract statistics or