Re: Use Cassandra to store 2 million records of persons

2011-01-21 Thread Dave Gardner
Our experience of Cassandra+Hadoop is good. We have a 16 node Cassandra cluster storing 110m users plus a 5 node Hadoop cluster. We can scan through all rows in about 2.5 hours. Dave On Thursday, 20 January 2011, David G. Boney dbon...@semanticartifacts.com wrote: I don't think the below

Use Cassandra to store 2 million records of persons

2011-01-20 Thread Surender Singh
Hi All I want to use Apache Cassandra to store information (like first name, last name, gender, address) about 2 million people. Then need to perform analytic and reporting on that data. is need to store information about 2 million people in Mysql and then transfer that information into

Re: Use Cassandra to store 2 million records of persons

2011-01-20 Thread David Boxenhorn
Cassandra is not a good solution for data mining type problems, since it doesn't have ad-hoc queries. Cassandra is designed to maximize throughput, which is not usually a problem for data mining. On Thu, Jan 20, 2011 at 2:07 PM, Surender Singh suriait2...@gmail.comwrote: Hi All I want to use

Re: Use Cassandra to store 2 million records of persons

2011-01-20 Thread Surender Singh
David Please tell me any solution for it. Thanks and regards Surender Singh On Thu, Jan 20, 2011 at 6:05 PM, David Boxenhorn da...@lookin2.com wrote: Cassandra is not a good solution for data mining type problems, since it doesn't have ad-hoc queries. Cassandra is designed to maximize

Re: Use Cassandra to store 2 million records of persons

2011-01-20 Thread David G. Boney
I don't think the below statement accurately describes data mining or using Cassandra for data mining. All the techniques I am familiar with for either data mining or machine learning, which data mining is a subset, make one or more sequential scans through the data to abstract statistics or