At that scale of data, and the fact that it's a batch job, I would go with the
bulk loading tool.
Cheers
-
Aaron Morton
Freelance Developer
@aaronmorton
http://www.thelastpickle.com
On 19/10/2011, at 3:32 AM, Mike Rapuano wrote:
> We are not currently live but testing with Cassandra. I'm looking for
> recommendations on the most efficient way to load text files over 25GBs in
> size to Cassandra (version 0.8.6). Our application may require us to load
> 2-3 text files between 25-40GBs each a few times a week to our 3 node
> cluster. I was reading this article on DataStax:
> http://www.datastax.com/dev/blog/bulk-loading
>
> Is it most efficient to create the sstables and then use sstableloader or
> does anyone have other recommendations to "bulk load data"? We are new to
> Cassandra and trying to work within what is generally acceptable practices.
>
> Thanks
> Mike
>
>
>