On Fri, Mar 25, 2011 at 1:41 PM, Jeffrey Wang <jw...@palantir.com> wrote: > I don't think it's Pig running out of memory, but rather Cassandra itself > (the data doesn't even make it to Pig). get_range_slices() is called with a > row batch size of 4096, the default, and it's fetching all of the columns in > each row. If I have 10K columns in each row, that's a huge request, and > Cassandra runs into memory pressure trying to serve it.
If your rows are that large, you should lower the batch size to be appropriate. -Brandon