Ok, nice tool, but I still can't see how much data each row occupies on the SSTable (or am I missing something?).
Obs: considering SSTables format, where rows are strictly sequential and sorted, a feature like that doesn't seem something very hard to implement, anyway. Wouldn't it be possible to calculate it only from index files, without even needing to read the actual table? On Tue, Jan 20, 2015 at 5:05 PM, Jens Rantil <jens.ran...@tink.se> wrote: > Hi, > > Datastax comes with sstablekeys that does that. You could also use > sstable2json script to find keys. > > Cheers, > Jens > > > > On Tue, Jan 20, 2015 at 2:53 PM, Edson Marquezani Filho > <edsonmarquez...@gmail.com> wrote: >> >> Hello, everybody. >> >> Does anyone know a way to list, for an arbitrary column family, all >> the rows owned (including replicas) by a given node and the data size >> (real size or disk occupation) of each one of them on that node? >> >> I would like to do that because I have data on one of my nodes growing >> faster than the others, although rows (and replicas) seem evenly >> distributed across the cluster. So, I would like to verify if I have >> some specific rows growing too much. >> >> Thank you. > >