On Fri, Feb 11, 2011 at 2:06 AM, Patrik Modesto <patrik.mode...@gmail.com> wrote: > Hi all! > > I'm thinking if size of a column name could matter for a large dataset > in Cassandra (I mean lots of rows). For example what if I have a row > with 10 columns each has 10 bytes value and 10 bytes name. Do I have > half the row size just of the column names and the other half of the > data (not counting storage overhead)? What if I have 10M of these > rows? Is there a difference? Should I use some 3bytes codes for a > column name to save memory/bandwidth?
Yes, column names take up space in storage and bandwidth, but the work to make a compressible data file format would reduce that overhead: https://issues.apache.org/jira/browse/CASSANDRA-674 -ryan