Using timestamp as key will cause your scan to largely hit one region. That may not be so good.
If you add something in front of the date, you may be able to spread your scan over several machines. On the other hand, your aggregation might be very small. In that case, the convenience of a time key might be enough to sufficient to make you prefer that implementation. How much data are you talking about aggregating each time you aggregate? On Tue, Apr 5, 2011 at 2:16 AM, Miguel Costa <[email protected]>wrote: > I want to have my data aggregated by day, so I would like to know wich is > the best option to query my data. To put The timestamp of the data on my > rowkey or to use timestamp of columns? > >
