Using timestamp as key will cause your scan to largely hit one region.  That
may not be so good.

If you add something in front of the date, you may be able to spread your
scan over several machines.

On the other hand, your aggregation might be very small.  In that case, the
convenience of a time key might be enough to sufficient to make you prefer
that implementation.

How much data are you talking about aggregating each time you aggregate?

On Tue, Apr 5, 2011 at 2:16 AM, Miguel Costa <[email protected]>wrote:

> I want to have my data aggregated by day, so I would like to know wich is
> the best option to query my data. To put The timestamp of the data on my
> rowkey or to use timestamp of columns?
>
>

Reply via email to