Hi Fabio,
I'm not sure if super columns are fully supported right now in
CassandraStorage. Brandon (who I CCed) would know for sure. That and I
thought the pig bug that made it impossible to get to nested data structures
has been resolved - the ticket you commented on today I think was a duplicate
of another bug that has been resolved.
What version of pig and what version of cassandra are you using?
Jeremy
On Aug 17, 2011, at 10:09 AM, Fabio Souto wrote:
> Hi,
>
> I have some metrics stored on a Cassandra supercolumn and the subcolumns are
> the timestamps of each metric, I'm loading the metrics in pig with this line:
>
> all_metrics = LOAD 'cassandra://keyspace/metrics' USING CassandraStorage() AS
> (metric_key, metrics_bag: bag {metric: tuple(timestamp, columns: bag {record:
> tuple(name:chararray, value:chararray)})});
>
> I just want to access the timestamp subcolumn to get the most recent value
> (using max), I try to use:
>
> metric_status = FOREACH all_metrics GENERATE metric_key,
> metrics_bag.timestamp;
> dump metric_status;
>
> but I'm getting empty values, like this:
>
> (key1,{(),(),()})
> (key2,{(),()})
> ...
>
>
>