[
https://issues.apache.org/jira/browse/CASSANDRA-2658?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Brandon Williams reopened CASSANDRA-2658:
-----------------------------------------
> Pig + CassandraStorage should work when trying to cast data after it's loaded
> -----------------------------------------------------------------------------
>
> Key: CASSANDRA-2658
> URL: https://issues.apache.org/jira/browse/CASSANDRA-2658
> Project: Cassandra
> Issue Type: Bug
> Affects Versions: 0.7.5
> Reporter: Jeremy Hanna
> Assignee: Brandon Williams
> Priority: Minor
> Labels: pig
>
> We currently do a lot with pig + cassandra, but one thing I've found is that
> currently it's very touchy with data that comes from Cassandra for some
> reason. For example, if I try to a SUM of data that has not been validated
> as an LongType in Cassandra, it borks. See this schema script for Cassandra
> -
> https://github.com/jeromatron/pygmalion/blob/master/cassandra/example_data.txt
> - and remove the validation on the num_heads data type and try to SUM that
> over the data and it gives data type errors. (It breaks with the num_heads
> validation removed and with or without the default_validation class being
> set.)
> We currently do analysis over data that is either just String (UTF8) data or
> that we have validated, so it works for us. However, I've seen a couple of
> people trying to use Cassandra with Pig that have had issues because of this.
> One of the tenets of pig is that it will eat anything and it kind of goes
> against this if the load/store somehow interferes with that. So in essence,
> I think this is a big deal for those wanting to use pig with cassandra in the
> ways that pig is normally used.
--
This message is automatically generated by JIRA.
For more information on JIRA, see: http://www.atlassian.com/software/jira