[
https://issues.apache.org/jira/browse/CASSANDRA-7241?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14010299#comment-14010299
]
Brandon Williams commented on CASSANDRA-7241:
---------------------------------------------
So, I took a different tack and ignored these errors, and instead focused on
ThriftColumnFamilyDataTypeTest, which fails with:
{noformat}
[junit] Testcase:
testCassandraStorageDataType(org.apache.cassandra.pig.ThriftColumnFamilyDataTypeTest):
Caused an ERROR
[junit] org.apache.pig.data.DefaultDataBag cannot be cast to
org.apache.pig.data.Tuple
[junit] java.lang.ClassCastException: org.apache.pig.data.DefaultDataBag
cannot be cast to org.apache.pig.data.Tuple
[junit] at
org.apache.cassandra.pig.ThriftColumnFamilyDataTypeTest.testCassandraStorageDataType(ThriftColumnFamilyDataTypeTest.java:150)
{noformat}
After a lengthy, tricky, painful bisect, I land back at CASSANDRA-5417. This
test fails 100% of the time, and given the error I don't see how it can
possibly be a timing issue. So I recreated this test using the cli and pig so
I could run it manually, and I get this:
{noformat}
org.apache.cassandra.serializers.MarshalException: Invalid UTF-8 bytes deadbeef
at
org.apache.cassandra.serializers.AbstractTextSerializer.deserialize(AbstractTextSerializer.java:43)
at
org.apache.cassandra.serializers.AbstractTextSerializer.deserialize(AbstractTextSerializer.java:26)
at
org.apache.cassandra.db.marshal.AbstractType.compose(AbstractType.java:142)
at
org.apache.cassandra.hadoop.pig.AbstractCassandraStorage.columnToTuple(AbstractCassandraStorage.java:131)
at
org.apache.cassandra.hadoop.pig.CassandraStorage.getNext(CassandraStorage.java:256)
at
org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigRecordReader.nextKeyValue(PigRecordReader.java:194)
at
org.apache.hadoop.mapred.MapTask$NewTrackingRecordReader.nextKeyValue(MapTask.java:532)
at
org.apache.hadoop.mapreduce.MapContext.nextKeyValue(MapContext.java:67)
at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:143)
at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:764)
at org.apache.hadoop.mapred.MapTask.run(MapTask.java:370)
at
org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:212)
{noformat}
Which is interesting, since the deadbeef column is BytesType (verified in the
cli), and the line in ACS that throws is also from CASSANDRA-5417.
I'm left to conclude that, if the problem is in pig, it's still
CASSANDRA-5417's fault :) I can attach the cli-ified script and very simple
pig script to run against if needed.
> Pig test fails on 2.1 branch
> ----------------------------
>
> Key: CASSANDRA-7241
> URL: https://issues.apache.org/jira/browse/CASSANDRA-7241
> Project: Cassandra
> Issue Type: Bug
> Reporter: Alex Liu
> Assignee: Brandon Williams
> Fix For: 2.1 rc1
>
>
> run ant pig-test on cassandra-2.1 branch. There are many tests failed. I
> trace it a little and find out Pig test fails starts from
> https://github.com/apache/cassandra/commit/362cc05352ec67e707e0ac790732e96a15e63f6b
> commit.
> It looks like storage changes break Pig tests.
--
This message was sent by Atlassian JIRA
(v6.2#6252)