[ 
https://issues.apache.org/jira/browse/CASSANDRA-7241?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14010299#comment-14010299
 ] 

Brandon Williams commented on CASSANDRA-7241:
---------------------------------------------

So, I took a different tack and ignored these errors, and instead focused on 
ThriftColumnFamilyDataTypeTest, which fails with:

{noformat}
    [junit] Testcase: 
testCassandraStorageDataType(org.apache.cassandra.pig.ThriftColumnFamilyDataTypeTest):
    Caused an ERROR
    [junit] org.apache.pig.data.DefaultDataBag cannot be cast to 
org.apache.pig.data.Tuple
    [junit] java.lang.ClassCastException: org.apache.pig.data.DefaultDataBag 
cannot be cast to org.apache.pig.data.Tuple
    [junit]     at 
org.apache.cassandra.pig.ThriftColumnFamilyDataTypeTest.testCassandraStorageDataType(ThriftColumnFamilyDataTypeTest.java:150)
{noformat}

After a lengthy, tricky, painful bisect, I land back at CASSANDRA-5417.  This 
test fails 100% of the time, and given the error I don't see how it can 
possibly be a timing issue.  So I recreated this test using the cli and pig so 
I could run it manually, and I get this:

{noformat}
org.apache.cassandra.serializers.MarshalException: Invalid UTF-8 bytes deadbeef
        at 
org.apache.cassandra.serializers.AbstractTextSerializer.deserialize(AbstractTextSerializer.java:43)
        at 
org.apache.cassandra.serializers.AbstractTextSerializer.deserialize(AbstractTextSerializer.java:26)
        at 
org.apache.cassandra.db.marshal.AbstractType.compose(AbstractType.java:142)
        at 
org.apache.cassandra.hadoop.pig.AbstractCassandraStorage.columnToTuple(AbstractCassandraStorage.java:131)
        at 
org.apache.cassandra.hadoop.pig.CassandraStorage.getNext(CassandraStorage.java:256)
        at 
org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigRecordReader.nextKeyValue(PigRecordReader.java:194)
        at 
org.apache.hadoop.mapred.MapTask$NewTrackingRecordReader.nextKeyValue(MapTask.java:532)
        at 
org.apache.hadoop.mapreduce.MapContext.nextKeyValue(MapContext.java:67)
        at org.apache.hadoop.mapreduce.Mapper.run(Mapper.java:143)
        at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:764)
        at org.apache.hadoop.mapred.MapTask.run(MapTask.java:370)
        at 
org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:212)
{noformat}

Which is interesting, since the deadbeef column is BytesType (verified in the 
cli), and the line in ACS that throws is also from CASSANDRA-5417.

I'm left to conclude that, if the problem is in pig, it's still 
CASSANDRA-5417's fault :)  I can attach the cli-ified script and very simple 
pig script to run against if needed.

> Pig test fails on 2.1 branch
> ----------------------------
>
>                 Key: CASSANDRA-7241
>                 URL: https://issues.apache.org/jira/browse/CASSANDRA-7241
>             Project: Cassandra
>          Issue Type: Bug
>            Reporter: Alex Liu
>            Assignee: Brandon Williams
>             Fix For: 2.1 rc1
>
>
> run ant pig-test on cassandra-2.1 branch. There are many tests failed. I 
> trace it a little and find out Pig test fails starts from 
> https://github.com/apache/cassandra/commit/362cc05352ec67e707e0ac790732e96a15e63f6b
> commit.
> It looks like storage changes break Pig tests.



--
This message was sent by Atlassian JIRA
(v6.2#6252)

Reply via email to