[
https://issues.apache.org/jira/browse/CASSANDRA-19727?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Francisco Guerrero updated CASSANDRA-19727:
-------------------------------------------
Bug Category: Parent values: Correctness(12982)Level 1 values:
Unrecoverable Corruption / Loss(13161)
Complexity: Low Hanging Fruit
Discovered By: User Report
Fix Version/s: NA
Severity: Low
Status: Open (was: Triage Needed)
> [Analytics] Bulk writer fails validation stage when writing to a cluster
> using RandomPartitioner
> ------------------------------------------------------------------------------------------------
>
> Key: CASSANDRA-19727
> URL: https://issues.apache.org/jira/browse/CASSANDRA-19727
> Project: Cassandra
> Issue Type: Bug
> Components: Analytics Library
> Reporter: Francisco Guerrero
> Assignee: Francisco Guerrero
> Priority: Normal
> Fix For: NA
>
>
> In bulk writer after writing SSTables, written data to SSTables is read back
> again to perform validations before shipping the files to the Cassandra
> instances. The logic to validate the SSTables assumes
> {{{}Murmur3Partitioner{}}}. This validation fails however when a bulk writer
> job is running against a cluster using the {{RandomPartitioner}} with the
> following stacktrace:
> {code}
> java.lang.RuntimeException: java.lang.IllegalStateException: Partitioner in
> ValidationMetadata does not match TableMetaData:
> org.apache.cassandra.dht.RandomPartitioner vs.
> org.apache.cassandra.dht.Murmur3Partitioner
> at
> org.apache.cassandra.spark.utils.Throwing.lambda$function$2(Throwing.java:84)
> at
> java.base/java.util.stream.ReferencePipeline$3$1.accept(ReferencePipeline.java:197)
> at
> java.base/java.util.HashMap$KeySpliterator.forEachRemaining(HashMap.java:1707)
> at
> java.base/java.util.stream.AbstractPipeline.copyInto(AbstractPipeline.java:509)
> at
> java.base/java.util.stream.AbstractPipeline.wrapAndCopyInto(AbstractPipeline.java:499)
> at
> java.base/java.util.stream.ReduceOps$ReduceOp.evaluateSequential(ReduceOps.java:921)
> at
> java.base/java.util.stream.AbstractPipeline.evaluate(AbstractPipeline.java:234)
> at
> java.base/java.util.stream.ReferencePipeline.collect(ReferencePipeline.java:682)
> at
> org.apache.cassandra.spark.data.BasicSupplier.openAll(BasicSupplier.java:44)
> at
> org.apache.cassandra.bridge.CassandraBridgeImplementation.getCompactionScanner(CassandraBridgeImplementation.java:248)
> at
> org.apache.cassandra.spark.data.DataLayer.openCompactionScanner(DataLayer.java:262)
> at
> org.apache.cassandra.spark.bulkwriter.SSTableWriter.validateSSTables(SSTableWriter.java:129)
> at
> org.apache.cassandra.spark.bulkwriter.SSTableWriter.close(SSTableWriter.java:113)
> at
> org.apache.cassandra.spark.bulkwriter.RecordWriter.finalizeSSTable(RecordWriter.java:224)
> at
> org.apache.cassandra.spark.bulkwriter.RecordWriter.write(RecordWriter.java:122)
> ... 15 more
> {code}
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]