Francisco Guerrero created CASSANDRA-19369:
----------------------------------------------
Summary: [Analytics] Use XXHash32 for digest calculation of
SSTables
Key: CASSANDRA-19369
URL: https://issues.apache.org/jira/browse/CASSANDRA-19369
Project: Cassandra
Issue Type: Improvement
Reporter: Francisco Guerrero
Assignee: Francisco Guerrero
During bulk writes, Cassandra Analytics calculates the MD5 checksum of every
SSTable it produces. During SSTable upload to Cassandra Sidecar, Cassandra
Analytics includes the {{content-md5}} header as part of the upload request.
This information is used by Cassandra Sidecar to validate the integrity of the
uploaded SSTable and prevent issues with bit flips and corrupted SSTables.
Recently, Cassandra Sidecar introduced [support for additional checksum
validations|https://issues.apache.org/jira/browse/CASSANDRASC-97] during
SSTable upload. Notably the XXHash32 digest support was added which offers for
more performant checksum calculations. This support now allows Cassandra
Analytics to use a more efficient digest algorithm that is friendlier on the
CPU usage of Sidecar and spark resources.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]