Andres de la Peña created CASSANDRA-19007:
---------------------------------------------
Summary: Queries with multi-column replica-side filtering can miss
rows
Key: CASSANDRA-19007
URL: https://issues.apache.org/jira/browse/CASSANDRA-19007
Project: Cassandra
Issue Type: Bug
Components: Consistency/Coordination
Reporter: Andres de la Peña
{{SELECT}} queries with multi-column replica-side filtering can miss rows if
the filtered columns are spread across out-of-sync replicas. This dtest
reproduces the issue:
{code:java}
@Test
public void testMultiColumnReplicaSideFiltering() throws IOException
{
try (Cluster cluster = init(Cluster.build().withNodes(2).start()))
{
cluster.schemaChange(withKeyspace("CREATE TABLE %s.t (k int PRIMARY
KEY, a int, b int)"));
// insert a split row
cluster.get(1).executeInternal(withKeyspace("INSERT INTO %s.t(k, a)
VALUES (0, 1)"));
cluster.get(2).executeInternal(withKeyspace("INSERT INTO %s.t(k, b)
VALUES (0, 2)"));
String select = withKeyspace("SELECT * FROM %s.t WHERE a = 1 AND b = 2
ALLOW FILTERING");
Object[][] initialRows = cluster.coordinator(1).execute(select, ALL);
assertRows(initialRows, row(0, 1, 2)); // not found!!
}
}
{code}
This edge case affects queries using either {{ALLOW FILTERING }}or any index
implementation.
The protection mechanism added by CASSANDRA-8272/8273 won't deal with this
case, since it only solves single-column conflicts.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]