[
https://issues.apache.org/jira/browse/CASSANDRA-11646?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15257767#comment-15257767
]
Stefania commented on CASSANDRA-11646:
--------------------------------------
It's this line
[here|https://github.com/apache/cassandra/blob/trunk/src/java/org/apache/cassandra/io/sstable/CQLSSTableWriter.java#L171].
It used to be a decompose on the type and it is causing the increase in size.
We do have unit tests and they can read the sstables, see for example this one
[here|https://github.com/apache/cassandra/blob/trunk/test/unit/org/apache/cassandra/io/sstable/CQLSSTableWriterTest.java#L341].
Is it because
[{{rawAddRow}}|https://github.com/apache/cassandra/blob/trunk/src/java/org/apache/cassandra/io/sstable/CQLSSTableWriter.java#L235]
adjust the format or because we already support it for UD functions?
To return to a decompose on the type, we need to somehow convert
{{com.datastax.driver.core.UDTValue}} to a BB. I'm not sure what's the best way
to do this without exposing BBs to the users.
> SSTableWriter output discrepancy
> --------------------------------
>
> Key: CASSANDRA-11646
> URL: https://issues.apache.org/jira/browse/CASSANDRA-11646
> Project: Cassandra
> Issue Type: Bug
> Reporter: T Jake Luciani
> Assignee: Stefania
> Fix For: 3.6
>
>
> Since CASSANDRA-10624 there is a non-trivial difference in the size of the
> output in CQLSSTableWriter.
> I've written the following code:
> {code}
> String KS = "cql_keyspace";
> String TABLE = "table1";
> File tempdir = Files.createTempDir();
> File dataDir = new File(tempdir.getAbsolutePath() + File.separator +
> KS + File.separator + TABLE);
> assert dataDir.mkdirs();
> String schema = "CREATE TABLE cql_keyspace.table1 ("
> + " k int PRIMARY KEY,"
> + " v1 text,"
> + " v2 int"
> + ");";// with compression = {};";
> String insert = "INSERT INTO cql_keyspace.table1 (k, v1, v2) VALUES
> (?, ?, ?)";
> CQLSSTableWriter writer = CQLSSTableWriter.builder()
> .sorted()
> .inDirectory(dataDir)
> .forTable(schema)
> .using(insert).build();
> for (int i = 0; i < 10000000; i++)
> writer.addRow(i, "test1", 24);
> writer.close();
> {code}
> Pre CASSANDRA-10624 the data file is ~63MB. Post it's ~69MB
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)