sstable processing times

2020-10-23 Thread James A. Robinson
Hi folks, I'm running a job on an offline node to test how long it takes to run sstablesplit several large sstable. I'm a bit dismayed to see it took about 22 hours to process a 1.5 gigabyte sstable! I worry about the 32 gigabyte sstable that is my ultimate target to split. This is running on a

cassandra tracing's source_elapsed microseconds

2020-10-08 Thread James A. Robinson
Hi folks, I've been looking at various articles on the TRACING ON output of cassandra. I'm not finding a definitive description of what the output means. https://docs.datastax.com/en/dse/6.7/cql/cql/cql_reference/cqlsh_commands/cqlshTracing.html says "Note: The source_elapsed column value is the

Re: sstableloader - warning vs. failure?

2020-02-07 Thread James A. Robinson
Ok, thanks very much the answer! On Fri, Feb 7, 2020 at 9:00 PM Erick Ramirez wrote: > INFO [pool-1-thread-4] 2020-02-08 01:35:37,946 NoSpamLogger.java:91 - >> Maximum memory usage reached (536870912), cannot allocate chunk of 1048576 >> > > The message gets logged when SSTables are being cache

sstableloader - warning vs. failure?

2020-02-07 Thread James A. Robinson
Hi folks, When sstableloader hits a very large sstable cassandra may end up logging a message like this: INFO [pool-1-thread-4] 2020-02-08 01:35:37,946 NoSpamLogger.java:91 - Maximum memory usage reached (536870912), cannot allocate chunk of 1048576 The loading process doesn't abort, and the ss

Cassandra and UTF-8 BOM?

2019-10-29 Thread James A. Robinson
Hi folks, I'm looking at a table that has a primary key defined as "publisher_id text". I've noticed some of the entries have what appears to me to be a UTF-8 BOM marker and some do not. https://docs.datastax.com/en/archived/cql/3.3/cql/cql_reference/cql_data_types_c.html says text is a UTF-8 en

n00b q re UPDATE v. INSERT in CQL

2019-10-25 Thread James A. Robinson
Hi folks, I'm working on a clean-up task for some bad data in a cassandra db. The bad data in this case are values with mixed case that will need to be lowercased. In some tables the value that needs to be changed is a primary key, in other cases it is not. >From the reading I've done, the situa

snapshots and 'dot' prefixed _index directories

2019-10-01 Thread James A. Robinson
Hi folks, I took a nodetool snapshot of a keyspace in my cassandra 3.11 cluster and it included directories with a 'dot' prefix (often called a hidden file/directory). As an example: /var/lib/cassandra/data/impactvizor/tableau_notification-04bfb600291e11e7aeab31f0f0e5804b/snapshots/1569974640/.