Re: Collections (MAP) data in Column Family

2015-10-15 Thread Robert Coli
On Wed, Oct 14, 2015 at 10:03 PM, Saladi Naidu 
wrote:

> Thanks for the reply. yes this indeed due to range tombstones with MAP
> data, even after tombstone past the gc_grace_poeriod and compactions ran in
> cluster, still no change with tombstone data in SSTables. Do you or nayone
> in the group know how to delete this bad data from the cluster?
>

Other than applying the patch from the upthread tickets, you could :

1) stop node

2) for each sstable affected:
a) dump to JSON with sstable2json
b) remove all duplicate range tombstones from JSON
c) recreate sstable with same name via json2sstable

3) start node

However based on my understanding of this issue, they'll just build up
again until you are running a version with the upthread patches.

=Rob


Collections (MAP) data in Column Family

2015-10-14 Thread Saladi Naidu
We are running Apache Cassandra 2.1.9. In one of our Column Family, we have MAP 
type column. We are seeing unusual data size of the column family (SSTables) 
with few 1000's of rows, while debugging, I looked at one of the SSTable and I 
see some unusual data in it
Below is JSON of one Row Key data
1. There is usual Column name, Key-Value pair and TS for the MAP - all_products 
column name 2. After Key Value pair,  I see Cluster Column style data in MAP 
with a "t" marker in between, this is literally repeated millions of cells - 
all_products:_","all_products:!",1442797965371999,"t",1442797965
Any clues on what is happening here? I know "d" marker for marked for delete, 
"e" marker for TTL but dont know what "t" marker is for? 
[{"key": "55736100", "cells": [["","",1444101633184000],           
["active","false",1444101633184000],           
["all_products:_","all_products:!",1442797965371999,"t",1442797965],           
["all_products:_","all_products:!",1442806687091999,"t",1442806687],           
["all_products:_","all_products:!",1443410022982999,"t",1443410022],           
["all_products:_","all_products:!",1443410595224999,"t",1443410595],           
["all_products:_","all_products:!",1443679978903999,"t",1443679978],           
["all_products:_","all_products:!",1444011801906999,"t",1444011801],           
["all_products:_","all_products:!",1444101633183999,"t",1444101633],           
["all_products:3135393730323130305f63735f435a","313539373032313030",1444101633184000],
           
["all_products:3135393730323130305f64655f4154","313539373032313030",1444101633184000],
           
["all_products:3135393730323130305f64655f4348","313539373032313030",1444101633184000],
           
["all_products:3135393730323130305f64655f4445","313539373032313030",1444101633184000],
           
["all_products:3135393730323130305f656e5f4348","313539373032313030",1444101633184000],
.["all_products:3233393238333430305f69745f4348","323339323833343030",1444101633184000],
           ["all_products:_","all_products:!",1442797965371999,"t",1442797965], 
          ["all_products:_","all_products:!",1442797965371999,"t",1442797965],  
         ["all_products:_","all_products:!",1442806687091999,"t",1442806687],   
        ["all_products:_","all_products:!",1442797965371999,"t",1442797965],    
       ["all_products:_","all_products:!",1442797965371999,"t",1442797965],     
      ["all_products:_","all_products:!",1442806687091999,"t",1442806687],      
     ["all_products:_","all_products:!",1442797965371999,"t",1442797965],       
    ["all_products:_","all_products:!",1442797965371999,"t",1442797965],        
   ["all_products:_","all_products:!",1442806687091999,"t",1442806687],         
  ["all_products:_","all_products:!",1442797965371999,"t",1442797965],          
 ["all_products:_","all_products:!",1442797965371999,"t",1442797965],           
["all_products:_","all_products:!",1442806687091999,"t",1442806687],           
["all_products:_","all_products:!",1442797965371999,"t",1442797965],           
["all_products:_","all_products:!",1442797965371999,"t",1442797965],           
["all_products:_","all_products:!",1442806687091999,"t",1442806687],           
["all_products:_","all_products:!",1442797965371999,"t",1442797965],           
["all_products:_","all_products:!",1442797965371999,"t",1442797965],           
["all_products:_","all_products:!",1442806687091999,"t",1442806687],           
["all_products:_","all_products:!",1442797965371999,"t",1442797965],           
["all_products:_","all_products:!",1442797965371999,"t",1442797965],           
["all_products:_","all_products:!",1442806687091999,"t",1442806687],           
["all_products:_","all_products:!",1442797965371999,"t",1442797965],           
["all_products:_","all_products:!",1442797965371999,"t",1442797965],           
["all_products:_","all_products:!",1442806687091999,"t",1442806687],

 Naidu Saladi 


Re: Collections (MAP) data in Column Family

2015-10-14 Thread Robert Coli
On Wed, Oct 14, 2015 at 11:24 AM, Saladi Naidu 
wrote:

> We are running Apache Cassandra 2.1.9. In one of our Column Family, we
> have MAP type column. We are seeing unusual data size of the column family
> (SSTables) with few 1000's of rows, while debugging, I looked at one of the
> SSTable and I see some unusual data in it
>
...

> 2. After Key Value pair,  I see Cluster Column style data in MAP with a
> "t" marker in between, this is literally repeated millions of cells -
> *all_products:_","all_products:!",1442797965371999,"t",1442797965*
>

(thx to driftx@IRC for the link to...)

https://issues.apache.org/jira/browse/CASSANDRA-10505 /
https://issues.apache.org/jira/browse/CASSANDRA-7953

" Range tombstones not purged during compaction / Duplicate range
tombstones "

It looks like you will need to wait for the next 2.1.x release for a fix. :)

=Rob