Seeking Cassandra speakers for the upcoming Distributed Data Day conference

2018-03-07 Thread Lynn Bender
Friends of Cassandra, We are currently seeking speakers for the upcoming Distributed Data Day conference in September. *http://distributeddataday.com * The conference will be held in San Francisco at Mission Bay Conference Center -- which was the site of the first

Re: Is there any limit in the number of partitions that a table can have

2018-03-07 Thread Carlos Rolo
Great explanation, thanks Jeff! On 7 Mar 2018 17:49, "Javier Pareja" wrote: > Thank you for your time Jeff, very helpful.I couldn't find anything out > there about the subject and I suspected that this could be the case. > > Regarding the clustering key in this case: >

Re: Batch too large exception

2018-03-07 Thread Goutham reddy
Mkadek, Sorry for the late reply. Thanks for the insight that I am unknowingly using batch inserts (Spring Data Cassandra) using the repository.save where I am inserting a list of objects at one go. And Cassandra is treating it as Batch Inserts aborting because of size and write timeout exception.

Re: One time major deletion/purge vs periodic deletion

2018-03-07 Thread kurt greaves
The important point to consider is whether you are deleting old data or recently written data. How old/recent depends on your write rate to the cluster and there's no real formula. Basically you want to avoid deleting a lot of old data all at once because the tombstones will end up in new SSTables

Re: backup/restore cassandra data

2018-03-07 Thread Ben Slater
You should be able to follow the same approach(s) as restoring from a backup as outlined here: https://docs.datastax.com/en/cassandra/2.1/cassandra/operations/ops_backup_snapshot_restore_t.html#ops_backup_snapshot_restore_t Cheers Ben On Thu, 8 Mar 2018 at 17:07 onmstester onmstester

backup/restore cassandra data

2018-03-07 Thread onmstester onmstester
Would it be possible to copy/paste Cassandra data directory from one of nodes (which Its OS partition corrupted) and use it in a fresh Cassandra node? I've used rf=1 so that's my only chance! Sent using Zoho Mail

Re: system.size_estimates - safe to remove sstables?

2018-03-07 Thread Kunal Gangakhedkar
Thanks a lot, Chris. Will try it today/tomorrow and update here. Thanks, Kunal On 7 March 2018 at 00:25, Chris Lohfink wrote: > While its off you can delete the files in the directory yeah > > Chris > > > On Mar 6, 2018, at 2:35 AM, Kunal Gangakhedkar

Re: Is there any limit in the number of partitions that a table can have

2018-03-07 Thread Jeff Jirsa
On Wed, Mar 7, 2018 at 7:13 AM, Carlos Rolo wrote: > Hi Jeff, > > Could you expand: "Tables without clustering keys are often deceptively > expensive to compact, as a lot of work (relative to the other cell > boundaries) happens on partition boundaries." This is something I

Re: Is there any limit in the number of partitions that a table can have

2018-03-07 Thread Javier Pareja
Thank you for your time Jeff, very helpful.I couldn't find anything out there about the subject and I suspected that this could be the case. Regarding the clustering key in this case: Back in the RDBMS world, you will always assign a sequential (or as sequential as possible) clustering key to a

Re: Adding disk to operating C*

2018-03-07 Thread Rahul Singh
Are you putting both the commitlogs and the Sstables on the adds? Consider moving your snapshots often if that’s also taking up space. Maybe able to save some space before you add drives. You should be able to add these new drives and mount them without an issue. Try to avoid different number

Re: One time major deletion/purge vs periodic deletion

2018-03-07 Thread Ben Slater
I would say you are better off spreading out the deletes so compactions have the best chance of actually removing them from disk before they become a problem. You will likely need to pay close attempting to compaction strategy tuning. I don’t have any personal experience with it but you may also

Re: One time major deletion/purge vs periodic deletion

2018-03-07 Thread Rahul Singh
Charu, I am aware of what type of things you are trying to do and why. Not sure if DCS will solve your problem. Consider a process that identifies the data that needs to be deleted and sets a TTL on that row or cell sometime in the future such as 10 days. The process could be run daily ,

Re: Seed nodes of DC2 creating own versions of system keyspaces

2018-03-07 Thread Oleksandr Shulgin
On Tue, Mar 6, 2018 at 8:28 PM, Jeff Jirsa wrote: > > Sorry, I wasnt as precise as I should have been: > > In 3.0 and newer, a bootstrapping node will wait until it has schema > before it bootstraps. HOWEVER, we make the ssystem_auth/system_distributed, > etc keyspaces as a

Re: Is there any limit in the number of partitions that a table can have

2018-03-07 Thread Jeff Jirsa
There is no limit The token range of murmur3 is 2^64, but Cassandra properly handles token overlaps (we use a key that’s effectively a tuple of the token/hash and the underlying key itself), so having more than 2^64 partitions won’t hurt anything in theory That said, having that many

Re: Cassandra Daemon not coming up

2018-03-07 Thread Rahul Singh
It’s possible that the schema supporting roles and users is corrupted. Do you have a backup of it? Another quick fix would be to potentially reset permissions on your data dirs and restart. You can also inspect using the offline Cassandra sstable reader to see if it’s unaffected. -- Rahul

Re: Is there any limit in the number of partitions that a table can have

2018-03-07 Thread Rahul Singh
The range is 2*2^63 -- Rahul Singh rahul.si...@anant.us Anant Corporation On Mar 7, 2018, 6:06 AM -0500, Javier Pareja , wrote: > Hello all, > > I have been trying to find an answer to the following but I have had no luck > so far: > Is there any limit to the number of

Re: Is there any limit in the number of partitions that a table can have

2018-03-07 Thread Javier Pareja
Thank you Rahul, but is it a good practice to use a large range here? Or would it be better to create partitions with more than 1 row (by using a clustering key)? >From a data query point of view I will be accessing the rows by a UID one at a time. F Javier Pareja On Wed, Mar 7, 2018 at 11:12

Is there any limit in the number of partitions that a table can have

2018-03-07 Thread Javier Pareja
Hello all, I have been trying to find an answer to the following but I have had no luck so far: Is there any limit to the number of partitions that a table can have? Let's say a table has a partition key an no clustering key, is there a recommended limit on the number of values that this

Re: [External] Is there any limit in the number of partitions that a table can have

2018-03-07 Thread Tom van der Woerdt
Hi Javier, When our users ask this question, I tend to answer "keep it above a billion". More partitions is better. I'm not aware of any actual limits on partition count. Practically it's almost always limited by the disk space in a server. Tom van der Woerdt Site Reliability Engineer

Re: Is there any limit in the number of partitions that a table can have

2018-03-07 Thread Carlos Rolo
Hi Jeff, Could you expand: "Tables without clustering keys are often deceptively expensive to compact, as a lot of work (relative to the other cell boundaries) happens on partition boundaries." This is something I didn't know and highly interesting to know more about! -- Carlos Rolo On Wed, Mar

Re: Is there any limit in the number of partitions that a table can have

2018-03-07 Thread Javier Pareja
Thank you Jeff, So, if I understood your email correctly, there is no restriction but I should be using clustering for performance reasons. I am expecting to store 10B rows per year in this table and each row will have a user defined type with an approx size of 1500 bytes. The access to the data