Yes, we are using 256 vnodes.Keyspace is configured with
NetworkTopologyStrategy in 4 regions, with RF3.
Copying sstabes and running cleanup is a good idea.

On Sun, Sep 16, 2018 at 9:26 PM Dinesh Joshi <dinesh.jo...@yahoo.com.invalid>
wrote:

> It would be helpful to describe your setup - specifically are you using
> vnodes? How is the keyspace setup? One option would be to copy SSTables
> from the replicas and running clean up. That might actually be faster.
> Since the SSTables are compressed you should use a tool that copies without
> compressing the data stream in transit.
>
> Dinesh
>
> On Sep 16, 2018, at 2:07 AM, Vitali Dyachuk <vdjat...@gmail.com> wrote:
>
> Both stream throughput settings are set to 0, meaning that there is no
> stream throttling on the C* side. Yes, i see high cpu used by STREAM-IN
> thread, sstables are compressed up to 80%
> What about copying sstables with rsync and then running repair? Probably
> its not that simple, vut If the data is RF3 so one node should have all the
> key ranges and repair will not recalculate all the hashes?
>
> Vitali
>
> On Sun, Sep 16, 2018, 02:33 dinesh.jo...@yahoo.com.INVALID <
> dinesh.jo...@yahoo.com.invalid> wrote:
>
>> Its a long shot but do you have
>> stream_throughput_outbound_megabits_per_sec or
>> inter_dc_stream_throughput_outbound_megabits_per_sec set to a low value?
>>
>> You're right in that 3.0 streaming uses 1 thread for incoming and
>> outgoing connection each per peer. It not only reads the bytes off of the
>> channel but also deserializes the partitions on that same thread. If you
>> see high CPU use by STREAM-IN thread then your streaming is CPU bound. In
>> this situation a powerful CPU will definitely help. Dropping internode
>> compression and encryption will also help. Are your SSTables compressed?
>>
>> Dinesh
>>
>>
>> On Friday, September 14, 2018, 4:15:28 AM PDT, Vitali Dyachuk <
>> vdjat...@gmail.com> wrote:
>>
>>
>> None of these throttling are helpful for streaming if you have even a
>> 150-200 Mbit/s bandwidth which is affordable in any cloud. Tweaking network
>> tcp memory, window size etc does not help, the bottleneck is not the
>> network.
>> These are my findings on how streaming is limited in C* 3.0.*
>>
>> 1)  Streaming of the particular range which needs to be steamed to the
>> new node is limited with one 1 thread and no tweaking of cpu affinity etc
>> helps, probably the powerfull computing VM will help
>> 2) Disabling compression internode_compression and disabling compression
>> per table in our case helps a bit
>> 3) When streaming has been dropped there is no resume available for the
>> streaming range so it will start from the beginning
>>
>> One of the options could be to create snapshots of sstables on the source
>> node and just copy all sstable snapshots to new node and then run repair,
>> data is ~5TB, RF3 ?
>> How is it possible at all to stream data fast to a new node/nodes ?
>>
>> Vitali.
>>
>> On Wed, Sep 12, 2018 at 5:02 PM Surbhi Gupta <surbhi.gupt...@gmail.com>
>> wrote:
>>
>> Increase 3 throughput
>> Compaction throughput
>> Stream throughput
>> Interdcstream throughput (if rebuilding from another DC)
>>
>> Make all of the above to 0 and see if there is any improvement and later
>> set the value if u can’t leave these values to 0.
>>
>> On Wed, Sep 12, 2018 at 5:42 AM Vitali Dyachuk <vdjat...@gmail.com>
>> wrote:
>>
>> Hi,
>> I'm currently streaming data with nodetool rebuild on 2 nodes, each node
>> is streaming from different location. The problem is that it takes ~7 days
>> to stream 4Tb of data to 1 node, the speed on each side is ~150Mbit/s  so
>> it should take around
>> ~2,5 days . Although there are resources on the destnodes and in the
>> source regions.
>> I've increased stream throughput, but its only affects outbound
>> connections.
>> Tested with iperf the bandwidth is 600Mibt/s from both sides. Last week
>> i've changed the CS from ST to LC because of huge sstables and compaction
>> of them is still ongoing.
>> How does rebuild command works ? Does it calculate the range then request
>> the needed sstables from that node and start streaming ? How is it possible
>> to speed up the streaming ?
>>
>> Vitali.
>>
>>

Reply via email to