Re: Does "nodetool repair" need to be run on each node for a given table?

2017-03-15 Thread Thakrar, Jayesh
Thank you Eric for helping out.
The reason I sent the question a second time is because I did not see my 
question and the first reply from the usergroup.
After I sent the question a second time, I got a personal flame from somebody 
else too and so examined my "spam" folders and that's where I found both.
All the same, I got some very valuable pointers and am pasting here as a summary

https://docs.datastax.com/en/cassandra/2.1/cassandra/tools/toolsRepair.html

https://www.datastax.com/dev/blog/repair-in-cassandra

https://www.pythian.com/blog/effective-anti-entropy-repair-cassandra/

https://www.youtube.com/watch?v=1Sz_K8UID6E




On 3/15/17, 10:22 AM, "Eric Evans"  wrote:

On Tue, Mar 14, 2017 at 12:04 PM, daemeon reiydelle  
wrote:
> Am I unreasonable in expecting a poster to have looked at the 
documentation
> before posting? And that reposting the same query WITHOUT reading the
> documents (when pointed out to them) when asked to do so is not 
appropriate?

It may be reasonable to assume that people will make a good faith
effort to read the docs, but what is unreasonable is to presume that
an RTFM will do any good whatsoever.  In my experience, there is
almost always a more productive approach.

> Do we have a way to blackball such?

No, but no one is making you respond, either.

-- 
Eric Evans
john.eric.ev...@gmail.com





Re: Does "nodetool repair" need to be run on each node for a given table?

2017-03-15 Thread Eric Evans
On Tue, Mar 14, 2017 at 12:04 PM, daemeon reiydelle  wrote:
> Am I unreasonable in expecting a poster to have looked at the documentation
> before posting? And that reposting the same query WITHOUT reading the
> documents (when pointed out to them) when asked to do so is not appropriate?

It may be reasonable to assume that people will make a good faith
effort to read the docs, but what is unreasonable is to presume that
an RTFM will do any good whatsoever.  In my experience, there is
almost always a more productive approach.

> Do we have a way to blackball such?

No, but no one is making you respond, either.

-- 
Eric Evans
john.eric.ev...@gmail.com


Re: Does "nodetool repair" need to be run on each node for a given table?

2017-03-14 Thread Max Campos
For anyone with questions about how repair works, why repair, partial range 
repair, incremental vs. non-incremental repair, best practices for repair, etc. 
 I highly recommend this talk from Alexander Dejanovski, The Last Pickle at 
Cassandra Summit 2016:

https://www.youtube.com/watch?v=1Sz_K8UID6E 
<https://www.youtube.com/watch?v=1Sz_K8UID6E>

Alexander went a little too crazy on the auto-repeat animated GIFs, but the 
information is excellent — thanks Alexander if you’re out there - I really got 
a lot out of your talk.  :-)

- Max

> On Mar 14, 2017, at 12:48 pm, Thakrar, Jayesh  
> wrote:
> 
> Thank you for the links, Meg - very helpful!
>  
> From: Meg Mara mailto:mm...@digitalriver.com>>
> Date: Tuesday, March 14, 2017 at 2:08 PM
> To: "user@cassandra.apache.org <mailto:user@cassandra.apache.org>" 
> mailto:user@cassandra.apache.org>>
> Subject: RE: Does "nodetool repair" need to be run on each node for a given 
> table?
>  
> Yes, whichever node initiates “nodetool repair” becomes the co-ordinator node 
> for that repair operation and only the token ranges it owns are repaired. 
> Other nodes which own data in that “repaired token range” will also be 
> involved in this repair operation, create their merkle trees and send it to 
> coordinator for comparison.
>  
> There is a lot to explain when it comes to repair and its many options (-pr, 
> -inc, -par etc). I recommend the following two links which provide the 
> answers to all your questions.
>  
> https://www.datastax.com/dev/blog/repair-in-cassandra 
> <https://www.datastax.com/dev/blog/repair-in-cassandra>
> https://www.pythian.com/blog/effective-anti-entropy-repair-cassandra/ 
> <https://www.pythian.com/blog/effective-anti-entropy-repair-cassandra/>
>  
> Thanks,
> -  Meg Mara
>  
> From: Thakrar, Jayesh [mailto:jthak...@conversantmedia.com 
> <mailto:jthak...@conversantmedia.com>] 
> Sent: Tuesday, March 14, 2017 1:26 PM
> To: daemeon reiydelle mailto:daeme...@gmail.com>>; 
> user@cassandra.apache.org <mailto:user@cassandra.apache.org>
> Subject: Re: Does "nodetool repair" need to be run on each node for a given 
> table?
>  
> Thanks Daemon - it’s the documentation that prompted me to ask the question.
>  
> The doc points to further info on incremental, and would I be correct to 
> interpret that nodetool initiates a repair on a specific node (i.e. the node 
> specified with -h or the current node)?
>  
> http://www.datastax.com/dev/blog/more-efficient-repairs 
> <http://www.datastax.com/dev/blog/more-efficient-repairs>
>  
> 
>  
> From: daemeon reiydelle mailto:daeme...@gmail.com>>
> Date: Monday, March 13, 2017 at 5:02 PM
> To: mailto:user@cassandra.apache.org>>
> Subject: Re: Does "nodetool repair" need to be run on each node for a given 
> table?
>  
> I
> ​ find it helpful to read the manual first. After review, I would be happy to 
> answer specific questions.
> 
> https://docs.datastax.com/en/cassandra/2.1/cassandra/tools/toolsRepair.html 
> <https://docs.datastax.com/en/cassandra/2.1/cassandra/tools/toolsRepair.html>​
> 
> 
> ...
> 
> Daemeon C.M. Reiydelle
> USA (+1) 415.501.0198
> London (+44) (0) 20 8144 9872
>  
> On Mon, Mar 13, 2017 at 1:30 PM, Thakrar, Jayesh 
> mailto:jthak...@conversantmedia.com>> wrote:
> I understand that the nodetool command connects to a specific server and for 
> many of the commands, e.g. "info", "compactionstats", etc, the information is 
> for that specific node.
> While for some other commands like "status", the info is for the whole 
> cluster.
>  
> So is "nodetool repair" that operates at a single node level (i.e. repairs 
> the partitions contained on the target node?).
> If so, what is the recommended approach to doing repairs?
>  
> E.g. we have a large number of tables (20+), large amount of data (40+ TB) 
> and a number of nodes (40+).
> Do I need to iterate through each server AND each table?
>  
> Thanks,
> Jayesh
>  
>  
>  
>  



Re: Does "nodetool repair" need to be run on each node for a given table?

2017-03-14 Thread daemeon reiydelle
Am I unreasonable in expecting a poster to have looked at the documentation
before posting? And that reposting the same query WITHOUT reading the
documents (when pointed out to them) when asked to do so is not
appropriate? Do we have a way to blackball such?


*...*



*Daemeon C.M. ReiydelleUSA (+1) 415.501.0198London (+44) (0) 20 8144 9872*

On Mon, Mar 13, 2017 at 1:30 PM, Thakrar, Jayesh <
jthak...@conversantmedia.com> wrote:

> I understand that the nodetool command connects to a specific server and
> for many of the commands, e.g. "info", "compactionstats", etc, the
> information is for that specific node.
>
> While for some other commands like "status", the info is for the whole
> cluster.
>
>
>
> So is "nodetool repair" that operates at a single node level (i.e. repairs
> the partitions contained on the target node?).
>
> If so, what is the recommended approach to doing repairs?
>
>
>
> E.g. we have a large number of tables (20+), large amount of data (40+ TB)
> and a number of nodes (40+).
>
> Do I need to iterate through each server AND each table?
>
>
>
> Thanks,
>
> Jayesh
>
>
>
>
>
>
>


Does "nodetool repair" need to be run on each node for a given table?

2017-03-14 Thread Thakrar, Jayesh
I understand that the nodetool command connects to a specific server and for 
many of the commands, e.g. "info", "compactionstats", etc, the information is 
for that specific node.
While for some other commands like "status", the info is for the whole cluster.

So is "nodetool repair" that operates at a single node level (i.e. repairs the 
partitions contained on the target node?).
If so, what is the recommended approach to doing repairs?

E.g. we have a large number of tables (20+), large amount of data (40+ TB) and 
a number of nodes (40+).
Do I need to iterate through each server AND each table?

Thanks,
Jayesh





Re: Does "nodetool repair" need to be run on each node for a given table?

2017-03-13 Thread daemeon reiydelle
I
​ find it helpful to read the manual first. After review, I would be happy
to answer specific questions.

https://docs.datastax.com/en/cassandra/2.1/cassandra/tools/toolsRepair.html​


*...*



*Daemeon C.M. ReiydelleUSA (+1) 415.501.0198London (+44) (0) 20 8144 9872*

On Mon, Mar 13, 2017 at 1:30 PM, Thakrar, Jayesh <
jthak...@conversantmedia.com> wrote:

> I understand that the nodetool command connects to a specific server and
> for many of the commands, e.g. "info", "compactionstats", etc, the
> information is for that specific node.
>
> While for some other commands like "status", the info is for the whole
> cluster.
>
>
>
> So is "nodetool repair" that operates at a single node level (i.e. repairs
> the partitions contained on the target node?).
>
> If so, what is the recommended approach to doing repairs?
>
>
>
> E.g. we have a large number of tables (20+), large amount of data (40+ TB)
> and a number of nodes (40+).
>
> Do I need to iterate through each server AND each table?
>
>
>
> Thanks,
>
> Jayesh
>
>
>
>
>
>
>


Does "nodetool repair" need to be run on each node for a given table?

2017-03-13 Thread Thakrar, Jayesh
I understand that the nodetool command connects to a specific server and for 
many of the commands, e.g. "info", "compactionstats", etc, the information is 
for that specific node.
While for some other commands like "status", the info is for the whole cluster.

So is "nodetool repair" that operates at a single node level (i.e. repairs the 
partitions contained on the target node?).
If so, what is the recommended approach to doing repairs?

E.g. we have a large number of tables (20+), large amount of data (40+ TB) and 
a number of nodes (40+).
Do I need to iterate through each server AND each table?

Thanks,
Jayesh