[jira] [Commented] (CASSANDRA-6124) Ability to specify a DC to consume from when using ColumnFamilyInputFormat externally

2013-10-15 Thread Adam Hattrell (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-6124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13795161#comment-13795161
 ] 

Adam Hattrell commented on CASSANDRA-6124:
--

Another +1, I have a user that would find this very useful.

 Ability to specify a DC to consume from when using ColumnFamilyInputFormat 
 externally
 -

 Key: CASSANDRA-6124
 URL: https://issues.apache.org/jira/browse/CASSANDRA-6124
 Project: Cassandra
  Issue Type: Improvement
  Components: Hadoop
Reporter: Patricio Echague
Priority: Minor
  Labels: hadoop
 Fix For: 1.2.12

 Attachments: CASSANDRA-6124.diff, CASSANDRA-6124-v2.diff


 Our production environment looks like this:
 - 6 cassandra nodes (online DC)
 - 3 cassandra nodes (offline DC)
 - Hadoop cluster.
 we are interested in connecting to the offline DC from hadoop (not colocated 
 with cassandra offline dc)
 I've tested this patch and seems to work with our 1.2.5 deployment.
 Kindly review.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (CASSANDRA-6124) Ability to specify a DC to consume from when using ColumnFamilyInputFormat externally

2013-10-15 Thread Jeremy Hanna (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-6124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13795160#comment-13795160
 ] 

Jeremy Hanna commented on CASSANDRA-6124:
-

[~patricioe] what do think of LOCAL_ONE?

 Ability to specify a DC to consume from when using ColumnFamilyInputFormat 
 externally
 -

 Key: CASSANDRA-6124
 URL: https://issues.apache.org/jira/browse/CASSANDRA-6124
 Project: Cassandra
  Issue Type: Improvement
  Components: Hadoop
Reporter: Patricio Echague
Priority: Minor
  Labels: hadoop
 Fix For: 1.2.12

 Attachments: CASSANDRA-6124.diff, CASSANDRA-6124-v2.diff


 Our production environment looks like this:
 - 6 cassandra nodes (online DC)
 - 3 cassandra nodes (offline DC)
 - Hadoop cluster.
 we are interested in connecting to the offline DC from hadoop (not colocated 
 with cassandra offline dc)
 I've tested this patch and seems to work with our 1.2.5 deployment.
 Kindly review.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


Re: [jira] [Commented] (CASSANDRA-6124) Ability to specify a DC to consume from when using ColumnFamilyInputFormat externally

2013-10-15 Thread Patricio Echagüe
Jeremy, LOCAL_ONE is definitely a work around. We are currently running
with my patch in production to avoid establishing connections to others DC
which is the intention of the patch.
On Oct 15, 2013 6:09 AM, Adam Hattrell (JIRA) j...@apache.org wrote:


 [
 https://issues.apache.org/jira/browse/CASSANDRA-6124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13795161#comment-13795161]

 Adam Hattrell commented on CASSANDRA-6124:
 --

 Another +1, I have a user that would find this very useful.

  Ability to specify a DC to consume from when using
 ColumnFamilyInputFormat externally
 
 -
 
  Key: CASSANDRA-6124
  URL:
 https://issues.apache.org/jira/browse/CASSANDRA-6124
  Project: Cassandra
   Issue Type: Improvement
   Components: Hadoop
 Reporter: Patricio Echague
 Priority: Minor
   Labels: hadoop
  Fix For: 1.2.12
 
  Attachments: CASSANDRA-6124.diff, CASSANDRA-6124-v2.diff
 
 
  Our production environment looks like this:
  - 6 cassandra nodes (online DC)
  - 3 cassandra nodes (offline DC)
  - Hadoop cluster.
  we are interested in connecting to the offline DC from hadoop (not
 colocated with cassandra offline dc)
  I've tested this patch and seems to work with our 1.2.5 deployment.
  Kindly review.



 --
 This message was sent by Atlassian JIRA
 (v6.1#6144)



[jira] [Commented] (CASSANDRA-6124) Ability to specify a DC to consume from when using ColumnFamilyInputFormat externally

2013-10-15 Thread Jason Brown (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-6124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13795213#comment-13795213
 ] 

Jason Brown commented on CASSANDRA-6124:


It is starting to look like LOCAL_ONE is useful. We have some internal users 
that we startled to realize that once we opened up a second datacenter, that 
their 99%ile was not as as consistent as before. I do have an alternate 
solution to that in my back pocket ... but +1 to building in a CL.LOCAL_ONE 

 Ability to specify a DC to consume from when using ColumnFamilyInputFormat 
 externally
 -

 Key: CASSANDRA-6124
 URL: https://issues.apache.org/jira/browse/CASSANDRA-6124
 Project: Cassandra
  Issue Type: Improvement
  Components: Hadoop
Reporter: Patricio Echague
Priority: Minor
  Labels: hadoop
 Fix For: 1.2.12

 Attachments: CASSANDRA-6124.diff, CASSANDRA-6124-v2.diff


 Our production environment looks like this:
 - 6 cassandra nodes (online DC)
 - 3 cassandra nodes (offline DC)
 - Hadoop cluster.
 we are interested in connecting to the offline DC from hadoop (not colocated 
 with cassandra offline dc)
 I've tested this patch and seems to work with our 1.2.5 deployment.
 Kindly review.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (CASSANDRA-6124) Ability to specify a DC to consume from when using ColumnFamilyInputFormat externally

2013-10-15 Thread Brandon Williams (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-6124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13795236#comment-13795236
 ] 

Brandon Williams commented on CASSANDRA-6124:
-

Created CASSANDRA-6202 for LOCAL_ONE.

 Ability to specify a DC to consume from when using ColumnFamilyInputFormat 
 externally
 -

 Key: CASSANDRA-6124
 URL: https://issues.apache.org/jira/browse/CASSANDRA-6124
 Project: Cassandra
  Issue Type: Improvement
  Components: Hadoop
Reporter: Patricio Echague
Priority: Minor
  Labels: hadoop
 Fix For: 1.2.12

 Attachments: CASSANDRA-6124.diff, CASSANDRA-6124-v2.diff


 Our production environment looks like this:
 - 6 cassandra nodes (online DC)
 - 3 cassandra nodes (offline DC)
 - Hadoop cluster.
 we are interested in connecting to the offline DC from hadoop (not colocated 
 with cassandra offline dc)
 I've tested this patch and seems to work with our 1.2.5 deployment.
 Kindly review.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (CASSANDRA-6124) Ability to specify a DC to consume from when using ColumnFamilyInputFormat externally

2013-10-15 Thread Jeremy Hanna (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-6124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13795369#comment-13795369
 ] 

Jeremy Hanna commented on CASSANDRA-6124:
-

To add to the discussion on the ticket, [~patricioe] said on the dev list 
LOCAL_ONE is definitely a work around. We are currently running with my patch 
in production to avoid establishing connections to others DC which is the 
intention of the patch.

 Ability to specify a DC to consume from when using ColumnFamilyInputFormat 
 externally
 -

 Key: CASSANDRA-6124
 URL: https://issues.apache.org/jira/browse/CASSANDRA-6124
 Project: Cassandra
  Issue Type: Improvement
  Components: Hadoop
Reporter: Patricio Echague
Priority: Minor
  Labels: hadoop
 Fix For: 1.2.12

 Attachments: CASSANDRA-6124.diff, CASSANDRA-6124-v2.diff


 Our production environment looks like this:
 - 6 cassandra nodes (online DC)
 - 3 cassandra nodes (offline DC)
 - Hadoop cluster.
 we are interested in connecting to the offline DC from hadoop (not colocated 
 with cassandra offline dc)
 I've tested this patch and seems to work with our 1.2.5 deployment.
 Kindly review.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (CASSANDRA-6124) Ability to specify a DC to consume from when using ColumnFamilyInputFormat externally

2013-10-07 Thread Brandon Williams (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-6124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13788254#comment-13788254
 ] 

Brandon Williams commented on CASSANDRA-6124:
-

Worth noting this is another case where a CL.LOCALONE would be useful (cc 
[~jjordan])

 Ability to specify a DC to consume from when using ColumnFamilyInputFormat 
 externally
 -

 Key: CASSANDRA-6124
 URL: https://issues.apache.org/jira/browse/CASSANDRA-6124
 Project: Cassandra
  Issue Type: Improvement
  Components: Hadoop
Reporter: Patricio Echague
Priority: Minor
  Labels: hadoop
 Fix For: 1.2.11

 Attachments: CASSANDRA-6124.diff, CASSANDRA-6124-v2.diff


 Our production environment looks like this:
 - 6 cassandra nodes (online DC)
 - 3 cassandra nodes (offline DC)
 - Hadoop cluster.
 we are interested in connecting to the offline DC from hadoop (not colocated 
 with cassandra offline dc)
 I've tested this patch and seems to work with our 1.2.5 deployment.
 Kindly review.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (CASSANDRA-6124) Ability to specify a DC to consume from when using ColumnFamilyInputFormat externally

2013-10-07 Thread Jonathan Ellis (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-6124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13788259#comment-13788259
 ] 

Jonathan Ellis commented on CASSANDRA-6124:
---

I'd favor LOCAL_ONE as a slightly more generally useful solution.

 Ability to specify a DC to consume from when using ColumnFamilyInputFormat 
 externally
 -

 Key: CASSANDRA-6124
 URL: https://issues.apache.org/jira/browse/CASSANDRA-6124
 Project: Cassandra
  Issue Type: Improvement
  Components: Hadoop
Reporter: Patricio Echague
Priority: Minor
  Labels: hadoop
 Fix For: 1.2.11

 Attachments: CASSANDRA-6124.diff, CASSANDRA-6124-v2.diff


 Our production environment looks like this:
 - 6 cassandra nodes (online DC)
 - 3 cassandra nodes (offline DC)
 - Hadoop cluster.
 we are interested in connecting to the offline DC from hadoop (not colocated 
 with cassandra offline dc)
 I've tested this patch and seems to work with our 1.2.5 deployment.
 Kindly review.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (CASSANDRA-6124) Ability to specify a DC to consume from when using ColumnFamilyInputFormat externally

2013-10-07 Thread Jeremiah Jordan (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-6124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13788265#comment-13788265
 ] 

Jeremiah Jordan commented on CASSANDRA-6124:


+1 to LOCAL_ONE, I still think it is kind of silly, but I have been thinking of 
more and more fairly valid use cases for it...

 Ability to specify a DC to consume from when using ColumnFamilyInputFormat 
 externally
 -

 Key: CASSANDRA-6124
 URL: https://issues.apache.org/jira/browse/CASSANDRA-6124
 Project: Cassandra
  Issue Type: Improvement
  Components: Hadoop
Reporter: Patricio Echague
Priority: Minor
  Labels: hadoop
 Fix For: 1.2.11

 Attachments: CASSANDRA-6124.diff, CASSANDRA-6124-v2.diff


 Our production environment looks like this:
 - 6 cassandra nodes (online DC)
 - 3 cassandra nodes (offline DC)
 - Hadoop cluster.
 we are interested in connecting to the offline DC from hadoop (not colocated 
 with cassandra offline dc)
 I've tested this patch and seems to work with our 1.2.5 deployment.
 Kindly review.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (CASSANDRA-6124) Ability to specify a DC to consume from when using ColumnFamilyInputFormat externally

2013-10-04 Thread JIRA

[ 
https://issues.apache.org/jira/browse/CASSANDRA-6124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13786030#comment-13786030
 ] 

Piotr Kołaczkowski commented on CASSANDRA-6124:
---

LGTM, but I'd name it cassandra.input.dc, not cassandra.input.dc.info.

 Ability to specify a DC to consume from when using ColumnFamilyInputFormat 
 externally
 -

 Key: CASSANDRA-6124
 URL: https://issues.apache.org/jira/browse/CASSANDRA-6124
 Project: Cassandra
  Issue Type: Improvement
  Components: Hadoop
Reporter: Patricio Echague
Priority: Minor
  Labels: hadoop
 Fix For: 1.2.11

 Attachments: CASSANDRA-6124.diff


 Our production environment looks like this:
 - 6 cassandra nodes (online DC)
 - 3 cassandra nodes (offline DC)
 - Hadoop cluster.
 we are interested in connecting to the offline DC from hadoop (not colocated 
 with cassandra offline dc)
 I've tested this patch and seems to work with our 1.2.5 deployment.
 Kindly review.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (CASSANDRA-6124) Ability to specify a DC to consume from when using ColumnFamilyInputFormat externally

2013-10-04 Thread Jeremy Hanna (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-6124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13786039#comment-13786039
 ] 

Jeremy Hanna commented on CASSANDRA-6124:
-

Interesting - most people usually use an IP within the datacenter they want and 
use CL.LOCAL_QUORUM afaik.  That or CL.ONE and rely on the nodes don't get 
overloaded so it wouldn't cross DC boundaries because of the dynamic snitch 
modifier.

This keeps it in the same DC regardless of the CL from what it looks like, 
which is convenient.

 Ability to specify a DC to consume from when using ColumnFamilyInputFormat 
 externally
 -

 Key: CASSANDRA-6124
 URL: https://issues.apache.org/jira/browse/CASSANDRA-6124
 Project: Cassandra
  Issue Type: Improvement
  Components: Hadoop
Reporter: Patricio Echague
Priority: Minor
  Labels: hadoop
 Fix For: 1.2.11

 Attachments: CASSANDRA-6124.diff


 Our production environment looks like this:
 - 6 cassandra nodes (online DC)
 - 3 cassandra nodes (offline DC)
 - Hadoop cluster.
 we are interested in connecting to the offline DC from hadoop (not colocated 
 with cassandra offline dc)
 I've tested this patch and seems to work with our 1.2.5 deployment.
 Kindly review.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (CASSANDRA-6124) Ability to specify a DC to consume from when using ColumnFamilyInputFormat externally

2013-10-04 Thread Patricio Echague (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-6124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13786342#comment-13786342
 ] 

Patricio Echague commented on CASSANDRA-6124:
-

Thanks guys. I'll send the suggested modification soon. Should I send another 
patch against trunk in case this one doesn't apply?

 Ability to specify a DC to consume from when using ColumnFamilyInputFormat 
 externally
 -

 Key: CASSANDRA-6124
 URL: https://issues.apache.org/jira/browse/CASSANDRA-6124
 Project: Cassandra
  Issue Type: Improvement
  Components: Hadoop
Reporter: Patricio Echague
Priority: Minor
  Labels: hadoop
 Fix For: 1.2.11

 Attachments: CASSANDRA-6124.diff


 Our production environment looks like this:
 - 6 cassandra nodes (online DC)
 - 3 cassandra nodes (offline DC)
 - Hadoop cluster.
 we are interested in connecting to the offline DC from hadoop (not colocated 
 with cassandra offline dc)
 I've tested this patch and seems to work with our 1.2.5 deployment.
 Kindly review.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (CASSANDRA-6124) Ability to specify a DC to consume from when using ColumnFamilyInputFormat externally

2013-10-04 Thread Jonathan Ellis (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-6124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13786555#comment-13786555
 ] 

Jonathan Ellis commented on CASSANDRA-6124:
---

Patricio, have you tried Jeremy's approach?  I'd rather not add extra special 
cases if there is an alternative.

 Ability to specify a DC to consume from when using ColumnFamilyInputFormat 
 externally
 -

 Key: CASSANDRA-6124
 URL: https://issues.apache.org/jira/browse/CASSANDRA-6124
 Project: Cassandra
  Issue Type: Improvement
  Components: Hadoop
Reporter: Patricio Echague
Priority: Minor
  Labels: hadoop
 Fix For: 1.2.11

 Attachments: CASSANDRA-6124.diff


 Our production environment looks like this:
 - 6 cassandra nodes (online DC)
 - 3 cassandra nodes (offline DC)
 - Hadoop cluster.
 we are interested in connecting to the offline DC from hadoop (not colocated 
 with cassandra offline dc)
 I've tested this patch and seems to work with our 1.2.5 deployment.
 Kindly review.



--
This message was sent by Atlassian JIRA
(v6.1#6144)


[jira] [Commented] (CASSANDRA-6124) Ability to specify a DC to consume from when using ColumnFamilyInputFormat externally

2013-10-04 Thread Patricio Echague (JIRA)

[ 
https://issues.apache.org/jira/browse/CASSANDRA-6124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13786586#comment-13786586
 ] 

Patricio Echague commented on CASSANDRA-6124:
-

We just don't want to connect to the lives nodes for security and quality 
reasons. As I mention previously we have two clusters. One dedicated to take 
request from live users(ONLINE DC) and one for all machine learning 
computation(OFFLINE DC).

I prefer the option of specifying the DC since the code avoid any connection to 
the ONLINE DC.

With the CL.ONE if a replica in our OFFLINE DC is down it will use the ONLINE 
DC and we are trying to avoid that. Thoughts on this front?

I'm ok with it if you think that keeping it simple is the best choice in which 
case I'll have to build from sources and apply my patch.

 Ability to specify a DC to consume from when using ColumnFamilyInputFormat 
 externally
 -

 Key: CASSANDRA-6124
 URL: https://issues.apache.org/jira/browse/CASSANDRA-6124
 Project: Cassandra
  Issue Type: Improvement
  Components: Hadoop
Reporter: Patricio Echague
Priority: Minor
  Labels: hadoop
 Fix For: 1.2.11

 Attachments: CASSANDRA-6124.diff


 Our production environment looks like this:
 - 6 cassandra nodes (online DC)
 - 3 cassandra nodes (offline DC)
 - Hadoop cluster.
 we are interested in connecting to the offline DC from hadoop (not colocated 
 with cassandra offline dc)
 I've tested this patch and seems to work with our 1.2.5 deployment.
 Kindly review.



--
This message was sent by Atlassian JIRA
(v6.1#6144)