[jira] [Commented] (CASSANDRA-6124) Ability to specify a DC to consume from when using ColumnFamilyInputFormat externally
[ https://issues.apache.org/jira/browse/CASSANDRA-6124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13795161#comment-13795161 ] Adam Hattrell commented on CASSANDRA-6124: -- Another +1, I have a user that would find this very useful. Ability to specify a DC to consume from when using ColumnFamilyInputFormat externally - Key: CASSANDRA-6124 URL: https://issues.apache.org/jira/browse/CASSANDRA-6124 Project: Cassandra Issue Type: Improvement Components: Hadoop Reporter: Patricio Echague Priority: Minor Labels: hadoop Fix For: 1.2.12 Attachments: CASSANDRA-6124.diff, CASSANDRA-6124-v2.diff Our production environment looks like this: - 6 cassandra nodes (online DC) - 3 cassandra nodes (offline DC) - Hadoop cluster. we are interested in connecting to the offline DC from hadoop (not colocated with cassandra offline dc) I've tested this patch and seems to work with our 1.2.5 deployment. Kindly review. -- This message was sent by Atlassian JIRA (v6.1#6144)
[jira] [Commented] (CASSANDRA-6124) Ability to specify a DC to consume from when using ColumnFamilyInputFormat externally
[ https://issues.apache.org/jira/browse/CASSANDRA-6124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13795160#comment-13795160 ] Jeremy Hanna commented on CASSANDRA-6124: - [~patricioe] what do think of LOCAL_ONE? Ability to specify a DC to consume from when using ColumnFamilyInputFormat externally - Key: CASSANDRA-6124 URL: https://issues.apache.org/jira/browse/CASSANDRA-6124 Project: Cassandra Issue Type: Improvement Components: Hadoop Reporter: Patricio Echague Priority: Minor Labels: hadoop Fix For: 1.2.12 Attachments: CASSANDRA-6124.diff, CASSANDRA-6124-v2.diff Our production environment looks like this: - 6 cassandra nodes (online DC) - 3 cassandra nodes (offline DC) - Hadoop cluster. we are interested in connecting to the offline DC from hadoop (not colocated with cassandra offline dc) I've tested this patch and seems to work with our 1.2.5 deployment. Kindly review. -- This message was sent by Atlassian JIRA (v6.1#6144)
Re: [jira] [Commented] (CASSANDRA-6124) Ability to specify a DC to consume from when using ColumnFamilyInputFormat externally
Jeremy, LOCAL_ONE is definitely a work around. We are currently running with my patch in production to avoid establishing connections to others DC which is the intention of the patch. On Oct 15, 2013 6:09 AM, Adam Hattrell (JIRA) j...@apache.org wrote: [ https://issues.apache.org/jira/browse/CASSANDRA-6124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13795161#comment-13795161] Adam Hattrell commented on CASSANDRA-6124: -- Another +1, I have a user that would find this very useful. Ability to specify a DC to consume from when using ColumnFamilyInputFormat externally - Key: CASSANDRA-6124 URL: https://issues.apache.org/jira/browse/CASSANDRA-6124 Project: Cassandra Issue Type: Improvement Components: Hadoop Reporter: Patricio Echague Priority: Minor Labels: hadoop Fix For: 1.2.12 Attachments: CASSANDRA-6124.diff, CASSANDRA-6124-v2.diff Our production environment looks like this: - 6 cassandra nodes (online DC) - 3 cassandra nodes (offline DC) - Hadoop cluster. we are interested in connecting to the offline DC from hadoop (not colocated with cassandra offline dc) I've tested this patch and seems to work with our 1.2.5 deployment. Kindly review. -- This message was sent by Atlassian JIRA (v6.1#6144)
[jira] [Commented] (CASSANDRA-6124) Ability to specify a DC to consume from when using ColumnFamilyInputFormat externally
[ https://issues.apache.org/jira/browse/CASSANDRA-6124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13795213#comment-13795213 ] Jason Brown commented on CASSANDRA-6124: It is starting to look like LOCAL_ONE is useful. We have some internal users that we startled to realize that once we opened up a second datacenter, that their 99%ile was not as as consistent as before. I do have an alternate solution to that in my back pocket ... but +1 to building in a CL.LOCAL_ONE Ability to specify a DC to consume from when using ColumnFamilyInputFormat externally - Key: CASSANDRA-6124 URL: https://issues.apache.org/jira/browse/CASSANDRA-6124 Project: Cassandra Issue Type: Improvement Components: Hadoop Reporter: Patricio Echague Priority: Minor Labels: hadoop Fix For: 1.2.12 Attachments: CASSANDRA-6124.diff, CASSANDRA-6124-v2.diff Our production environment looks like this: - 6 cassandra nodes (online DC) - 3 cassandra nodes (offline DC) - Hadoop cluster. we are interested in connecting to the offline DC from hadoop (not colocated with cassandra offline dc) I've tested this patch and seems to work with our 1.2.5 deployment. Kindly review. -- This message was sent by Atlassian JIRA (v6.1#6144)
[jira] [Commented] (CASSANDRA-6124) Ability to specify a DC to consume from when using ColumnFamilyInputFormat externally
[ https://issues.apache.org/jira/browse/CASSANDRA-6124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13795236#comment-13795236 ] Brandon Williams commented on CASSANDRA-6124: - Created CASSANDRA-6202 for LOCAL_ONE. Ability to specify a DC to consume from when using ColumnFamilyInputFormat externally - Key: CASSANDRA-6124 URL: https://issues.apache.org/jira/browse/CASSANDRA-6124 Project: Cassandra Issue Type: Improvement Components: Hadoop Reporter: Patricio Echague Priority: Minor Labels: hadoop Fix For: 1.2.12 Attachments: CASSANDRA-6124.diff, CASSANDRA-6124-v2.diff Our production environment looks like this: - 6 cassandra nodes (online DC) - 3 cassandra nodes (offline DC) - Hadoop cluster. we are interested in connecting to the offline DC from hadoop (not colocated with cassandra offline dc) I've tested this patch and seems to work with our 1.2.5 deployment. Kindly review. -- This message was sent by Atlassian JIRA (v6.1#6144)
[jira] [Commented] (CASSANDRA-6124) Ability to specify a DC to consume from when using ColumnFamilyInputFormat externally
[ https://issues.apache.org/jira/browse/CASSANDRA-6124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13795369#comment-13795369 ] Jeremy Hanna commented on CASSANDRA-6124: - To add to the discussion on the ticket, [~patricioe] said on the dev list LOCAL_ONE is definitely a work around. We are currently running with my patch in production to avoid establishing connections to others DC which is the intention of the patch. Ability to specify a DC to consume from when using ColumnFamilyInputFormat externally - Key: CASSANDRA-6124 URL: https://issues.apache.org/jira/browse/CASSANDRA-6124 Project: Cassandra Issue Type: Improvement Components: Hadoop Reporter: Patricio Echague Priority: Minor Labels: hadoop Fix For: 1.2.12 Attachments: CASSANDRA-6124.diff, CASSANDRA-6124-v2.diff Our production environment looks like this: - 6 cassandra nodes (online DC) - 3 cassandra nodes (offline DC) - Hadoop cluster. we are interested in connecting to the offline DC from hadoop (not colocated with cassandra offline dc) I've tested this patch and seems to work with our 1.2.5 deployment. Kindly review. -- This message was sent by Atlassian JIRA (v6.1#6144)
[jira] [Commented] (CASSANDRA-6124) Ability to specify a DC to consume from when using ColumnFamilyInputFormat externally
[ https://issues.apache.org/jira/browse/CASSANDRA-6124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13788254#comment-13788254 ] Brandon Williams commented on CASSANDRA-6124: - Worth noting this is another case where a CL.LOCALONE would be useful (cc [~jjordan]) Ability to specify a DC to consume from when using ColumnFamilyInputFormat externally - Key: CASSANDRA-6124 URL: https://issues.apache.org/jira/browse/CASSANDRA-6124 Project: Cassandra Issue Type: Improvement Components: Hadoop Reporter: Patricio Echague Priority: Minor Labels: hadoop Fix For: 1.2.11 Attachments: CASSANDRA-6124.diff, CASSANDRA-6124-v2.diff Our production environment looks like this: - 6 cassandra nodes (online DC) - 3 cassandra nodes (offline DC) - Hadoop cluster. we are interested in connecting to the offline DC from hadoop (not colocated with cassandra offline dc) I've tested this patch and seems to work with our 1.2.5 deployment. Kindly review. -- This message was sent by Atlassian JIRA (v6.1#6144)
[jira] [Commented] (CASSANDRA-6124) Ability to specify a DC to consume from when using ColumnFamilyInputFormat externally
[ https://issues.apache.org/jira/browse/CASSANDRA-6124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13788259#comment-13788259 ] Jonathan Ellis commented on CASSANDRA-6124: --- I'd favor LOCAL_ONE as a slightly more generally useful solution. Ability to specify a DC to consume from when using ColumnFamilyInputFormat externally - Key: CASSANDRA-6124 URL: https://issues.apache.org/jira/browse/CASSANDRA-6124 Project: Cassandra Issue Type: Improvement Components: Hadoop Reporter: Patricio Echague Priority: Minor Labels: hadoop Fix For: 1.2.11 Attachments: CASSANDRA-6124.diff, CASSANDRA-6124-v2.diff Our production environment looks like this: - 6 cassandra nodes (online DC) - 3 cassandra nodes (offline DC) - Hadoop cluster. we are interested in connecting to the offline DC from hadoop (not colocated with cassandra offline dc) I've tested this patch and seems to work with our 1.2.5 deployment. Kindly review. -- This message was sent by Atlassian JIRA (v6.1#6144)
[jira] [Commented] (CASSANDRA-6124) Ability to specify a DC to consume from when using ColumnFamilyInputFormat externally
[ https://issues.apache.org/jira/browse/CASSANDRA-6124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13788265#comment-13788265 ] Jeremiah Jordan commented on CASSANDRA-6124: +1 to LOCAL_ONE, I still think it is kind of silly, but I have been thinking of more and more fairly valid use cases for it... Ability to specify a DC to consume from when using ColumnFamilyInputFormat externally - Key: CASSANDRA-6124 URL: https://issues.apache.org/jira/browse/CASSANDRA-6124 Project: Cassandra Issue Type: Improvement Components: Hadoop Reporter: Patricio Echague Priority: Minor Labels: hadoop Fix For: 1.2.11 Attachments: CASSANDRA-6124.diff, CASSANDRA-6124-v2.diff Our production environment looks like this: - 6 cassandra nodes (online DC) - 3 cassandra nodes (offline DC) - Hadoop cluster. we are interested in connecting to the offline DC from hadoop (not colocated with cassandra offline dc) I've tested this patch and seems to work with our 1.2.5 deployment. Kindly review. -- This message was sent by Atlassian JIRA (v6.1#6144)
[jira] [Commented] (CASSANDRA-6124) Ability to specify a DC to consume from when using ColumnFamilyInputFormat externally
[ https://issues.apache.org/jira/browse/CASSANDRA-6124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13786030#comment-13786030 ] Piotr Kołaczkowski commented on CASSANDRA-6124: --- LGTM, but I'd name it cassandra.input.dc, not cassandra.input.dc.info. Ability to specify a DC to consume from when using ColumnFamilyInputFormat externally - Key: CASSANDRA-6124 URL: https://issues.apache.org/jira/browse/CASSANDRA-6124 Project: Cassandra Issue Type: Improvement Components: Hadoop Reporter: Patricio Echague Priority: Minor Labels: hadoop Fix For: 1.2.11 Attachments: CASSANDRA-6124.diff Our production environment looks like this: - 6 cassandra nodes (online DC) - 3 cassandra nodes (offline DC) - Hadoop cluster. we are interested in connecting to the offline DC from hadoop (not colocated with cassandra offline dc) I've tested this patch and seems to work with our 1.2.5 deployment. Kindly review. -- This message was sent by Atlassian JIRA (v6.1#6144)
[jira] [Commented] (CASSANDRA-6124) Ability to specify a DC to consume from when using ColumnFamilyInputFormat externally
[ https://issues.apache.org/jira/browse/CASSANDRA-6124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13786039#comment-13786039 ] Jeremy Hanna commented on CASSANDRA-6124: - Interesting - most people usually use an IP within the datacenter they want and use CL.LOCAL_QUORUM afaik. That or CL.ONE and rely on the nodes don't get overloaded so it wouldn't cross DC boundaries because of the dynamic snitch modifier. This keeps it in the same DC regardless of the CL from what it looks like, which is convenient. Ability to specify a DC to consume from when using ColumnFamilyInputFormat externally - Key: CASSANDRA-6124 URL: https://issues.apache.org/jira/browse/CASSANDRA-6124 Project: Cassandra Issue Type: Improvement Components: Hadoop Reporter: Patricio Echague Priority: Minor Labels: hadoop Fix For: 1.2.11 Attachments: CASSANDRA-6124.diff Our production environment looks like this: - 6 cassandra nodes (online DC) - 3 cassandra nodes (offline DC) - Hadoop cluster. we are interested in connecting to the offline DC from hadoop (not colocated with cassandra offline dc) I've tested this patch and seems to work with our 1.2.5 deployment. Kindly review. -- This message was sent by Atlassian JIRA (v6.1#6144)
[jira] [Commented] (CASSANDRA-6124) Ability to specify a DC to consume from when using ColumnFamilyInputFormat externally
[ https://issues.apache.org/jira/browse/CASSANDRA-6124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13786342#comment-13786342 ] Patricio Echague commented on CASSANDRA-6124: - Thanks guys. I'll send the suggested modification soon. Should I send another patch against trunk in case this one doesn't apply? Ability to specify a DC to consume from when using ColumnFamilyInputFormat externally - Key: CASSANDRA-6124 URL: https://issues.apache.org/jira/browse/CASSANDRA-6124 Project: Cassandra Issue Type: Improvement Components: Hadoop Reporter: Patricio Echague Priority: Minor Labels: hadoop Fix For: 1.2.11 Attachments: CASSANDRA-6124.diff Our production environment looks like this: - 6 cassandra nodes (online DC) - 3 cassandra nodes (offline DC) - Hadoop cluster. we are interested in connecting to the offline DC from hadoop (not colocated with cassandra offline dc) I've tested this patch and seems to work with our 1.2.5 deployment. Kindly review. -- This message was sent by Atlassian JIRA (v6.1#6144)
[jira] [Commented] (CASSANDRA-6124) Ability to specify a DC to consume from when using ColumnFamilyInputFormat externally
[ https://issues.apache.org/jira/browse/CASSANDRA-6124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13786555#comment-13786555 ] Jonathan Ellis commented on CASSANDRA-6124: --- Patricio, have you tried Jeremy's approach? I'd rather not add extra special cases if there is an alternative. Ability to specify a DC to consume from when using ColumnFamilyInputFormat externally - Key: CASSANDRA-6124 URL: https://issues.apache.org/jira/browse/CASSANDRA-6124 Project: Cassandra Issue Type: Improvement Components: Hadoop Reporter: Patricio Echague Priority: Minor Labels: hadoop Fix For: 1.2.11 Attachments: CASSANDRA-6124.diff Our production environment looks like this: - 6 cassandra nodes (online DC) - 3 cassandra nodes (offline DC) - Hadoop cluster. we are interested in connecting to the offline DC from hadoop (not colocated with cassandra offline dc) I've tested this patch and seems to work with our 1.2.5 deployment. Kindly review. -- This message was sent by Atlassian JIRA (v6.1#6144)
[jira] [Commented] (CASSANDRA-6124) Ability to specify a DC to consume from when using ColumnFamilyInputFormat externally
[ https://issues.apache.org/jira/browse/CASSANDRA-6124?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13786586#comment-13786586 ] Patricio Echague commented on CASSANDRA-6124: - We just don't want to connect to the lives nodes for security and quality reasons. As I mention previously we have two clusters. One dedicated to take request from live users(ONLINE DC) and one for all machine learning computation(OFFLINE DC). I prefer the option of specifying the DC since the code avoid any connection to the ONLINE DC. With the CL.ONE if a replica in our OFFLINE DC is down it will use the ONLINE DC and we are trying to avoid that. Thoughts on this front? I'm ok with it if you think that keeping it simple is the best choice in which case I'll have to build from sources and apply my patch. Ability to specify a DC to consume from when using ColumnFamilyInputFormat externally - Key: CASSANDRA-6124 URL: https://issues.apache.org/jira/browse/CASSANDRA-6124 Project: Cassandra Issue Type: Improvement Components: Hadoop Reporter: Patricio Echague Priority: Minor Labels: hadoop Fix For: 1.2.11 Attachments: CASSANDRA-6124.diff Our production environment looks like this: - 6 cassandra nodes (online DC) - 3 cassandra nodes (offline DC) - Hadoop cluster. we are interested in connecting to the offline DC from hadoop (not colocated with cassandra offline dc) I've tested this patch and seems to work with our 1.2.5 deployment. Kindly review. -- This message was sent by Atlassian JIRA (v6.1#6144)