[jira] [Commented] (CASSANDRA-6084) java.io.IOException: Could not get input splits
[ https://issues.apache.org/jira/browse/CASSANDRA-6084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13779215#comment-13779215 ] Cyril Scetbon commented on CASSANDRA-6084: -- {quote}It's not the amount of connections so much as how fast they're trying to spawn and connect{quote} right :( java.io.IOException: Could not get input splits --- Key: CASSANDRA-6084 URL: https://issues.apache.org/jira/browse/CASSANDRA-6084 Project: Cassandra Issue Type: Bug Components: Hadoop Environment: Osx 10.8.5 Java 1.7 Cassandra 1.2.10 Pig 0.9.2/0.11.1 Reporter: Cyril Scetbon Assignee: Alex Liu Attachments: 6084_debug.txt see http://www.mail-archive.com/user@cassandra.apache.org/msg32414.html I've noticed that if I restart Cassandra I get more errors for the first minutes, although it's accessible through cqlsh without issue. I have tested on a 1-node (Osx Laptop) and 4-nodes (Ubuntu servers) and got the same error. I tried with version 1.2.6, 1.2.8, 1.2.9, 1.2.10 without success -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (CASSANDRA-6084) java.io.IOException: Could not get input splits
[ https://issues.apache.org/jira/browse/CASSANDRA-6084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13777190#comment-13777190 ] Cyril Scetbon commented on CASSANDRA-6084: -- Here are the logs you asked for : pig log - http://pastebin.com/JTfDfKd0 cassandra log - http://pastebin.com/vheZycpg java.io.IOException: Could not get input splits --- Key: CASSANDRA-6084 URL: https://issues.apache.org/jira/browse/CASSANDRA-6084 Project: Cassandra Issue Type: Bug Components: Hadoop Environment: Osx 10.8.5 Java 1.7 Cassandra 1.2.10 Pig 0.9.2/0.11.1 Reporter: Cyril Scetbon Assignee: Alex Liu Attachments: 6084_debug.txt see http://www.mail-archive.com/user@cassandra.apache.org/msg32414.html I've noticed that if I restart Cassandra I get more errors for the first minutes, although it's accessible through cqlsh without issue. I have tested on a 1-node (Osx Laptop) and 4-nodes (Ubuntu servers) and got the same error. I tried with version 1.2.6, 1.2.8, 1.2.9, 1.2.10 without success -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (CASSANDRA-6084) java.io.IOException: Could not get input splits
[ https://issues.apache.org/jira/browse/CASSANDRA-6084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=1377#comment-1377 ] Alex Liu commented on CASSANDRA-6084: - The following is the error from pig log {code} Caused by: java.util.concurrent.ExecutionException: java.io.IOException: failed connecting to all endpoints 127.0.0.1, connections: rpc_endpoint: 127.0.0.1, listen_endpoint: 127.0.0.1, host: 127.0.0.1, port: 9160 , errors: java.io.IOException: Unable to connect to server 127.0.0.1:9160 at org.apache.cassandra.hadoop.ConfigHelper.createConnection(ConfigHelper.java:575) at org.apache.cassandra.hadoop.AbstractColumnFamilyInputFormat.getSubSplits(AbstractColumnFamilyInputFormat.java:278) at org.apache.cassandra.hadoop.AbstractColumnFamilyInputFormat.access$200(AbstractColumnFamilyInputFormat.java:64) at org.apache.cassandra.hadoop.AbstractColumnFamilyInputFormat$SplitCallable.call(AbstractColumnFamilyInputFormat.java:224) at org.apache.cassandra.hadoop.AbstractColumnFamilyInputFormat$SplitCallable.call(AbstractColumnFamilyInputFormat.java:209) at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:334) at java.util.concurrent.FutureTask.run(FutureTask.java:166) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1110) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:603) at java.lang.Thread.run(Thread.java:722) Caused by: org.apache.thrift.transport.TTransportException: java.net.SocketException: Connection reset by peer at org.apache.thrift.transport.TSocket.open(TSocket.java:183) at org.apache.thrift.transport.TFramedTransport.open(TFramedTransport.java:81) at org.apache.cassandra.thrift.TFramedTransportFactory.openTransport(TFramedTransportFactory.java:42) at org.apache.cassandra.hadoop.ConfigHelper.createConnection(ConfigHelper.java:570) ... 9 more Caused by: java.net.SocketException: Connection reset by peer at java.net.PlainSocketImpl.socketConnect(Native Method) at java.net.AbstractPlainSocketImpl.doConnect(AbstractPlainSocketImpl.java:339) at java.net.AbstractPlainSocketImpl.connectToAddress(AbstractPlainSocketImpl.java:200) at java.net.AbstractPlainSocketImpl.connect(AbstractPlainSocketImpl.java:182) at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:391) at java.net.Socket.connect(Socket.java:579) at org.apache.thrift.transport.TSocket.open(TSocket.java:178) ... 12 more at java.util.concurrent.FutureTask$Sync.innerGet(FutureTask.java:252) at java.util.concurrent.FutureTask.get(FutureTask.java:111) at org.apache.cassandra.hadoop.AbstractColumnFamilyInputFormat.getSplits(AbstractColumnFamilyInputFormat.java:187) ... 21 more {code} There is no error in Cassandra system log. It can't connection to 127.0.0.1:9160 sometimes when vnode is enable where there are more connection requests. Can it be something related to rpc_* settings in cassandra.yaml? {code rpc_server_type: sync # rpc_min_threads: 16 # rpc_max_threads: 2048 # rpc_send_buff_size_in_bytes: # rpc_recv_buff_size_in_bytes: rpc_keepalive: true {code} Note: the issue is gone when vnode is disable where there is much less rpc connections java.io.IOException: Could not get input splits --- Key: CASSANDRA-6084 URL: https://issues.apache.org/jira/browse/CASSANDRA-6084 Project: Cassandra Issue Type: Bug Components: Hadoop Environment: Osx 10.8.5 Java 1.7 Cassandra 1.2.10 Pig 0.9.2/0.11.1 Reporter: Cyril Scetbon Assignee: Alex Liu Attachments: 6084_debug.txt see http://www.mail-archive.com/user@cassandra.apache.org/msg32414.html I've noticed that if I restart Cassandra I get more errors for the first minutes, although it's accessible through cqlsh without issue. I have tested on a 1-node (Osx Laptop) and 4-nodes (Ubuntu servers) and got the same error. I tried with version 1.2.6, 1.2.8, 1.2.9, 1.2.10 without success -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (CASSANDRA-6084) java.io.IOException: Could not get input splits
[ https://issues.apache.org/jira/browse/CASSANDRA-6084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13777881#comment-13777881 ] Alex Liu commented on CASSANDRA-6084: - Can we add 3 re-try if it can't get the connection? java.io.IOException: Could not get input splits --- Key: CASSANDRA-6084 URL: https://issues.apache.org/jira/browse/CASSANDRA-6084 Project: Cassandra Issue Type: Bug Components: Hadoop Environment: Osx 10.8.5 Java 1.7 Cassandra 1.2.10 Pig 0.9.2/0.11.1 Reporter: Cyril Scetbon Assignee: Alex Liu Attachments: 6084_debug.txt see http://www.mail-archive.com/user@cassandra.apache.org/msg32414.html I've noticed that if I restart Cassandra I get more errors for the first minutes, although it's accessible through cqlsh without issue. I have tested on a 1-node (Osx Laptop) and 4-nodes (Ubuntu servers) and got the same error. I tried with version 1.2.6, 1.2.8, 1.2.9, 1.2.10 without success -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (CASSANDRA-6084) java.io.IOException: Could not get input splits
[ https://issues.apache.org/jira/browse/CASSANDRA-6084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13777948#comment-13777948 ] Brandon Williams commented on CASSANDRA-6084: - bq. Note: the issue is gone when vnode is disable where there is much less rpc connections I think perhaps it's outstripping the thrift server's ability to spawn new threads. Can you repro with vnodes and hsha? java.io.IOException: Could not get input splits --- Key: CASSANDRA-6084 URL: https://issues.apache.org/jira/browse/CASSANDRA-6084 Project: Cassandra Issue Type: Bug Components: Hadoop Environment: Osx 10.8.5 Java 1.7 Cassandra 1.2.10 Pig 0.9.2/0.11.1 Reporter: Cyril Scetbon Assignee: Alex Liu Attachments: 6084_debug.txt see http://www.mail-archive.com/user@cassandra.apache.org/msg32414.html I've noticed that if I restart Cassandra I get more errors for the first minutes, although it's accessible through cqlsh without issue. I have tested on a 1-node (Osx Laptop) and 4-nodes (Ubuntu servers) and got the same error. I tried with version 1.2.6, 1.2.8, 1.2.9, 1.2.10 without success -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (CASSANDRA-6084) java.io.IOException: Could not get input splits
[ https://issues.apache.org/jira/browse/CASSANDRA-6084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13778033#comment-13778033 ] Cyril Scetbon commented on CASSANDRA-6084: -- It seems to be a good idea [~brandon.williams] ! I've tested with both sync and hsha and for 20 tests I get no error with hsha versus 4 errors with sync. java.io.IOException: Could not get input splits --- Key: CASSANDRA-6084 URL: https://issues.apache.org/jira/browse/CASSANDRA-6084 Project: Cassandra Issue Type: Bug Components: Hadoop Environment: Osx 10.8.5 Java 1.7 Cassandra 1.2.10 Pig 0.9.2/0.11.1 Reporter: Cyril Scetbon Assignee: Alex Liu Attachments: 6084_debug.txt see http://www.mail-archive.com/user@cassandra.apache.org/msg32414.html I've noticed that if I restart Cassandra I get more errors for the first minutes, although it's accessible through cqlsh without issue. I have tested on a 1-node (Osx Laptop) and 4-nodes (Ubuntu servers) and got the same error. I tried with version 1.2.6, 1.2.8, 1.2.9, 1.2.10 without success -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (CASSANDRA-6084) java.io.IOException: Could not get input splits
[ https://issues.apache.org/jira/browse/CASSANDRA-6084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13778041#comment-13778041 ] Brandon Williams commented on CASSANDRA-6084: - It sounds like we should just handle this as part of CASSANDRA-6091 then, since there's no need to make 256 connections to one machine. HSHA is a decent workaround for the interim. java.io.IOException: Could not get input splits --- Key: CASSANDRA-6084 URL: https://issues.apache.org/jira/browse/CASSANDRA-6084 Project: Cassandra Issue Type: Bug Components: Hadoop Environment: Osx 10.8.5 Java 1.7 Cassandra 1.2.10 Pig 0.9.2/0.11.1 Reporter: Cyril Scetbon Assignee: Alex Liu Attachments: 6084_debug.txt see http://www.mail-archive.com/user@cassandra.apache.org/msg32414.html I've noticed that if I restart Cassandra I get more errors for the first minutes, although it's accessible through cqlsh without issue. I have tested on a 1-node (Osx Laptop) and 4-nodes (Ubuntu servers) and got the same error. I tried with version 1.2.6, 1.2.8, 1.2.9, 1.2.10 without success -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (CASSANDRA-6084) java.io.IOException: Could not get input splits
[ https://issues.apache.org/jira/browse/CASSANDRA-6084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13778049#comment-13778049 ] Alex Liu commented on CASSANDRA-6084: - Great! java.io.IOException: Could not get input splits --- Key: CASSANDRA-6084 URL: https://issues.apache.org/jira/browse/CASSANDRA-6084 Project: Cassandra Issue Type: Bug Components: Hadoop Environment: Osx 10.8.5 Java 1.7 Cassandra 1.2.10 Pig 0.9.2/0.11.1 Reporter: Cyril Scetbon Assignee: Alex Liu Attachments: 6084_debug.txt see http://www.mail-archive.com/user@cassandra.apache.org/msg32414.html I've noticed that if I restart Cassandra I get more errors for the first minutes, although it's accessible through cqlsh without issue. I have tested on a 1-node (Osx Laptop) and 4-nodes (Ubuntu servers) and got the same error. I tried with version 1.2.6, 1.2.8, 1.2.9, 1.2.10 without success -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (CASSANDRA-6084) java.io.IOException: Could not get input splits
[ https://issues.apache.org/jira/browse/CASSANDRA-6084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13778059#comment-13778059 ] Cyril Scetbon commented on CASSANDRA-6084: -- Just a question. If it's really caused by the number of connections, (not so big, right ?) don't you think it lacks a error message or exception which identify that there's no more memory or something else ? java.io.IOException: Could not get input splits --- Key: CASSANDRA-6084 URL: https://issues.apache.org/jira/browse/CASSANDRA-6084 Project: Cassandra Issue Type: Bug Components: Hadoop Environment: Osx 10.8.5 Java 1.7 Cassandra 1.2.10 Pig 0.9.2/0.11.1 Reporter: Cyril Scetbon Assignee: Alex Liu Attachments: 6084_debug.txt see http://www.mail-archive.com/user@cassandra.apache.org/msg32414.html I've noticed that if I restart Cassandra I get more errors for the first minutes, although it's accessible through cqlsh without issue. I have tested on a 1-node (Osx Laptop) and 4-nodes (Ubuntu servers) and got the same error. I tried with version 1.2.6, 1.2.8, 1.2.9, 1.2.10 without success -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (CASSANDRA-6084) java.io.IOException: Could not get input splits
[ https://issues.apache.org/jira/browse/CASSANDRA-6084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13778067#comment-13778067 ] Brandon Williams commented on CASSANDRA-6084: - It's not the amount of connections so much as how fast they're trying to spawn and connect. The couldn't get the split because something happened at the network level is about the best message you can get there on the client side. If you pushed logging up in o.a.c.thrift on the server you might get something useful, but that's kind of impractical. java.io.IOException: Could not get input splits --- Key: CASSANDRA-6084 URL: https://issues.apache.org/jira/browse/CASSANDRA-6084 Project: Cassandra Issue Type: Bug Components: Hadoop Environment: Osx 10.8.5 Java 1.7 Cassandra 1.2.10 Pig 0.9.2/0.11.1 Reporter: Cyril Scetbon Assignee: Alex Liu Attachments: 6084_debug.txt see http://www.mail-archive.com/user@cassandra.apache.org/msg32414.html I've noticed that if I restart Cassandra I get more errors for the first minutes, although it's accessible through cqlsh without issue. I have tested on a 1-node (Osx Laptop) and 4-nodes (Ubuntu servers) and got the same error. I tried with version 1.2.6, 1.2.8, 1.2.9, 1.2.10 without success -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (CASSANDRA-6084) java.io.IOException: Could not get input splits
[ https://issues.apache.org/jira/browse/CASSANDRA-6084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13776037#comment-13776037 ] Alex Liu commented on CASSANDRA-6084: - What's your ring info? Use node tool to describe the ring. java.io.IOException: Could not get input splits --- Key: CASSANDRA-6084 URL: https://issues.apache.org/jira/browse/CASSANDRA-6084 Project: Cassandra Issue Type: Bug Components: Hadoop Environment: Osx 10.8.5 Java 1.7 Cassandra 1.2.10 Pig 0.9.2/0.11.1 Reporter: Cyril Scetbon Assignee: Alex Liu see http://www.mail-archive.com/user@cassandra.apache.org/msg32414.html I've noticed that if I restart Cassandra I get more errors for the first minutes, although it's accessible through cqlsh without issue. I have tested on a 1-node (Osx Laptop) and 4-nodes (Ubuntu servers) and got the same error. I tried with version 1.2.6, 1.2.8, 1.2.9, 1.2.10 without success -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (CASSANDRA-6084) java.io.IOException: Could not get input splits
[ https://issues.apache.org/jira/browse/CASSANDRA-6084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13776049#comment-13776049 ] Cyril Scetbon commented on CASSANDRA-6084: -- you can find the ring information at http://pastebin.com/WBMWSBpi, and below is the status of the cluster : {code}[cyril:~] $ nodetool status Datacenter: datacenter1 === Status=Up/Down |/ State=Normal/Leaving/Joining/Moving -- AddressLoad Tokens Owns (effective) Host ID Rack UN 127.0.0.1 91.79 KB 256 100.0% 5ce4096f-f9e1-40d1-b492-09ae20cd218b rack1{code} Don't forget that it's not working *except for a few times* java.io.IOException: Could not get input splits --- Key: CASSANDRA-6084 URL: https://issues.apache.org/jira/browse/CASSANDRA-6084 Project: Cassandra Issue Type: Bug Components: Hadoop Environment: Osx 10.8.5 Java 1.7 Cassandra 1.2.10 Pig 0.9.2/0.11.1 Reporter: Cyril Scetbon Assignee: Alex Liu see http://www.mail-archive.com/user@cassandra.apache.org/msg32414.html I've noticed that if I restart Cassandra I get more errors for the first minutes, although it's accessible through cqlsh without issue. I have tested on a 1-node (Osx Laptop) and 4-nodes (Ubuntu servers) and got the same error. I tried with version 1.2.6, 1.2.8, 1.2.9, 1.2.10 without success -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (CASSANDRA-6084) java.io.IOException: Could not get input splits
[ https://issues.apache.org/jira/browse/CASSANDRA-6084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13776062#comment-13776062 ] Alex Liu commented on CASSANDRA-6084: - Is vnode enable? It's not recommended to run hadoop/pig job when vnode is enabled. java.io.IOException: Could not get input splits --- Key: CASSANDRA-6084 URL: https://issues.apache.org/jira/browse/CASSANDRA-6084 Project: Cassandra Issue Type: Bug Components: Hadoop Environment: Osx 10.8.5 Java 1.7 Cassandra 1.2.10 Pig 0.9.2/0.11.1 Reporter: Cyril Scetbon Assignee: Alex Liu see http://www.mail-archive.com/user@cassandra.apache.org/msg32414.html I've noticed that if I restart Cassandra I get more errors for the first minutes, although it's accessible through cqlsh without issue. I have tested on a 1-node (Osx Laptop) and 4-nodes (Ubuntu servers) and got the same error. I tried with version 1.2.6, 1.2.8, 1.2.9, 1.2.10 without success -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (CASSANDRA-6084) java.io.IOException: Could not get input splits
[ https://issues.apache.org/jira/browse/CASSANDRA-6084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13776343#comment-13776343 ] Cyril Scetbon commented on CASSANDRA-6084: -- Yes vnodes are enabled. You didn't tell me it's not recommended at [CASSANDRA-5544|https://issues.apache.org/jira/browse/CASSANDRA-5544]. I know it will create too many split as compared to situation when vnodes are disabled, but I suppose it's not the source of errors I get, right ? java.io.IOException: Could not get input splits --- Key: CASSANDRA-6084 URL: https://issues.apache.org/jira/browse/CASSANDRA-6084 Project: Cassandra Issue Type: Bug Components: Hadoop Environment: Osx 10.8.5 Java 1.7 Cassandra 1.2.10 Pig 0.9.2/0.11.1 Reporter: Cyril Scetbon Assignee: Alex Liu see http://www.mail-archive.com/user@cassandra.apache.org/msg32414.html I've noticed that if I restart Cassandra I get more errors for the first minutes, although it's accessible through cqlsh without issue. I have tested on a 1-node (Osx Laptop) and 4-nodes (Ubuntu servers) and got the same error. I tried with version 1.2.6, 1.2.8, 1.2.9, 1.2.10 without success -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (CASSANDRA-6084) java.io.IOException: Could not get input splits
[ https://issues.apache.org/jira/browse/CASSANDRA-6084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13776410#comment-13776410 ] Alex Liu commented on CASSANDRA-6084: - Let's disable the vnode to see whether we can reproduce the issue? java.io.IOException: Could not get input splits --- Key: CASSANDRA-6084 URL: https://issues.apache.org/jira/browse/CASSANDRA-6084 Project: Cassandra Issue Type: Bug Components: Hadoop Environment: Osx 10.8.5 Java 1.7 Cassandra 1.2.10 Pig 0.9.2/0.11.1 Reporter: Cyril Scetbon Assignee: Alex Liu see http://www.mail-archive.com/user@cassandra.apache.org/msg32414.html I've noticed that if I restart Cassandra I get more errors for the first minutes, although it's accessible through cqlsh without issue. I have tested on a 1-node (Osx Laptop) and 4-nodes (Ubuntu servers) and got the same error. I tried with version 1.2.6, 1.2.8, 1.2.9, 1.2.10 without success -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (CASSANDRA-6084) java.io.IOException: Could not get input splits
[ https://issues.apache.org/jira/browse/CASSANDRA-6084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13776465#comment-13776465 ] Alex Liu commented on CASSANDRA-6084: - I will work on vnode support for hadoop/pig later once I have my other tasks done. java.io.IOException: Could not get input splits --- Key: CASSANDRA-6084 URL: https://issues.apache.org/jira/browse/CASSANDRA-6084 Project: Cassandra Issue Type: Bug Components: Hadoop Environment: Osx 10.8.5 Java 1.7 Cassandra 1.2.10 Pig 0.9.2/0.11.1 Reporter: Cyril Scetbon Assignee: Alex Liu see http://www.mail-archive.com/user@cassandra.apache.org/msg32414.html I've noticed that if I restart Cassandra I get more errors for the first minutes, although it's accessible through cqlsh without issue. I have tested on a 1-node (Osx Laptop) and 4-nodes (Ubuntu servers) and got the same error. I tried with version 1.2.6, 1.2.8, 1.2.9, 1.2.10 without success -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (CASSANDRA-6084) java.io.IOException: Could not get input splits
[ https://issues.apache.org/jira/browse/CASSANDRA-6084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13776714#comment-13776714 ] Cyril Scetbon commented on CASSANDRA-6084: -- I confirm that I can't reproduce the issue without vnodes. So, with vnodes it works sporadically whereas it always work without. Can you tell me if when you'll work on the vnode support ? in a few days ? a few weeks ? a few hours :) Thanks Alex java.io.IOException: Could not get input splits --- Key: CASSANDRA-6084 URL: https://issues.apache.org/jira/browse/CASSANDRA-6084 Project: Cassandra Issue Type: Bug Components: Hadoop Environment: Osx 10.8.5 Java 1.7 Cassandra 1.2.10 Pig 0.9.2/0.11.1 Reporter: Cyril Scetbon Assignee: Alex Liu see http://www.mail-archive.com/user@cassandra.apache.org/msg32414.html I've noticed that if I restart Cassandra I get more errors for the first minutes, although it's accessible through cqlsh without issue. I have tested on a 1-node (Osx Laptop) and 4-nodes (Ubuntu servers) and got the same error. I tried with version 1.2.6, 1.2.8, 1.2.9, 1.2.10 without success -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (CASSANDRA-6084) java.io.IOException: Could not get input splits
[ https://issues.apache.org/jira/browse/CASSANDRA-6084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13776741#comment-13776741 ] Alex Liu commented on CASSANDRA-6084: - It may takes a few of weeks. I had been investigated it a few months back, and we decide not to support it in DSE. It looks like we should address it now. Basic idea is to combine a few vnode splits into a sudo big split, so it works like without vnodes. java.io.IOException: Could not get input splits --- Key: CASSANDRA-6084 URL: https://issues.apache.org/jira/browse/CASSANDRA-6084 Project: Cassandra Issue Type: Bug Components: Hadoop Environment: Osx 10.8.5 Java 1.7 Cassandra 1.2.10 Pig 0.9.2/0.11.1 Reporter: Cyril Scetbon Assignee: Alex Liu see http://www.mail-archive.com/user@cassandra.apache.org/msg32414.html I've noticed that if I restart Cassandra I get more errors for the first minutes, although it's accessible through cqlsh without issue. I have tested on a 1-node (Osx Laptop) and 4-nodes (Ubuntu servers) and got the same error. I tried with version 1.2.6, 1.2.8, 1.2.9, 1.2.10 without success -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (CASSANDRA-6084) java.io.IOException: Could not get input splits
[ https://issues.apache.org/jira/browse/CASSANDRA-6084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13776748#comment-13776748 ] Alex Liu commented on CASSANDRA-6084: - CASSANDRA-6091 is opened to better support vnode for hadoop/pig jobs. I am close this ticket. java.io.IOException: Could not get input splits --- Key: CASSANDRA-6084 URL: https://issues.apache.org/jira/browse/CASSANDRA-6084 Project: Cassandra Issue Type: Bug Components: Hadoop Environment: Osx 10.8.5 Java 1.7 Cassandra 1.2.10 Pig 0.9.2/0.11.1 Reporter: Cyril Scetbon Assignee: Alex Liu see http://www.mail-archive.com/user@cassandra.apache.org/msg32414.html I've noticed that if I restart Cassandra I get more errors for the first minutes, although it's accessible through cqlsh without issue. I have tested on a 1-node (Osx Laptop) and 4-nodes (Ubuntu servers) and got the same error. I tried with version 1.2.6, 1.2.8, 1.2.9, 1.2.10 without success -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (CASSANDRA-6084) java.io.IOException: Could not get input splits
[ https://issues.apache.org/jira/browse/CASSANDRA-6084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13776753#comment-13776753 ] Jonathan Ellis commented on CASSANDRA-6084: --- There's nothing in the split code that *shouldn't* work with vnodes, so I suspect that there's a bug that affects both and is just 256 or so times more likely to manifest with vnodes. java.io.IOException: Could not get input splits --- Key: CASSANDRA-6084 URL: https://issues.apache.org/jira/browse/CASSANDRA-6084 Project: Cassandra Issue Type: Bug Components: Hadoop Environment: Osx 10.8.5 Java 1.7 Cassandra 1.2.10 Pig 0.9.2/0.11.1 Reporter: Cyril Scetbon Assignee: Alex Liu see http://www.mail-archive.com/user@cassandra.apache.org/msg32414.html I've noticed that if I restart Cassandra I get more errors for the first minutes, although it's accessible through cqlsh without issue. I have tested on a 1-node (Osx Laptop) and 4-nodes (Ubuntu servers) and got the same error. I tried with version 1.2.6, 1.2.8, 1.2.9, 1.2.10 without success -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (CASSANDRA-6084) java.io.IOException: Could not get input splits
[ https://issues.apache.org/jira/browse/CASSANDRA-6084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13776784#comment-13776784 ] Alex Liu commented on CASSANDRA-6084: - Agreed. The connection error is from https://github.com/apache/cassandra/blob/cassandra-1.2/src/java/org/apache/cassandra/hadoop/AbstractColumnFamilyInputFormat.java#L272 , so it could be issue that the number of concurrent jobs fail to get the connections or some splits can't get the correct host. I will get some debug logs to see what's the exception. java.io.IOException: Could not get input splits --- Key: CASSANDRA-6084 URL: https://issues.apache.org/jira/browse/CASSANDRA-6084 Project: Cassandra Issue Type: Bug Components: Hadoop Environment: Osx 10.8.5 Java 1.7 Cassandra 1.2.10 Pig 0.9.2/0.11.1 Reporter: Cyril Scetbon Assignee: Alex Liu see http://www.mail-archive.com/user@cassandra.apache.org/msg32414.html I've noticed that if I restart Cassandra I get more errors for the first minutes, although it's accessible through cqlsh without issue. I have tested on a 1-node (Osx Laptop) and 4-nodes (Ubuntu servers) and got the same error. I tried with version 1.2.6, 1.2.8, 1.2.9, 1.2.10 without success -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (CASSANDRA-6084) java.io.IOException: Could not get input splits
[ https://issues.apache.org/jira/browse/CASSANDRA-6084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13776871#comment-13776871 ] Alex Liu commented on CASSANDRA-6084: - [~cscetbon] I can't reproduce the issue by enabling vnodes at my local. Do you mind applying the debug patch I attached to cassandra-1.2.10 tag and reproduce the issue by enabling vnodes? Please post the error logs, so we can investigate the issue more. java.io.IOException: Could not get input splits --- Key: CASSANDRA-6084 URL: https://issues.apache.org/jira/browse/CASSANDRA-6084 Project: Cassandra Issue Type: Bug Components: Hadoop Environment: Osx 10.8.5 Java 1.7 Cassandra 1.2.10 Pig 0.9.2/0.11.1 Reporter: Cyril Scetbon Assignee: Alex Liu Attachments: 6084_debug.txt see http://www.mail-archive.com/user@cassandra.apache.org/msg32414.html I've noticed that if I restart Cassandra I get more errors for the first minutes, although it's accessible through cqlsh without issue. I have tested on a 1-node (Osx Laptop) and 4-nodes (Ubuntu servers) and got the same error. I tried with version 1.2.6, 1.2.8, 1.2.9, 1.2.10 without success -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (CASSANDRA-6084) java.io.IOException: Could not get input splits
[ https://issues.apache.org/jira/browse/CASSANDRA-6084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13776901#comment-13776901 ] Alex Liu commented on CASSANDRA-6084: - Please also post cassandra system.log. java.io.IOException: Could not get input splits --- Key: CASSANDRA-6084 URL: https://issues.apache.org/jira/browse/CASSANDRA-6084 Project: Cassandra Issue Type: Bug Components: Hadoop Environment: Osx 10.8.5 Java 1.7 Cassandra 1.2.10 Pig 0.9.2/0.11.1 Reporter: Cyril Scetbon Assignee: Alex Liu Attachments: 6084_debug.txt see http://www.mail-archive.com/user@cassandra.apache.org/msg32414.html I've noticed that if I restart Cassandra I get more errors for the first minutes, although it's accessible through cqlsh without issue. I have tested on a 1-node (Osx Laptop) and 4-nodes (Ubuntu servers) and got the same error. I tried with version 1.2.6, 1.2.8, 1.2.9, 1.2.10 without success -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (CASSANDRA-6084) java.io.IOException: Could not get input splits
[ https://issues.apache.org/jira/browse/CASSANDRA-6084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13775575#comment-13775575 ] Alex Liu commented on CASSANDRA-6084: - [~cscetbon] Can you post your cassandra.yaml setting for listen_address and rpc_address? The following are from default settings {code} # Setting this to 0.0.0.0 is always wrong. listen_address: localhost # Note that unlike ListenAddress above, it is allowed to specify 0.0.0.0 # here if you want to listen on all interfaces but is not best practice # as it is known to confuse the node auto-discovery features of some # client drivers. rpc_address: localhost {code} It looks like your setting for listen_address is 0.0.0.0 which could lead to wrong behaviour. java.io.IOException: Could not get input splits --- Key: CASSANDRA-6084 URL: https://issues.apache.org/jira/browse/CASSANDRA-6084 Project: Cassandra Issue Type: Bug Components: Hadoop Environment: Osx 10.8.5 Java 1.7 Cassandra 1.2.10 Pig 0.9.2/0.11.1 Reporter: Cyril Scetbon Assignee: Alex Liu see http://www.mail-archive.com/user@cassandra.apache.org/msg32414.html I've noticed that if I restart Cassandra I get more errors for the first minutes, although it's accessible through cqlsh without issue. I have tested on a 1-node (Osx Laptop) and 4-nodes (Ubuntu servers) and got the same error. I tried with version 1.2.6, 1.2.8, 1.2.9, 1.2.10 without success -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (CASSANDRA-6084) java.io.IOException: Could not get input splits
[ https://issues.apache.org/jira/browse/CASSANDRA-6084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13775537#comment-13775537 ] Alex Liu commented on CASSANDRA-6084: - the error log is {code} 2013-09-20 10:36:50,877 [main] ERROR org.apache.pig.tools.pigstats.SimplePigStats - ERROR 2997: Unable to recreate exception from backend error: org.apache.pig.backend.executionengine.ExecException: ERROR 2118: Could not get input splits at org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigInputFormat.getSplits(PigInputFormat.java:288) at org.apache.hadoop.mapred.JobClient.writeNewSplits(JobClient.java:1054) at org.apache.hadoop.mapred.JobClient.writeSplits(JobClient.java:1071) at org.apache.hadoop.mapred.JobClient.access$700(JobClient.java:179) at org.apache.hadoop.mapred.JobClient$2.run(JobClient.java:983) at org.apache.hadoop.mapred.JobClient$2.run(JobClient.java:936) at java.security.AccessController.doPrivileged(Native Method) at javax.security.auth.Subject.doAs(Subject.java:415) at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1190) at org.apache.hadoop.mapred.JobClient.submitJobInternal(JobClient.java:936) at org.apache.hadoop.mapred.JobClient.submitJob(JobClient.java:910) at org.apache.hadoop.mapred.jobcontrol.Job.submit(Job.java:378) at org.apache.hadoop.mapred.jobcontrol.JobControl.startReadyJobs(JobControl.java:247) at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43) at java.lang.reflect.Method.invoke(Method.java:601) at org.apache.pig.backend.hadoop20.PigJobControl.mainLoopAction(PigJobControl.java:157) at org.apache.pig.backend.hadoop20.PigJobControl.run(PigJobControl.java:134) at java.lang.Thread.run(Thread.java:722) at org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher$1.run(MapReduceLauncher.java:257) Caused by: java.io.IOException: Could not get input splits at org.apache.cassandra.hadoop.AbstractColumnFamilyInputFormat.getSplits(AbstractColumnFamilyInputFormat.java:189) at org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigInputFormat.getSplits(PigInputFormat.java:274) ... 20 more Caused by: java.util.concurrent.ExecutionException: java.io.IOException: failed connecting to all endpoints 0.0.0.0 at java.util.concurrent.FutureTask$Sync.innerGet(FutureTask.java:252) at java.util.concurrent.FutureTask.get(FutureTask.java:111) at org.apache.cassandra.hadoop.AbstractColumnFamilyInputFormat.getSplits(AbstractColumnFamilyInputFormat.java:185) ... 21 more Caused by: java.io.IOException: failed connecting to all endpoints 0.0.0.0 at org.apache.cassandra.hadoop.AbstractColumnFamilyInputFormat.getSubSplits(AbstractColumnFamilyInputFormat.java:303) at org.apache.cassandra.hadoop.AbstractColumnFamilyInputFormat.access$200(AbstractColumnFamilyInputFormat.java:62) at org.apache.cassandra.hadoop.AbstractColumnFamilyInputFormat$SplitCallable.call(AbstractColumnFamilyInputFormat.java:222) at org.apache.cassandra.hadoop.AbstractColumnFamilyInputFormat$SplitCallable.call(AbstractColumnFamilyInputFormat.java:207) at java.util.concurrent.FutureTask$Sync.innerRun(FutureTask.java:334) at java.util.concurrent.FutureTask.run(FutureTask.java:166) at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1110) at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:603) at java.lang.Thread.run(Thread.java:722) {code} java.io.IOException: Could not get input splits --- Key: CASSANDRA-6084 URL: https://issues.apache.org/jira/browse/CASSANDRA-6084 Project: Cassandra Issue Type: Bug Components: Hadoop Environment: Osx 10.8.5 Java 1.7 Cassandra 1.2.10 Pig 0.9.2/0.11.1 Reporter: Cyril Scetbon Assignee: Alex Liu see http://www.mail-archive.com/user@cassandra.apache.org/msg32414.html I've noticed that if I restart Cassandra I get more errors for the first minutes, although it's accessible through cqlsh without issue. I have tested on a 1-node (Osx Laptop) and 4-nodes (Ubuntu servers) and got the same error. I tried with version 1.2.6, 1.2.8, 1.2.9, 1.2.10 without success -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see:
[jira] [Commented] (CASSANDRA-6084) java.io.IOException: Could not get input splits
[ https://issues.apache.org/jira/browse/CASSANDRA-6084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13775621#comment-13775621 ] Cyril Scetbon commented on CASSANDRA-6084: -- Here is my configuration : {code}[cyril:~] $ egrep ^(listen|rpc) /usr/local/etc/cassandra/cassandra.yaml |grep -v '#' listen_address: localhost rpc_address: localhost rpc_port: 9160 rpc_keepalive: true rpc_server_type: sync {code} You can see that I'm not using 0.0.0.0 java.io.IOException: Could not get input splits --- Key: CASSANDRA-6084 URL: https://issues.apache.org/jira/browse/CASSANDRA-6084 Project: Cassandra Issue Type: Bug Components: Hadoop Environment: Osx 10.8.5 Java 1.7 Cassandra 1.2.10 Pig 0.9.2/0.11.1 Reporter: Cyril Scetbon Assignee: Alex Liu see http://www.mail-archive.com/user@cassandra.apache.org/msg32414.html I've noticed that if I restart Cassandra I get more errors for the first minutes, although it's accessible through cqlsh without issue. I have tested on a 1-node (Osx Laptop) and 4-nodes (Ubuntu servers) and got the same error. I tried with version 1.2.6, 1.2.8, 1.2.9, 1.2.10 without success -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (CASSANDRA-6084) java.io.IOException: Could not get input splits
[ https://issues.apache.org/jira/browse/CASSANDRA-6084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13775720#comment-13775720 ] Mikhail Stepura commented on CASSANDRA-6084: BTW, what {{PIG_INITIAL_ADDRESS}} do you guys use? java.io.IOException: Could not get input splits --- Key: CASSANDRA-6084 URL: https://issues.apache.org/jira/browse/CASSANDRA-6084 Project: Cassandra Issue Type: Bug Components: Hadoop Environment: Osx 10.8.5 Java 1.7 Cassandra 1.2.10 Pig 0.9.2/0.11.1 Reporter: Cyril Scetbon Assignee: Alex Liu see http://www.mail-archive.com/user@cassandra.apache.org/msg32414.html I've noticed that if I restart Cassandra I get more errors for the first minutes, although it's accessible through cqlsh without issue. I have tested on a 1-node (Osx Laptop) and 4-nodes (Ubuntu servers) and got the same error. I tried with version 1.2.6, 1.2.8, 1.2.9, 1.2.10 without success -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira
[jira] [Commented] (CASSANDRA-6084) java.io.IOException: Could not get input splits
[ https://issues.apache.org/jira/browse/CASSANDRA-6084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13776002#comment-13776002 ] Cyril Scetbon commented on CASSANDRA-6084: -- My environment : {code}export PIG_INITIAL_ADDRESS=localhost export PIG_RPC_PORT=9160 export PIG_PARTITIONER=org.apache.cassandra.dht.Murmur3Partitioner{code} java.io.IOException: Could not get input splits --- Key: CASSANDRA-6084 URL: https://issues.apache.org/jira/browse/CASSANDRA-6084 Project: Cassandra Issue Type: Bug Components: Hadoop Environment: Osx 10.8.5 Java 1.7 Cassandra 1.2.10 Pig 0.9.2/0.11.1 Reporter: Cyril Scetbon Assignee: Alex Liu see http://www.mail-archive.com/user@cassandra.apache.org/msg32414.html I've noticed that if I restart Cassandra I get more errors for the first minutes, although it's accessible through cqlsh without issue. I have tested on a 1-node (Osx Laptop) and 4-nodes (Ubuntu servers) and got the same error. I tried with version 1.2.6, 1.2.8, 1.2.9, 1.2.10 without success -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators For more information on JIRA, see: http://www.atlassian.com/software/jira