Re: node keeps dying

2014-09-25 Thread Vivek Mishra
Increase heap size with Cassandra and try
On 25/09/2014 3:02 am, Prem Yadav ipremya...@gmail.com wrote:

 BTW, thanks Michael.
 I am surprised why I didn't search for Cassandra oom before.
 I got some good links that discuss that. Will try to optimize and see how
it goes.


 On Wed, Sep 24, 2014 at 10:27 PM, Prem Yadav ipremya...@gmail.com wrote:

 Well its not the Linux OOM killer. The system is running with all
default settings.

 Total memory 7GB- Cassandra gets assigned 2GB
 2 core processors.
 Two rings with 3 nodes in each ring.

 On Wed, Sep 24, 2014 at 9:53 PM, Michael Shuler mich...@pbandjelly.org
wrote:

 On 09/24/2014 11:32 AM, Prem Yadav wrote:

 this is an issue that has happened a few times. We are using DSE 4.0


 I believe this is Apache Cassandra 2.0.5, which is better info for this
list.

 One of the Cassandra nodes is detected as dead by the opscenter even
 though I can see the process is up.

 the logs show heap space error:

   INFO [RMI TCP Connection(18270)-172.31.49.189] 2014-09-24
08:31:05,340
 StorageService.java (line 2538) Starting repair command #30766,
 repairing 1 ranges for keyspace keyspace
 ERROR [BatchlogTasks:1] 2014-09-24 08:48:54,780 CassandraDaemon.java
 (line 196) Exception in thread Thread[BatchlogTasks:1,5,main]
 java.lang.OutOfMemoryError: Java heap space
  at java.util.ArrayList.init(Unknown Source)


 OOM.

 System environment and configuration modification details might be
helpful for others to give you advice. Searching for cassandra oom gave
me a few good links to read, and knowing some details about your nodes
might be really helpful. Additionally, CASSANDRA-7507 [0] suggests that an
OOM leaving the process running in an unclean state is not desired, and the
process should be killed.

 Several of the search links provide details on how to capture and dig
around a heap dump to aid in troubleshooting.

 [0] https://issues.apache.org/jira/browse/CASSANDRA-7507
 --
 Kind regards,
 Michael





node keeps dying

2014-09-24 Thread Prem Yadav
Hi,
this is an issue that has happened a few times. We are using DSE 4.0
One of the Cassandra nodes is detected as dead by the opscenter even though
I can see the process is up.

the logs show heap space error:

 INFO [RMI TCP Connection(18270)-172.31.49.189] 2014-09-24 08:31:05,340
StorageService.java (line 2538) Starting repair command #30766, repairing 1
ranges for keyspace keyspace
ERROR [BatchlogTasks:1] 2014-09-24 08:48:54,780 CassandraDaemon.java (line
196) Exception in thread Thread[BatchlogTasks:1,5,main]
java.lang.OutOfMemoryError: Java heap space
at java.util.ArrayList.init(Unknown Source)
at
org.antlr.runtime.CommonTokenStream.init(CommonTokenStream.java:68)
at
org.antlr.runtime.CommonTokenStream.init(CommonTokenStream.java:72)
at
org.apache.cassandra.cql3.QueryProcessor.parseStatement(QueryProcessor.java:413)
at
org.apache.cassandra.cql3.QueryProcessor.getStatement(QueryProcessor.java:396)
at
org.apache.cassandra.cql3.QueryProcessor.processInternal(QueryProcessor.java:253)
at
org.apache.cassandra.db.BatchlogManager.process(BatchlogManager.java:355)
at
org.apache.cassandra.db.BatchlogManager.replayAllFailedBatches(BatchlogManager.java:179)
at
org.apache.cassandra.db.BatchlogManager$1.runMayThrow(BatchlogManager.java:97)
at
org.apache.cassandra.utils.WrappedRunnable.run(WrappedRunnable.java:28)
at
org.apache.cassandra.concurrent.DebuggableScheduledThreadPoolExecutor$UncomplainingRunnable.run(DebuggableScheduledThreadPoolExecutor.java:75)
at java.util.concurrent.Executors$RunnableAdapter.call(Unknown
Source)
at java.util.concurrent.FutureTask$Sync.innerRunAndReset(Unknown
Source)
at java.util.concurrent.FutureTask.runAndReset(Unknown Source)
at
java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$301(Unknown
Source)
at
java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(Unknown
Source)
at java.util.concurrent.ThreadPoolExecutor.runWorker(Unknown Source)
at java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown
Source)
at java.lang.Thread.run(Unknown Source)


Any advice will be helpful

thanks


Re: node keeps dying

2014-09-24 Thread Michael Shuler

On 09/24/2014 11:32 AM, Prem Yadav wrote:

this is an issue that has happened a few times. We are using DSE 4.0


I believe this is Apache Cassandra 2.0.5, which is better info for this 
list.



One of the Cassandra nodes is detected as dead by the opscenter even
though I can see the process is up.

the logs show heap space error:

  INFO [RMI TCP Connection(18270)-172.31.49.189] 2014-09-24 08:31:05,340
StorageService.java (line 2538) Starting repair command #30766,
repairing 1 ranges for keyspace keyspace
ERROR [BatchlogTasks:1] 2014-09-24 08:48:54,780 CassandraDaemon.java
(line 196) Exception in thread Thread[BatchlogTasks:1,5,main]
java.lang.OutOfMemoryError: Java heap space
 at java.util.ArrayList.init(Unknown Source)


OOM.

System environment and configuration modification details might be 
helpful for others to give you advice. Searching for cassandra oom 
gave me a few good links to read, and knowing some details about your 
nodes might be really helpful. Additionally, CASSANDRA-7507 [0] suggests 
that an OOM leaving the process running in an unclean state is not 
desired, and the process should be killed.


Several of the search links provide details on how to capture and dig 
around a heap dump to aid in troubleshooting.


[0] https://issues.apache.org/jira/browse/CASSANDRA-7507
--
Kind regards,
Michael


Re: node keeps dying

2014-09-24 Thread Prem Yadav
Well its not the Linux OOM killer. The system is running with all default
settings.

Total memory 7GB- Cassandra gets assigned 2GB
2 core processors.
Two rings with 3 nodes in each ring.

On Wed, Sep 24, 2014 at 9:53 PM, Michael Shuler mich...@pbandjelly.org
wrote:

 On 09/24/2014 11:32 AM, Prem Yadav wrote:

 this is an issue that has happened a few times. We are using DSE 4.0


 I believe this is Apache Cassandra 2.0.5, which is better info for this
 list.

  One of the Cassandra nodes is detected as dead by the opscenter even
 though I can see the process is up.

 the logs show heap space error:

   INFO [RMI TCP Connection(18270)-172.31.49.189] 2014-09-24 08:31:05,340
 StorageService.java (line 2538) Starting repair command #30766,
 repairing 1 ranges for keyspace keyspace
 ERROR [BatchlogTasks:1] 2014-09-24 08:48:54,780 CassandraDaemon.java
 (line 196) Exception in thread Thread[BatchlogTasks:1,5,main]
 java.lang.OutOfMemoryError: Java heap space
  at java.util.ArrayList.init(Unknown Source)


 OOM.

 System environment and configuration modification details might be helpful
 for others to give you advice. Searching for cassandra oom gave me a few
 good links to read, and knowing some details about your nodes might be
 really helpful. Additionally, CASSANDRA-7507 [0] suggests that an OOM
 leaving the process running in an unclean state is not desired, and the
 process should be killed.

 Several of the search links provide details on how to capture and dig
 around a heap dump to aid in troubleshooting.

 [0] https://issues.apache.org/jira/browse/CASSANDRA-7507
 --
 Kind regards,
 Michael



Re: node keeps dying

2014-09-24 Thread Prem Yadav
BTW, thanks Michael.
I am surprised why I didn't search for Cassandra oom before.
I got some good links that discuss that. Will try to optimize and see how
it goes.


On Wed, Sep 24, 2014 at 10:27 PM, Prem Yadav ipremya...@gmail.com wrote:

 Well its not the Linux OOM killer. The system is running with all default
 settings.

 Total memory 7GB- Cassandra gets assigned 2GB
 2 core processors.
 Two rings with 3 nodes in each ring.

 On Wed, Sep 24, 2014 at 9:53 PM, Michael Shuler mich...@pbandjelly.org
 wrote:

 On 09/24/2014 11:32 AM, Prem Yadav wrote:

 this is an issue that has happened a few times. We are using DSE 4.0


 I believe this is Apache Cassandra 2.0.5, which is better info for this
 list.

  One of the Cassandra nodes is detected as dead by the opscenter even
 though I can see the process is up.

 the logs show heap space error:

   INFO [RMI TCP Connection(18270)-172.31.49.189] 2014-09-24 08:31:05,340
 StorageService.java (line 2538) Starting repair command #30766,
 repairing 1 ranges for keyspace keyspace
 ERROR [BatchlogTasks:1] 2014-09-24 08:48:54,780 CassandraDaemon.java
 (line 196) Exception in thread Thread[BatchlogTasks:1,5,main]
 java.lang.OutOfMemoryError: Java heap space
  at java.util.ArrayList.init(Unknown Source)


 OOM.

 System environment and configuration modification details might be
 helpful for others to give you advice. Searching for cassandra oom gave
 me a few good links to read, and knowing some details about your nodes
 might be really helpful. Additionally, CASSANDRA-7507 [0] suggests that an
 OOM leaving the process running in an unclean state is not desired, and the
 process should be killed.

 Several of the search links provide details on how to capture and dig
 around a heap dump to aid in troubleshooting.

 [0] https://issues.apache.org/jira/browse/CASSANDRA-7507
 --
 Kind regards,
 Michael