[jira] [Updated] (CASSANDRA-13523) StreamReceiveTask: java.lang.OutOfMemoryError: Map failed

2018-02-12 Thread Michael Shuler (JIRA)

 [ 
https://issues.apache.org/jira/browse/CASSANDRA-13523?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Michael Shuler updated CASSANDRA-13523:
---
Environment: Cassandra 2.1.13, Ubuntu 14.04.5 LTS, Docker version 1.9.1, 
run as a container, 4 core server with 16GB memory.  (was: Ubuntu 14.04.5 LTS, 
Docker version 1.9.1, run as a container, 4 core server with 16GB memory.)

> StreamReceiveTask: java.lang.OutOfMemoryError: Map failed
> -
>
> Key: CASSANDRA-13523
> URL: https://issues.apache.org/jira/browse/CASSANDRA-13523
> Project: Cassandra
>  Issue Type: Bug
>  Components: Streaming and Messaging
> Environment: Cassandra 2.1.13, Ubuntu 14.04.5 LTS, Docker version 
> 1.9.1, run as a container, 4 core server with 16GB memory.
>Reporter: Matthew O'Riordan
>Priority: Major
>  Labels: bug, crash
> Fix For: 2.1.13
>
>
> During a nodetool repair -par on one of our keyspaces, Cassandra crashed due 
> to what seems like memory exhaustion within the JVM.  The machine itself had 
> plenty of available memory at the time and did not appear to be under any 
> significant load.
> In the system log, before the crash, the following was logged:
> {code}
> ...
> INFO  [AntiEntropySessions:55] 2017-05-10 18:18:20,627  RepairJob.java:163 - 
> [repair #0330beb0-35ad-11e7-b7e4-091830ac5256] requesting merkle trees for 
> stats_day_aggregates (to [/54.162.66.114, /54.236.226.76, /52.221.228.170, 
> /54.154.35.144, /54.154.96.213, /52.221.217.27])
> INFO  [ValidationExecutor:54] 2017-05-10 18:18:20,628  
> ColumnFamilyStore.java:905 - Enqueuing flush of stats_day_aggregates: 7018 
> (0%) on-heap, 0 (0%) off-heap
> INFO  [MemtableFlushWriter:13608] 2017-05-10 18:18:20,628  Memtable.java:347 
> - Writing Memtable-stats_day_aggregates@3469792(2.734KiB serialized bytes, 14 
> ops, 0%/0% of on/off-heap limit)
> INFO  [MemtableFlushWriter:13608] 2017-05-10 18:18:20,629  Memtable.java:382 
> - Completed flushing 
> /var/lib/cassandra/data/ably_production_0_stats/stats_day_aggregates-b6e29201e3d111e5bbf3091830ac5256/ably_production_0_stats-stats_day_aggregates-tmp-ka-43026-Data.db
>  (0.000KiB) for commitlog position ReplayPosition(segmentId=1491420635101, 
> position=24224955)
> INFO  [StreamReceiveTask:638] 2017-05-10 18:18:21,220  
> StreamResultFuture.java:180 - [Stream #0008cac1-35ad-11e7-b7e4-091830ac5256] 
> Session with /52.203.21.193 is complete
> INFO  [StreamReceiveTask:638] 2017-05-10 18:18:21,220  
> StreamResultFuture.java:212 - [Stream #0008cac1-35ad-11e7-b7e4-091830ac5256] 
> All sessions completed
> INFO  [StreamReceiveTask:638] 2017-05-10 18:18:21,221  
> StreamingRepairTask.java:96 - [repair #fe7e3320-35ac-11e7-b7e4-091830ac5256] 
> streaming task succeed, returning response to /52.221.217.27
> INFO  [CqlSlowLog-Writer-thread-0] 2017-05-10 18:18:26,230  
> CqlSlowLogWriter.java:151 - Recording statements with duration of 4844 in 
> slow log
> INFO  [Service Thread] 2017-05-10 18:18:26,233  GCInspector.java:258 - G1 Old 
> Generation GC in 4781ms.  G1 Eden Space: 131072 -> 0; G1 Old Gen: 
> 2774539816 -> 1830851216; G1 Survivor Space: 37748736 -> 0; 
> INFO  [AntiEntropyStage:1] 2017-05-10 18:18:26,237  RepairSession.java:171 - 
> [repair #0330beb0-35ad-11e7-b7e4-091830ac5256] Received merkle tree for 
> stats_day_aggregates from /54.162.66.114
> INFO  [StreamConnectionEstablisher:1] 2017-05-10 18:18:26,239  
> StreamCoordinator.java:209 - [Stream #0293bb60-35ad-11e7-b7e4-091830ac5256, 
> ID#0] Beginning stream session with /54.154.226.20
> INFO  [AntiEntropyStage:1] 2017-05-10 18:18:26,294  RepairSession.java:171 - 
> [repair #0330beb0-35ad-11e7-b7e4-091830ac5256] Received merkle tree for 
> stats_day_aggregates from /54.236.226.76
> INFO  [Service Thread] 2017-05-10 18:18:26,298  StatusLogger.java:51 - Pool 
> NameActive   Pending  Completed   Blocked  All Time 
> Blocked
> INFO  [AntiEntropyStage:1] 2017-05-10 18:18:26,344  RepairSession.java:171 - 
> [repair #0330beb0-35ad-11e7-b7e4-091830ac5256] Received merkle tree for 
> stats_day_aggregates from /54.154.35.144
> INFO  [CqlSlowLog-Writer-thread-0] 2017-05-10 18:18:26,344  
> CqlSlowLogWriter.java:151 - Recording statements with duration of 5035 in 
> slow log
> WARN  [GossipTasks:1] 2017-05-10 18:18:26,344  FailureDetector.java:258 - Not 
> marking nodes down due to local pause of 5109502584 > 50
> INFO  [AntiEntropyStage:1] 2017-05-10 18:18:26,344  RepairSession.java:171 - 
> [repair #0330beb0-35ad-11e7-b7e4-091830ac5256] Received merkle tree for 
> stats_day_aggregates from /54.154.96.213
> INFO  [AntiEntropyStage:1] 2017-05-10 18:18:26,344  RepairSession.java:171 - 
> [repair #0330beb0-35ad-11e7-b7e4-091830ac5256] Received merkle tree for 
> stats_day_aggregates from /52.221.228.170
> 

[jira] [Updated] (CASSANDRA-13523) StreamReceiveTask: java.lang.OutOfMemoryError: Map failed

2018-02-12 Thread Michael Shuler (JIRA)

 [ 
https://issues.apache.org/jira/browse/CASSANDRA-13523?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Michael Shuler updated CASSANDRA-13523:
---
Fix Version/s: (was: 2.1.13)
   2.1.x

> StreamReceiveTask: java.lang.OutOfMemoryError: Map failed
> -
>
> Key: CASSANDRA-13523
> URL: https://issues.apache.org/jira/browse/CASSANDRA-13523
> Project: Cassandra
>  Issue Type: Bug
>  Components: Streaming and Messaging
> Environment: Cassandra 2.1.13, Ubuntu 14.04.5 LTS, Docker version 
> 1.9.1, run as a container, 4 core server with 16GB memory.
>Reporter: Matthew O'Riordan
>Priority: Major
>  Labels: bug, crash
> Fix For: 2.1.x
>
>
> During a nodetool repair -par on one of our keyspaces, Cassandra crashed due 
> to what seems like memory exhaustion within the JVM.  The machine itself had 
> plenty of available memory at the time and did not appear to be under any 
> significant load.
> In the system log, before the crash, the following was logged:
> {code}
> ...
> INFO  [AntiEntropySessions:55] 2017-05-10 18:18:20,627  RepairJob.java:163 - 
> [repair #0330beb0-35ad-11e7-b7e4-091830ac5256] requesting merkle trees for 
> stats_day_aggregates (to [/54.162.66.114, /54.236.226.76, /52.221.228.170, 
> /54.154.35.144, /54.154.96.213, /52.221.217.27])
> INFO  [ValidationExecutor:54] 2017-05-10 18:18:20,628  
> ColumnFamilyStore.java:905 - Enqueuing flush of stats_day_aggregates: 7018 
> (0%) on-heap, 0 (0%) off-heap
> INFO  [MemtableFlushWriter:13608] 2017-05-10 18:18:20,628  Memtable.java:347 
> - Writing Memtable-stats_day_aggregates@3469792(2.734KiB serialized bytes, 14 
> ops, 0%/0% of on/off-heap limit)
> INFO  [MemtableFlushWriter:13608] 2017-05-10 18:18:20,629  Memtable.java:382 
> - Completed flushing 
> /var/lib/cassandra/data/ably_production_0_stats/stats_day_aggregates-b6e29201e3d111e5bbf3091830ac5256/ably_production_0_stats-stats_day_aggregates-tmp-ka-43026-Data.db
>  (0.000KiB) for commitlog position ReplayPosition(segmentId=1491420635101, 
> position=24224955)
> INFO  [StreamReceiveTask:638] 2017-05-10 18:18:21,220  
> StreamResultFuture.java:180 - [Stream #0008cac1-35ad-11e7-b7e4-091830ac5256] 
> Session with /52.203.21.193 is complete
> INFO  [StreamReceiveTask:638] 2017-05-10 18:18:21,220  
> StreamResultFuture.java:212 - [Stream #0008cac1-35ad-11e7-b7e4-091830ac5256] 
> All sessions completed
> INFO  [StreamReceiveTask:638] 2017-05-10 18:18:21,221  
> StreamingRepairTask.java:96 - [repair #fe7e3320-35ac-11e7-b7e4-091830ac5256] 
> streaming task succeed, returning response to /52.221.217.27
> INFO  [CqlSlowLog-Writer-thread-0] 2017-05-10 18:18:26,230  
> CqlSlowLogWriter.java:151 - Recording statements with duration of 4844 in 
> slow log
> INFO  [Service Thread] 2017-05-10 18:18:26,233  GCInspector.java:258 - G1 Old 
> Generation GC in 4781ms.  G1 Eden Space: 131072 -> 0; G1 Old Gen: 
> 2774539816 -> 1830851216; G1 Survivor Space: 37748736 -> 0; 
> INFO  [AntiEntropyStage:1] 2017-05-10 18:18:26,237  RepairSession.java:171 - 
> [repair #0330beb0-35ad-11e7-b7e4-091830ac5256] Received merkle tree for 
> stats_day_aggregates from /54.162.66.114
> INFO  [StreamConnectionEstablisher:1] 2017-05-10 18:18:26,239  
> StreamCoordinator.java:209 - [Stream #0293bb60-35ad-11e7-b7e4-091830ac5256, 
> ID#0] Beginning stream session with /54.154.226.20
> INFO  [AntiEntropyStage:1] 2017-05-10 18:18:26,294  RepairSession.java:171 - 
> [repair #0330beb0-35ad-11e7-b7e4-091830ac5256] Received merkle tree for 
> stats_day_aggregates from /54.236.226.76
> INFO  [Service Thread] 2017-05-10 18:18:26,298  StatusLogger.java:51 - Pool 
> NameActive   Pending  Completed   Blocked  All Time 
> Blocked
> INFO  [AntiEntropyStage:1] 2017-05-10 18:18:26,344  RepairSession.java:171 - 
> [repair #0330beb0-35ad-11e7-b7e4-091830ac5256] Received merkle tree for 
> stats_day_aggregates from /54.154.35.144
> INFO  [CqlSlowLog-Writer-thread-0] 2017-05-10 18:18:26,344  
> CqlSlowLogWriter.java:151 - Recording statements with duration of 5035 in 
> slow log
> WARN  [GossipTasks:1] 2017-05-10 18:18:26,344  FailureDetector.java:258 - Not 
> marking nodes down due to local pause of 5109502584 > 50
> INFO  [AntiEntropyStage:1] 2017-05-10 18:18:26,344  RepairSession.java:171 - 
> [repair #0330beb0-35ad-11e7-b7e4-091830ac5256] Received merkle tree for 
> stats_day_aggregates from /54.154.96.213
> INFO  [AntiEntropyStage:1] 2017-05-10 18:18:26,344  RepairSession.java:171 - 
> [repair #0330beb0-35ad-11e7-b7e4-091830ac5256] Received merkle tree for 
> stats_day_aggregates from /52.221.228.170
> INFO  [Service Thread] 2017-05-10 18:18:26,345  StatusLogger.java:66 - 
> MutationStage 0 0 1199223432 0
>  0
>