Hello Usergrid Team, We are suddenly facing "out of memory" exceptions in our Tomcat Severs, under low load conditions. Please note, our usergrid installations have been very stable over the last 6 months, and we have "not" seen such issues before. Our setup configuration is as below: Environment: Ubuntu 14.04, Tomcat 7, JDK 1.8.0_65 (Oracle); Cassandra version: 2.2.6 (DataStax); Usergrid version: 2.2.0 (Master branch, 3rd May, 2016)
I am pasting a few logs that have suddenly started showing up. ------------------------------------------------------------ Nov 09 16:15:26 catalina.out: 05:45:26,812 WARN EntityMappingParser:116 - Encountered 2 collections consecutively. N+1 dimensional arrays are unsupported, only arrays of depth 1 are supported ------------------------------------------------------------ Nov 09 17:22:12 catalina.out: 06:52:12,848 WARN AsyncEventServiceImpl:362 - No index operation messages came back from event processing for msg: ------------------------------------------------------------ Nov 09 17:39:56 catalina.out: 07:09:56,177 INFO transport:470 - [ip-10-0-2-128] failed to get local cluster state for [#transport#-3][ip-10-0-2-128][inet[/10.0.4.205:9300]], disconnecting... Nov 09 17:39:56 catalina.out: org.elasticsearch.transport.ReceiveTimeoutTransportException: [][inet[/10.0.4.205:9300]][cluster:monitor/state] request_id [11652] timed out after [5247ms] Nov 09 17:39:56 catalina.out: at org.elasticsearch.transport.TransportService$TimeoutHandler.run(TransportService.java:529) Nov 09 17:39:56 catalina.out: at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142) Nov 09 17:39:56 catalina.out: at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617) Nov 09 17:39:56 catalina.out: at java.lang.Thread.run(Thread.java:745) ------------------------------------------------------------ Nov 09 17:40:17 catalina.out: 07:10:17,557 WARN transport:415 - [ip-10-0-2-128] Received response for a request that has timed out, sent [10887ms] ago, timed out [3ms] ago, action [cluster:monitor/state], node [[bluedls__us-east-1a__db__10.0.4.63][T6OWiR1US9m5ABxHh0tW0w][ip-10-0-4-63][inet[/10.0.4.63:9300]]{zone=us-east-1__us-east-1a}], id [11678] ------------------------------------------------------------ Nov 09 17:43:05 catalina.out: 07:13:05,091 ERROR AbstractExceptionMapper:74 - com.netflix.hystrix.exception.HystrixRuntimeException 5XX Uncaught Exception (500) Nov 09 17:43:05 catalina.out: com.netflix.hystrix.exception.HystrixRuntimeException: ConsistentReplayCommand timed-out and fallback failed. .. Nov 09 17:43:05 catalina.out: Caused by: java.util.concurrent.TimeoutException .. Nov 09 17:43:05 catalina.out: Caused by: rx.exceptions.OnErrorThrowable$OnNextValue: OnError while emitting onNext value: org.apache.usergrid.persistence.collection.mvcc.stage.CollectionIoEvent.class .. Nov 09 17:43:05 catalina.out: 07:13:05,123 ERROR AbstractExceptionMapper:108 - Server Error (500): Nov 09 17:43:05 catalina.out: {"error":"hystrix_runtime","timestamp":1510229585122,"duration":0,"error_description":"ConsistentReplayCommand timed-out and fallback failed.","exception":"com.netflix.hystrix.exception.HystrixRuntimeException"} ------------------------------------------------------------ Our monitoring indicates there is no issue in cassandra and elasticseach clusters. Look forward to your help. Thanks Jaskaran