[ 
https://issues.apache.org/jira/browse/YARN-4772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15184981#comment-15184981
 ] 

Steve Loughran commented on YARN-4772:
--------------------------------------

tail of the logs.

Looking at the slf4j message, it may be that this has been happening in test 
teardown, so really it's that leveldb had been stopped while another thread was 
using it: a race condition on teardown.
{code}
2016-03-08 13:11:44,597 [EntityLogPluginWorker #0] DEBUG 
org.apache.hadoop.yarn.server.timeline.security.TimelineACLsManager 
(TimelineACLsManager.java:checkAccess(106)) - Verifying the access of stevel on 
the timeline entity { id: appattempt_1111_0000_000000, type: spark_event_v01 }
2016-03-08 13:11:44,605 [EntityLogPluginWorker #0] DEBUG 
org.apache.hadoop.yarn.server.timeline.EntityGroupFSTimelineStore 
(LogInfo.java:doParse(198)) - Adding 
appattempt_1111_0000_000000(spark_event_v01) to store
2016-03-08 13:11:46,588 [EntityLogPluginWorker #0] DEBUG 
org.apache.hadoop.yarn.server.timeline.security.TimelineACLsManager 
(TimelineACLsManager.java:checkAccess(106)) - Verifying the access of stevel on 
the timeline entity { id: appattempt_1111_0000_000000, type: spark_event_v01 }
2016-03-08 13:11:46,597 [EntityLogPluginWorker #0] DEBUG 
org.apache.hadoop.yarn.server.timeline.EntityGroupFSTimelineStore 
(LogInfo.java:doParse(198)) - Adding 
appattempt_1111_0000_000000(spark_event_v01) to store
2016-03-08 13:11:48,126 [EntityLogPluginWorker #0] DEBUG 
org.apache.hadoop.yarn.server.timeline.security.TimelineACLsManager 
(TimelineACLsManager.java:checkAccess(106)) - Verifying the access of stevel on 
the timeline entity { id: appattempt_1111_0000_000000, type: spark_event_v01 }
2016-03-08 13:11:48,136 [EntityLogPluginWorker #0] DEBUG 
org.apache.hadoop.yarn.server.timeline.EntityGroupFSTimelineStore 
(LogInfo.java:doParse(198)) - Adding 
appattempt_1111_0000_000000(spark_event_v01) to store
2016-03-08 13:11:49,663 [EntityLogPluginWorker #0] DEBUG 
org.apache.hadoop.yarn.server.timeline.security.TimelineACLsManager 
(TimelineACLsManager.java:checkAccess(106)) - Verifying the access of stevel on 
the timeline entity { id: appattempt_1111_0000_000000, type: spark_event_v01 }
2016-03-08 13:11:49,672 [EntityLogPluginWorker #0] DEBUG 
org.apache.hadoop.yarn.server.timeline.EntityGroupFSTimelineStore 
(LogInfo.java:doParse(198)) - Adding 
appattempt_1111_0000_000000(spark_event_v01) to store
2016-03-08 13:11:51,588 [EntityLogPluginWorker #0] DEBUG 
org.apache.hadoop.yarn.server.timeline.security.TimelineACLsManager 
(TimelineACLsManager.java:checkAccess(106)) - Verifying the access of stevel on 
the timeline entity { id: appattempt_1111_0000_000000, type: spark_event_v01 }
2016-03-08 13:11:51,596 [EntityLogPluginWorker #0] DEBUG 
org.apache.hadoop.yarn.server.timeline.EntityGroupFSTimelineStore 
(LogInfo.java:doParse(198)) - Adding 
appattempt_1111_0000_000000(spark_event_v01) to store
2016-03-08 13:11:53,164 [EntityLogPluginWorker #0] DEBUG 
org.apache.hadoop.yarn.server.timeline.security.TimelineACLsManager 
(TimelineACLsManager.java:checkAccess(106)) - Verifying the access of stevel on 
the timeline entity { id: appattempt_1111_0000_000000, type: spark_event_v01 }
2016-03-08 13:11:53,174 [EntityLogPluginWorker #0] DEBUG 
org.apache.hadoop.yarn.server.timeline.EntityGroupFSTimelineStore 
(LogInfo.java:doParse(198)) - Adding 
appattempt_1111_0000_000000(spark_event_v01) to store
2016-03-08 13:11:54,658 [EntityLogPluginWorker #0] DEBUG 
org.apache.hadoop.yarn.server.timeline.security.TimelineACLsManager 
(TimelineACLsManager.java:checkAccess(106)) - Verifying the access of stevel on 
the timeline entity { id: appattempt_1111_0000_000000, type: spark_event_v01 }
2016-03-08 13:11:54,667 [EntityLogPluginWorker #0] DEBUG 
org.apache.hadoop.yarn.server.timeline.EntityGroupFSTimelineStore 
(LogInfo.java:doParse(198)) - Adding 
appattempt_1111_0000_000000(spark_event_v01) to store
2016-03-08 13:11:56,526 [EntityLogPluginWorker #0] DEBUG 
org.apache.hadoop.yarn.server.timeline.security.TimelineACLsManager 
(TimelineACLsManager.java:checkAccess(106)) - Verifying the access of stevel on 
the timeline entity { id: appattempt_1111_0000_000000, type: spark_event_v01 }
2016-03-08 13:11:56,534 [EntityLogPluginWorker #0] DEBUG 
org.apache.hadoop.yarn.server.timeline.EntityGroupFSTimelineStore 
(LogInfo.java:doParse(198)) - Adding 
appattempt_1111_0000_000000(spark_event_v01) to store
2016-03-08 13:11:58,239 [EntityLogPluginWorker #0] DEBUG 
org.apache.hadoop.yarn.server.timeline.security.TimelineACLsManager 
(TimelineACLsManager.java:checkAccess(106)) - Verifying the access of stevel on 
the timeline entity { id: appattempt_1111_0000_000000, type: spark_event_v01 }
2016-03-08 13:11:58,266 [EntityLogPluginWorker #0] DEBUG 
org.apache.hadoop.yarn.server.timeline.EntityGroupFSTimelineStore 
(LogInfo.java:doParse(198)) - Adding 
appattempt_1111_0000_000000(spark_event_v01) to store
2016-03-08 13:11:59,792 [EntityLogPluginWorker #0] DEBUG 
org.apache.hadoop.yarn.server.timeline.security.TimelineACLsManager 
(TimelineACLsManager.java:checkAccess(106)) - Verifying the access of stevel on 
the timeline entity { id: appattempt_1111_0000_000000, type: spark_event_v01 }
2016-03-08 13:11:59,801 [EntityLogPluginWorker #0] DEBUG 
org.apache.hadoop.yarn.server.timeline.EntityGroupFSTimelineStore 
(LogInfo.java:doParse(198)) - Adding 
appattempt_1111_0000_000000(spark_event_v01) to store
2016-03-08 13:12:01,612 [EntityLogPluginWorker #0] DEBUG 
org.apache.hadoop.yarn.server.timeline.security.TimelineACLsManager 
(TimelineACLsManager.java:checkAccess(106)) - Verifying the access of stevel on 
the timeline entity { id: appattempt_1111_0000_000000, type: spark_event_v01 }
2016-03-08 13:12:01,621 [EntityLogPluginWorker #0] DEBUG 
org.apache.hadoop.yarn.server.timeline.EntityGroupFSTimelineStore 
(LogInfo.java:doParse(198)) - Adding 
appattempt_1111_0000_000000(spark_event_v01) to store
2016-03-08 13:12:03,294 [EntityLogPluginWorker #0] DEBUG 
org.apache.hadoop.yarn.server.timeline.security.TimelineACLsManager 
(TimelineACLsManager.java:checkAccess(106)) - Verifying the access of stevel on 
the timeline entity { id: appattempt_1111_0000_000000, type: spark_event_v01 }
2016-03-08 13:12:03,308 [EntityLogPluginWorker #0] DEBUG 
org.apache.hadoop.yarn.server.timeline.EntityGroupFSTimelineStore 
(LogInfo.java:doParse(198)) - Adding 
appattempt_1111_0000_000000(spark_event_v01) to store
2016-03-08 13:12:05,289 [EntityLogPluginWorker #0] DEBUG 
org.apache.hadoop.yarn.server.timeline.security.TimelineACLsManager 
(TimelineACLsManager.java:checkAccess(106)) - Verifying the access of stevel on 
the timeline entity { id: appattempt_1111_0000_000000, type: spark_event_v01 }
2016-03-08 13:12:05,305 [EntityLogPluginWorker #0] DEBUG 
org.apache.hadoop.yarn.server.timeline.EntityGroupFSTimelineStore 
(LogInfo.java:doParse(198)) - Adding 
appattempt_1111_0000_000000(spark_event_v01) to store
2016-03-08 13:12:07,090 [EntityLogPluginWorker #0] DEBUG 
org.apache.hadoop.yarn.server.timeline.security.TimelineACLsManager 
(TimelineACLsManager.java:checkAccess(106)) - Verifying the access of stevel on 
the timeline entity { id: appattempt_1111_0000_000000, type: spark_event_v01 }
2016-03-08 13:12:07,100 [EntityLogPluginWorker #0] DEBUG 
org.apache.hadoop.yarn.server.timeline.EntityGroupFSTimelineStore 
(LogInfo.java:doParse(198)) - Adding 
appattempt_1111_0000_000000(spark_event_v01) to store
2016-03-08 13:12:07,483 [ScalaTest-main-running-ScaleSuite] WARN  
org.mortbay.log (Slf4jLog.java:warn(76)) - 4 threads could not be stopped
2016-03-08 13:12:07,568 [ScalaTest-main-running-ScaleSuite] INFO  
org.apache.hadoop.yarn.server.timeline.EntityGroupFSTimelineStore 
(EntityGroupFSTimelineStore.java:serviceStop(286)) - Stopping 
EntityGroupFSTimelineStore
2016-03-08 13:12:07,573 [ScalaTest-main-running-ScaleSuite] INFO  
org.apache.hadoop.yarn.server.timeline.EntityGroupFSTimelineStore 
(EntityGroupFSTimelineStore.java:serviceStop(290)) - Waiting for executor to 
terminate
2016-03-08 13:12:08,661 [EntityLogPluginWorker #0] DEBUG 
org.apache.hadoop.yarn.server.timeline.security.TimelineACLsManager 
(TimelineACLsManager.java:checkAccess(106)) - Verifying the access of stevel on 
the timeline entity { id: appattempt_1111_0000_000000, type: spark_event_v01 }
2016-03-08 13:12:08,672 [EntityLogPluginWorker #0] DEBUG 
org.apache.hadoop.yarn.server.timeline.EntityGroupFSTimelineStore 
(LogInfo.java:doParse(198)) - Adding 
appattempt_1111_0000_000000(spark_event_v01) to store
2016-03-08 13:12:10,248 [EntityLogPluginWorker #0] DEBUG 
org.apache.hadoop.yarn.server.timeline.security.TimelineACLsManager 
(TimelineACLsManager.java:checkAccess(106)) - Verifying the access of stevel on 
the timeline entity { id: appattempt_1111_0000_000000, type: spark_event_v01 }
2016-03-08 13:12:10,257 [EntityLogPluginWorker #0] DEBUG 
org.apache.hadoop.yarn.server.timeline.EntityGroupFSTimelineStore 
(LogInfo.java:doParse(198)) - Adding 
appattempt_1111_0000_000000(spark_event_v01) to store
2016-03-08 13:12:12,027 [EntityLogPluginWorker #0] DEBUG 
org.apache.hadoop.yarn.server.timeline.security.TimelineACLsManager 
(TimelineACLsManager.java:checkAccess(106)) - Verifying the access of stevel on 
the timeline entity { id: appattempt_1111_0000_000000, type: spark_event_v01 }
2016-03-08 13:12:12,035 [EntityLogPluginWorker #0] DEBUG 
org.apache.hadoop.yarn.server.timeline.EntityGroupFSTimelineStore 
(LogInfo.java:doParse(198)) - Adding 
appattempt_1111_0000_000000(spark_event_v01) to store
2016-03-08 13:12:13,575 [EntityLogPluginWorker #0] DEBUG 
org.apache.hadoop.yarn.server.timeline.security.TimelineACLsManager 
(TimelineACLsManager.java:checkAccess(106)) - Verifying the access of stevel on 
the timeline entity { id: appattempt_1111_0000_000000, type: spark_event_v01 }
2016-03-08 13:12:13,588 [EntityLogPluginWorker #0] DEBUG 
org.apache.hadoop.yarn.server.timeline.EntityGroupFSTimelineStore 
(LogInfo.java:doParse(198)) - Adding 
appattempt_1111_0000_000000(spark_event_v01) to store
2016-03-08 13:12:15,578 [EntityLogPluginWorker #0] DEBUG 
org.apache.hadoop.yarn.server.timeline.security.TimelineACLsManager 
(TimelineACLsManager.java:checkAccess(106)) - Verifying the access of stevel on 
the timeline entity { id: appattempt_1111_0000_000000, type: spark_event_v01 }
2016-03-08 13:12:15,589 [EntityLogPluginWorker #0] DEBUG 
org.apache.hadoop.yarn.server.timeline.EntityGroupFSTimelineStore 
(LogInfo.java:doParse(198)) - Adding 
appattempt_1111_0000_000000(spark_event_v01) to store
2016-03-08 13:12:17,576 [EntityLogPluginWorker #0] DEBUG 
org.apache.hadoop.yarn.server.timeline.security.TimelineACLsManager 
(TimelineACLsManager.java:checkAccess(106)) - Verifying the access of stevel on 
the timeline entity { id: appattempt_1111_0000_000000, type: spark_event_v01 }
2016-03-08 13:12:17,576 [ScalaTest-main-running-ScaleSuite] WARN  
org.apache.hadoop.yarn.server.timeline.EntityGroupFSTimelineStore 
(EntityGroupFSTimelineStore.java:serviceStop(295)) - Executor did not terminate
2016-03-08 13:12:17,586 [EntityLogPluginWorker #0] DEBUG 
org.apache.hadoop.yarn.server.timeline.EntityGroupFSTimelineStore 
(LogInfo.java:doParse(198)) - Adding 
appattempt_1111_0000_000000(spark_event_v01) to store
pthread lock: Invalid argument
/bin/sh: line 1:  8196 Abort trap: 6           java -Djava.awt.headless=true 
-Djava.io.tmpdir=/Users/stevel/Projects/Hortonworks/Projects/sparkwork/spark-timeline-integration/target/tmp
 -Dscale.test.jobs=100000 -Dspark.testing=1 -Dspark.ui.enabled=false 
-Dspark.ui.showConsoleProgress=false -Dspark.unsafe.exceptionOnMemoryLeak=true 
-Dbasedir=/Users/stevel/Projects/Hortonworks/Projects/sparkwork/spark-timeline-integration
 -ea -Xmx3g -Xss4096k -XX:MaxPermSize=512m -XX:ReservedCodeCacheSize=512m 
org.scalatest.tools.Runner -R 
'/Users/stevel/Projects/Hortonworks/Projects/sparkwork/spark-timeline-integration/target/classes
 
/Users/stevel/Projects/Hortonworks/Projects/sparkwork/spark-timeline-integration/target/test-classes'
 -w org.apache.spark.deploy.history.yarn.integration.ScaleSuite -o -f 
/Users/stevel/Projects/Hortonworks/Projects/sparkwork/spark-timeline-integration/target/surefire-reports/SparkTestSuite.txt
 -u 
/Users/stevel/Projects/Hortonworks/Projects/sparkwork/spark-timeline-integration/target/surefire-reports/.
{code}

> Overloaded leveljb can crash the ATS "pthread lock: Invalid argument"
> ---------------------------------------------------------------------
>
>                 Key: YARN-4772
>                 URL: https://issues.apache.org/jira/browse/YARN-4772
>             Project: Hadoop YARN
>          Issue Type: Bug
>          Components: timelineserver
>    Affects Versions: 2.8.0
>         Environment: OSX, scala history scale tests; Java 1.7.0_75-b13
>            Reporter: Steve Loughran
>
> while running scale tests with a few hundred thousand events attached to a 
> single timeline entity, the JVM crashed
> {code}
> pthread lock: Invalid argument
> /bin/sh: line 1:  8196 Abort trap: 6           
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to