[jira] [Assigned] (YARN-5688) Make allocation of opportunistic containers asynchronous

2021-01-11 Thread Sampada Dehankar (Jira)


 [ 
https://issues.apache.org/jira/browse/YARN-5688?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sampada Dehankar reassigned YARN-5688:
--

Assignee: Sampada Dehankar  (was: Abhishek Modi)

> Make allocation of opportunistic containers asynchronous
> 
>
> Key: YARN-5688
> URL: https://issues.apache.org/jira/browse/YARN-5688
> Project: Hadoop YARN
>  Issue Type: Sub-task
>  Components: resourcemanager
>Reporter: Konstantinos Karanasos
>Assignee: Sampada Dehankar
>Priority: Major
>
> In the current implementation of the 
> {{OpportunisticContainerAllocatorAMService}}, we synchronously perform the 
> allocation of opportunistic containers. This results in "blocking" the 
> service at the RM when scheduling the opportunistic containers.
> The {{OpportunisticContainerAllocator}} should instead asynchronously run as 
> a separate thread.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org



[jira] [Assigned] (YARN-9828) Add log line for app submission in RouterWebServices.

2019-09-25 Thread Sampada Dehankar (Jira)


 [ 
https://issues.apache.org/jira/browse/YARN-9828?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sampada Dehankar reassigned YARN-9828:
--

Assignee: Sampada Dehankar  (was: Abhishek Modi)

> Add log line for app submission in RouterWebServices.
> -
>
> Key: YARN-9828
> URL: https://issues.apache.org/jira/browse/YARN-9828
> Project: Hadoop YARN
>  Issue Type: Task
>Reporter: Abhishek Modi
>Assignee: Sampada Dehankar
>Priority: Trivial
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org



[jira] [Assigned] (YARN-9843) Test TestAMSimulator.testAMSimulator fails intermittently.

2019-09-25 Thread Sampada Dehankar (Jira)


 [ 
https://issues.apache.org/jira/browse/YARN-9843?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sampada Dehankar reassigned YARN-9843:
--

Assignee: Sampada Dehankar  (was: Abhishek Modi)

> Test TestAMSimulator.testAMSimulator fails intermittently.
> --
>
> Key: YARN-9843
> URL: https://issues.apache.org/jira/browse/YARN-9843
> Project: Hadoop YARN
>  Issue Type: Test
>Reporter: Abhishek Modi
>Assignee: Sampada Dehankar
>Priority: Major
>
> Stack trace for failure:
> java.lang.AssertionError: java.io.IOException: Unable to delete directory 
> /testptch/hadoop/hadoop-tools/hadoop-sls/target/test-dir/output4038286622450859971/metrics.
>  at org.junit.Assert.fail(Assert.java:88)
>  at 
> org.apache.hadoop.yarn.sls.appmaster.TestAMSimulator.deleteMetricOutputDir(TestAMSimulator.java:141)
>  at 
> org.apache.hadoop.yarn.sls.appmaster.TestAMSimulator.tearDown(TestAMSimulator.java:298)
>  at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>  at 
> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:62)
>  at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
>  at java.lang.reflect.Method.invoke(Method.java:498)
>  at 
> org.junit.runners.model.FrameworkMethod$1.runReflectiveCall(FrameworkMethod.java:50)
>  at 
> org.junit.internal.runners.model.ReflectiveCallable.run(ReflectiveCallable.java:12)
>  at 
> org.junit.runners.model.FrameworkMethod.invokeExplosively(FrameworkMethod.java:47)
>  at 
> org.junit.internal.runners.statements.RunAfters.evaluate(RunAfters.java:33)
>  at org.junit.runners.ParentRunner.runLeaf(ParentRunner.java:325)
>  at 
> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:78)
>  at 
> org.junit.runners.BlockJUnit4ClassRunner.runChild(BlockJUnit4ClassRunner.java:57)
>  at org.junit.runners.ParentRunner$3.run(ParentRunner.java:290)
>  at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:71)
>  at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:288)
>  at org.junit.runners.ParentRunner.access$000(ParentRunner.java:58)
>  at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:268)
>  at org.junit.runners.ParentRunner.run(ParentRunner.java:363)
>  at org.junit.runners.Suite.runChild(Suite.java:128)
>  at org.junit.runners.Suite.runChild(Suite.java:27)
>  at org.junit.runners.ParentRunner$3.run(ParentRunner.java:290)
>  at org.junit.runners.ParentRunner$1.schedule(ParentRunner.java:71)
>  at org.junit.runners.ParentRunner.runChildren(ParentRunner.java:288)
>  at org.junit.runners.ParentRunner.access$000(ParentRunner.java:58)
>  at org.junit.runners.ParentRunner$2.evaluate(ParentRunner.java:268)
>  at org.junit.runners.ParentRunner.run(ParentRunner.java:363)
>  at 
> org.apache.maven.surefire.junit4.JUnit4Provider.execute(JUnit4Provider.java:365)
>  at 
> org.apache.maven.surefire.junit4.JUnit4Provider.executeWithRerun(JUnit4Provider.java:273)
>  at 
> org.apache.maven.surefire.junit4.JUnit4Provider.executeTestSet(JUnit4Provider.java:238)
>  at 
> org.apache.maven.surefire.junit4.JUnit4Provider.invoke(JUnit4Provider.java:159)
>  at 
> org.apache.maven.surefire.booter.ForkedBooter.invokeProviderInSameClassLoader(ForkedBooter.java:384)
>  at 
> org.apache.maven.surefire.booter.ForkedBooter.runSuitesInProcess(ForkedBooter.java:345)
>  at 
> org.apache.maven.surefire.booter.ForkedBooter.execute(ForkedBooter.java:126)
>  at org.apache.maven.surefire.booter.ForkedBooter.main(ForkedBooter.java:418)



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org



[jira] [Commented] (YARN-7691) Add Unit Tests for ContainersLauncher

2018-01-02 Thread Sampada Dehankar (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-7691?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16309218#comment-16309218
 ] 

Sampada Dehankar commented on YARN-7691:


Thanks for the review and commit [~asuresh].

> Add Unit Tests for ContainersLauncher
> -
>
> Key: YARN-7691
> URL: https://issues.apache.org/jira/browse/YARN-7691
> Project: Hadoop YARN
>  Issue Type: Task
>Affects Versions: 2.9.1
>Reporter: Sampada Dehankar
>Assignee: Sampada Dehankar
> Fix For: 3.1.0, 2.9.1, 3.0.1
>
> Attachments: YARN-7691.001.patch, YARN-7691.002.patch
>
>
> We need to add more test in the recovry path.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org



[jira] [Updated] (YARN-7691) Add Unit Tests for Containers Launcher

2018-01-02 Thread Sampada Dehankar (JIRA)

 [ 
https://issues.apache.org/jira/browse/YARN-7691?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sampada Dehankar updated YARN-7691:
---
Attachment: YARN-7691.002.patch

> Add Unit Tests for Containers Launcher
> --
>
> Key: YARN-7691
> URL: https://issues.apache.org/jira/browse/YARN-7691
> Project: Hadoop YARN
>  Issue Type: Task
>Affects Versions: 2.9.1
>Reporter: Sampada Dehankar
>Assignee: Sampada Dehankar
> Attachments: YARN-7691.001.patch, YARN-7691.002.patch
>
>
> We need to add more test in the recovry path.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org



[jira] [Commented] (YARN-7542) Fix issue that causes some Running Opportunistic Containers to be recovered as PAUSED

2018-01-01 Thread Sampada Dehankar (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-7542?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16307516#comment-16307516
 ] 

Sampada Dehankar commented on YARN-7542:


Thanks for submitting the patch [~asuresh]. 
The patch for test cases for ContainersLauncher has been uploaded to 
https://issues.apache.org/jira/browse/YARN-7691 .
Please have a look.

> Fix issue that causes some Running Opportunistic Containers to be recovered 
> as PAUSED
> -
>
> Key: YARN-7542
> URL: https://issues.apache.org/jira/browse/YARN-7542
> Project: Hadoop YARN
>  Issue Type: Bug
>Reporter: Arun Suresh
>Assignee: Sampada Dehankar
> Fix For: 3.1.0, 2.9.1, 3.0.1
>
> Attachments: YARN-7542.001.patch
>
>
> Steps to reproduce:
> * Start YARN cluster - Enable Opportunistic containers and set NM queue 
> length to something > 10. Also Enable work preserving restart
> * Start an MR job (without opportunistic containers)
> * Kill the NM and restart it again.
> * In the logs - it shows that some of the containers are in SUSPENDED state - 
> even though they are still running.
> [~sampada15] / [~kartheek], can you take a look at this ?



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org



[jira] [Updated] (YARN-7691) Add Unit Tests for Containers Launcher

2018-01-01 Thread Sampada Dehankar (JIRA)

 [ 
https://issues.apache.org/jira/browse/YARN-7691?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sampada Dehankar updated YARN-7691:
---
Attachment: YARN-7691.001.patch

> Add Unit Tests for Containers Launcher
> --
>
> Key: YARN-7691
> URL: https://issues.apache.org/jira/browse/YARN-7691
> Project: Hadoop YARN
>  Issue Type: Task
>Affects Versions: 2.9.1
>Reporter: Sampada Dehankar
>Assignee: Sampada Dehankar
> Attachments: YARN-7691.001.patch
>
>
> We need to add more test in the recovry path.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org



[jira] [Comment Edited] (YARN-7542) NM recovers some Running Opportunistic Containers as SUSPEND

2017-12-28 Thread Sampada Dehankar (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-7542?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16305999#comment-16305999
 ] 

Sampada Dehankar edited comment on YARN-7542 at 12/29/17 6:02 AM:
--

Thanks [~asuresh]. Created YARN-7691 to track additional test cases for 
recovery path.


was (Author: sampada15):
Created YARN-7691 to track additional test cases for recovery path.

> NM recovers some Running Opportunistic Containers as SUSPEND
> 
>
> Key: YARN-7542
> URL: https://issues.apache.org/jira/browse/YARN-7542
> Project: Hadoop YARN
>  Issue Type: Bug
>Reporter: Arun Suresh
>Assignee: Sampada Dehankar
> Attachments: YARN-7542.001.patch
>
>
> Steps to reproduce:
> * Start YARN cluster - Enable Opportunistic containers and set NM queue 
> length to something > 10. Also Enable work preserving restart
> * Start an MR job (without opportunistic containers)
> * Kill the NM and restart it again.
> * In the logs - it shows that some of the containers are in SUSPENDED state - 
> even though they are still running.
> [~sampada15] / [~kartheek], can you take a look at this ?



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org



[jira] [Commented] (YARN-7542) NM recovers some Running Opportunistic Containers as SUSPEND

2017-12-28 Thread Sampada Dehankar (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-7542?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16305999#comment-16305999
 ] 

Sampada Dehankar commented on YARN-7542:


Created YARN-7691 to track additional test cases for recovery path.

> NM recovers some Running Opportunistic Containers as SUSPEND
> 
>
> Key: YARN-7542
> URL: https://issues.apache.org/jira/browse/YARN-7542
> Project: Hadoop YARN
>  Issue Type: Bug
>Reporter: Arun Suresh
>Assignee: Sampada Dehankar
> Attachments: YARN-7542.001.patch
>
>
> Steps to reproduce:
> * Start YARN cluster - Enable Opportunistic containers and set NM queue 
> length to something > 10. Also Enable work preserving restart
> * Start an MR job (without opportunistic containers)
> * Kill the NM and restart it again.
> * In the logs - it shows that some of the containers are in SUSPENDED state - 
> even though they are still running.
> [~sampada15] / [~kartheek], can you take a look at this ?



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org



[jira] [Created] (YARN-7691) Add Unit Tests for Containers Launcher

2017-12-28 Thread Sampada Dehankar (JIRA)
Sampada Dehankar created YARN-7691:
--

 Summary: Add Unit Tests for Containers Launcher
 Key: YARN-7691
 URL: https://issues.apache.org/jira/browse/YARN-7691
 Project: Hadoop YARN
  Issue Type: Task
Affects Versions: 2.9.1
Reporter: Sampada Dehankar
Assignee: Sampada Dehankar


We need to add more test in the recovry path.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org



[jira] [Updated] (YARN-7542) NM recovers some Running Opportunistic Containers as SUSPEND

2017-12-28 Thread Sampada Dehankar (JIRA)

 [ 
https://issues.apache.org/jira/browse/YARN-7542?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sampada Dehankar updated YARN-7542:
---
Attachment: YARN-7542.001.patch

> NM recovers some Running Opportunistic Containers as SUSPEND
> 
>
> Key: YARN-7542
> URL: https://issues.apache.org/jira/browse/YARN-7542
> Project: Hadoop YARN
>  Issue Type: Bug
>Reporter: Arun Suresh
>Assignee: Sampada Dehankar
> Attachments: YARN-7542.001.patch
>
>
> Steps to reproduce:
> * Start YARN cluster - Enable Opportunistic containers and set NM queue 
> length to something > 10. Also Enable work preserving restart
> * Start an MR job (without opportunistic containers)
> * Kill the NM and restart it again.
> * In the logs - it shows that some of the containers are in SUSPENDED state - 
> even though they are still running.
> [~sampada15] / [~kartheek], can you take a look at this ?



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org



[jira] [Comment Edited] (YARN-7601) Incorrect container states recovered as LevelDB uses alphabetical order

2017-12-26 Thread Sampada Dehankar (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-7601?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16303705#comment-16303705
 ] 

Sampada Dehankar edited comment on YARN-7601 at 12/26/17 9:47 AM:
--

[~asuresh]: Can you please review this patch? Looks like this particular test 
[TestContainerSchedulerQueuing: testKillOnlyRequiredOpportunisticContainers] 
passes intermittently. After submitting the same patch again, it passed.


was (Author: sampada15):
[~asuresh]: Can you please review this patch?

> Incorrect container states recovered as LevelDB uses alphabetical order
> ---
>
> Key: YARN-7601
> URL: https://issues.apache.org/jira/browse/YARN-7601
> Project: Hadoop YARN
>  Issue Type: Bug
>Reporter: Sampada Dehankar
>Assignee: Sampada Dehankar
> Attachments: YARN-7601.001.patch, YARN-7601.002.patch
>
>
> LevelDB stores key-value pairs in the alphabetical order. Container id 
> concatenated by its state is used as key. So, even if container goes through 
> any states in its life cycle, the order of states for following values 
> retrieved from LevelDB is always going to be as below`:
> LAUNCHED
> PAUSED
> QUEUED
> For eg: If a container is LAUNCHED then PAUSED and LAUNCHED again, the 
> recovered container state is PAUSED currently instead of LAUNCHED.
> We propose to store the timestamp as the value while making call to 
>   
>   storeContainerLaunched
>   storeContainerPaused
>   storeContainerQueued
>   
> so that correct container state is recovered based on timestamps.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org



[jira] [Commented] (YARN-7601) Incorrect container states recovered as LevelDB uses alphabetical order

2017-12-26 Thread Sampada Dehankar (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-7601?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16303705#comment-16303705
 ] 

Sampada Dehankar commented on YARN-7601:


[~asuresh]: Can you please review this patch?

> Incorrect container states recovered as LevelDB uses alphabetical order
> ---
>
> Key: YARN-7601
> URL: https://issues.apache.org/jira/browse/YARN-7601
> Project: Hadoop YARN
>  Issue Type: Bug
>Reporter: Sampada Dehankar
>Assignee: Sampada Dehankar
> Attachments: YARN-7601.001.patch, YARN-7601.002.patch
>
>
> LevelDB stores key-value pairs in the alphabetical order. Container id 
> concatenated by its state is used as key. So, even if container goes through 
> any states in its life cycle, the order of states for following values 
> retrieved from LevelDB is always going to be as below`:
> LAUNCHED
> PAUSED
> QUEUED
> For eg: If a container is LAUNCHED then PAUSED and LAUNCHED again, the 
> recovered container state is PAUSED currently instead of LAUNCHED.
> We propose to store the timestamp as the value while making call to 
>   
>   storeContainerLaunched
>   storeContainerPaused
>   storeContainerQueued
>   
> so that correct container state is recovered based on timestamps.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org



[jira] [Updated] (YARN-7601) Incorrect container states recovered as LevelDB uses alphabetical order

2017-12-26 Thread Sampada Dehankar (JIRA)

 [ 
https://issues.apache.org/jira/browse/YARN-7601?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sampada Dehankar updated YARN-7601:
---
Attachment: YARN-7601.002.patch

> Incorrect container states recovered as LevelDB uses alphabetical order
> ---
>
> Key: YARN-7601
> URL: https://issues.apache.org/jira/browse/YARN-7601
> Project: Hadoop YARN
>  Issue Type: Bug
>Reporter: Sampada Dehankar
>Assignee: Sampada Dehankar
> Attachments: YARN-7601.001.patch, YARN-7601.002.patch
>
>
> LevelDB stores key-value pairs in the alphabetical order. Container id 
> concatenated by its state is used as key. So, even if container goes through 
> any states in its life cycle, the order of states for following values 
> retrieved from LevelDB is always going to be as below`:
> LAUNCHED
> PAUSED
> QUEUED
> For eg: If a container is LAUNCHED then PAUSED and LAUNCHED again, the 
> recovered container state is PAUSED currently instead of LAUNCHED.
> We propose to store the timestamp as the value while making call to 
>   
>   storeContainerLaunched
>   storeContainerPaused
>   storeContainerQueued
>   
> so that correct container state is recovered based on timestamps.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org



[jira] [Updated] (YARN-7601) Incorrect container states recovered as LevelDB uses alphabetical order

2017-12-25 Thread Sampada Dehankar (JIRA)

 [ 
https://issues.apache.org/jira/browse/YARN-7601?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sampada Dehankar updated YARN-7601:
---
Attachment: YARN-7601.001.patch

> Incorrect container states recovered as LevelDB uses alphabetical order
> ---
>
> Key: YARN-7601
> URL: https://issues.apache.org/jira/browse/YARN-7601
> Project: Hadoop YARN
>  Issue Type: Bug
>Reporter: Sampada Dehankar
>Assignee: Sampada Dehankar
> Attachments: YARN-7601.001.patch
>
>
> LevelDB stores key-value pairs in the alphabetical order. Container id 
> concatenated by its state is used as key. So, even if container goes through 
> any states in its life cycle, the order of states for following values 
> retrieved from LevelDB is always going to be as below`:
> LAUNCHED
> PAUSED
> QUEUED
> For eg: If a container is LAUNCHED then PAUSED and LAUNCHED again, the 
> recovered container state is PAUSED currently instead of LAUNCHED.
> We propose to store the timestamp as the value while making call to 
>   
>   storeContainerLaunched
>   storeContainerPaused
>   storeContainerQueued
>   
> so that correct container state is recovered based on timestamps.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org



[jira] [Created] (YARN-7601) Incorrect container states recovered as LevelDB uses alphabetical order

2017-12-03 Thread Sampada Dehankar (JIRA)
Sampada Dehankar created YARN-7601:
--

 Summary: Incorrect container states recovered as LevelDB uses 
alphabetical order
 Key: YARN-7601
 URL: https://issues.apache.org/jira/browse/YARN-7601
 Project: Hadoop YARN
  Issue Type: Bug
Reporter: Sampada Dehankar
Assignee: Sampada Dehankar


LevelDB stores key-value pairs in the alphabetical order. Container id 
concatenated by its state is used as key. So, even if container goes through 
any states in its life cycle, the order of states for following values 
retrieved from LevelDB is always going to be as below`:

LAUNCHED
PAUSED
QUEUED

For eg: If a container is LAUNCHED then PAUSED and LAUNCHED again, the 
recovered container state is PAUSED currently instead of LAUNCHED.
We propose to store the timestamp as the value while making call to 
  
  storeContainerLaunched
  storeContainerPaused
  storeContainerQueued
  
so that correct container state is recovered based on timestamps.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org



[jira] [Commented] (YARN-7343) Add a junit test for ContainerScheduler recovery

2017-11-07 Thread Sampada Dehankar (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-7343?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16241895#comment-16241895
 ] 

Sampada Dehankar commented on YARN-7343:


Thanks [~asuresh]. Fixed the checkstyle issues. 

> Add a junit test for ContainerScheduler recovery
> 
>
> Key: YARN-7343
> URL: https://issues.apache.org/jira/browse/YARN-7343
> Project: Hadoop YARN
>  Issue Type: Task
>Reporter: kartheek muthyala
>Assignee: Sampada Dehankar
>Priority: Minor
> Attachments: YARN-7343.001.patch, YARN-7343.002.patch, 
> YARN-7343.003.patch
>
>
> With queuing at NM, Container recovery becomes interesting. Add a junit test 
> for recovering containers in different states. This should test the recovery 
> with the ContainerScheduler class that was introduced for enabling container 
> queuing on contention of resources. 



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org



[jira] [Updated] (YARN-7343) Add a junit test for ContainerScheduler recovery

2017-11-07 Thread Sampada Dehankar (JIRA)

 [ 
https://issues.apache.org/jira/browse/YARN-7343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sampada Dehankar updated YARN-7343:
---
Attachment: YARN-7343.003.patch

> Add a junit test for ContainerScheduler recovery
> 
>
> Key: YARN-7343
> URL: https://issues.apache.org/jira/browse/YARN-7343
> Project: Hadoop YARN
>  Issue Type: Task
>Reporter: kartheek muthyala
>Assignee: Sampada Dehankar
>Priority: Minor
> Attachments: YARN-7343.001.patch, YARN-7343.002.patch, 
> YARN-7343.003.patch
>
>
> With queuing at NM, Container recovery becomes interesting. Add a junit test 
> for recovering containers in different states. This should test the recovery 
> with the ContainerScheduler class that was introduced for enabling container 
> queuing on contention of resources. 



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org



[jira] [Commented] (YARN-7343) Add a junit test for ContainerScheduler recovery

2017-11-02 Thread Sampada Dehankar (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-7343?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16237113#comment-16237113
 ] 

Sampada Dehankar commented on YARN-7343:


[~asuresh] Please have a look at the patch submitted. Removed the powermock 
dependency. 

> Add a junit test for ContainerScheduler recovery
> 
>
> Key: YARN-7343
> URL: https://issues.apache.org/jira/browse/YARN-7343
> Project: Hadoop YARN
>  Issue Type: Task
>Reporter: kartheek muthyala
>Assignee: Sampada Dehankar
>Priority: Minor
> Attachments: YARN-7343.001.patch, YARN-7343.002.patch
>
>
> With queuing at NM, Container recovery becomes interesting. Add a junit test 
> for recovering containers in different states. This should test the recovery 
> with the ContainerScheduler class that was introduced for enabling container 
> queuing on contention of resources. 



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org



[jira] [Updated] (YARN-7343) Add a junit test for ContainerScheduler recovery

2017-11-02 Thread Sampada Dehankar (JIRA)

 [ 
https://issues.apache.org/jira/browse/YARN-7343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sampada Dehankar updated YARN-7343:
---
Attachment: YARN-7343.002.patch

> Add a junit test for ContainerScheduler recovery
> 
>
> Key: YARN-7343
> URL: https://issues.apache.org/jira/browse/YARN-7343
> Project: Hadoop YARN
>  Issue Type: Task
>Reporter: kartheek muthyala
>Assignee: Sampada Dehankar
>Priority: Minor
> Attachments: YARN-7343.001.patch, YARN-7343.002.patch
>
>
> With queuing at NM, Container recovery becomes interesting. Add a junit test 
> for recovering containers in different states. This should test the recovery 
> with the ContainerScheduler class that was introduced for enabling container 
> queuing on contention of resources. 



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org



[jira] [Commented] (YARN-7343) Add a junit test for ContainerScheduler recovery

2017-10-25 Thread Sampada Dehankar (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-7343?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16218672#comment-16218672
 ] 

Sampada Dehankar commented on YARN-7343:


Thanks  for the review and suggestions [~asuresh]. Let me try the above patch 
without powermock dependency.

> Add a junit test for ContainerScheduler recovery
> 
>
> Key: YARN-7343
> URL: https://issues.apache.org/jira/browse/YARN-7343
> Project: Hadoop YARN
>  Issue Type: Task
>Reporter: kartheek muthyala
>Assignee: Sampada Dehankar
>Priority: Minor
> Attachments: YARN-7343.001.patch
>
>
> With queuing at NM, Container recovery becomes interesting. Add a junit test 
> for recovering containers in different states. This should test the recovery 
> with the ContainerScheduler class that was introduced for enabling container 
> queuing on contention of resources. 



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org



[jira] [Commented] (YARN-7343) Add a junit test for ContainerScheduler recovery

2017-10-23 Thread Sampada Dehankar (JIRA)

[ 
https://issues.apache.org/jira/browse/YARN-7343?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16214750#comment-16214750
 ] 

Sampada Dehankar commented on YARN-7343:


There is 1 unit test failure here 
[hadoop.yarn.server.nodemanager.scheduler.TestDistributedScheduler] which is 
not related to my change. 

[~asuresh] Could you please review this patch?

> Add a junit test for ContainerScheduler recovery
> 
>
> Key: YARN-7343
> URL: https://issues.apache.org/jira/browse/YARN-7343
> Project: Hadoop YARN
>  Issue Type: Task
>Reporter: kartheek muthyala
>Assignee: Sampada Dehankar
>Priority: Minor
> Attachments: YARN-7343.001.patch
>
>
> With queuing at NM, Container recovery becomes interesting. Add a junit test 
> for recovering containers in different states. This should test the recovery 
> with the ContainerScheduler class that was introduced for enabling container 
> queuing on contention of resources. 



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org



[jira] [Updated] (YARN-7343) Add a junit test for ContainerScheduler recovery

2017-10-21 Thread Sampada Dehankar (JIRA)

 [ 
https://issues.apache.org/jira/browse/YARN-7343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sampada Dehankar updated YARN-7343:
---
Attachment: YARN-7343.001.patch

> Add a junit test for ContainerScheduler recovery
> 
>
> Key: YARN-7343
> URL: https://issues.apache.org/jira/browse/YARN-7343
> Project: Hadoop YARN
>  Issue Type: Task
>Reporter: kartheek muthyala
>Assignee: Sampada Dehankar
>Priority: Minor
> Attachments: YARN-7343.001.patch
>
>
> With queuing at NM, Container recovery becomes interesting. Add a junit test 
> for recovering containers in different states. This should test the recovery 
> with the ContainerScheduler class that was introduced for enabling container 
> queuing on contention of resources. 



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: yarn-issues-unsubscr...@hadoop.apache.org
For additional commands, e-mail: yarn-issues-h...@hadoop.apache.org