[ 
https://issues.apache.org/jira/browse/YARN-4091?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Sunil G updated YARN-4091:
--------------------------
    Attachment: SchedulerActivityManager-TestReport.pdf

HI [~ChenGe] and [~leftnoteasy]

I got some time to do test  with this patch. I thought of sharing test results 
here along with few inputs.

I added this comments in the doc as well.

Comments:
# I think Diagnostic message could be improved.  "do not need more resource" => 
“Applications does not need more resource”
# For node activity, "priority": "-1" does not make sense. Could we hide the 
same from node level and show for app (container)?
# timeStamp is not meaningful ("timeStamp": "1469792611186"). Its could be date 
and time or relative to previous activity.
# *finalAllocationState* is one of the entry for application. Could we say 
*finalAppAllocationState*.
# In queue level, is “allocationState” meaningful? I think we can hide in queue 
level, thoughts.?
# As mentioned earlier, priority could be hidden in places where its -1.
# As an improvement, its better to give pending resource requests per app after 
allocation. So we can get some idea and can help a lot.
# when I tested below test case "allocation for an application is done and app 
is running. Second app is awaiting due to AM resource percentage." I could not 
get expected result. Am I missing something.? Test case 6 in the report.
# Could we also print node_label too when container is allocated


I tried some more cases and will try enhancing this report.



> Add REST API to retrieve scheduler activity
> -------------------------------------------
>
>                 Key: YARN-4091
>                 URL: https://issues.apache.org/jira/browse/YARN-4091
>             Project: Hadoop YARN
>          Issue Type: Sub-task
>          Components: capacity scheduler, resourcemanager
>    Affects Versions: 2.7.0
>            Reporter: Sunil G
>            Assignee: Chen Ge
>         Attachments: Improvement on debugdiagnostic information - YARN.pdf, 
> SchedulerActivityManager-TestReport.pdf, YARN-4091-design-doc-v1.pdf, 
> YARN-4091.1.patch, YARN-4091.2.patch, YARN-4091.3.patch, YARN-4091.4.patch, 
> YARN-4091.preliminary.1.patch, app_activities.json, node_activities.json
>
>
> As schedulers are improved with various new capabilities, more configurations 
> which tunes the schedulers starts to take actions such as limit assigning 
> containers to an application, or introduce delay to allocate container etc. 
> There are no clear information passed down from scheduler to outerworld under 
> these various scenarios. This makes debugging very tougher.
> This ticket is an effort to introduce more defined states on various parts in 
> scheduler where it skips/rejects container assignment, activate application 
> etc. Such information will help user to know whats happening in scheduler.
> Attaching a short proposal for initial discussion. We would like to improve 
> on this as we discuss.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to