[ 
https://issues.apache.org/jira/browse/YUNIKORN-42?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17096550#comment-17096550
 ] 

Adam Antal commented on YUNIKORN-42:
------------------------------------

Had a discussion with [~wwei] and [~wilfreds] today, and here are some notes.

First of all, the scope of this issue is huge, so I'll create subtasks to 
handle it more easily. The first milestone should be to have an event cache in 
core which only considers application and queue based events. Later we can 
think about pushing these events to the shim side, adding 
pod(AllocationAsk)/node based events as well and API to expose this information 
to the user - even UI support.

What is also important to have this run in a separate goroutine (similar to 
{{PartitionManager}}) and be independent of the scheduler. We don't want to 
have performance degradation as a side-effect of better trackability of events.

> Better to support POD events for YuniKorn to troubleshoot allocation failures
> -----------------------------------------------------------------------------
>
>                 Key: YUNIKORN-42
>                 URL: https://issues.apache.org/jira/browse/YUNIKORN-42
>             Project: Apache YuniKorn
>          Issue Type: Task
>            Reporter: Wangda Tan
>            Assignee: Adam Antal
>            Priority: Major
>              Labels: pull-request-available
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> Now it is tricky to do troubleshoot for pod allocation, we need better expose 
> this information to POD description.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@yunikorn.apache.org
For additional commands, e-mail: issues-h...@yunikorn.apache.org

Reply via email to