[ 
https://issues.apache.org/jira/browse/TEZ-4344?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17744187#comment-17744187
 ] 

okumin edited comment on TEZ-4344 at 7/18/23 11:47 AM:
-------------------------------------------------------

[~ayushtkn] [~abstractdog] Hi, can we have a chance to make it more pluggable?
As a background, we inject a similar capability to the same point.
* 
https://speakerdeck.com/okumin/hive-distributed-profiling-system-in-treasure-data-english-version-number-tdtechtalk?slide=26
* https://api-docs.treasuredata.com/blog/hive-distributed-profiling/

I guess it is impossible to fully replace our patch with TezThreadDumpHelper 
because we need to add some contexts specific to us(e.g. a global job id of our 
platform) and we want to send it to our DWH. So, I wonder if we can generalize 
the feature like `TaskAttemptHook`.
I'm really surprised and glad to see people who are on the same page!


was (Author: okumin):
[~ayushtkn][~abstractdog] Hi, can we have a chance to make it more pluggable?
As a background, we inject a similar capability to the same point.
* 
https://speakerdeck.com/okumin/hive-distributed-profiling-system-in-treasure-data-english-version-number-tdtechtalk?slide=26
* https://api-docs.treasuredata.com/blog/hive-distributed-profiling/

I guess it is impossible to fully replace our patch with TezThreadDumpHelper 
because we need to add some contexts specific to us(e.g. a global job id of our 
platform) and we want to send it to our DWH. So, I wonder if we can generalize 
the feature like `TaskAttemptHook`.
I'm really surprised and glad to see people who are on the same page!

> Collect jstack periodically from all containers
> -----------------------------------------------
>
>                 Key: TEZ-4344
>                 URL: https://issues.apache.org/jira/browse/TEZ-4344
>             Project: Apache Tez
>          Issue Type: Sub-task
>            Reporter: László Bodor
>            Assignee: Ayush Saxena
>            Priority: Major
>             Fix For: 0.10.3
>
>          Time Spent: 6h 20m
>  Remaining Estimate: 0h
>
> 1. set a property of interval of seconds (default: 0 ==> off)
> 2. attach jstack files to app logs (this is easy maybe, putting jstack log 
> files next to app syslog files can make it included by yarn)
> jstack should have a name like containername_dagname_timestamp
> +option if containers should create jstacks even when idle (don't have a task 
> assigned to them), by default they're not supposed to do so
> I don't want to have a jstack dependency for this (configure path, etc.), so 
> an internal thread dump facility is preferred with zero configuration.
> Also this doesn't require new endpoints of AM and task containers (like 
> TEZ-4345), this can be implemented quite easily.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to