fsamuel-bs opened a new pull request #29977:
URL: https://github.com/apache/spark/pull/29977


   ### What changes were proposed in this pull request?
   Proposing a new set of APIs for ExecutorPlugins, to provide callbacks 
invoked at the start and end of each task of a job. Not very opinionated on the 
shape of the API, tried to be as minimal as possible for now.
   
   ### Why are the changes needed?
   Changes described in detail on 
[SPARK-33088](https://issues.apache.org/jira/browse/SPARK-33088), but mostly 
this boils down to:
   
   1. This feature was considered when the ExecutorPlugin API was initially 
introduced in #21923, but never implemented.
   2. The use-case which **requires** this feature is to propagate tracing 
information from the driver to the executor, such that calls from the same job 
can all be traced.
     a. Tracing frameworks usually are setup in thread locals, therefore it's 
important for the setup to happen in the same thread which runs the tasks.
     b. Executors can be for multiple jobs, therefore it's not sufficient to 
set tracing information at executor startup time -- it needs to happen every 
time a task starts or ends.
   
   ### Does this PR introduce _any_ user-facing change?
   No. This PR introduces new features for future developers to use.
   
   ### How was this patch tested?
   Unit tests on `PluginContainerSuite`.


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to