GitHub user WeichenXu123 opened a pull request:
https://github.com/apache/spark/pull/21163
[SPARK-24097][ML] Instruments improvements - RandomForest and
GradientBoostedTree
## What changes were proposed in this pull request?
Instruments improvements for `RandomForest` and `GradientBoostedTree` in
`ml.tree.impl` package.
**Note:** I add `Instrumentation` and `OptionalInstrumentation` with
`Serializable`. Because tree algos need to ouput some logs on executor side and
the instrumentation object need to be broadcast.
## How was this patch tested?
Manual.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/WeichenXu123/spark instr_rf_gbt
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/21163.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #21163
----
commit cd5ea88687b72335647d74c0aeef375de01724d9
Author: WeichenXu <weichen.xu@...>
Date: 2018-04-26T04:59:18Z
init pr
----
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]