Github user zdh2292390 commented on the issue:
https://github.com/apache/spark/pull/16415
@jkbradley Accually, I see there are quite a few points to improve
MLlib GBDT.
First, we should stop using the RandomForest API and make a private API
for GBDT because there are a lot of redundancy or reduplicative operations in
RandomForest API such as buildMetadata ãfindSplitsBins and
convertToTreeRDD. These operations accually only need once for GBDT and not
necessary to do for every tree.
I'd like to change that but I need some advices to avoid break the
existing architectureã
---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]