[GitHub] spark issue #16415: [SPARK-19007]Speedup and optimize the GradientBoostedTre...

zdh2292390 Thu, 05 Jan 2017 18:17:23 -0800

Github user zdh2292390 commented on the issue:

    https://github.com/apache/spark/pull/16415
  
    @jkbradley   Accually, I see there are quite a few points  to improve  
MLlib GBDT.
    First, we should stop using the RandomForest  API  and make a private API  
for GBDT  because  there are a lot of redundancy or reduplicative operations in 
RandomForest  API   such as  buildMetadata  ãfindSplitsBins and  
convertToTreeRDD. These operations  accually only need once for GBDT  and not 
necessary to do for every tree.
    
    I'd like to change that  but I need some advices  to avoid  break the 
existing architectureã



---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] spark issue #16415: [SPARK-19007]Speedup and optimize the GradientBoostedTre...

Reply via email to