[ 
https://issues.apache.org/jira/browse/MADLIB-976?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Frank McQuillan updated MADLIB-976:
-----------------------------------
    Fix Version/s:     (was: v1.12)
                   v2.0

> Random Forest - extremely long training time
> --------------------------------------------
>
>                 Key: MADLIB-976
>                 URL: https://issues.apache.org/jira/browse/MADLIB-976
>             Project: Apache MADlib
>          Issue Type: Bug
>          Components: Module: Random Forest
>            Reporter: Rashmi Raghu
>             Fix For: v2.0
>
>
> When running Random Forest training function on a modest data set it took a 
> long time - much longer than expected or desired. The training data set has 
> around 8000 rows and 400 features. Several models on similar data all took 
> around 40,000 seconds to run.  Each time we used 100 trees and 14 random 
> features selected. Dataset shared offline.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to