Re: GradientBoostTrees leaks a persisted RDD

Sean Owen Thu, 23 Apr 2015 09:24:12 -0700

Those are different RDDs that DecisionTree persists, though. It's not redundant.


On Thu, Apr 23, 2015 at 11:12 AM, jimfcarroll <[email protected]> wrote:
> Hi Sean and Joe,
>
> I have another question.
>
> GradientBoostedTrees.run iterates over the RDD calling DecisionTree.run on
> each iteration with a new random sample from the input RDD. DecisionTree.run
> calls RandomForest.run. which also calls persist.
>
> One of these seems superfluous.
>
> Should I simply remove the persist call at the GradientBoostedTrees level?
>
> Thanks
> Jim
>
>
>
>
> --
> View this message in context: 
> http://apache-spark-developers-list.1001551.n3.nabble.com/GradientBoostTrees-leaks-a-persisted-RDD-tp11750p11762.html
> Sent from the Apache Spark Developers List mailing list archive at Nabble.com.
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: [email protected]
> For additional commands, e-mail: [email protected]
>

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Re: GradientBoostTrees leaks a persisted RDD

Reply via email to