Hi, For our large ALS runs, we are considering using sc.setCheckPointDir so that the intermediate factors are written to HDFS and the lineage is broken...
Is there a comparison which shows the performance degradation due to these options ? If not I will be happy to add experiments with it... Thanks. Deb
