[
https://issues.apache.org/jira/browse/SPARK-15447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Nick Pentreath updated SPARK-15447:
-----------------------------------
Description:
We made several changes to ALS in 2.0. It is necessary to run some tests to
avoid performance regression. We should test (synthetic) datasets from 1
million ratings to 1 billion ratings.
cc [~mlnick] [~holdenk] Do you have time to run some large-scale performance
tests?
Links:
[Results
spreadsheet|https://docs.google.com/spreadsheets/d/1iX5LisfXcZSTCHp8VPoo5z-eCO85A5VsZDtZ5e475ks/edit?usp=sharing]
was:
We made several changes to ALS in 2.0. It is necessary to run some tests to
avoid performance regression. We should test (synthetic) datasets from 1
million ratings to 1 billion ratings.
cc [~mlnick] [~holdenk] Do you have time to run some large-scale performance
tests?
> Performance test for ALS in Spark 2.0
> -------------------------------------
>
> Key: SPARK-15447
> URL: https://issues.apache.org/jira/browse/SPARK-15447
> Project: Spark
> Issue Type: Task
> Components: ML
> Affects Versions: 2.0.0
> Reporter: Xiangrui Meng
> Assignee: Nick Pentreath
> Priority: Critical
> Labels: QA
>
> We made several changes to ALS in 2.0. It is necessary to run some tests to
> avoid performance regression. We should test (synthetic) datasets from 1
> million ratings to 1 billion ratings.
> cc [~mlnick] [~holdenk] Do you have time to run some large-scale performance
> tests?
> Links:
> [Results
> spreadsheet|https://docs.google.com/spreadsheets/d/1iX5LisfXcZSTCHp8VPoo5z-eCO85A5VsZDtZ5e475ks/edit?usp=sharing]
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]