[ 
https://issues.apache.org/jira/browse/SYSTEMML-1009?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Matthias Boehm resolved SYSTEMML-1009.
--------------------------------------
       Resolution: Fixed
         Assignee: Matthias Boehm
    Fix Version/s: SystemML 0.11

> Avoid spark context creation on parfor optimization
> ---------------------------------------------------
>
>                 Key: SYSTEMML-1009
>                 URL: https://issues.apache.org/jira/browse/SYSTEMML-1009
>             Project: SystemML
>          Issue Type: Improvement
>            Reporter: Matthias Boehm
>            Assignee: Matthias Boehm
>             Fix For: SystemML 0.11
>
>
> Currently, every parfor script triggers the lazy spark context creation, 
> independent of its input data size and script in order to obtain memory 
> budgets and parallelism. On small data the the spark context creation 
> dominates end-to-end execution time. We should improve this to a 
> configuration-only analysis, which would avoid the context creation.
> For example, here are the XS and S performance results for univariate 
> statistics:
> {code}
> UnivariateStatistics on mbperftest/bivar/A_10k/data: 14
> UnivariateStatistics on mbperftest/bivar/A_10k/data: 14
> UnivariateStatistics on mbperftest/bivar/A_10k/data: 17
> UnivariateStatistics on mbperftest/bivar/A_10k/data: 16
> UnivariateStatistics on mbperftest/bivar/A_100k/data: 14
> UnivariateStatistics on mbperftest/bivar/A_100k/data: 15
> UnivariateStatistics on mbperftest/bivar/A_100k/data: 14
> UnivariateStatistics on mbperftest/bivar/A_100k/data: 17
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to