[ 
https://issues.apache.org/jira/browse/SYSTEMML-1274?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Matthias Boehm resolved SYSTEMML-1274.
--------------------------------------
       Resolution: Done
         Assignee: Matthias Boehm
    Fix Version/s: SystemML 0.13

> Unnecessary rdd computation for nnz maintenance on write
> --------------------------------------------------------
>
>                 Key: SYSTEMML-1274
>                 URL: https://issues.apache.org/jira/browse/SYSTEMML-1274
>             Project: SystemML
>          Issue Type: Bug
>          Components: Runtime
>            Reporter: Matthias Boehm
>            Assignee: Matthias Boehm
>             Fix For: SystemML 0.13
>
>
> Our primitive for writing binary block RDDs to HDFS (as used in guarded 
> collect), first computes the number of non-zeros (nnz) and subsequently 
> writes out the data. This leads to redundant RDD computation, which can be 
> expensive for large DAGs of RDD operations. Explicitly computing the nnz is 
> unnecessary as we could simply piggyback this computation onto the write via 
> an accumulator as done in multiple other places in SystemML. 



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

Reply via email to