Matthias Boehm created SYSTEMML-1767:

             Summary: Performance issues codegen rowwise (column aggregation) 
w/ wide matrices
                 Key: SYSTEMML-1767
             Project: SystemML
          Issue Type: Bug
            Reporter: Matthias Boehm

On scenarios with wide matrices of millions of features, the codegen rowwise 
template shows performance issues due to unnecessary multi-threading which 
requires additional memory per thread for partial aggregation which leads to 
cache thrashing. We should similarly to the mmchain operator establish a 
threshold for maximum temporary results and fall back to sequential operations 
if this threshold is exceeded.

This message was sent by Atlassian JIRA

Reply via email to