Matthias Boehm created SYSTEMML-1968: ----------------------------------------
Summary: Improve cost-based optimizer for distributed operations Key: SYSTEMML-1968 URL: https://issues.apache.org/jira/browse/SYSTEMML-1968 Project: SystemML Issue Type: Sub-task Reporter: Matthias Boehm The current cost-based codegen optimizer has a couple of short-comings that lead to missed opportunities for large-scale computation: * Cost model unaware of distributed operations, which sometimes leads to unnecessary fusion of cell-wise vector operations into distributed row operations, which hurts performance due to additional side inputs that needs to be broadcast to all executors. * The current identification of interesting points ignores common Cell entries. In order to reason about above problematic, we need to consider cell to row switches, i.e., Cell templates that are merged into row templates as well. -- This message was sent by Atlassian JIRA (v6.4.14#64029)