lachezar-n opened a new pull request, #2108:
URL: https://github.com/apache/systemds/pull/2108

   Here are many significant changes to the existing implementation and logic 
of the Cost Estimator and some basic tests to increase the coverage. 
   
   State of the Cost Estimator:
   * Supports more CP instructions
   * Supports now a basic spectrum of Spark instructions
   * Supports inferring of variables's dimensions for several instructions (CP 
and Spark)
   * Implements lineage-like logic for getting the estimated cost of Spark 
instructions 
   * Extended the Cost Estimator framework to include resource metrics not 
relevant for the compilation but relevant for the cost (mem/disk/network 
bandwidth, CPU NLFOPS)
   
   About the tests:
   * Not many tests were added, but some general test classes with examples 
created which can be extended to increase the coverage
   * Currently supported algorithms: L2SVM, LinReg, PCA, PNMF; Means excluded 
<- includes parser + many unknowns
   * Further tests for the enumerators should follow soon with the finalised 
version of the enumerators themselves
   
   *Note: it took longer than discussed but I also added a simplified logic to 
account for disk spill-overs reading at Spark operations to cover the potential 
scenarios we discussed. 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@systemds.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to