Proposal to add 'accuracy test suite' before 1.0 release

2017-02-17 Thread Niketan Pansare
Hi all, We currently test the correctness of individual runtime operators using our integration tests but not the "released" algorithms. To be fair, we do test a subset of "simplified" algorithms on synthetic datasets and compare the accuracy with R. Also, we are testing subset of released

Re: Operators in HOP DAG

2017-02-17 Thread Matthias Boehm
ad 1: t(-*): ternary minus mult (for patterns like X-s*Y) ad 2: ua(+RC): unary aggregate with aggregation function + (at runtime level you will see k+ for Kahan plus) and direction RC, i.e., full aggregate over rows and columns. ad 3: lix: matrix or frame left indexing (for patterns like

Re: Proposal to add 'accuracy test suite' before 1.0 release

2017-02-17 Thread Deron Eriksson
+1 for creating tests for the main algorithm scripts. This would be a great addition to the project. Note that the creation of tests (junit) typically requires some Java skills (and knowledge of ml algorithms) whereas a new algorithm script typically requires R/Python skills. Therefore, testing

Re: Proposal to add 'accuracy test suite' before 1.0 release

2017-02-17 Thread dusenberrymw
There is also the possibility of writing the correctness tests completely in DML itself, thus allowing an ML researcher / data scientist to easily create the tests. For example, the SystemML-NN library has a full test suite written entirely in DML in the `nn/test/` directory (i.e. no Java

Operators in HOP DAG

2017-02-17 Thread Nantia Makrynioti
Hello, I generated a HOP plan using -explain, but I can't find the meaning of the following operators: 1. t(-*) 2. ua(+RC) 3. lix 4. u(cast_as_scalar) Thank you in advance, Nantia

Build failed in Jenkins: SystemML-DailyTest #816

2017-02-17 Thread jenkins
See Changes: [npansar] [MINOR] Code refactoring MatrixIndexingSPInstruction to enable parallel [Deron Eriksson] [SYSTEMML-1280] Restore and deprecate SQLContext methods [Deron Eriksson] [SYSTEMML-1279] Decrease numCols