Mike Dusenberry created SYSTEMML-1005:
-----------------------------------------

             Summary: MultiLogReg Test Failure: Invalid input w/ zeros for 
rexpand ignore=false (rlen=1617, nnz=1455).
                 Key: SYSTEMML-1005
                 URL: https://issues.apache.org/jira/browse/SYSTEMML-1005
             Project: SystemML
          Issue Type: Bug
            Reporter: Mike Dusenberry
            Priority: Blocker


Currently, the {{test_mllearn.py -> TestMLLearn.testLogisticSK1}} test is 
failing with the following error:

{code}
Caused by: org.apache.sysml.runtime.DMLRuntimeException: 
org.apache.sysml.runtime.DMLRuntimeException: ERROR: Runtime error in program 
block generated from statement block between lines 151 and 164 -- Error 
evaluating instruction: 
CP°rexpand°cast=true°max=10.0°ignore=false°dir=cols°target=Y_vec°_mVar275·MATRIX·DOUBLE
        at 
org.apache.sysml.runtime.controlprogram.Program.execute(Program.java:152)
        at 
org.apache.sysml.api.mlcontext.ScriptExecutor.executeRuntimeProgram(ScriptExecutor.java:374)
        ... 17 more
Caused by: org.apache.sysml.runtime.DMLRuntimeException: ERROR: Runtime error 
in program block generated from statement block between lines 151 and 164 -- 
Error evaluating instruction: 
CP°rexpand°cast=true°max=10.0°ignore=false°dir=cols°target=Y_vec°_mVar275·MATRIX·DOUBLE
        at 
org.apache.sysml.runtime.controlprogram.ProgramBlock.executeSingleInstruction(ProgramBlock.java:335)
        at 
org.apache.sysml.runtime.controlprogram.ProgramBlock.executeInstructions(ProgramBlock.java:224)
        at 
org.apache.sysml.runtime.controlprogram.ProgramBlock.execute(ProgramBlock.java:168)
        at 
org.apache.sysml.runtime.controlprogram.Program.execute(Program.java:145)
        ... 18 more
Caused by: org.apache.sysml.runtime.DMLRuntimeException: Invalid input w/ zeros 
for rexpand ignore=false (rlen=1617, nnz=1455).
        at 
org.apache.sysml.runtime.matrix.data.LibMatrixReorg.rexpand(LibMatrixReorg.java:721)
        at 
org.apache.sysml.runtime.matrix.data.MatrixBlock.rexpandOperations(MatrixBlock.java:5419)
        at 
org.apache.sysml.runtime.instructions.cp.ParameterizedBuiltinCPInstruction.processInstruction(ParameterizedBuiltinCPInstruction.java:252)
        at 
org.apache.sysml.runtime.controlprogram.ProgramBlock.executeSingleInstruction(ProgramBlock.java:305)
        ... 21 more
{code}

Basically, this test case directly creates {{MatrixBlocks}} and supplies them 
as input the {{LogisticRegression}} Scala wrapper we have, which in turn calls 
{{MultiLogReg.dml}}.  

Within {{MultiLogReg.dml}}, the {{Y_vec}} input is converted from a vector of 
class labels to a matrix of one-hot encoded labels.  During this conversion, 
the {{Y_vec}} vector is first transformed to have class labels <= 0 be 
converted to be the largest labels.  Thus, this updated {{Y_vec}} matrix has no 
zero values.  This updated {{Y_vec}} vector is then passed into the {{table}} 
function to be one-hot encoded.  At this point, it checks if {{Y_vec}} has any 
zero values based on the {{nnz}} of the {{MatrixBlock}}, and in this case fails 
because the {{nnz}} of {{Y_vec}} is still erroneously set to the previous 
{{nnz}} from before the above transformation for class labels <= 0.

Interestingly, if we remove the recent update to {{MultiLogReg.dml}} from 
SYSTEMML-958, 
[https://github.com/apache/incubator-systemml/commit/10dff5c9e3eb737a965846246d8187fcb0b03689],
 the test passes.  Regardless, this is a bug as the {{nnz}} should be updated 
after {{Y_vec}} is transformed to have no 0 values.

cc [~mboehm7]



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to