Matthias Boehm created SYSTEMML-2162:
----------------------------------------

             Summary: Deserialization of compressed blocks w/ shared dictionary 
fails in Spark
                 Key: SYSTEMML-2162
                 URL: https://issues.apache.org/jira/browse/SYSTEMML-2162
             Project: SystemML
          Issue Type: Bug
            Reporter: Matthias Boehm


For very large datasets that exceed aggregate cluster memory, we automatically 
apply compression. However, in case blocks with shared DDC1 dictionaries are 
evicted and subsequently deserialized this deserialization fails with 
mismatching number of bytes.

{code}
java.lang.IllegalStateException: unread block data
        at 
java.io.ObjectInputStream$BlockDataInputStream.setBlockDataMode(ObjectInputStream.java:2431)
        at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1383)
        at 
java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:2000)
        at java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:1924)
        at 
java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:1801)
        at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1351)
        at java.io.ObjectInputStream.readObject(ObjectInputStream.java:371)
{code}




--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to