Nong Li created SPARK-13574:
-------------------------------

             Summary: Improve parquet dictionary decoding for strings
                 Key: SPARK-13574
                 URL: https://issues.apache.org/jira/browse/SPARK-13574
             Project: Spark
          Issue Type: Improvement
            Reporter: Nong Li
            Priority: Minor


Currently, the parquet reader will copy the dictionary value for each data 
value. This is bad for string columns as we explode the dictionary during 
decode. We should instead, have the data values point to the safe backing 
memory.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to