Nong Li created SPARK-13574:
-------------------------------
Summary: Improve parquet dictionary decoding for strings
Key: SPARK-13574
URL: https://issues.apache.org/jira/browse/SPARK-13574
Project: Spark
Issue Type: Improvement
Reporter: Nong Li
Priority: Minor
Currently, the parquet reader will copy the dictionary value for each data
value. This is bad for string columns as we explode the dictionary during
decode. We should instead, have the data values point to the safe backing
memory.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]