Wes McKinney created ARROW-3928: ----------------------------------- Summary: [Python] Add option to deduplicate PyBytes / PyString / PyUnicode objects in Table.to_pandas conversion path Key: ARROW-3928 URL: https://issues.apache.org/jira/browse/ARROW-3928 Project: Apache Arrow Issue Type: Improvement Components: Python Reporter: Wes McKinney Fix For: 0.12.0
While hashing carries a performance penalty, the memory savings can be huge. See also ARROW-3911 -- we should develop some reusable machinery for conversions that yield Python objects -- This message was sent by Atlassian JIRA (v7.6.3#76005)