carols10cents opened a new pull request #8402:
URL: https://github.com/apache/arrow/pull/8402


   This adds more support for:
   
   - When converting Arrow -> Parquet containing an Arrow Dictionary,
   materialize the Dictionary values and send to Parquet to be encoded with
   a dictionary or not according to the Parquet settings (not supported:
   converting an Arrow Dictionary directly to Parquet DictEncoding, also
   only supports Int32 index types in this commit, also removes NULLs)
   - When converting Parquet -> Arrow, noticing that the Arrow schema
   metadata in a Parquet file has a Dictionary type and converting the data
   to an Arrow dictionary (right now this only supports String dictionaries
   
   I'm not sure if this is in a good enough state to merge or not yet, please 
let me know @nevi-me !


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
[email protected]


Reply via email to