bkietz commented on pull request #9130:
URL: https://github.com/apache/arrow/pull/9130#issuecomment-760312991


   @pitrou The dictionaries are [a feature inspired by 
ParquetDataset](https://github.com/apache/arrow/pull/7536#issuecomment-649500017):
 it's useful for each partition expression to contain the dictionary of all 
unique values that field could take. They are only required when parsing paths. 
When constructing a Partitioning from a factory (inferring fields from a vector 
of paths) the dictionaries are assembled automatically. However if the 
Partitioning is being directly constructed then the dictionaries must be 
explicitly specified.
   
   @jorisvandenbossche I'll add a binding for `max_partitions` to python


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Reply via email to