amoeba commented on issue #38476: URL: https://github.com/apache/arrow/issues/38476#issuecomment-1783640499
Hi @JonathanDCohen, thanks for the bug report. `f.schema.column` takes a column index ([docs](https://arrow.apache.org/docs/python/generated/pyarrow.parquet.ParquetSchema.html#pyarrow.parquet.ParquetSchema.column)) rather than a name so you should use it like, ```python >>> f.schema.column(0) <ParquetColumnSchema> name: a path: a max_definition_level: 1 max_repetition_level: 0 physical_type: INT64 logical_type: None converted_type (legacy): NONE ``` It would be good to add a check for the type that argument or maybe even allow the user to pass a column name in addition to an index. Would you be interested in contributing a change? -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: [email protected] For queries about this service, please contact Infrastructure at: [email protected]
