[
https://issues.apache.org/jira/browse/ARROW-18001?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17616421#comment-17616421
]
Alenka Frim commented on ARROW-18001:
-------------------------------------
Oh, sorry for the confusion and thank you for clearing it out for me!
> [Python] parquet.write_table/parquet.ParquetWriter should except a subset of
> columns
> ------------------------------------------------------------------------------------
>
> Key: ARROW-18001
> URL: https://issues.apache.org/jira/browse/ARROW-18001
> Project: Apache Arrow
> Issue Type: Improvement
> Components: Python
> Reporter: Alenka Frim
> Priority: Major
>
> This question came up in the GitHub issue:
> [https://github.com/apache/arrow/issues/14025] and it would be a good
> improvement to the Parquet part of PyArrow. Haven't found any existing issue
> and so created a new one.
> h6. Description:
> If a user wants to change a type of one single column when using
> {{{}parquet.write_table{}}}/{{{}parquet.ParquetWriter{}}} they currently need
> to specify the schema with all columns included. If a column is not specified
> in the schema, it will not be included in the parquet file.
> h6. Proposal
> There should be a possibility for {{parquet.ParquetWriter}} excepting a
> subset of columns in a Schema and infer everything else.
--
This message was sent by Atlassian Jira
(v8.20.10#820010)