AlenkaF commented on code in PR #47663:
URL: https://github.com/apache/arrow/pull/47663#discussion_r2618327372
##########
python/pyarrow/_csv.pyx:
##########
@@ -807,6 +810,59 @@ cdef class ConvertOptions(_Weakrefable):
fast: bool
----
fast: [[true,true,false,false,null]]
+
+ Set a default column type for all columns (disables type inference):
+
+ >>> convert_options = csv.ConvertOptions(default_column_type=pa.string())
+ >>> csv.read_csv(io.BytesIO(s.encode()), convert_options=convert_options)
+ pyarrow.Table
+ animals: string
+ n_legs: string
+ entry: string
+ fast: string
+ ----
+ animals: [["Flamingo","Horse","Brittle stars","Centipede",""]]
+ n_legs: [["2","4","5","100","6"]]
+ entry: [["01/03/2022","02/03/2022","03/03/2022","04/03/2022","05/03/2022"]]
+ fast: [["Yes","Yes","No","No",""]]
+
+ Combine default_column_type with column_types (specific column types
override default):
+
+ >>> convert_options = csv.ConvertOptions(
+ ... column_types={"n_legs": pa.int64(), "fast":
pa.bool_()},
+ ... default_column_type=pa.string(),
+ ... true_values=["Yes"],
+ ... false_values=["No"])
+ >>> csv.read_csv(io.BytesIO(s.encode()), convert_options=convert_options)
+ pyarrow.Table
+ animals: string
+ n_legs: int64
+ entry: string
+ fast: bool
+ ----
+ animals: [["Flamingo","Horse","Brittle stars","Centipede",""]]
+ n_legs: [[2,4,5,100,6]]
+ entry: [["01/03/2022","02/03/2022","03/03/2022","04/03/2022","05/03/2022"]]
+ fast: [[true,true,false,false,null]]
+
+ Use default_column_type with selective column_types for mixed type
conversion:
Review Comment:
I mean the last two examples, both specifying `column_types` and
`default_column_type`. I think one can be omitted.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]