[
https://issues.apache.org/jira/browse/ARROW-12539?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Joris Van den Bossche updated ARROW-12539:
------------------------------------------
Summary: [C++] Unable to read date64 or date32 in specific format from CSV
(was: [C++] Unable to read date64 or date32 in specific format)
> [C++] Unable to read date64 or date32 in specific format from CSV
> -----------------------------------------------------------------
>
> Key: ARROW-12539
> URL: https://issues.apache.org/jira/browse/ARROW-12539
> Project: Apache Arrow
> Issue Type: Bug
> Components: C++, Python
> Affects Versions: 3.0.0
> Reporter: Stephen Bias
> Priority: Major
> Labels: csv, date
>
> when importing csv data with dates in the format {{"%d-%b-%y"}} or
> {{"%d-%b-%Y"}} an error is given in conversion:
> example:
> {code:python}
> import pyarrow as pa
> from pyarrow import csv
> data = b"a,b\n1,15-OCT-15\n2,18-JUN-90\n"
> tp = ["%d-%b-%y"]
> try:
> schema_d64 = pa.schema([pa.field("a", pa.int64()), pa.field("b",
> pa.date64())])
> co_d64 = csv.ConvertOptions(timestamp_parsers=tp, column_types=schema_d64)
> a_d64 = csv.read_csv(pa.py_buffer(data), convert_options=co_d64)
> except Exception as e:
> print(e)
> try:
> schema_d32 = pa.schema([pa.field("a", pa.int64()), pa.field("b",
> pa.date32())])
> co_d32 = csv.ConvertOptions(timestamp_parsers=tp, column_types=schema_d32)
> a_d32 = csv.read_csv(pa.py_buffer(data), convert_options=co_d32)
> except Exception as e:
> print(e){code}
>
--
This message was sent by Atlassian Jira
(v8.3.4#803005)