Martin Nowak created ARROW-12031:
------------------------------------

             Summary: [C++] infer CSV timestamps columns with fractional seconds
                 Key: ARROW-12031
                 URL: https://issues.apache.org/jira/browse/ARROW-12031
             Project: Apache Arrow
          Issue Type: Improvement
          Components: C++
    Affects Versions: 3.0.0
            Reporter: Martin Nowak


After implementing fractional second parsing in and ARROW-9107 and ARROW-10337, 
no support for inference of subsecond timestamps was added (see 
[code]([https://github.com/apache/arrow/blob/master/cpp/src/arrow/csv/inference_internal.h#L115),
 also mentioned in ARROW-3738).

The [InferState logic | 
https://github.com/apache/arrow/blob/81d6724546559fa8d6e85376a1cc339dab0506ad/cpp/src/arrow/csv/inference_internal.h#L63]
 could test for TimestampNS (with nanosecond resolution) after Timestamps with 
second resolution to not change existing inference results.

Wouldn't expect too much interest in trying to infer timestamps with milli- and 
microsecond resolution. The nanosecond resolution [can already parse 
|https://github.com/apache/arrow/blob/81d6724546559fa8d6e85376a1cc339dab0506ad/cpp/src/arrow/util/value_parsing.h#L512-L518]
 the lower resolution ones.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to