MuteBardTison opened a new pull request, #46035:
URL: https://github.com/apache/arrow/pull/46035

   ### Rationale for this change
   Currently, the Arrow C++ CSV converter does not support parsing strings into 
`duration` types. This limits CSV ingestion capabilities when handling datasets 
with time-based intervals represented as strings (e.g., `1s`, `2m`, `3h`). This 
PR adds support for parsing such strings into Arrow's `DurationType`.
   
   ### What changes are included in this PR?
   - Added `DurationValueDecoder` using `StringConverter<DurationType>`
   - Registered support in both standard and dictionary converters
   - Added unit tests covering:
     - Basic parsing across all time units (s, ms, µs, ns)
     - Null and custom null values
     - Whitespace handling and error cases
   
   ### Are these changes tested?
   Yes, conversion logic is fully covered by new tests in `converter_test.cc`.
   
   ### Are there any user-facing changes?
   Yes, users can now convert duration strings in CSV files to Arrow `duration` 
arrays by specifying the appropriate schema type.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: github-unsubscr...@arrow.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to