Jack Howard created ARROW-17130:
-----------------------------------

             Summary: Enable multiple character delimiters in read_csv
                 Key: ARROW-17130
                 URL: https://issues.apache.org/jira/browse/ARROW-17130
             Project: Apache Arrow
          Issue Type: Improvement
          Components: Format
    Affects Versions: 8.0.1
            Reporter: Jack Howard


Read_CSV ParseOptions allows only a single character delimiter.   Single 
character delimiters are highly susceptible to the candidate value existing 
within the data to be loaded, negating the ability to serve as a delimiter.

If a double character delimiter is used, the current limit of a single 
character returns  "only single character unicode strings can be converted to 
Py_UCS4, got length 2"



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to