Maxim Gekk created SPARK-24118:
----------------------------------
Summary: Support lineSep format independent from encoding
Key: SPARK-24118
URL: https://issues.apache.org/jira/browse/SPARK-24118
Project: Spark
Issue Type: Sub-task
Components: SQL
Affects Versions: 2.3.0
Reporter: Maxim Gekk
Currently, the lineSep option of JSON datasource is depend on encoding. It is
impossible to define correct lineSep for JSON files with BOM in UTF-16 and
UTF-32 encoding, for example. Need to propose a format of lineSep which will
represent sequence of octets (bytes) and will be independent from encoding.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]