Bryan Chen created SPARK-21756:
----------------------------------
Summary: Add JSON option to allow unquoted control characters
Key: SPARK-21756
URL: https://issues.apache.org/jira/browse/SPARK-21756
Project: Spark
Issue Type: Improvement
Components: SQL
Affects Versions: 2.1.0
Reporter: Bryan Chen
Hi, we have unquoted control chars in raw json data and the JSON reader just
failed at parsing those and complains they are malformed records.
Can we add an allowUnquotedControlChars in JSONOptions for this scenario? The
default value would be set to false according to the spec though.
{panel:title=From JacksonParser}
/**
* Feature that determines whether parser will allow
* JSON Strings to contain unquoted control characters
* (ASCII characters with value less than 32, including
* tab and line feed characters) or not.
* If feature is set false, an exception is thrown if such a
* character is encountered.
*<p>
* Since JSON specification requires quoting for all control characters,
* this is a non-standard feature, and as such disabled by default.
*/
ALLOW_UNQUOTED_CONTROL_CHARS(false),
{panel}
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]