Github user HyukjinKwon commented on a diff in the pull request:
https://github.com/apache/spark/pull/20849#discussion_r175262708
--- Diff: python/pyspark/sql/readwriter.py ---
@@ -237,6 +237,8 @@ def json(self, path, schema=None,
primitivesAsString=None, prefersDecimal=None,
:param allowUnquotedControlChars: allows JSON Strings to contain
unquoted control
characters (ASCII characters
with value less than 32,
including tab and line feed
characters) or not.
+ :param charset: standard charset name, for example UTF-8, UTF-16
and UTF-32. If None is
+ set, the charset of input json will be detected
automatically.
--- End diff --
Can qe have another test case with an encoding jackson doesn't
automatically detect too?
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]