Lauri Koobas created SPARK-34993: ------------------------------------ Summary: from_json() acts differently on created and literal strings with backslashes Key: SPARK-34993 URL: https://issues.apache.org/jira/browse/SPARK-34993 Project: Spark Issue Type: Bug Components: SQL Affects Versions: 3.1.1 Environment: Databricks DBR 8.1 Reporter: Lauri Koobas
JSON string with the value that contains backslashes fails to be recovered by `from_json()`. I found that if the same string is created with `to_json(named_struct())` then it actually does work. The following code to reproduce. I would expect all of these methods to return the same (correct) result: ```select to_json(named_struct('msg', '\\')) , schema_of_json(to_json(named_struct('msg', '\\'))) , from_json(to_json(named_struct('msg', '\\')), schema_of_json(to_json(named_struct('msg', '\\')))) , from_json('\{"msg":"\\"}', schema_of_json(to_json(named_struct('msg', '\\')))) , from_json('\{"msg":"\\"}', 'msg string') ``` -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org