[GitHub] [spark] AngersZhuuuu commented on a change in pull request #35229: [SPARK-27442][SQL] Remove check field name when reading data

GitBox Tue, 18 Jan 2022 05:42:38 -0800


AngersZhuuuu commented on a change in pull request #35229:
URL: https://github.com/apache/spark/pull/35229#discussion_r786769731




##########
File path: sql/core/src/test/scala/org/apache/spark/sql/SQLQuerySuite.scala
##########
@@ -4243,6 +4243,15 @@ class SQLQuerySuite extends QueryTest with 
SharedSparkSession with AdaptiveSpark
       checkAnswer(df3, df4)
     }
   }
+
+  test("SPARK-27442: Spark support read parquet file with invalid char in 
field name") {
+    withResourceTempPath("test-data/field_with_invalid_char.snappy.parquet") { 
dir =>
+      val df = spark.read.parquet(dir.getAbsolutePath)
+      checkAnswer(df, Row(1, 2, 3, 4, 5) :: Nil)
+      assert(df.schema.names.sameElements(Array("max(t)", "a b", "{", ".", 
"a.b")))

Review comment:
       > can we have both `a` and `a.b` fields, and then filter by `a.b`? just 
make sure filter pushdown still works.
   
   Added 




-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]



---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

[GitHub] [spark] AngersZhuuuu commented on a change in pull request #35229: [SPARK-27442][SQL] Remove check field name when reading data

Reply via email to