AngersZhuuuu commented on a change in pull request #35229:
URL: https://github.com/apache/spark/pull/35229#discussion_r786769731
##########
File path: sql/core/src/test/scala/org/apache/spark/sql/SQLQuerySuite.scala
##########
@@ -4243,6 +4243,15 @@ class SQLQuerySuite extends QueryTest with
SharedSparkSession with AdaptiveSpark
checkAnswer(df3, df4)
}
}
+
+ test("SPARK-27442: Spark support read parquet file with invalid char in
field name") {
+ withResourceTempPath("test-data/field_with_invalid_char.snappy.parquet") {
dir =>
+ val df = spark.read.parquet(dir.getAbsolutePath)
+ checkAnswer(df, Row(1, 2, 3, 4, 5) :: Nil)
+ assert(df.schema.names.sameElements(Array("max(t)", "a b", "{", ".",
"a.b")))
Review comment:
> can we have both `a` and `a.b` fields, and then filter by `a.b`? just
make sure filter pushdown still works.
Added
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
To unsubscribe, e-mail: [email protected]
For queries about this service, please contact Infrastructure at:
[email protected]
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]