Github user vanzin commented on a diff in the pull request:
https://github.com/apache/spark/pull/21302#discussion_r188022670
--- Diff:
sql/core/src/test/scala/org/apache/spark/sql/execution/datasources/parquet/ParquetFilterSuite.scala
---
@@ -602,6 +602,16 @@ class ParquetFilterSuite extends QueryTest with
ParquetTest with SharedSQLContex
}
}
}
+
+ test("SPARK-23852: Broken Parquet push-down for partially-written
stats") {
+ // parquet-1217.parquet contains a single column with values -1, 0, 1,
2 and null.
+ // The row-group statistics include null counts, but not min and max
values, which
+ // triggers PARQUET-1217.
+ val df = readResourceParquetFile("test-data/parquet-1217.parquet")
--- End diff --
That should be done in master (and backported to 2.3 if desired).
---
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]