GitHub user maropu opened a pull request:
https://github.com/apache/spark/pull/21964
[SPARK-24788][SQL] RelationalGroupedDataset.toString with unresolved exprs
should not fail
## What changes were proposed in this pull request?
In the current master, `toString` throws an exception when
`RelationalGroupedDataset` has unresolved expressions;
```
scala> spark.range(0, 10).groupBy("id")
res4: org.apache.spark.sql.RelationalGroupedDataset =
RelationalGroupedDataset: [grouping expressions: [id: bigint], value: [id:
bigint], type: GroupBy]
scala> spark.range(0, 10).groupBy('id)
org.apache.spark.sql.catalyst.analysis.UnresolvedException: Invalid call to
dataType on unresolved object, tree: 'id
at
org.apache.spark.sql.catalyst.analysis.UnresolvedAttribute.dataType(unresolved.scala:105)
at
org.apache.spark.sql.RelationalGroupedDataset$$anonfun$12.apply(RelationalGroupedDataset.scala:474)
at
org.apache.spark.sql.RelationalGroupedDataset$$anonfun$12.apply(RelationalGroupedDataset.scala:473)
at
scala.collection.TraversableLike$$anonfun$map$1.apply(TraversableLike.scala:234)
at
scala.collection.TraversableLike$$anonfun$map$1.apply(TraversableLike.scala:234)
at
scala.collection.mutable.ResizableArray$class.foreach(ResizableArray.scala:59)
at scala.collection.mutable.ArrayBuffer.foreach(ArrayBuffer.scala:48)
at scala.collection.TraversableLike$class.map(TraversableLike.scala:234)
at scala.collection.AbstractTraversable.map(Traversable.scala:104)
at
org.apache.spark.sql.RelationalGroupedDataset.toString(RelationalGroupedDataset.scala:473)
at
scala.runtime.ScalaRunTime$.scala$runtime$ScalaRunTime$$inner$1(ScalaRunTime.scala:332)
at scala.runtime.ScalaRunTime$.stringOf(ScalaRunTime.scala:337)
at scala.runtime.ScalaRunTime$.replStringOf(ScalaRunTime.scala:345)
```
Closes #21752
## How was this patch tested?
Added tests in `DataFrameAggregateSuite`.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/maropu/spark SPARK-24788
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/spark/pull/21964.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #21964
commit 465e7624073016d01ae6d3c5df501bf9b2c6410b
Author: Chris Horn
Date: 2018-07-11T21:25:26Z
SPARK-24788 failing test case
commit e995b0bf2824593532056ff0048e65e8a33e5aad
Author: Chris Horn
Date: 2018-07-11T21:25:54Z
SPARK-24788 fixed UnresolvedException when toString an unresolved grouping
expression
commit 5213635d595f76261a8387e5a5135ebd9bcfa8d9
Author: Chris Horn
Date: 2018-07-13T19:09:35Z
simplify test description; remove whitespace
commit 2e48604ff9aadebc4f7f3f8edeee252722967da9
Author: Chris Horn
Date: 2018-07-13T19:22:07Z
do not use Matchers
commit c4e7490f1762aff5ae5b7126adb9ddd8d987a77d
Author: Takeshi Yamamuro
Date: 2018-08-02T06:20:34Z
Fix
---
-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org