[
https://issues.apache.org/jira/browse/SPARK-7687?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Josh Rosen updated SPARK-7687:
------------------------------
Description: In DataFrame.describe(), the count aggregate produces an
integer, the avg and stdev aggregates produce doubles, and min and max
aggregates can produce varying types depending on what type of column they're
applied to. As a result, we should cast all aggregate results to String so that
describe()'s output types match its declared output schema. (was: In
DataFrame.describe(), the count aggregate produces an integer, the avg and
stdev aggregates produce doubles, and min and max aggregates can produce
varying types depending on what type of column they're applied to. As a result,
we should cast all aggregate results to String so that describe()'s output
types can match its declared output schema.)
> DataFrame.describe() should cast all aggregates to String
> ---------------------------------------------------------
>
> Key: SPARK-7687
> URL: https://issues.apache.org/jira/browse/SPARK-7687
> Project: Spark
> Issue Type: Bug
> Components: SQL
> Affects Versions: 1.3.1
> Reporter: Josh Rosen
> Assignee: Josh Rosen
> Priority: Critical
>
> In DataFrame.describe(), the count aggregate produces an integer, the avg and
> stdev aggregates produce doubles, and min and max aggregates can produce
> varying types depending on what type of column they're applied to. As a
> result, we should cast all aggregate results to String so that describe()'s
> output types match its declared output schema.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]