[jira] [Assigned] (SPARK-21100) describe should give quartiles similar to Pandas
[ https://issues.apache.org/jira/browse/SPARK-21100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21100: Assignee: (was: Apache Spark) > describe should give quartiles similar to Pandas > > > Key: SPARK-21100 > URL: https://issues.apache.org/jira/browse/SPARK-21100 > Project: Spark > Issue Type: Improvement > Components: SQL >Affects Versions: 2.1.1 >Reporter: Andrew Ray >Priority: Minor > > The DataFrame describe method should also include quartiles (25th, 50th, and > 75th percentiles) like Pandas. > Example pandas output: > {code} > In [4]: df.describe() > Out[4]: >Unnamed: 0 displ year cyl cty hwy > count 234.00 234.00 234.00 234.00 234.00 234.00 > mean 117.503.471795 2003.505.89 16.858974 23.440171 > std 67.6941651.291959 4.5096461.6115344.2559465.954643 > min 1.001.60 1999.004.009.00 12.00 > 25% 59.252.40 1999.004.00 14.00 18.00 > 50%117.503.30 2003.506.00 17.00 24.00 > 75%175.754.60 2008.008.00 19.00 27.00 > max234.007.00 2008.008.00 35.00 44.00 > {code} -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Assigned] (SPARK-21100) describe should give quartiles similar to Pandas
[ https://issues.apache.org/jira/browse/SPARK-21100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Apache Spark reassigned SPARK-21100: Assignee: Apache Spark > describe should give quartiles similar to Pandas > > > Key: SPARK-21100 > URL: https://issues.apache.org/jira/browse/SPARK-21100 > Project: Spark > Issue Type: Improvement > Components: SQL >Affects Versions: 2.1.1 >Reporter: Andrew Ray >Assignee: Apache Spark >Priority: Minor > > The DataFrame describe method should also include quartiles (25th, 50th, and > 75th percentiles) like Pandas. > Example pandas output: > {code} > In [4]: df.describe() > Out[4]: >Unnamed: 0 displ year cyl cty hwy > count 234.00 234.00 234.00 234.00 234.00 234.00 > mean 117.503.471795 2003.505.89 16.858974 23.440171 > std 67.6941651.291959 4.5096461.6115344.2559465.954643 > min 1.001.60 1999.004.009.00 12.00 > 25% 59.252.40 1999.004.00 14.00 18.00 > 50%117.503.30 2003.506.00 17.00 24.00 > 75%175.754.60 2008.008.00 19.00 27.00 > max234.007.00 2008.008.00 35.00 44.00 > {code} -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org