[jira] [Assigned] (SPARK-21100) describe should give quartiles similar to Pandas

2017-06-14 Thread Apache Spark (JIRA)

 [ 
https://issues.apache.org/jira/browse/SPARK-21100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Apache Spark reassigned SPARK-21100:


Assignee: (was: Apache Spark)

> describe should give quartiles similar to Pandas
> 
>
> Key: SPARK-21100
> URL: https://issues.apache.org/jira/browse/SPARK-21100
> Project: Spark
>  Issue Type: Improvement
>  Components: SQL
>Affects Versions: 2.1.1
>Reporter: Andrew Ray
>Priority: Minor
>
> The DataFrame describe method should also include quartiles (25th, 50th, and 
> 75th percentiles) like Pandas.
> Example pandas output:
> {code}
> In [4]: df.describe()
> Out[4]:
>Unnamed: 0   displ year cyl cty hwy
> count  234.00  234.00   234.00  234.00  234.00  234.00
> mean   117.503.471795  2003.505.89   16.858974   23.440171
> std 67.6941651.291959 4.5096461.6115344.2559465.954643
> min  1.001.60  1999.004.009.00   12.00
> 25% 59.252.40  1999.004.00   14.00   18.00
> 50%117.503.30  2003.506.00   17.00   24.00
> 75%175.754.60  2008.008.00   19.00   27.00
> max234.007.00  2008.008.00   35.00   44.00
> {code}



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Assigned] (SPARK-21100) describe should give quartiles similar to Pandas

2017-06-14 Thread Apache Spark (JIRA)

 [ 
https://issues.apache.org/jira/browse/SPARK-21100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Apache Spark reassigned SPARK-21100:


Assignee: Apache Spark

> describe should give quartiles similar to Pandas
> 
>
> Key: SPARK-21100
> URL: https://issues.apache.org/jira/browse/SPARK-21100
> Project: Spark
>  Issue Type: Improvement
>  Components: SQL
>Affects Versions: 2.1.1
>Reporter: Andrew Ray
>Assignee: Apache Spark
>Priority: Minor
>
> The DataFrame describe method should also include quartiles (25th, 50th, and 
> 75th percentiles) like Pandas.
> Example pandas output:
> {code}
> In [4]: df.describe()
> Out[4]:
>Unnamed: 0   displ year cyl cty hwy
> count  234.00  234.00   234.00  234.00  234.00  234.00
> mean   117.503.471795  2003.505.89   16.858974   23.440171
> std 67.6941651.291959 4.5096461.6115344.2559465.954643
> min  1.001.60  1999.004.009.00   12.00
> 25% 59.252.40  1999.004.00   14.00   18.00
> 50%117.503.30  2003.506.00   17.00   24.00
> 75%175.754.60  2008.008.00   19.00   27.00
> max234.007.00  2008.008.00   35.00   44.00
> {code}



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org