[ 
https://issues.apache.org/jira/browse/SPARK-11668?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15004228#comment-15004228
 ] 

Xiangrui Meng commented on SPARK-11668:
---------------------------------------

[~shubhanshumis...@gmail.com] [~yanboliang] implemented std err, t score, and 
p-value in SPARK-11494. But we definitely need to add more summary stats. But 
this JIRA is too broad. So I'm going to close this one and please create JIRAs 
for concrete summary statistics (and group them properly). Thanks!

> R style summary stats in GLM package SparkR
> -------------------------------------------
>
>                 Key: SPARK-11668
>                 URL: https://issues.apache.org/jira/browse/SPARK-11668
>             Project: Spark
>          Issue Type: Improvement
>          Components: SparkR
>    Affects Versions: 1.5.0, 1.5.1
>         Environment: LINUX
> WINDOWS
> MAC
>            Reporter: Shubhanshu Mishra
>            Priority: Minor
>              Labels: GLM, sparkr
>
> In the current GLM module in R the `summary(model)` function call will only 
> return the values of the coefficients however in the actual R GLM module, the 
> function also returns the std. err, z score, p-value and confidence intervals 
> for the coefficients as well as some model based statistics like R-squared 
> values, AIC, BIC etc. 
> Another inspiration for adding these metrics can be using the format of 
> python statsmodels package described here: 
> http://statsmodels.sourceforge.net/devel/examples/notebooks/generated/formulas.html



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

Reply via email to