[
https://issues.apache.org/jira/browse/SPARK-11669?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Shubhanshu Mishra reopened SPARK-11669:
---------------------------------------
What I mean't when I said a Python API to GLM was that the GLM module is
something which is implemented in Spark and should be made a part of the MLLIB
module rather than just being a SparkR feature. This will allow users who come
to python statsmodels background to use the GLM module in their python code as
well.
I know the current GLM module is just build using SparkR but I feel it should
be a core module with just a common API for multiple languages.
> Python interface to SparkR GLM module
> -------------------------------------
>
> Key: SPARK-11669
> URL: https://issues.apache.org/jira/browse/SPARK-11669
> Project: Spark
> Issue Type: Improvement
> Components: PySpark, SparkR
> Affects Versions: 1.5.0, 1.5.1
> Environment: LINUX
> MAC
> WINDOWS
> Reporter: Shubhanshu Mishra
> Priority: Minor
> Labels: GLM, pyspark, sparkR, statistics
>
> There should be a python interface to the sparkR GLM module. Currently the
> only python library which creates R style GLM module results in statsmodels.
> Inspiration for the API can be taken from the following page.
> http://statsmodels.sourceforge.net/devel/examples/notebooks/generated/formulas.html
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]