[
https://issues.apache.org/jira/browse/SPARK-15799?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15318769#comment-15318769
]
Shivaram Venkataraman commented on SPARK-15799:
-----------------------------------------------
I dont think there are any license issues and at least before we merged SparkR
into the apache the package passed all the CRAN checks. The only problem is
that we might need to ship the entire Spark assembly JAR (or all the jars that
we have with the new release structure) to make the package work without
additional downloads. Some other minor things that might make it challenging to
use SparkR directly from CRAN
1. Matching versions between client and cluster versions of Spark. This is
still a requirement today but the main difference is that people might upgrade
CRAN packages separately from their Spark clusters say.
2. Figuring out where to put scripts like spark-submit that can be used to
submit batch jobs. This isn't something normal R packages offer so I'm not sure
there are existing practices we can follow here.
> Release SparkR on CRAN
> ----------------------
>
> Key: SPARK-15799
> URL: https://issues.apache.org/jira/browse/SPARK-15799
> Project: Spark
> Issue Type: New Feature
> Components: SparkR
> Reporter: Xiangrui Meng
>
> Story: "As an R user, I would like to see SparkR released on CRAN, so I can
> use SparkR easily in an existing R environment and have other packages built
> on top of SparkR."
> I made this JIRA with the following questions in mind:
> * Are there known issues that prevent us releasing SparkR on CRAN?
> * Do we want to package Spark jars in the SparkR release?
> * Are there license issues?
> * How does it fit into Spark's release process?
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]