[ https://issues.apache.org/jira/browse/SPARK-6836?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14606993#comment-14606993 ]
Felix Cheung commented on SPARK-6836: ------------------------------------- +1 on randomSplit - probably need that for reproducible ML projects? > RDD APIs missing in SparkR > -------------------------- > > Key: SPARK-6836 > URL: https://issues.apache.org/jira/browse/SPARK-6836 > Project: Spark > Issue Type: New Feature > Components: SparkR > Reporter: Shivaram Venkataraman > Priority: Minor > > There are a few RDD API functions missing in SparkR > def randomSplit(self, weights, seed=None): > def repartitionAndSortWithinPartitions(self, numPartitions=None) > I also think that we should audit the API and make sure we want to support > these in SparkR before implementing them. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org