Shivaram Venkataraman created SPARK-16519: ---------------------------------------------
Summary: Handle SparkR RDD generics that create warnings in R CMD check Key: SPARK-16519 URL: https://issues.apache.org/jira/browse/SPARK-16519 Project: Spark Issue Type: Sub-task Components: SparkR Reporter: Shivaram Venkataraman One of the warnings we get from R CMD check is that RDD implementations of some of the generics are not documented. These generics are shared between RDD, DataFrames in SparkR. The list includes {quote} WARNING Undocumented S4 methods: generic 'cache' and siglist 'RDD' generic 'collect' and siglist 'RDD' generic 'count' and siglist 'RDD' generic 'distinct' and siglist 'RDD' generic 'first' and siglist 'RDD' generic 'join' and siglist 'RDD,RDD' generic 'length' and siglist 'RDD' generic 'partitionBy' and siglist 'RDD' generic 'persist' and siglist 'RDD,character' generic 'repartition' and siglist 'RDD' generic 'show' and siglist 'RDD' generic 'take' and siglist 'RDD,numeric' generic 'unpersist' and siglist 'RDD' {quote} As described in https://stat.ethz.ch/pipermail/r-devel/2003-September/027490.html this looks like a limitation of R where exporting a generic from a package also exports all the implementations of that generic. One way to get around this is to remove the RDD API or rename the methods in Spark 2.1 -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org