[
https://issues.apache.org/jira/browse/SPARK-6812?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14530890#comment-14530890
]
Shivaram Venkataraman commented on SPARK-6812:
----------------------------------------------
Regarding the name conflicts occurring between the Scala API, other R packages
etc. there are a couple of points
- IMHO we don't need to mimic the Scala API always as SparkR is more geared
towards R users than towards users who are switching to it from Scala. So we
should still look to override common R functions for DataFrames or Machine
learning so that we can maintain familiarity for R-users
- Also I agree with you that naming conflicts exist even when you load three or
four R related packages (without SparkR), so I think the best we can do is to
load SparkR at the top of default packages list as you mention.
Regarding sortDF, I was actually planning to name it `arrange` after the dply
call as part of https://issues.apache.org/jira/browse/SPARK-7231 -- Let me know
if this sounds good to you.
> filter() on DataFrame does not work as expected
> -----------------------------------------------
>
> Key: SPARK-6812
> URL: https://issues.apache.org/jira/browse/SPARK-6812
> Project: Spark
> Issue Type: Bug
> Components: SparkR
> Reporter: Davies Liu
> Assignee: Sun Rui
> Priority: Blocker
>
> {code}
> > filter(df, df$age > 21)
> Error in filter(df, df$age > 21) :
> no method for coercing this S4 class to a vector
> {code}
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]