[
https://issues.apache.org/jira/browse/SPARK-10828?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14908399#comment-14908399
]
Shivaram Venkataraman commented on SPARK-10828:
-----------------------------------------------
I don't think we want to support new ways to read in HDFS formats into SparkR
-- IMHO The DataSource API is the right way to solve this problem as its well
established now and works across Python, Scala, R etc.
You can check with the Accumulo project to see if they have plans to add a
DataSource implementation. Also the DataSource implementation does not need
live in the Spark source tree (See http://github.com/databricks/spark-avro for
an example), so we don't need a JIRA in Spark to track this.
> Can we use the accumulo data RDD created from JAVA in spark, in sparkR?Is
> there any other way to proceed with it to create RRDD from a source RDD other
> than text RDD?Or to use any other format of data stored in HDFS in sparkR?
> ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
>
> Key: SPARK-10828
> URL: https://issues.apache.org/jira/browse/SPARK-10828
> Project: Spark
> Issue Type: Question
> Components: R
> Affects Versions: 1.5.0
> Environment: ubuntu 12.04,8GB RAM,accumulo 1.6.3,hadoop 2.6
> Reporter: madhvi gupta
>
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]