[ 
https://issues.apache.org/jira/browse/SPARK-10828?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14908399#comment-14908399
 ] 

Shivaram Venkataraman commented on SPARK-10828:
-----------------------------------------------

I don't think we want to support new ways to read in HDFS formats into SparkR 
-- IMHO The DataSource API is the right way to solve this problem as its well 
established now and works across Python, Scala, R etc. 

You can check with the Accumulo project to see if they have plans to add a 
DataSource implementation. Also the DataSource implementation does not need 
live in the Spark source tree (See http://github.com/databricks/spark-avro for 
an example), so we don't need a JIRA in Spark to track this.

> Can we use the accumulo data RDD created from JAVA in spark, in sparkR?Is 
> there any other way to proceed with it to create RRDD from a source RDD other 
> than text RDD?Or to use any other format of data stored in HDFS in sparkR?
> ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
>
>                 Key: SPARK-10828
>                 URL: https://issues.apache.org/jira/browse/SPARK-10828
>             Project: Spark
>          Issue Type: Question
>          Components: R
>    Affects Versions: 1.5.0
>         Environment: ubuntu 12.04,8GB RAM,accumulo 1.6.3,hadoop 2.6
>            Reporter: madhvi gupta
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to