[
https://issues.apache.org/jira/browse/SPARK-10587?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14744643#comment-14744643
]
SemiCoder commented on SPARK-10587:
-----------------------------------
It's not my code, it's code in latest released version.
In fact, when I want to create a IndexedRowMatrix , I give a parameter "rows" ,
and the __init__ method will check whether it's an RDD , if it's an RDD, it
will call java function, and the one parameter of calljavafunction is
"rows.toDF()" . However, toDF() doesn't exist in RDD. I know it exists in
sqlcontext . I mean I think it is an error in
python/pyspark/mllib/linalg/distriuted.py . Otherwise could you tell me how to
create an RDD which has function toDF() to avoid this situation?
> In pyspark, toDF() dosen't exsist in RDD object
> -----------------------------------------------
>
> Key: SPARK-10587
> URL: https://issues.apache.org/jira/browse/SPARK-10587
> Project: Spark
> Issue Type: Bug
> Affects Versions: 1.5.0
> Reporter: SemiCoder
>
> I can't find toDF() function in RDD.
> In pyspark.mllib.linalg.distributed , the IndexedRowMatrix.__init__()
> require the rows should be an RDD and execute rows.toDF() but actually the
> RDD in pyspark dosen't have toDF() function
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]