Any updates on this issue? A cursory search shows that others are still
experiencing this issue. I'm seeing this occur on trivial data sets in
pyspark; however they are running successfully in scala.
While this is an acceptable workaround I would like to know if this item is
on the spark roadmap o
It's ... kind of weird if I try to execute this
cotizas = sc.textFile("A_ko")
print cotizas.take(10)
it doesn't work, but if I remove only one "A" character from this file ...
it's all OK ...
At first I thought it was due to the number of splits or something like
that ... but I downloaded
So it does not work for files on HDFS either? That is really a problem.
--
View this message in context:
http://apache-spark-user-list.1001560.n3.nabble.com/pyspark-Failed-to-run-first-tp7691p8128.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.
I've got the same problem trying to execute the following scriptlet from my
Eclipse environment:
/v = sc.textFile("path_to_my_file")
print v.take(1)
/
File "my_script.py", line 18, in
print v.take(1)
File "spark-1.0.0-bin-hadoop2\python\pyspark\rdd.py", line 868, in take
* iterator =
I'm starting to develop ADMM for some models using pyspark(Spark version
1.0.0). So I constantly simulated data to test my code. I did simulation in
python but then I ran into the same kind of problems as mentioned above.
Same meaningless error messages show up when I tried methods like first,
take