Re: pyspark-Failed to run first

2015-09-29 Thread balajikvijayan
Any updates on this issue? A cursory search shows that others are still experiencing this issue. I'm seeing this occur on trivial data sets in pyspark; however they are running successfully in scala. While this is an acceptable workaround I would like to know if this item is on the spark roadmap o

Re: pyspark-Failed to run first

2014-06-24 Thread angel2014
It's ... kind of weird if I try to execute this cotizas = sc.textFile("A_ko") print cotizas.take(10) it doesn't work, but if I remove only one "A" character from this file ... it's all OK ... At first I thought it was due to the number of splits or something like that ... but I downloaded

Re: pyspark-Failed to run first

2014-06-23 Thread Congrui Yi
So it does not work for files on HDFS either? That is really a problem. -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/pyspark-Failed-to-run-first-tp7691p8128.html Sent from the Apache Spark User List mailing list archive at Nabble.com.

Re: pyspark-Failed to run first

2014-06-23 Thread angel2014
I've got the same problem trying to execute the following scriptlet from my Eclipse environment: /v = sc.textFile("path_to_my_file") print v.take(1) / File "my_script.py", line 18, in print v.take(1) File "spark-1.0.0-bin-hadoop2\python\pyspark\rdd.py", line 868, in take * iterator =

Re: pyspark-Failed to run first

2014-06-19 Thread Congrui Yi
I'm starting to develop ADMM for some models using pyspark(Spark version 1.0.0). So I constantly simulated data to test my code. I did simulation in python but then I ran into the same kind of problems as mentioned above. Same meaningless error messages show up when I tried methods like first, take