Hi all,

I'm having an issue running some code that works on a build of spark I made
(and still have) but now rebuilding it again, I get the below traceback. I
built it using the 1.4.0 release, profile hadoop-2.4 but version 2.7 and
I'm using python3. It's not vital to my work (as I can use my other build)
but I'd still like to figure out what's going on.

Best,
shaananc

Traceback (most recent call last):
  File "factor.py", line 73, in <module>
    main()
  File "factor.py", line 53, in main
    poly_filename = polysel.run(sc, parameters)
  File "/home/ubuntu/spark_apps/polysel.py", line 90, in run
    polysel1_bestpolys = run_polysel1(sc, parameters)
  File "/home/ubuntu/spark_apps/polysel.py", line 72, in run_polysel1
    polysel1_bestpolys = [v for _, v in polysel1_polys.takeOrdered(nrkeep,
key=lambda s: s[0])]
  File "/home/ubuntu/spark/python/pyspark/rdd.py", line 1198, in takeOrdered
    return self.mapPartitions(lambda it: [heapq.nsmallest(num, it,
key)]).reduce(merge)
  File "/home/ubuntu/spark/python/pyspark/rdd.py", line 762, in reduce
    vals = self.mapPartitions(func).collect()
  File "/home/ubuntu/spark/python/pyspark/rdd.py", line 736, in collect
    port = self.ctx._jvm.PythonRDD.collectAndServe(self._jrdd.rdd())
  File "/home/ubuntu/spark/python/pyspark/rdd.py", line 2343, in _jrdd
    bvars, self.ctx._javaAccumulator)
  File "/usr/local/lib/python3.4/dist-packages/py4j/java_gateway.py", line
701, in __call__
    self._fqn)
  File "/usr/local/lib/python3.4/dist-packages/py4j/protocol.py", line 304,
in get_return_value
    format(target_id, '.', name, value))
py4j.protocol.Py4JError: An error occurred while calling
None.org.apache.spark.api.python.PythonRDD. Trace:
py4j.Py4JException: Constructor
org.apache.spark.api.python.PythonRDD([class
org.apache.spark.rdd.ParallelCollectionRDD, class [B, class
java.util.HashMap, class java.util.ArrayList, class java.lang.Boolean,
class java.lang.String, class java.util.ArrayList, class
org.apache.spark.Accumulator]) does not exist
at
py4j.reflection.ReflectionEngine.getConstructor(ReflectionEngine.java:184)
at
py4j.reflection.ReflectionEngine.getConstructor(ReflectionEngine.java:202)
at py4j.Gateway.invoke(Gateway.java:213)
at
py4j.commands.ConstructorCommand.invokeConstructor(ConstructorCommand.java:79)
at py4j.commands.ConstructorCommand.execute(ConstructorCommand.java:68)
at py4j.GatewayConnection.run(GatewayConnection.java:207)
at java.lang.Thread.run(Thread.java:745)

Reply via email to