Re: Is it a bug?

2016-05-25 Thread Zheng Wendell
Any update? On Sun, May 8, 2016 at 3:17 PM, Ted Yu wrote: > I don't think so. > RDD is immutable. > > > On May 8, 2016, at 2:14 AM, Sisyphuss wrote: > > > > > > > > > > > > --

Re: Is it a bug?

2016-05-09 Thread Zheng Wendell
You can move the definition of `t` upward. My example is still valide. On Mon, May 9, 2016 at 1:46 PM, Ted Yu wrote: > Using spark-shell, I was not allowed to define the map() without declaring > t first: > > scala> rdd = rdd.map(x => x*t) > :26: error: not found: value t >rdd =

Re: Python3 does not have Module 'UserString'

2016-02-12 Thread Zheng Wendell
Sorry, I can no longer reproduce the error. After upgrading Python3.4.2 to Python 3.4.4, the error disappears. Spark release: spark-1.6.0-bin-hadoop2.6 code snippet: ``` lines = sc.parallelize([5,6,2,8,5,2,4,9,2,1,7,3,4,1,5,8,7,6]) pairs = lines.map(lambda x: (x, 1)) counts =

Re: Python3 does not have Module 'UserString'

2016-02-12 Thread Zheng Wendell
I think this may be also due to the fact that I have multiple copies of Python. My driver program was using Python3.4.2 My local slave nodes are using Python3.4.4 (System administrator's version) On Fri, Feb 12, 2016 at 5:51 PM, Zheng Wendell <zhengwend...@gmail.com> wrote: > Sor