Get statistic result from RDD

2015-10-20 Thread Shepherd
xt: http://apache-spark-user-list.1001560.n3.nabble.com/Get-statistic-result-from-RDD-tp25147.html Sent from the Apache Spark User List mailing list archive at Nabble.com. - To unsubscribe, e-mail: user-unsubscr...@spark.apache.o

RE: Get statistic result from RDD

2015-10-20 Thread ChengBo
Thanks, but I still don’t get it. I have used groupBy to group data by userID, and for each ID, I need to get the statistic information. Best Frank From: Ted Yu [mailto:yuzhih...@gmail.com] Sent: Tuesday, October 20, 2015 3:12 PM To: ChengBo Cc: user Subject: Re: Get statistic result from RDD

Re: Get statistic result from RDD

2015-10-20 Thread Ted Yu
each ID, I need to > get the statistic information. > > > > Best > > Frank > > > > *From:* Ted Yu [mailto:yuzhih...@gmail.com] > *Sent:* Tuesday, October 20, 2015 3:12 PM > *To:* ChengBo > *Cc:* user > *Subject:* Re: Get statistic result from RDD > > >

RE: Get statistic result from RDD

2015-10-20 Thread ChengBo
I tried, but it shows: “error: value reduceByKey is not a member of iterable[((Int, Int, String, String), String), Int]” Best Frank From: Ted Yu [mailto:yuzhih...@gmail.com] Sent: Tuesday, October 20, 2015 3:46 PM To: ChengBo Cc: user Subject: Re: Get statistic result from RDD Please take

Re: Get statistic result from RDD

2015-10-20 Thread Ted Yu
in p(0), and how > many 6 to 7 in p(0). > In the final result, each row represents each userID's statistic result. > > Thanks a lot. I really appreciate it. > > > > -- > View this message in context: > http

RE: Get statistic result from RDD

2015-10-20 Thread ChengBo
I tried, but it shows: “error: value reduceByKey is not a member of iterable[((Int, Int, String, String), String), Int]” Best Cheng From: Ted Yu [mailto:yuzhih...@gmail.com] Sent: Tuesday, October 20, 2015 3:46 PM To: ChengBo Cc: user Subject: Re: Get statistic result from RDD Please take