Aah, right, copied from the wrong browser tab i guess. Thanks! TD
On Tue, Jul 15, 2014 at 5:57 PM, Michael Campbell < michael.campb...@gmail.com> wrote: > I think you typo'd the jira id; it should be > https://issues.apache.org/jira/browse/SPARK-2475 "Check whether #cores > > #receivers in local mode" > > > On Mon, Jul 14, 2014 at 3:57 PM, Tathagata Das < > tathagata.das1...@gmail.com> wrote: > >> The problem is not really for local[1] or local. The problem arises when >> there are more input streams than there are cores. >> But I agree, for people who are just beginning to use it by running it >> locally, there should be a check addressing this. >> >> I made a JIRA for this. >> https://issues.apache.org/jira/browse/SPARK-2464 >> >> TD >> >> >> On Sun, Jul 13, 2014 at 4:26 PM, Sean Owen <so...@cloudera.com> wrote: >> >>> How about a PR that rejects a context configured for local or local[1]? >>> As I understand it is not intended to work and has bitten several people. >>> On Jul 14, 2014 12:24 AM, "Michael Campbell" <michael.campb...@gmail.com> >>> wrote: >>> >>>> This almost had me not using Spark; I couldn't get any output. It is >>>> not at all obvious what's going on here to the layman (and to the best of >>>> my knowledge, not documented anywhere), but now you know you'll be able to >>>> answer this question for the numerous people that will also have it. >>>> >>>> >>>> On Sun, Jul 13, 2014 at 5:24 PM, Walrus theCat <walrusthe...@gmail.com> >>>> wrote: >>>> >>>>> Great success! >>>>> >>>>> I was able to get output to the driver console by changing the >>>>> construction of the Streaming Spark Context from: >>>>> >>>>> val ssc = new StreamingContext("local" /**TODO change once a cluster >>>>> is up **/, >>>>> "AppName", Seconds(1)) >>>>> >>>>> >>>>> to: >>>>> >>>>> val ssc = new StreamingContext("local[2]" /**TODO change once a >>>>> cluster is up **/, >>>>> "AppName", Seconds(1)) >>>>> >>>>> >>>>> I found something that tipped me off that this might work by digging >>>>> through this mailing list. >>>>> >>>>> >>>>> On Sun, Jul 13, 2014 at 2:00 PM, Walrus theCat <walrusthe...@gmail.com >>>>> > wrote: >>>>> >>>>>> More strange behavior: >>>>>> >>>>>> lines.foreachRDD(x => println(x.first)) // works >>>>>> lines.foreachRDD(x => println((x.count,x.first))) // no output is >>>>>> printed to driver console >>>>>> >>>>>> >>>>>> >>>>>> >>>>>> On Sun, Jul 13, 2014 at 11:47 AM, Walrus theCat < >>>>>> walrusthe...@gmail.com> wrote: >>>>>> >>>>>>> >>>>>>> Thanks for your interest. >>>>>>> >>>>>>> lines.foreachRDD(x => println(x.count)) >>>>>>> >>>>>>> And I got 0 every once in a while (which I think is strange, >>>>>>> because lines.print prints the input I'm giving it over the socket.) >>>>>>> >>>>>>> >>>>>>> When I tried: >>>>>>> >>>>>>> lines.map(_->1).reduceByKey(_+_).foreachRDD(x => println(x.count)) >>>>>>> >>>>>>> I got no count. >>>>>>> >>>>>>> Thanks >>>>>>> >>>>>>> >>>>>>> On Sun, Jul 13, 2014 at 11:34 AM, Tathagata Das < >>>>>>> tathagata.das1...@gmail.com> wrote: >>>>>>> >>>>>>>> Try doing DStream.foreachRDD and then printing the RDD count and >>>>>>>> further inspecting the RDD. >>>>>>>> On Jul 13, 2014 1:03 AM, "Walrus theCat" <walrusthe...@gmail.com> >>>>>>>> wrote: >>>>>>>> >>>>>>>>> Hi, >>>>>>>>> >>>>>>>>> I have a DStream that works just fine when I say: >>>>>>>>> >>>>>>>>> dstream.print >>>>>>>>> >>>>>>>>> If I say: >>>>>>>>> >>>>>>>>> dstream.map(_,1).print >>>>>>>>> >>>>>>>>> that works, too. However, if I do the following: >>>>>>>>> >>>>>>>>> dstream.reduce{case(x,y) => x}.print >>>>>>>>> >>>>>>>>> I don't get anything on my console. What's going on? >>>>>>>>> >>>>>>>>> Thanks >>>>>>>>> >>>>>>>> >>>>>>> >>>>>> >>>>> >>>> >> >