Re: Question about textFileStream

2014-11-12 Thread Rishi Yadav
yes, can you always specify minimum number of partitions and that would force some parallelism ( assuming you have enough cores) On Wed, Nov 12, 2014 at 9:36 AM, Saiph Kappa wrote: > What if the window is of 5 seconds, and the file takes longer than 5 > seconds to be completely scanned? It will

Re: Question about textFileStream

2014-11-12 Thread Saiph Kappa
What if the window is of 5 seconds, and the file takes longer than 5 seconds to be completely scanned? It will still attempt to load the whole file? On Mon, Nov 10, 2014 at 6:24 PM, Soumitra Kumar wrote: > Entire file in a window. > > On Mon, Nov 10, 2014 at 9:20 AM, Saiph Kappa > wrote: > >> H

Re: Question about textFileStream

2014-11-10 Thread Soumitra Kumar
Entire file in a window. On Mon, Nov 10, 2014 at 9:20 AM, Saiph Kappa wrote: > Hi, > > In my application I am doing something like this "new > StreamingContext(sparkConf, Seconds(10)).textFileStream("logs/")", and I > get some unknown exceptions when I copy a file with about 800 MB to that > fol

Question about textFileStream

2014-11-10 Thread Saiph Kappa
Hi, In my application I am doing something like this "new StreamingContext(sparkConf, Seconds(10)).textFileStream("logs/")", and I get some unknown exceptions when I copy a file with about 800 MB to that folder ("logs/"). I have a single worker running with 512 MB of memory. Anyone can tell me if