What do you mean by "control your input”, are you trying to pace your spark 
streaming by number of words. If so that is not supported as of now, you can 
only control time & consume all files within that time period. 
-- 
Regards,
Mayur Rustagi
Ph: +1 (760) 203 3257
http://www.sigmoidanalytics.com
@mayur_rustagi

On Tue, Sep 9, 2014 at 2:24 AM, danilopds <danilob...@gmail.com> wrote:

> Hi,
> I was reading the paper of Spark Streaming:
> "Discretized Streams: Fault-Tolerant Streaming Computation at Scale"
> So,
> I read that performance evaluation used 100-byte input records in test Grep
> and WordCount.
> I don't have much experience and I'd like to know how can I control this
> value in my records (like words in an input file)?
> Can anyone suggest me something to start?
> Thanks!
> --
> View this message in context: 
> http://apache-spark-user-list.1001560.n3.nabble.com/Records-Input-Byte-tp13733.html
> Sent from the Apache Spark User List mailing list archive at Nabble.com.
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
> For additional commands, e-mail: user-h...@spark.apache.org

Reply via email to