Re: FYI: Blog Post on Flink's Streaming Performance and Fault Tolerance

2015-08-05 Thread Sachin Goel
This makes me want to dig into the streaming API. About time perhaps. :')

Cheers!
Sachin

-- Sachin Goel
Computer Science, IIT Delhi
m. +91-9871457685
On Aug 6, 2015 2:39 AM, "Theodore Vasiloudis" <
theodoros.vasilou...@gmail.com> wrote:

> Great post Stephan! A small note: the code for Google Dataflow does display
> correctly for me, I'm getting < and > instead of <>
>
> On Wed, Aug 5, 2015 at 4:11 PM, Stephan Ewen  wrote:
>
> > Hi all!
> >
> > We just published a blog post about how streaming fault tolerance
> > mechanisms evolved, and what kind of performance Flink gets with its
> > checkpointing mechanism.
> >
> > I think it is a pretty interesting read for people that are interested in
> > Flink or data streaming in general.
> >
> > The blog post talks about:
> >
> >   - Fault tolerance techniques, starting from acknowledgements, over
> micro
> > batches, to transactional updates and distributed snapshots.
> >
> >   - Performance of Flink, throughput, latency, and tradeoffs.
> >
> >   - A "chaos monkey" experiment where computation continues strongly
> > consistent even when periodically killing workers.
> >
> >
> > Comments welcome!
> >
> > Greetings,
> > Stephan
> >
>


Re: FYI: Blog Post on Flink's Streaming Performance and Fault Tolerance

2015-08-05 Thread Theodore Vasiloudis
Great post Stephan! A small note: the code for Google Dataflow does display
correctly for me, I'm getting < and > instead of <>

On Wed, Aug 5, 2015 at 4:11 PM, Stephan Ewen  wrote:

> Hi all!
>
> We just published a blog post about how streaming fault tolerance
> mechanisms evolved, and what kind of performance Flink gets with its
> checkpointing mechanism.
>
> I think it is a pretty interesting read for people that are interested in
> Flink or data streaming in general.
>
> The blog post talks about:
>
>   - Fault tolerance techniques, starting from acknowledgements, over micro
> batches, to transactional updates and distributed snapshots.
>
>   - Performance of Flink, throughput, latency, and tradeoffs.
>
>   - A "chaos monkey" experiment where computation continues strongly
> consistent even when periodically killing workers.
>
>
> Comments welcome!
>
> Greetings,
> Stephan
>


[jira] [Created] (FLINK-2489) Improve the streaming window join

2015-08-05 Thread Stephan Ewen (JIRA)
Stephan Ewen created FLINK-2489:
---

 Summary: Improve the streaming window join
 Key: FLINK-2489
 URL: https://issues.apache.org/jira/browse/FLINK-2489
 Project: Flink
  Issue Type: Improvement
  Components: Streaming
Affects Versions: 0.10
Reporter: Stephan Ewen


This issue tracks the ideas and discussion to improve the current window join.

Here is an initial list of desiderata for an improved window join:

  - Joins should initially run only on time windows
  - Support for operator time and event time
  - Event time version must support multiple windows being in progress (or 
buffered) while waiting for watermarks

  - Support for simple Java heap and Flink-managed memory
  - Flink-managed memory variant should support out-of-core operation

  - Join buffers or hashtables need to be checkpointed.
  - Join buffers or hashtables need to support incremental checkpointing
  - Join buffers or hashtables should support asynchronous checkpointing

  - For sliding time windows, we should consider variants to reuse the join 
candidates across multiple windows they are contained in.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


Re: FYI: Blog Post on Flink's Streaming Performance and Fault Tolerance

2015-08-05 Thread Ankur Chauhan
Pretty awesome piece. 

Sent from my iPhone

> On Aug 5, 2015, at 10:10, Hawin Jiang  wrote:
> 
> Great job, Guys
> 
> Let me read it carefully. 
> 
> 
> 
> 
> 
> 
> 
>> On Wed, Aug 5, 2015 at 7:25 AM, Stephan Ewen  wrote:
>> I forgot the link ;-)
>> 
>> http://data-artisans.com/high-throughput-low-latency-and-exactly-once-stream-processing-with-apache-flink/
>> 
>>> On Wed, Aug 5, 2015 at 4:11 PM, Stephan Ewen  wrote:
>>> Hi all!
>>> 
>>> We just published a blog post about how streaming fault tolerance 
>>> mechanisms evolved, and what kind of performance Flink gets with its 
>>> checkpointing mechanism.
>>> 
>>> I think it is a pretty interesting read for people that are interested in 
>>> Flink or data streaming in general.
>>> 
>>> The blog post talks about:
>>> 
>>>   - Fault tolerance techniques, starting from acknowledgements, over micro 
>>> batches, to transactional updates and distributed snapshots.
>>> 
>>>   - Performance of Flink, throughput, latency, and tradeoffs.
>>> 
>>>   - A "chaos monkey" experiment where computation continues strongly 
>>> consistent even when periodically killing workers.
>>> 
>>> 
>>> Comments welcome!
>>> 
>>> Greetings,
>>> Stephan
> 


Re: FYI: Blog Post on Flink's Streaming Performance and Fault Tolerance

2015-08-05 Thread Hawin Jiang
Great job, Guys

Let me read it carefully.







On Wed, Aug 5, 2015 at 7:25 AM, Stephan Ewen  wrote:

> I forgot the link ;-)
>
>
> http://data-artisans.com/high-throughput-low-latency-and-exactly-once-stream-processing-with-apache-flink/
>
> On Wed, Aug 5, 2015 at 4:11 PM, Stephan Ewen  wrote:
>
>> Hi all!
>>
>> We just published a blog post about how streaming fault tolerance
>> mechanisms evolved, and what kind of performance Flink gets with its
>> checkpointing mechanism.
>>
>> I think it is a pretty interesting read for people that are interested in
>> Flink or data streaming in general.
>>
>> The blog post talks about:
>>
>>   - Fault tolerance techniques, starting from acknowledgements, over
>> micro batches, to transactional updates and distributed snapshots.
>>
>>   - Performance of Flink, throughput, latency, and tradeoffs.
>>
>>   - A "chaos monkey" experiment where computation continues strongly
>> consistent even when periodically killing workers.
>>
>>
>> Comments welcome!
>>
>> Greetings,
>> Stephan
>>
>>
>>
>


Re: FYI: Blog Post on Flink's Streaming Performance and Fault Tolerance

2015-08-05 Thread Stephan Ewen
I forgot the link ;-)

http://data-artisans.com/high-throughput-low-latency-and-exactly-once-stream-processing-with-apache-flink/

On Wed, Aug 5, 2015 at 4:11 PM, Stephan Ewen  wrote:

> Hi all!
>
> We just published a blog post about how streaming fault tolerance
> mechanisms evolved, and what kind of performance Flink gets with its
> checkpointing mechanism.
>
> I think it is a pretty interesting read for people that are interested in
> Flink or data streaming in general.
>
> The blog post talks about:
>
>   - Fault tolerance techniques, starting from acknowledgements, over micro
> batches, to transactional updates and distributed snapshots.
>
>   - Performance of Flink, throughput, latency, and tradeoffs.
>
>   - A "chaos monkey" experiment where computation continues strongly
> consistent even when periodically killing workers.
>
>
> Comments welcome!
>
> Greetings,
> Stephan
>
>
>


FYI: Blog Post on Flink's Streaming Performance and Fault Tolerance

2015-08-05 Thread Stephan Ewen
Hi all!

We just published a blog post about how streaming fault tolerance
mechanisms evolved, and what kind of performance Flink gets with its
checkpointing mechanism.

I think it is a pretty interesting read for people that are interested in
Flink or data streaming in general.

The blog post talks about:

  - Fault tolerance techniques, starting from acknowledgements, over micro
batches, to transactional updates and distributed snapshots.

  - Performance of Flink, throughput, latency, and tradeoffs.

  - A "chaos monkey" experiment where computation continues strongly
consistent even when periodically killing workers.


Comments welcome!

Greetings,
Stephan


[jira] [Created] (FLINK-2488) Expose attemptNumber in RuntimeContext

2015-08-05 Thread Robert Metzger (JIRA)
Robert Metzger created FLINK-2488:
-

 Summary: Expose attemptNumber in RuntimeContext
 Key: FLINK-2488
 URL: https://issues.apache.org/jira/browse/FLINK-2488
 Project: Flink
  Issue Type: Improvement
  Components: JobManager, TaskManager
Affects Versions: 0.10
Reporter: Robert Metzger
Priority: Minor


It would be nice to expose the attemptNumber of a task in the 
{{RuntimeContext}}. 
This would allow user code to behave differently in restart scenarios.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


Re: 答复: Need help with CI problem

2015-08-05 Thread Stephan Ewen
That is interesting, never seen that before.

Travis support would be best, I guess.

On Wed, Aug 5, 2015 at 10:04 AM, huangwei (G) 
wrote:

> Thank you!
> I`ll try supp...@travis-ci.com.
> It`s just a line printing "Do you have a question?" and nothing more when
> I click the link.
>
> -邮件原件-
> 发件人: Robert Metzger [mailto:rmetz...@apache.org]
> 发送时间: 2015年8月5日 15:56
> 收件人: dev@flink.apache.org
> 主题: Re: Need help with CI problem
>
> Hi,
> I think the Apache Mailing lists don't support sending image attachments.
> But I guess you are asking because of a message like this:
> https://github.com/apache/flink/pull/977
>
> What happens when you click the link?
> Can you write to a message to supp...@travis-ci.com, maybe they can help
> you.
>
>
> On Wed, Aug 5, 2015 at 8:36 AM, huangwei (G) 
> wrote:
>
> > Hi, I cannot see the CI detail as I click the link.
> >
> > It was like:
> >
> > [image: cid:image001.png@01D0CF8B.60774F60]
> >
> >
> >
> > Does anyone help me?
> >
> >
> >
>


答复: Need help with CI problem

2015-08-05 Thread huangwei (G)
Thank you!
I`ll try supp...@travis-ci.com.
It`s just a line printing "Do you have a question?" and nothing more when I 
click the link.

-邮件原件-
发件人: Robert Metzger [mailto:rmetz...@apache.org] 
发送时间: 2015年8月5日 15:56
收件人: dev@flink.apache.org
主题: Re: Need help with CI problem

Hi,
I think the Apache Mailing lists don't support sending image attachments.
But I guess you are asking because of a message like this:
https://github.com/apache/flink/pull/977

What happens when you click the link?
Can you write to a message to supp...@travis-ci.com, maybe they can help you.


On Wed, Aug 5, 2015 at 8:36 AM, huangwei (G)  wrote:

> Hi, I cannot see the CI detail as I click the link.
>
> It was like:
>
> [image: cid:image001.png@01D0CF8B.60774F60]
>
>
>
> Does anyone help me?
>
>
>


Re: Need help with CI problem

2015-08-05 Thread Robert Metzger
Hi,
I think the Apache Mailing lists don't support sending image attachments.
But I guess you are asking because of a message like this:
https://github.com/apache/flink/pull/977

What happens when you click the link?
Can you write to a message to supp...@travis-ci.com, maybe they can help
you.


On Wed, Aug 5, 2015 at 8:36 AM, huangwei (G)  wrote:

> Hi, I cannot see the CI detail as I click the link.
>
> It was like:
>
> [image: cid:image001.png@01D0CF8B.60774F60]
>
>
>
> Does anyone help me?
>
>
>


[jira] [Created] (FLINK-2487) the array has out of bounds

2015-08-05 Thread zhangrucong (JIRA)
zhangrucong created FLINK-2487:
--

 Summary: the array has out of bounds
 Key: FLINK-2487
 URL: https://issues.apache.org/jira/browse/FLINK-2487
 Project: Flink
  Issue Type: Bug
  Components: Streaming
Affects Versions: 0.8.1
Reporter: zhangrucong
Priority: Minor






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)