Re: Checkpoints periodically fail with hdfs as the state backend - Could not flush and close the file system output stream

2019-05-21 Thread Congxian Qiu
Hi Pedro From the previous given log, I found that checkpoint 65912 has been expired then, the raise the IOException. When some checkpoint expired, the checkpoint dir will be deleted(CheckpointCoordinator#549 on release-1.6 branch), and the unfinished task will still write to the previous

Re: 回复:Memory Allocate/Deallocate related Thread Deadlock encountered when running a large job > 10k tasks

2019-05-21 Thread zhijiang
If it is still the case of multiple slots in one TaskManager, it is the same as before. But you said you already used the single slot per TaskManager, right? If it is the case of single slot in TaskManager, you could attach the jstack when occurs next time, otherwise it is not needed. Best,

https://github.com/google/zetasql

2019-05-21 Thread kant kodali
https://github.com/google/zetasql

Connectors (specifically Kinesis Connector)

2019-05-21 Thread Steven Nelson
Hello! We keep having difficulties with the Kinesis connector. We have to publish our own version, and we understand why. What I am curious about is the plan to make this better in the future. Is there an issue/FLIP that I can reference when talking internally about this? -Steve

Re: Checkpoints periodically fail with hdfs as the state backend - Could not flush and close the file system output stream

2019-05-21 Thread PedroMrChaves
The issue happened again. /AsynchronousException{java.lang.Exception: Could not materialize checkpoint 47400 for operator ENRICH (1/4).} at org.apache.flink.streaming.runtime.tasks.StreamTask$AsyncCheckpointExceptionHandler.tryHandleCheckpointException(StreamTask.java:1153) at

Flink on mesos - One cluster per job - Version 1.6

2019-05-21 Thread Juan Gentile
Hello! We are trying to run in Mesos a job which will launch its own cluster (as opposed to launching the cluster and then submitting jobs to it) We have a couple of questions/issues: 1. Is there any easier way to achieve this without having to generate a graph file before submitting the

Question regarding date conditions/row expirations on Dynamic Tables

2019-05-21 Thread Wayne Heaney
I'm trying to build a Dynamic table that will be updated when records haven't been touched for a certain period of time. This is a unique case in that no event is touching the record, yet I want it to trigger an event based on when a date comparison result changes based on my table's continuous

Re: Flink vs KStreams

2019-05-21 Thread Elias Levy
My 2c: KStreams: Pros: * Streaming as a library: No need to submit your job to a cluster. Easy to scale up/down the job by adding or removing workers. * Streaming durability: State is durably stored in Kafka topics in a streaming fashion. Durability is amortized across the job's lifetime. * No

RE: 回复:Memory Allocate/Deallocate related Thread Deadlock encountered when running a large job > 10k tasks

2019-05-21 Thread Narayanaswamy, Krishna
Hi Zhijiang, I couldn’t get the jstack due to some constraints this time around. Will try and get them when it occurs next. But from the looks of it from the console/logs it appears to be the same as the 2 slot cases. DataSource finishing up and CoGroup looking to move from DEPLOYING to

Re: 回复:Memory Allocate/Deallocate related Thread Deadlock encountered when running a large job > 10k tasks

2019-05-21 Thread zhijiang
Hi Krishna, Could you show me or attach the jstack for the single slot case? Or is it the same jstack as before? Best, Zhijiang -- From:Narayanaswamy, Krishna Send Time:2019年5月21日(星期二) 19:50 To:zhijiang ; Aljoscha Krettek ;

Re: Queryable State race condition or serialization errors?

2019-05-21 Thread burgesschen
Hi Gary. Thanks for the reply. I am using RocksDBStateBackend though. Best, Chen -- Sent from: http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/

Re: Possilby very slow keyBy in program with no parallelism

2019-05-21 Thread Piotr Nowojski
Hi Theo, Regarding the performance issue. > None of my machine resources is fully utilized, i.e. none of the cluster CPU > runs at 100% utilization (according to htop). And the memory is virtually > available, but the RES column in htop states the processes uses 5499MB. By nature of stream

How to generate a sequential watermark which increases by one unit each time

2019-05-21 Thread Averell
Hi everyone, I have a stream of files, each file has multiple records. A record has one Long field named `ts`, which holds the file creation time (so it increases over time, and one same value for all records in each file). However, I need to have a watermark that increases by exactly one unit

RE: 回复:Memory Allocate/Deallocate related Thread Deadlock encountered when running a large job > 10k tasks

2019-05-21 Thread Narayanaswamy, Krishna
We started to run jobs using the single slotted task managers which seemed to be ok for the past couple of days, but today morning we seem to be seeing these deadlocks even with 1 slot. Is there something else we could try out? Thanks, Krishna. From: Narayanaswamy, Krishna [Tech] Sent: Friday,

Re:Re: flink消费kafka问题

2019-05-21 Thread Michael Ran
保存成功了,才做check 不行吗? 在 2019-05-21 10:37:50,"Biao Liu" 写道: >Hi, 你需要正确处理“落库失败”的数据,例如可以直接抛异常出来,这样 job 会不停 failover,直到不再落库失败 > > >方伟 于2019年5月20日周一 下午6:02写道: > >> Hi 你好~: >> >>

Re: Queryable State race condition or serialization errors?

2019-05-21 Thread Gary Yao
Hi Burgess Chen, If you are using MemoryStateBackend or FsStateBackend, you can observe race conditions on the state objects. However, the RocksDBStateBackend should be safe from these issues [1]. Best, Gary [1]