Re: Deploying Jobmanager on k8s as a Deployment

2022-09-07 Thread Yang Wang
For native K8s integration, the Flink ResourceManager will delete the JobManager K8s deployment as well as the HA data once the job reached a globally terminal state. However, it is indeed a problem for standalone mode since the JobManager will be restarted again even the job has finished. I

Re: [Flink 1.15.1 - Application mode native k8s Exception] - Exception occurred while acquiring lock 'ConfigMapLock

2022-09-07 Thread Yang Wang
Given that you are running multiple JobManagers, it does not matter for the "already exists" exception during leader election. BTW, I think running multiple JobManagers does not take enough advantages when deploying Flink on Kubernetes. Because a new JobManager will be started immediately once

Re: Where will the state be stored in the taskmanager when using rocksdbstatebend?

2022-09-07 Thread Yuan Mei
Hey Hjw, Under the current Flink architecture (i.e., task states are stored locally and periodically uploaded to remote durable storage during checkpointing), there is no other way rather than scaling out your application to solve the problem. This is equivalent to making the state size in each

Re: 关于 UDAF 里面 ListView 的疑问

2022-09-07 Thread Zhiwen Sun
hi, 感谢你的回复。 报错是在 getValue 的时候。 at GroupAggsHandler$439.getValue(Unknown Source) at org.apache.flink.table.runtime.operators.aggregate.GroupAggFunction.processElement(GroupAggFunction.java:146) 我的疑问是使用一个 Class 包装下 ListView 就能正常工作,而直接使用 ListView 是会报错。 比如使用 AggregateFunction

flink 1.13 发现下游busy是0 ,上游backpressure 是100%

2022-09-07 Thread 周虓岗
通过metrics观察,下游的input queue都是0, 上游queue是满的 观察监控,发现当出现速率跌零的时候,下游的input queue是0 max by (exported_task_name)(flink_taskmanager_job_task_Network_Input_0_maxQueueLen{job_name=~'$job_name'}) 此时上游的队列还是满的 (flink_taskmanager_job_task_Network_Output_0_maxQueueLen{job_name=~'$job_name'})

关于flink table store的疑问

2022-09-07 Thread Kyle Zhang
Hi all, 看table store的介绍也是关于数据湖存储以及用于实时流式读取的,那在定位上与iceberg、hudi等项目有什么不一样么,为什么要再开发一个项目? Best.

Re: Deploying Jobmanager on k8s as a Deployment

2022-09-07 Thread Austin Cawley-Edwards
Hey Gil, I'm referring to when a pod exits on its own, not when being deleted. Deployments only support the "Always" restart policy [1]. In my understanding, the JM only cleans up HA data when it is shutdown[2], after which the process will exit which leads to the problem with k8s Deployment

Re: Deploying Jobmanager on k8s as a Deployment

2022-09-07 Thread Gil De Grove
Hello Austin, I'm not aware of any limitations of deployement not letting pod exit (correctly or incorrectly). What do you mean by that exactly? Would it be possible for you to point out to piece of documentation that make you think that ? A pod, if correctly setup will be exited when receiving

Re: Mixed up session aggregations for same key

2022-09-07 Thread David Anderson
The way that Flink handles session windows is that every new event is initially assigned to its own session window, and then overlapping sessions are merged. I imagine this is why you are seeing so many calls to createAccumulator. This implementation choice is deeply embedded in the code; I don't

Re: Deploying Jobmanager on k8s as a Deployment

2022-09-07 Thread Austin Cawley-Edwards
Cool, thanks! How does it clean up the HA data, if the cluster is never able to shut down (due to the k8s Deployment restriction)? Best, Austin On Mon, Sep 5, 2022 at 6:51 PM Gyula Fóra wrote: > Hi! > > The operator supports both Flink native and standalone deployment modes > and in both cases

Re:关于 UDAF 里面 ListView 的疑问

2022-09-07 Thread Xuyang
Hi, 理论上来说,在你的case中,会先通过createAccumulator方法创建一个ListView作为acc,然后,每一个输入的row都会触发accumulate方法,将数据更新到刚才的acc中,最终通过getValue方法拿到当前acc的值。 实际测试中的NPE发生在更新acc的时候还是getValue的时候呢?可以通过在这三个阶段设一下断点,分别看一下当前持有的acc是不是同一个对象 -- Best! Xuyang 在 2022-09-07 16:23:25,"Zhiwen Sun" 写道: Hi,

Re: New licensing for Akka

2022-09-07 Thread Robin Cassan via user
Thanks a lot for your answers, this is reassuring! Cheers Le mer. 7 sept. 2022 à 13:12, Chesnay Schepler a écrit : > Just to squash concerns, we will make sure this license change will not > affect Flink users in any way. > > On 07/09/2022 11:14, Robin Cassan via user wrote: > > Hi all! > > It

Re: Slow Tests in Flink 1.15

2022-09-07 Thread Alexey Trenikhun
The class contains single test method, which runs single job (the job is quite complex), then waits for job being running after that waits for data being populated in output topic, and this doesn't happen during 5 minutes (test timeout). Tried under debugger, set breakpoint in Kafka record

Re: Cassandra sink with Flink 1.15

2022-09-07 Thread Chesnay Schepler
Are you running into this in the IDE, or when submitting the job to a Flink cluster? If it is the first, then you're probably affected by the Scala-free Flink efforts. Either add an explicit dependency on flink-streaming-scala or migrate to Flink tuples. On 07/09/2022 14:17, Lars Skjærven

Re: Slow Tests in Flink 1.15

2022-09-07 Thread Chesnay Schepler
The test that gotten slow; how many test cases does it actually contain / how many jobs does it actually run? Are these tests using the table/sql API? On 07/09/2022 14:15, Alexey Trenikhun wrote: We are also observing extreme slow down (5+ minutes vs 15 seconds) in 1 of 2 integration tests .

Cassandra sink with Flink 1.15

2022-09-07 Thread Lars Skjærven
Hello, When upgrading from 1.14 to 1.15 we bumped into a type issue when attempting to sink to Cassandra (scala 2.12.13). This was working nicely in 1.14. Any tip is highly appreciated. Using a MapFunction() to generate the stream of tuples: CassandraSink .addSink( mystream.map(new

Re: Slow Tests in Flink 1.15

2022-09-07 Thread Alexey Trenikhun
We are also observing extreme slow down (5+ minutes vs 15 seconds) in 1 of 2 integration tests . Both tests use Kafka. The slow test uses org.apache.flink.runtime.minicluster.TestingMiniCluster, this test tests complete job, which consumes and produces Kafka messages. Not affected test extends

Re: New licensing for Akka

2022-09-07 Thread Chesnay Schepler
Just to squash concerns, we will make sure this license change will not affect Flink users in any way. On 07/09/2022 11:14, Robin Cassan via user wrote: Hi all! It seems Akka have announced a licensing change https://www.lightbend.com/blog/why-we-are-changing-the-license-for-akka If I

Re: Flink upgrade path

2022-09-07 Thread Jing Ge
Hi, I would recommend you to check the release notes of 1.14[1] and 1.15[2]. If your Flink jobs are using Flink features that have big improvements in these two releases, it would be better to upgrade step by step without skipping 1.14.x. In general, depending on how complicated your jobs are,

Re: New licensing for Akka

2022-09-07 Thread Matthias Pohl via user
There is some more discussion going on in the related PR [1]. Based on the current state of the discussion, akka 2.6.20 will be the last version under Apache 2.0 license. But, I guess, we'll have to see where this discussion is heading considering that it's kind of fresh. [1]

Re: New licensing for Akka

2022-09-07 Thread Chesnay Schepler
We'll have to look into it. The license would apply to usages of Flink. That said, I'm not sure if we'd even be allowed to use Akka under that license since it puts significant restrictions on the use of the software. If that is the case, then it's either use a fork created by another party or

Re: flink作业生成保存点失败

2022-09-07 Thread Congxian Qiu
Hi 有 savepoint/checkpoint 失败时的具体 jobmanager log 以及失败 task 对应的 taskmanager log 的话可以发一下,大家帮助看一下 Best, Congxian Xuyang 于2022年8月30日周二 23:18写道: > > Hi,看起来这个报错是用于输出信息的文件找不到了,可以尝试加一下这个配置再试一下“taskmanager.log.path”,找一下导致tasks超时的根本原因。 > 还可以试一下用火焰图或jstack查看一下那几个tasks超时的时候是卡在哪个方法上。 > > > > > > > > > > >

New licensing for Akka

2022-09-07 Thread Robin Cassan via user
Hi all! It seems Akka have announced a licensing change https://www.lightbend.com/blog/why-we-are-changing-the-license-for-akka If I understand correctly, this could end-up increasing cost a lot for companies using Flink in production. Do you know if the Flink developers have any initial reaction

Re: Flink upgrade path

2022-09-07 Thread Congxian Qiu
In addition to the state compatibility mentioned above, the interfaces provided by Flink are stable if they have public annotation[1] [1] https://github.com/apache/flink/blob/master/flink-annotations/src/main/java/org/apache/flink/annotation/Public.java Best, Congxian Hangxiang Yu

关于 UDAF 里面 ListView 的疑问

2022-09-07 Thread Zhiwen Sun
Hello all, 我看 ListView 使用的时候,有以下示例 public class MyAccumulator { public ListView list = new ListView<>(); // or explicit: // {@literal @}DataTypeHint("ARRAY") // public ListView list = new ListView<>(); public long count = 0L; } public class MyAggregateFunction extends