Failure to recover from failed checkpoint when using S3 streaming file sink

2022-09-10 Thread Oran Shuster via user
This issue has been going on for a while but i couldn't figure it out no matter what I tried Some general info Flink 1.14.5 with checkpoint/HA storage in S3 we have 3 jobs which are identical code the only difference is which kafka topic is read and what prefix is used in the S3 sink this means tha

RE: Flink logging

2022-01-25 Thread Oran Shuster
It's hard for me to see the issue from what you posted, However i can post how i added that jar to my flink pods and you can compare Instead of creating a custom image i loaded the JAR as a config map You can create a configMap easily from a file: 1.Download the jar you want 2.Create the configMap

Example for Jackson JsonNode Kafka serialization schema

2022-01-25 Thread Oran Shuster
In the documentation we have an example on how to implement deserialization from bytes to Jackson ObjectNode objects - JSONKeyValueDeserializationSchema https://nightlies.apache.org/flink/flink-docs-release-1.13/docs/connectors/datastream/kafka/ However, there is no example on the other direction:

Re: I/O reactor status: STOPPED after moving to elasticsearch7 connector

2021-10-15 Thread Oran Shuster
07:43:38, Itamar Syn-Hershko wrote: > Hi Oran, can you check your ES logs / metrics? > > Most issues we see with the ES sink are around incorrect batching and/or > overloaded clusters. Could it be your ES write queue is building up? > > On Wed, Oct 13, 2021 at 1:06 AM Oran Shust

Re: Apache/flink 1.13.1 images re-pushed and in arm64 arch?

2021-08-11 Thread Oran Shuster
Chesnay Schepler wrote: > > > > Ah, yes I was looking at the wrong one. I'll figure out what happened > > and get the situation fixed. > > > > Overall though, I'd recommend to rely on https://hub.docker.com/_/flink. > > > > On 10/08/2021 16:04, Oran Sh

Re: Apache/flink 1.13.1 images re-pushed and in arm64 arch?

2021-08-10 Thread Oran Shuster
listed as > amd64, with the last push being 5 days ago. > Perhaps some error happened during the rebuild, which has since been > corrected? > > On 10/08/2021 00:43, Oran Shuster wrote: > > Our 1.13.1 k8s deployments started failing with > > tandard_init_linux.go:228: exec

Apache/flink 1.13.1 images re-pushed and in arm64 arch?

2021-08-09 Thread Oran Shuster
Our 1.13.1 k8s deployments started failing with tandard_init_linux.go:228: exec user process caused: exec format error Not understanding how an old image with an old depoyment starts failing like that i looked at the apache/flink dockerhub page. Although 1.13.1 was released more than a month ago,

Flink running in k8s pod - pod is able to access S3 bucket, flink does not

2021-01-27 Thread Oran Shuster
So i'm really stumped on this for a couple of days now Some general info - Flink version 1.12.1, using k8s HA service. The k8s is self managed on AWS our checkpoints and savepoints are on s3, i created a new bucket just for it and set the proper permissions to the k8s node The job manager is wor