Flink 1.10 -> Savepoints referencing to checkpoints or not

2020-11-16 Thread Bajaj, Abhinav
Hi, I am trying to understand the Flink 1.10 savepoints related documentation that mentions - “When triggering a savepoint, a new savepoint directory is created where the data as well

Re: Flink CPU load metrics in K8s

2020-08-14 Thread Bajaj, Abhinav
Awesome. This is exactly what I was going to look for. Thanks much. ~ Abhinav From: Arvid Heise Date: Thursday, August 13, 2020 at 12:33 AM To: "Bajaj, Abhinav" Cc: Xintong Song , "user@flink.apache.org" , Roman Grebennikov Subject: Re: Flink CPU load metrics in K8s Hi

Re: Flink CPU load metrics in K8s

2020-08-12 Thread Bajaj, Abhinav
the expected CPU usage for the containers. I was hoping that someone in the community may have already ran into this behavior on K8s and can share their specific experience . Thanks much. ~ Abhinav Bajaj From: Xintong Song Date: Wednesday, August 12, 2020 at 3:56 AM To: "Bajaj, Abhinav"

Re: Flink CPU load metrics in K8s

2020-08-11 Thread Bajaj, Abhinav
Hi, Reaching out to folks running Flink on K8s. ~ Abhinav Bajaj From: "Bajaj, Abhinav" Date: Wednesday, August 5, 2020 at 1:46 PM To: Roman Grebennikov , "user@flink.apache.org" Subject: Re: Flink CPU load metrics in K8s Thanks Roman for providing the details. I also ma

Re: Flink CPU load metrics in K8s

2020-08-05 Thread Bajaj, Abhinav
the metrics. As for me personally, JVM.CPU.Time is more clear indicator of CPU usage, which is always increasing amount of milliseconds CPU spent executing your code. And it will also catch CPU usage spikes. Roman Grebennikov | g...@dfdx.me On Mon, Aug 3, 2020, at 23:34, Bajaj, Abhinav

Flink CPU load metrics in K8s

2020-08-03 Thread Bajaj, Abhinav
Hi, I am trying to understand the CPU Load metrics reported by Flink 1.7.1 running with openjdk 1.8.0_212 on K8s. After deploying the Flink Job on K8s, I tried to get CPU Load metrics following this

Re: RocksDB default logging configuration

2020-04-28 Thread Bajaj, Abhinav
. Thanks much, ~ Abhinav Bajaj From: Yun Tang Date: Monday, April 27, 2020 at 8:17 PM To: "Bajaj, Abhinav" , "user@flink.apache.org" Cc: Chesnay Schepler Subject: Re: RocksDB default logging configuration CAUTION: This email originated from outside of the organization. Do not

Re: RocksDB default logging configuration

2020-04-27 Thread Bajaj, Abhinav
, if possible. I am using Flink 1.7.1. Thanks Chesnay for your response below. ~ Abhinav Bajaj From: Chesnay Schepler Date: Wednesday, April 22, 2020 at 11:17 PM To: "Bajaj, Abhinav" , "user@flink.apache.org" Subject: Re: RocksDB default logging configuration CAUTION: This e

Re: RocksDB default logging configuration

2020-04-22 Thread Bajaj, Abhinav
Bumping this one again to catch some attention. From: "Bajaj, Abhinav" Date: Monday, April 20, 2020 at 3:23 PM To: "user@flink.apache.org" Subject: RocksDB default logging configuration Hi, Some of our teams ran into the disk space issues because of RocksDB default l

RocksDB default logging configuration

2020-04-20 Thread Bajaj, Abhinav
Hi, Some of our teams ran into the disk space issues because of RocksDB default logging configuration - FLINK-15068. It seems the workaround suggested uses the OptionsFactory to set some of the parameters from inside the job. Since we

Re: How to test flink job recover from checkpoint

2020-03-05 Thread Bajaj, Abhinav
I implemented a custom function that throws up a runtime exception. You can extend from simpler MapFunction or more complicated RichParallelSourceFunction depending on your use case. You can add logic to throw a runtime exception on a certain condition in the map or run method. .

Re: How to test flink job recover from checkpoint

2020-03-04 Thread Bajaj, Abhinav
I implemented a custom function that throws up a runtime exception. You can extend from simpler MapFunction or more complicated RichParallelSourceFunction depending on your use case. You can add logic to throw a runtime exception on a certain condition in the map or run method. .

Re: JobMaster does not register with ResourceManager in high availability setup

2020-03-04 Thread Bajaj, Abhinav
complaining for not being able to connect to zookeeper after that. ~ Abhinav Bajaj From: "Bajaj, Abhinav" Date: Wednesday, March 4, 2020 at 12:01 PM To: Xintong Song Cc: "user@flink.apache.org" Subject: Re: JobMaster does not register with ResourceManager in high availa

Re: JobMaster does not register with ResourceManager in high availability setup

2020-03-04 Thread Bajaj, Abhinav
Thanks Xintong for pointing that out. I will dig deeper and get back with my findings. ~ Abhinav Bajaj From: Xintong Song Date: Tuesday, March 3, 2020 at 7:36 PM To: "Bajaj, Abhinav" Cc: "user@flink.apache.org" Subject: Re: JobMaster does not register with Res

JobMaster does not register with ResourceManager in high availability setup

2020-03-03 Thread Bajaj, Abhinav
Hi, We recently came across an issue where JobMaster does not register with ResourceManager in Fink high availability setup. Let me share the details below. Setup * Flink 1.7.1 * K8s * High availability mode with a single Jobmanager and 3 zookeeper nodes in quorum. Scenario *

Re: Flink logging issue with logback

2020-01-08 Thread Bajaj, Abhinav
gt; well. >> >> It is so weird and i have no idea how to debug it. >> Best, >> Yang >> >> Bajaj, Abhinav > <mailto:abhinav.ba...@here.com>> 于2020年1月7日周二 上午4:28写道: >> >> Hi, >> >> Thanks much for

Re: Flink logging issue with logback

2020-01-06 Thread Bajaj, Abhinav
Flink. Is this expected or a known difference? Thanks again, Abhinav Bajaj PS: Ahh. I see how my email was confusing the first time. Hopefully this one is better :P From: Dawid Wysakowicz Date: Monday, January 6, 2020 at 5:13 AM Cc: "Bajaj, Abhinav" , "user@flink.apache.

Flink logging issue with logback

2020-01-03 Thread Bajaj, Abhinav
Hi, I am investigating a logging issue with Flink. Setup * Using Flink-1.7.1 using logback as suggested in Flink documentation here. * Submitting the Flink job

Re: Savepoint process recovery in Jobmanager HA setup

2019-07-29 Thread Bajaj, Abhinav
Thanks much for your response. I was also suspecting the same and just wanted to confirm. I guess the best way forward for now is to request savepoint again. ~ Abhi From: Yun Tang Date: Saturday, July 27, 2019 at 7:35 AM To: "Bajaj, Abhinav" , "user@flink.apache.org" Su

Savepoint process recovery in Jobmanager HA setup

2019-07-26 Thread Bajaj, Abhinav
Hi, I am trying to test a scenario that triggers a savepoint on a Flink 1.7.1 Job deployed with jobmanager HA mode. The purpose is to check if savepoint process recovers if the leader jobmanager fails during the savepoint. During my testing, I found that the new leader jobmanager returns the

Re: Flink 1.2.1 - Job initial start time

2018-05-31 Thread Bajaj, Abhinav
Adding my team mate. ~ Abhinav Bajaj From: "Bajaj, Abhinav" Date: Thursday, May 31, 2018 at 9:30 AM To: "user@flink.apache.org" Subject: Flink 1.2.1 - Job initial start time Hi, Is there a way to know the initial start time of a job? I am looking for the first

Flink 1.2.1 - Job initial start time

2018-05-31 Thread Bajaj, Abhinav
Hi, Is there a way to know the initial start time of a job? I am looking for the first time job was started and not the time of the restart. It seems the “start-time” reported by monitoring API under

Re: Akka heartbeat configurations

2018-05-16 Thread Bajaj, Abhinav
this time low in order to keep the time to catch up low. Faul tolerance guarantees should not be affected. I hope this helps. Regards, Timo Am 15.05.18 um 01:42 schrieb Bajaj, Abhinav: Hi, We are running into issues where GC pause will result into Taskmanagers being marked dead incorrectly. Flin

Akka heartbeat configurations

2018-05-14 Thread Bajaj, Abhinav
Hi, We are running into issues where GC pause will result into Taskmanagers being marked dead incorrectly. Flink documentation documents some knobs of Akka configurations to play

Re: Unable to load AWS credentials: Flink 1.2.1 + S3 + Kubernetes

2018-04-05 Thread Bajaj, Abhinav
e.org> Cc: "dyana.rose" <dyana.r...@salecycle.com>, "Bajaj, Abhinav" <abhinav.ba...@here.com> Subject: Re: Unable to load AWS credentials: Flink 1.2.1 + S3 + Kubernetes Hi! This is pretty much all in Hadoop's magic, from Flink's view, once this h

Re: Unable to load AWS credentials: Flink 1.2.1 + S3 + Kubernetes

2018-04-02 Thread Bajaj, Abhinav
lt;https://emea01.safelinks.protection.outlook.com/?url=https%3A%2F%2Fissues.apache.org%2Fjira%2Fbrowse%2FFLINK-8439=01%7C01%7C%7C598fddcc7a3c49f110ff08d59557b3ca%7C6d4034cd72254f72b85391feaea64919%7C1=tWqdbwBFGMoqo8e6axIT%2FT%2FAlIkYd0ohG57ds%2Bx95XA%3D=0> Dyana On 2018/03/28 02:15:06, "Bajaj, Abhinav"

Unable to load AWS credentials: Flink 1.2.1 + S3 + Kubernetes

2018-03-27 Thread Bajaj, Abhinav
Hi, I am trying to use Flink 1.2.1 with RockDB as statebackend and S3 for checkpoints. I am using Flink 1.2.1 docker images and running them in Kubernetes cluster. I have followed the steps documented in the Flink documentation -

Re: Alternatives to FLINK_CONF_DIR, HADOOP_CONF_DIR, YARN_CONF_DIR

2017-03-23 Thread Bajaj, Abhinav
I think the FLINK_CONF_DIR points to the conf directory. This is the place where the Flink CLI looks for the flink-conf.yaml file. I think there is an alternate option for HADOOP_CONF_DIR, YARN_CONF_DIR but I am not sure. Check this

Re: Job fails to start with S3 savepoint

2017-03-20 Thread Bajaj, Abhinav
.CB504880] Abhinav Bajaj Lead Engineer HERE Predictive Analytics Office: +12062092767 Mobile: +17083299516 HERE Seattle 701 Pike Street, #2000, Seattle, WA 98101, USA 47° 36' 41" N. 122° 19' 57" W HERE Maps From: "Bajaj, Abhinav" <abhinav.ba...@here.com> Date: Monday,

Re: Job fails to start with S3 savepoint

2017-03-20 Thread Bajaj, Abhinav
feaea64919%7C1=QqbkwWAbOwb97ckBio4Ty7nL7XJC598mYVeN5nr6HXo%3D=0> Regards, Timo Am 18/03/17 um 01:33 schrieb Bajaj, Abhinav: Hi, I am trying to explore using S3 for storing checkpoints and savepoints. I can get Flink to store the checkpoints and savepoints in s3. However, when I try to submit the s

Job fails to start with S3 savepoint

2017-03-17 Thread Bajaj, Abhinav
Hi, I am trying to explore using S3 for storing checkpoints and savepoints. I can get Flink to store the checkpoints and savepoints in s3. However, when I try to submit the same Job using the stored savepoint, it fails with below exception. I am using Flink 1.2 and submitted the job from the UI

Re: Documentation for translation of Job graph to Execution graph

2016-06-29 Thread Bajaj, Abhinav
job name to the logs, so its indeed not very easy to separate the log entries generated by different jobs. In general, we recommend running one JobManager per job (multiple jobs is of course also supported). On Sat, Jun 18, 2016 at 1:41 AM, Bajaj, Abhinav <abhinav.ba...@here.com<mailto:

Re: Documentation for translation of Job graph to Execution graph

2016-06-21 Thread Bajaj, Abhinav
y different jobs. In general, we recommend running one JobManager per job (multiple jobs is of course also supported). On Sat, Jun 18, 2016 at 1:41 AM, Bajaj, Abhinav <abhinav.ba...@here.com<mailto:abhinav.ba...@here.com>> wrote: Hi, Thanks for sharing this link. I have not see

Re: Documentation for translation of Job graph to Execution graph

2016-06-17 Thread Bajaj, Abhinav
ts/flink/flink-docs-release-1.0/concepts/concepts.html. With what parts are you having trouble? Maybe I can help. Cheers, Aljoscha On Thu, 16 Jun 2016 at 19:31 Bajaj, Abhinav <abhinav.ba...@here.com<mailto:abhinav.ba...@here.com>> wrote: Hi, When troubleshooting a flink job, it is tricky

Documentation for translation of Job graph to Execution graph

2016-06-16 Thread Bajaj, Abhinav
Hi, When troubleshooting a flink job, it is tricky to map the Job graph (application code) to the logs & monitoring REST APIs. So, I am trying to find documentation on how a Job graph is translated to Execution graph. I found this -

Re: Submit Flink Jobs to YARN running on AWS

2016-06-06 Thread Bajaj, Abhinav
apache.org>" <user@flink.apache.org<mailto:user@flink.apache.org>> Subject: Re: Submit Flink Jobs to YARN running on AWS Hi Abhi, I'm also looking to deploy Flink jobs remotely to YARN, and eventually automate it - just wondering if you found a way to do it? Thanks, Josh On We

Re: Submitting job via Flink web interface giving Null pointer exception

2016-06-02 Thread Bajaj, Abhinav
ng into Null pointer exception on an internal codebase. Weird thing is that it does not happen when submitting the job via commandline On Thu, Jun 2, 2016 at 2:08 AM, Bajaj, Abhinav <abhinav.ba...@here.com<mailto:abhinav.ba...@here.com>> wrote: Hi Ritesh, We are using Flink 1.0.0 and don’

Re: Submitting job via Flink web interface giving Null pointer exception

2016-06-01 Thread Bajaj, Abhinav
Hi Ritesh, We are using Flink 1.0.0 and don’t have issues with using the command line or the web interface. Can you please share some details on the command you are using and probably a screen shot of the web interface with the values used for submission. You are running Flink standalone or

Re: Submit Flink Jobs to YARN running on AWS

2016-05-24 Thread Bajaj, Abhinav
Hi, Has anyone tried to submit a Flink Job remotely to Yarn running in AWS ? The case I am stuck with is where the Flink client is on my laptop and YARN is running on AWS. @Robert, Did you get a chance to try this out? Regards, Abhi From: "Bajaj, Abhinav" <abhinav.b

Re: Import Configuration File in Flink Cluster

2016-05-23 Thread Bajaj, Abhinav
I was gonna post the exact question and noticed this thread. It will be great if we can have a method in parameter tool to load from resources. Thanks Simon :) Abhinav Bajaj Senior Engineer HERE Predictive Analytics Office: +12062092767 Mobile: +17083299516 HERE Seattle 701 Pike Street,

Re: Submit Flink Jobs to YARN running on AWS

2016-04-29 Thread Bajaj, Abhinav
ave very similar architectures and will face the same issues. I guess they have some recipes for such situations. Regards, Robert On Tue, Apr 26, 2016 at 10:47 AM, Robert Metzger <rmetz...@apache.org<mailto:rmetz...@apache.org>> wrote: Hi Abhi, I'll try to reproduce the issue and com

Re: Submit Flink Jobs to YARN running on AWS

2016-04-25 Thread Bajaj, Abhinav
] https://ci.apache.org/projects/flink/flink-docs-release-1.0/setup/yarn_setup.html [2] https://ci.apache.org/projects/flink/flink-docs-release-1.0/apis/cli.html 2016-03-08 0:25 GMT+01:00 Bajaj, Abhinav <abhinav.ba...@here.com<mailto:abhinav.ba...@here.com>>: Hi, I am a newbie

Re: Submit Flink Jobs to YARN running on AWS

2016-03-09 Thread Bajaj, Abhinav
it correctly: Flink binds its ports to the internal IP addresses, so you cannot send a message through the external IP addresses. Can you see if you can configure explicitly the external IP address as the JobManager hostname, so the JobManager will bind to that specific network interface? St

Submit Flink Jobs to YARN running on AWS

2016-03-07 Thread Bajaj, Abhinav
Hi, I am a newbie to Flink and trying to use it in AWS. I have created a YARN cluster on AWS EC2 machines. Trying to submit Flink job to the remote YARN cluster using the Flink Client running on my local machine. The Jobmanager start successfully on the YARN container but the client is not