date:20180629

Re: help understand/debug high memory footprint on jobmanager

2018-06-29 Thread Steven Wu

Thanks everyone for jumping in. BTW, we are using flink-1.4.1. deployment is stand-alone mode. here is the JIRA: https://issues.apache.org/jira/browse/FLINK-9693 On Fri, Jun 29, 2018 at 12:09 PM, Stephan Ewen wrote: > Just saw Stefan's response, it is basically the same. > > We either null out

error: object connectors is not a member of package org.apache.flink.streaming

2018-06-29 Thread Mich Talebzadeh

I am following this Flink Kafka example https://stackoverflow.com/questions/31446374/can-anyone-share-a-flink-kafka-example-in-scala This is my edited program. I am using Flink 1.5 in flink-scala shell import org.apache.flink.streaming.api.scala._ import

Re: First try running I am getting org.apache.flink.client.program.ProgramInvocationException: Could not retrieve the execution result.

2018-06-29 Thread Mich Talebzadeh

As it turned out in the application log, it could mot find yarn configuraration! not anything to do with port Caused by: java.lang.ClassNotFoundException: org.apache.hadoop.yarn.conf.YarnConfiguration. I had installed flink *w**ithout bundled Hadoop* and my version of Hadoop is 3.1 I went back

Re: help understand/debug high memory footprint on jobmanager

2018-06-29 Thread Stephan Ewen

Just saw Stefan's response, it is basically the same. We either null out the field on deploy or archival. On deploy would be even more memory friendly. @Steven - can you open a JIRA ticket for this? On Fri, Jun 29, 2018 at 9:08 PM, Stephan Ewen wrote: > The problem seems to be that the

Re: help understand/debug high memory footprint on jobmanager

2018-06-29 Thread Stephan Ewen

The problem seems to be that the Executions that are kept for history (mainly metrics / web UI) still hold a reference to their TaskStateSnapshot. Upon archival, that field needs to be cleared for GC. This is quite clearly a bug... On Fri, Jun 29, 2018 at 11:29 AM, Stefan Richter <

Re: Over Window Not Processing Messages

2018-06-29 Thread Gregory Fee

Thanks! I'm working on a way to deliver the data in order (or closer to in order) and deliver watermarks more often. I'll let you know my results. On Thu, Jun 28, 2018 at 5:36 AM, Fabian Hueske wrote: > In a nutshell the Over operator works as follows: > - When a row arrives it is put into a

Re: Flink job hangs/deadlocks (possibly related to out of memory)

2018-06-29 Thread gerardg

(fixed formatting) Hello, We have experienced some problems where a task just hangs without showing any kind of log error while other tasks running in the same task manager continue without problems. When these tasks are restarted the task manager gets killed and shows several errors similar to

Re: How to deploy Flink in a geo-distributed environment

2018-06-29 Thread Stephen

Thanks! On Thu, Jun 28, 2018 at 7:32 PM Tzu-Li (Gordon) Tai wrote: > Hi, > > It should be possible to deploy a single Flink cluster across > geo-distributed nodes, but Flink currently offers no optimization for such > a specific use case. > AFAIK, the general pattern for dealing with

Flink job hangs/deadlocks (possibly related to out of memory)

2018-06-29 Thread gerardg

Hello,We have experienced some problems where a task just hangs without showing any kind of log error while other tasks running in the same task manager continue without problems. When these tasks are restarted the task manager gets killed and shows several errors similar to these

Re: First try running I am getting org.apache.flink.client.program.ProgramInvocationException: Could not retrieve the execution result.

2018-06-29 Thread Hequn Cheng

port should be consistent. 1> nc -l 2219 2>./bin/flink run examples/streaming/SocketWindowWordCount.jar --port 2219 On Fri, Jun 29, 2018 at 9:21 PM, Mich Talebzadeh wrote: > thanks Hequn. > > This the port I started with > > hduser@rhes75: /data6/hduser/flink-1.5.0> nc -l 2219 > hello > > and

Re: Using Google Cloud Storage for checkpointing

2018-06-29 Thread Till Rohrmann

Hi Rohil, this sounds a little bit strange. If the GoogleHadoopFileSystem jar is on the classpath and the implementation is specified in core-site.xml, then the Hadoop Filesystem should be able to load the GCS filesystem. I just tried it out locally (without K8s though) and it seemed to work.

Re: First try running I am getting org.apache.flink.client.program.ProgramInvocationException: Could not retrieve the execution result.

2018-06-29 Thread Mich Talebzadeh

thanks Hequn. This the port I started with hduser@rhes75: /data6/hduser/flink-1.5.0> nc -l 2219 hello and as I expected I should collect from port 2219? However, I did what you suggested ./bin/flink run examples/streaming/SocketWindowWordCount.jar --port 2199 Starting execution of program

Re: First try running I am getting org.apache.flink.client.program.ProgramInvocationException: Could not retrieve the execution result.

2018-06-29 Thread Hequn Cheng

Hi Mich, You port is not matching. Start netcat with "nc -l 2219 &", but run flink job with "--port 2199". On Fri, Jun 29, 2018 at 8:40 PM, Mich Talebzadeh wrote: > Hi, > > I have installed flink 1.5 in standalone mode and trying a basic run as > per this example > >

First try running I am getting org.apache.flink.client.program.ProgramInvocationException: Could not retrieve the execution result.

2018-06-29 Thread Mich Talebzadeh

Hi, I have installed flink 1.5 in standalone mode and trying a basic run as per this example https://ci.apache.org/projects/flink/flink-docs-release-1.2/quickstart/setup_quickstart.html started netcat on port 2199 nc -l 2219 & Run the example ./bin/flink run

Re: Data Type of timestamp in Streaming SQL Result? Long instead of timestamp?

2018-06-29 Thread chrisr123

Thank you Hequn, I got it working. Here is the tumbling window query, in both SQL and Table API. I'm getting same results with these: SQL API tableEnvironment.registerDataStream("pageViews", eventStream, "pageViewTime.rowtime, username, url"); String continuousQuery = "SELECT

Re: help understand/debug high memory footprint on jobmanager

2018-06-29 Thread Stefan Richter

Hi Steven, from your analysis, I would conclude the following problem. ExecutionVertexes hold executions, which are bootstrapped with the state (in form of the map of state handles) when the job is initialized from a checkpoint/savepoint. It holds a reference on this state, even when the task

Re: Let BucketingSink roll file on each checkpoint

2018-06-29 Thread zhangminglei

By the way, I do not think below is a correct way. As @ Fabian said. The BucketingSink closes files once they reached a certain size (BatchSize) or have not been written to for a certain amount of time (InactiveBucketThreshold). > . If we can close > file during checkpoint, then the result is

Re: Let BucketingSink roll file on each checkpoint

2018-06-29 Thread zhangminglei

Hi Xilang I think you are doing a together work with the offline team. Also what you said ETL, ETL team want to use the data in HDFS. I would like to confirm one question from you. What is their scheduling time for every job ? 5mins or 10 mins ? > My user case is we read data from message

Re: Web history limit in flink 1.5

2018-06-29 Thread Chesnay Schepler

https://ci.apache.org/projects/flink/flink-docs-release-1.5/ops/config.html#jobstore-expiration-time On 28.06.2018 11:15, eSKa wrote: Hello, we were playing around with flink 1.5 - so far so good. Only thing that we are missing is web history setup. In flink 1.4 and before we were using

Re: help understand/debug high memory footprint on jobmanager

error: object connectors is not a member of package org.apache.flink.streaming

Re: First try running I am getting org.apache.flink.client.program.ProgramInvocationException: Could not retrieve the execution result.

Re: help understand/debug high memory footprint on jobmanager

Re: help understand/debug high memory footprint on jobmanager

Re: Over Window Not Processing Messages

Re: Flink job hangs/deadlocks (possibly related to out of memory)

Re: How to deploy Flink in a geo-distributed environment

Flink job hangs/deadlocks (possibly related to out of memory)

Re: First try running I am getting org.apache.flink.client.program.ProgramInvocationException: Could not retrieve the execution result.

Re: Using Google Cloud Storage for checkpointing

Re: First try running I am getting org.apache.flink.client.program.ProgramInvocationException: Could not retrieve the execution result.

Re: First try running I am getting org.apache.flink.client.program.ProgramInvocationException: Could not retrieve the execution result.

First try running I am getting org.apache.flink.client.program.ProgramInvocationException: Could not retrieve the execution result.

Re: Data Type of timestamp in Streaming SQL Result? Long instead of timestamp?

Re: help understand/debug high memory footprint on jobmanager

Re: Let BucketingSink roll file on each checkpoint

Re: Let BucketingSink roll file on each checkpoint

Re: Web history limit in flink 1.5

19 matches

Site Navigation

Mail list logo

Footer information