Re: Metrics 指标不连续

2019-02-25 Thread Yaoting Gong
大家好: 我发下问题了, 在此回复下,防止有遇到同样问题的同学。 通过对比数据和看源码。发现flink 在push metrics 到 promgateway 的时候, 没有按照规范,给出“instance”值,可能导致 Prometheus 存储出现问题。 我调整了代码,每个实例,给了Java UUID 字符串后,发现数据恢复正常。 PR: https://github.com/apache/flink/pull/7820 On Wed, Feb 20, 2019 at 6:27 PM Yaoting Gong wrote: > 大家好: > > 我这边采用Flink

Re: left join failing with FlinkLogicalJoinConverter NPE

2019-02-25 Thread Xingcan Cui
Yes. Please check that. If it's the nested type's problem, this might be a bug. On Mon, Feb 25, 2019, 21:50 Karl Jin wrote: > Do you think something funky might be happening with Map/Multiset types? > If so how do I deal with it (I think I can verify by removing those columns > and retry?)? > >

Re: [Blink]使用 CsvTableSource 做维度表,在 csv 文件修改后无法更新

2019-02-25 Thread Jark Wu
Hi, Csv 维表主要拿来测试用的,不支持动态更新。blink中的hbase也支持维表,可以试下。 Best, Jark On Tue, 26 Feb 2019 at 09:21, Dian Fu wrote: > CSVTableSource不支持动态更新。自定义维表可以实现LookupableTableSource接口 > > > > 在 2019年2月25日,下午11:51,龙逸尘 写道: > > > > Hi all, > >我正在测试 Blink 的维度表 join 功能,参考 blink doc 上的案例,使用 CsvTableSource

user@flink.apache.org

2019-02-25 Thread Chua, Jace D.
jaym...@amazon.com

Re: Flink window triggering and timing on connected streams

2019-02-25 Thread Andrew Roberts
I’m not sure that approach will work for me, as I have many sessions going at the same time which can overlap. Also, I need to be able to have sessions time out if they never receive an end event. Do you know directly if setting a timer triggers when any timestamp passes that time, or when the

Re: left join failing with FlinkLogicalJoinConverter NPE

2019-02-25 Thread Karl Jin
Do you think something funky might be happening with Map/Multiset types? If so how do I deal with it (I think I can verify by removing those columns and retry?)? On Mon, Feb 25, 2019 at 6:28 PM Karl Jin wrote: > Thanks for checking in quickly, > > Below is what I got on printSchema on the two

Re: [Blink]sql client kafka sink 失败

2019-02-25 Thread 张洪涛
如果把kafka connector shade jar放在blink lib 下面 然后启动是没有问题的 但是放在sql client --jar 参数就有问题 我又多测试了几遍 发现class not found的类 是随机的 有什么建议么? 2019-02-26 10:36:10,343 ERROR org.apache.flink.kafka011.shaded.org.apache.kafka.common.utils.KafkaThread - Uncaught exception in kafka-producer-network-thread |

Re: cast from bigdecimal to bigint throws ArithmeticException

2019-02-25 Thread Timothy Victor
This is more a Java question than Flink per se. But I believe you need to specify the rounding mode because it is calling longValueExact. If it just called longValue it would have worked without throwing an exceptionbut you risk overflowing 64 bits and getting a totally erroneous answer.

cast from bigdecimal to bigint throws ArithmeticException

2019-02-25 Thread 徐涛
Hi Experts, There is a Flink table which has a column typed as java.math.BigDecimal, then in SQL I try to cast it to type long, cast(duration as bigint) however it throws the following exception: java.lang.ArithmeticException: Rounding necessary at

Re: Flink window triggering and timing on connected streams

2019-02-25 Thread Hequn Cheng
Hi Andrew, > I have an “end session” event that I want to cause the window to fire and purge. Do you want to fire the window only by the 'end session' event? I see one option to solve the problem. You can use a tumbling window(say 5s) and set your timestamp to t‘+5s each time receiving an 'end

cast from bigdecimal to bigint throws ArithmeticException

2019-02-25 Thread 徐涛
Hi Experts, There is a Flink table which has a column typed as java.math.BigDecimal, then in SQL I try to cast it to type long, cast(duration as bigint) however it throws the following exception: java.lang.ArithmeticException: Rounding necessary at

Collapsing watermarks after keyby

2019-02-25 Thread Padarn Wilson
Hi Flink Mailing List, Long story short - I want to somehow collapse watermarks at an operator across keys, so that keys with dragging watermarks do not drag behind. Details below: --- I have an application in which I want to perform the follow sequence of steps: Assume my data is made up of

What are blobstore files and why do they keep filling up /tmp directory?

2019-02-25 Thread Kumar Bolar, Harshith
Hi all, We're running Flink on a standalone five node cluster. The /tmp/ directory keeps filling with directories starting with blobstore--*. These directories are very large (approx 1 GB) and fill up the space very quickly and the jobs fail with a No space left of device error. The files in

Re: [Blink]使用 CsvTableSource 做维度表,在 csv 文件修改后无法更新

2019-02-25 Thread Dian Fu
CSVTableSource不支持动态更新。自定义维表可以实现LookupableTableSource接口 > 在 2019年2月25日,下午11:51,龙逸尘 写道: > > Hi all, >我正在测试 Blink 的维度表 join 功能,参考 blink doc 上的案例,使用 CsvTableSource 做维度表,指定了 > csv 文件路径。启动 flink 程序后,修改指定的 csv 文件,但是更新无法同步到已注册的维度表中。请问有什么解决办法吗? >另外想咨询下,如果想自己定义维度表的数据源,应该实现那些方法,有哪些注意事项?感谢!

Re: left join failing with FlinkLogicalJoinConverter NPE

2019-02-25 Thread Karl Jin
Thanks for checking in quickly, Below is what I got on printSchema on the two tables (left joining the second one to the first one on uc_pk = i_uc_pk). rowtime in both are extracted from the string field uc_update_ts root |-- uc_pk: String |-- uc_update_ts: String |-- rowtime:

StreamingFileSink on EMR

2019-02-25 Thread Bohinski, Kevin (Contractor)
When running Flink 1.7 on EMR 5.21 using StreamingFileSink we see java.lang.UnsupportedOperationException: Recoverable writers on Hadoop are only supported for HDFS and for Hadoop version 2.7 or newer. EMR is showing Hadoop version 2.8.5. Is anyone else seeing this issue?

Flink window triggering and timing on connected streams

2019-02-25 Thread Andrew Roberts
Hello, I’m trying to implement session windows over a set of connected streams (event time), with some custom triggering behavior. Essentially, I allow very long session gaps, but I have an “end session” event that I want to cause the window to fire and purge. I’m assigning timestamps and

Re: left join failing with FlinkLogicalJoinConverter NPE

2019-02-25 Thread Xingcan Cui
Hi Karl, It seems that some field types of your inputs were not properly extracted. Could you share the result of `printSchema()` for your input tables? Best, Xingcan > On Feb 25, 2019, at 4:35 PM, Karl Jin wrote: > > Hello, > > First time posting, so please let me know if the formatting

left join failing with FlinkLogicalJoinConverter NPE

2019-02-25 Thread Karl Jin
Hello, First time posting, so please let me know if the formatting isn't correct, etc. I'm trying to left join two Kafka sources, running 1.7.2 locally, but getting the below exception. Looks like some sort of query optimization process but I'm not sure where to start investigating/debugging. I

Re: Metrics for number of "open windows"?

2019-02-25 Thread Konstantin Knauf
Hi Andrew, when you implement your own Trigger or customize an existing Trigger you get access to the TriggerContext in all callbacks of the Trigger interface. The TriggerContext allows you to register custom metrics via TriggerContext:getMetricGroup() and you can use partitioned state scoped to

Flink 1.6.4 signing key file in docker-flink repo?

2019-02-25 Thread William Saar
Trying to build a new Docker image by replacing 1.6.3 with 1.6.4 in the Dockerfile found here ( https://github.com/docker-flink/docker-flink ), but seems to require a new signing key, Is it available somewhere? Getting + wget -nv -O flink.tgz.asc

Re: Flink Streaming Job Task is getting cancelled silently and causing job to restart

2019-02-25 Thread Andrey Zagrebin
Hi Sohi, There seems to be no avro implementations of Encoder interface used in StreamingFileSink but maybe it could be implemented based on AvroKeyValueWriter with not such a big effort. There is also a DefaultRollingPolicy which is based on time and number of records. It might create a

[Blink]使用 CsvTableSource 做维度表,在 csv 文件修改后无法更新

2019-02-25 Thread 龙逸尘
Hi all, 我正在测试 Blink 的维度表 join 功能,参考 blink doc 上的案例,使用 CsvTableSource 做维度表,指定了 csv 文件路径。启动 flink 程序后,修改指定的 csv 文件,但是更新无法同步到已注册的维度表中。请问有什么解决办法吗? 另外想咨询下,如果想自己定义维度表的数据源,应该实现那些方法,有哪些注意事项?感谢!

Re: Standalone HA cluster: Fatal error occurred in the cluster entrypoint.

2019-02-25 Thread Alexander Smirnov
Hi all, I am getting similar exception while upgrading from Flink 1.4 to 1.6: ``` 06 Feb 2019 14:37:34,080 ERROR org.apache.flink.runtime.entrypoint.ClusterEntrypoint - Fatal error occurred in the cluster entrypoint. java.lang.RuntimeException: org.apache.flink.util.FlinkException: Could not

Re: Flink Streaming Job Task is getting cancelled silently and causing job to restart

2019-02-25 Thread sohimankotia
Hi Erik, I am still not able to understand reason behind this exception. Is this exception causing failure and restart of job ? or This is occurring after failure/restart is triggered . Thanks Sohi -- Sent from: http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/

Re: Flink Streaming Job Task is getting cancelled silently and causing job to restart

2019-02-25 Thread sohimankotia
Hi Andrey, I am using AvroSinkWriter (with Bucketing Sink) with compression enabled . Looks like StreamingFileSink does not have direct support for AvroSinkWriter. Sequence File Format is there for StreamingFileSink , but looks like it roll files on every checkpoint (OnCheckpointRollingPolicy)

Share broadcast state between multiple operators

2019-02-25 Thread Richard Deurwaarder
Hi All, Due to the way our code is structured, we would like to use the broadcast state at multiple points of our pipeline. So not only share it between multiple instances of the same operator but also between multiple operators. See the image below for a simplified example. Flink does not seem

Re: Metrics for number of "open windows"?

2019-02-25 Thread Andrey Zagrebin
Hi Andrew, Just to add the Rong's answer, if you use RocksDB state backend, you can activate state metrics forwarded from RocksDB [1]. Best, Andrey [1] https://ci.apache.org/projects/flink/flink-docs-stable/monitoring/metrics.html#rocksdb On Thu, Feb 21, 2019 at 11:22 PM Rong Rong wrote: >

Re: Flink Streaming Job Task is getting cancelled silently and causing job to restart

2019-02-25 Thread sohimankotia
Thanks Andrey . Yeah will upgrade and see if same gets reproduced . -Sohi -- Sent from: http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/

Re: Flink Streaming Job Task is getting cancelled silently and causing job to restart

2019-02-25 Thread Andrey Zagrebin
Hi Sohi, I would also recommend trying the newer StreamingFileSink which is available in Flink 1.7.x [1]. Best, Andrey [1] https://ci.apache.org/projects/flink/flink-docs-release-1.7/dev/connectors/streamfile_sink.html On Sun, Feb 24, 2019 at 4:14 AM sohimankotia wrote: > Hi Erik, > > Are