Re: Performance Issues in Source Operator while migrating to Flink-1.14 from 1.9

2022-02-16 Thread Arujit Pradhan
t; https://twitter.com/MartijnVisser82 > > > On Tue, 15 Feb 2022 at 13:07, Arujit Pradhan > wrote: > >> Hey team, >> >> We are migrating our Flink codebase and a bunch of jobs from Flink-1.9 to >> Flink-1.14. To ensure uniformity in performance we ran a bunch

Performance Issues in Source Operator while migrating to Flink-1.14 from 1.9

2022-02-15 Thread Arujit Pradhan
Hey team, We are migrating our Flink codebase and a bunch of jobs from Flink-1.9 to Flink-1.14. To ensure uniformity in performance we ran a bunch of jobs for a week both in 1.9 and 1.14 simultaneously with the same resources and configurations and monitored them. Though most of the jobs are runn

Provide DataTypeHint for ScalarUDF where the return type is Object[]

2022-01-03 Thread Arujit Pradhan
Hey team, We are migrating our Flink codes from Flink-1.9 to Flink-1.14 and as a part of this, we are updating a bunch of UDFs. Wanted to understand, how to provide *data type hints for the UDFs which return Object[]*. For example, if the return type is simply Object something like this works. *

Re: Alternate to PreserveWatermark() in recent Flink versions

2021-10-25 Thread Arujit Pradhan
:17 PM JING ZHANG wrote: > Hi, > I'm not sure I understand your requirement. > However, are you looking for `PreserveWatermarks` in package > `org.apache.flink.table.sources.wmstrategies`? > > Best, > JING ZHANG > > > Arujit Pradhan 于2021年10月25日周一 下午4:02写道: &

Alternate to PreserveWatermark() in recent Flink versions

2021-10-25 Thread Arujit Pradhan
Hi all, We maintain an Open-sourced project for protobuf data processing using Flink dagger . But we are currently on Flink-1.9 and want to migrate to the latest stable 1.14. In the older version, we use `*StreamTableSource` *and ` *DefinedRowtimeAttributes` *APIs

Tests in FileUtilsTest while building Flink in local

2020-02-19 Thread Arujit Pradhan
Hi all, I was trying to build Flink in my local machine and these two unit tests are failing. *[ERROR] Errors:[ERROR] FileUtilsTest.testCompressionOnRelativePath:261->verifyDirectoryCompression:440 » NoSuchFile[ERROR] FileUtilsTest.testDeleteDirectoryConcurrently » FileSystem /var/folders/x9/

Open Method is not being called in case of AggregateFunction UDFs

2019-12-11 Thread Arujit Pradhan
Hi all, So we are creating some User Defined Functions of type AggregateFunction. And we want to send some static metrics from the *open()* method of the UDFs as we can get *MetricGroup *by *FunctionContext *which is only exposed in the open method. Our code looks something like this(Which is an i

Re: Compound Time interval in SQL queries

2019-11-21 Thread Arujit Pradhan
LE ( rowtime, INTERVAL '7' HOUR + INTERVAL '30' MINUTE) > > thanks, > godfrey > > -- > 发件人:Arujit Pradhan > 发送时间:2019年11月21日(星期四) 16:23 > 收件人:user > 主 题:Compound Time interval in SQL quer

Compound Time interval in SQL queries

2019-11-21 Thread Arujit Pradhan
Hi all, Is there a way to define a compound time interval(that can consist of both HOUR and MINUTE) in windows in a Flink SQL query. For example, we want to do something like this: SELECT count(1) AS event_count , TUMBLE_END(rowtime, INTERVAL '7' HOUR AND '30' MINUTE) AS window_timestamp FROM `da