[jira] [Created] (FLINK-22912) Support state ttl in Python DataStream API

2021-06-07 Thread Dian Fu (Jira)
Dian Fu created FLINK-22912: --- Summary: Support state ttl in Python DataStream API Key: FLINK-22912 URL: https://issues.apache.org/jira/browse/FLINK-22912 Project: Flink Issue Type: Improvement

[jira] [Created] (FLINK-22911) Align FLIP-136 (Improve interoperability between DataStream and Table API) in PyFlink Table API

2021-06-07 Thread Dian Fu (Jira)
Dian Fu created FLINK-22911: --- Summary: Align FLIP-136 (Improve interoperability between DataStream and Table API) in PyFlink Table API Key: FLINK-22911 URL: https://issues.apache.org/jira/browse/FLINK-22911

Re: [DISCUSS] FLIP-169: DataStream API for Fine-Grained Resource Requirements

2021-06-07 Thread Xintong Song
I think being able to specify fine grained resource requirements without having to change the codes and recompile the job is indeed a good idea. It definitely improves the usability. However, this requires more careful designs, which probably deserves a separate thread. I'd be good to have that

Re: Re: Add control mode for flink

2021-06-07 Thread Yun Gao
Very thanks Jiangang for bringing this up and very thanks for the discussion! I also agree with the summarization by Xintong and Jing that control flow seems to be a common buidling block for many functionalities and dynamic configuration framework is a representative application that

Re: [DISCUSS] FLIP-169: DataStream API for Fine-Grained Resource Requirements

2021-06-07 Thread Yangze Guo
@Wenlong After another consideration, the config option approach I mentioned above might not be appropriate. The resource requirements for SSG should be a job level configuration and should no be set in the flink-conf. I think we can define a JSON format, which would be the ResourceSpecs mapped

Re: [DISCUSS] FLIP-169: DataStream API for Fine-Grained Resource Requirements

2021-06-07 Thread Yangze Guo
Thanks for the feedbacks, Xintong and Wenlong! @Wenlong I think that is a good idea, adjust the resource without re-compiling the job will facilitate the tuning process. We can define a pattern "slot-sharing-group.resource.{ssg name}" (welcome any proposal for the prefix naming) for the resource

[jira] [Created] (FLINK-22910) ShuffleMaster enhancement for pluggable shuffle service framework

2021-06-07 Thread Yingjie Cao (Jira)
Yingjie Cao created FLINK-22910: --- Summary: ShuffleMaster enhancement for pluggable shuffle service framework Key: FLINK-22910 URL: https://issues.apache.org/jira/browse/FLINK-22910 Project: Flink

Re: Add control mode for flink

2021-06-07 Thread kai wang
I'm big +1 for this feature. 1. Limit the input qps. 2. Change log level for debug. in my team, the two examples above are needed JING ZHANG 于2021年6月8日周二 上午11:18写道: > Thanks Jiangang for bringing this up. > As mentioned in Jiangang's email, `dynamic configuration framework` > provides

Apply for permission to edit wiki

2021-06-07 Thread Senhong Liu
Hello, My username is Senhong Liu (senhong...@gmail.com) and I want to apply for permission to propose a FLIP. Anyone who can help me? THX! Best, Senhong

Re: Add control mode for flink

2021-06-07 Thread JING ZHANG
Thanks Jiangang for bringing this up. As mentioned in Jiangang's email, `dynamic configuration framework` provides many useful functions in Kuaishou, because it could update job behavior without relaunching the job. The functions are very popular in Kuaishou, we also see similar demands in

[jira] [Created] (FLINK-22909) Supports change log inputs for event time operators

2021-06-07 Thread Jingsong Lee (Jira)
Jingsong Lee created FLINK-22909: Summary: Supports change log inputs for event time operators Key: FLINK-22909 URL: https://issues.apache.org/jira/browse/FLINK-22909 Project: Flink Issue

Re: [DISCUSS] FLIP-169: DataStream API for Fine-Grained Resource Requirements

2021-06-07 Thread wenlong.lwl
Thanks Yangze for the flip, it is great for users to be able to declare the fine-grained resource requirements for the job. I have one minor suggestion: can we support setting resource requirements by configuration? Currently most of the config options in execution config can be configured by

[jira] [Created] (FLINK-22908) FileExecutionGraphInfoStoreTest.testPutSuspendedJobOnClusterShutdown fails on azure

2021-06-07 Thread Xintong Song (Jira)
Xintong Song created FLINK-22908: Summary: FileExecutionGraphInfoStoreTest.testPutSuspendedJobOnClusterShutdown fails on azure Key: FLINK-22908 URL: https://issues.apache.org/jira/browse/FLINK-22908

Re: Add control mode for flink

2021-06-07 Thread 刘建刚
Thanks Xintong Song for the detailed supplement. Since flink is long-running, it is similar to many services. So interacting with it or controlling it is a common desire. This was our initial thought when implementing the feature. In our inner flink, many configs used in yaml can be adjusted by

[DISCUSS] Limit size of already processed files in File Source SplitEnumerator

2021-06-07 Thread Tianxin Zhao
Hi! Currently Flink File Source relies on a Set pathsAlreadyProcessed in SplitEnumerator to decide which file has been processed and avoids reprocessing files if a file is already in this set. However this set could be ever growing and ultimately exceed memory size if there are new files

[jira] [Created] (FLINK-22907) SQL Client queries fails on select statement

2021-06-07 Thread Ryan Darling (Jira)
Ryan Darling created FLINK-22907: Summary: SQL Client queries fails on select statement Key: FLINK-22907 URL: https://issues.apache.org/jira/browse/FLINK-22907 Project: Flink Issue Type: Bug

Re: [DISCUSS] Watermark propagation with Sink API

2021-06-07 Thread Eron Wright
Piotr, David, and Arvid, we've had an expansive discussion but ultimately the proposal is narrow. It is: 1. When a watermark arrives at the sink operator, tell the sink function. 2. When the sink operator is idled, tell the sink function. With these enhancements, we will significantly improve

Re: [DISCUSS]FLIP-150: Introduce Hybrid Source

2021-06-07 Thread Arvid Heise
Sorry for joining the party so late, but it's such an interesting FLIP with a huge impact that I wanted to add my 2 cents. [1] I'm mirroring some basic question from the PR review to this thread because it's about the name: We could rename the thing to ConcatenatedSource(s), SourceSequence, or

[jira] [Created] (FLINK-22906) Add build time to Flink documentation

2021-06-07 Thread Seth Wiesman (Jira)
Seth Wiesman created FLINK-22906: Summary: Add build time to Flink documentation Key: FLINK-22906 URL: https://issues.apache.org/jira/browse/FLINK-22906 Project: Flink Issue Type:

[jira] [Created] (FLINK-22905) Versioned Table's SQL Script was missing a "," at Line 7 which yields Could not execute SQL statement ERROR

2021-06-07 Thread liuyan (Jira)
liuyan created FLINK-22905: -- Summary: Versioned Table's SQL Script was missing a "," at Line 7 which yields Could not execute SQL statement ERROR Key: FLINK-22905 URL: https://issues.apache.org/jira/browse/FLINK-22905

[jira] [Created] (FLINK-22904) Performance regression on 25.05.2020 in mapRebalanceMapSink

2021-06-07 Thread Piotr Nowojski (Jira)
Piotr Nowojski created FLINK-22904: -- Summary: Performance regression on 25.05.2020 in mapRebalanceMapSink Key: FLINK-22904 URL: https://issues.apache.org/jira/browse/FLINK-22904 Project: Flink

[jira] [Created] (FLINK-22903) Code of method xxx of class "StreamExecCalc$1248" grows beyond 64 KB

2021-06-07 Thread smith jayden (Jira)
smith jayden created FLINK-22903: Summary: Code of method xxx of class "StreamExecCalc$1248" grows beyond 64 KB Key: FLINK-22903 URL: https://issues.apache.org/jira/browse/FLINK-22903 Project: Flink

Re: recover from svaepoint

2021-06-07 Thread Piotr Nowojski
Hi, Thanks Tianxin and 周瑞' for reporting and tracking down the problem. Indeed that could be the reason behind it. Have either of you already created a JIRA ticket for this bug? > Concerning the required changing of the UID of an operator Piotr, is this a known issue and documented somewhere? I

[jira] [Created] (FLINK-22902) Port KafkaSink to FLIP-143

2021-06-07 Thread Arvid Heise (Jira)
Arvid Heise created FLINK-22902: --- Summary: Port KafkaSink to FLIP-143 Key: FLINK-22902 URL: https://issues.apache.org/jira/browse/FLINK-22902 Project: Flink Issue Type: Improvement

[jira] [Created] (FLINK-22901) Introduce getChangeLogUpsertKeys in FlinkRelMetadataQuery

2021-06-07 Thread Jingsong Lee (Jira)
Jingsong Lee created FLINK-22901: Summary: Introduce getChangeLogUpsertKeys in FlinkRelMetadataQuery Key: FLINK-22901 URL: https://issues.apache.org/jira/browse/FLINK-22901 Project: Flink

Re: Add control mode for flink

2021-06-07 Thread Jark Wu
Thanks Xintong for the summary, I'm big +1 for this feature. Xintong's summary for Table/SQL's needs is correct. The "custom (broadcast) event" feature is important to us and even blocks further awesome features and optimizations in Table/SQL. I also discussed offline with @Yun Gao several

Re: [DISCUSS] Feedback Collection Jira Bot

2021-06-07 Thread Arvid Heise
One more idea for the bot. Could we have a label to exclude certain tickets from the life-cycle? I'm thinking about long-term tickets such as improving DataStream to eventually replace DataSet. We would collect ideas over the next couple of weeks without any visible progress on the

Re: [DISCUSS] Watermark propagation with Sink API

2021-06-07 Thread Arvid Heise
Hi Eron, you either have very specific use cases in mind or have a misconception about idleness in Flink with the new sources. The basic idea is that you have watermark generators only at the sources and the user supplies them. As a source author, you have no option to limit that. Here a bit of

[jira] [Created] (FLINK-22900) flink 1.11.2 fileSystem source table read fileSystem sink table path multi-partition error

2021-06-07 Thread bigdataf (Jira)
bigdataf created FLINK-22900: Summary: flink 1.11.2 fileSystem source table read fileSystem sink table path multi-partition error Key: FLINK-22900 URL: https://issues.apache.org/jira/browse/FLINK-22900

Re: [DISCUSS] Support specifying custom transactional.id prefix in FlinkKafkaProducer

2021-06-07 Thread Till Rohrmann
Thanks for starting this discussion Wenhao. I've given you permission to create a FLIP. Cheers, Till On Sat, Jun 5, 2021 at 9:48 AM Wenhao Ji wrote: > Hi everyone, > > Currently, the "transactional.id"s of the Kafka producers in > FlinkKafkaProducer are generated based on the task name. This

[jira] [Created] (FLINK-22899) ValuesUpsertSinkFunction needs to use global upsert

2021-06-07 Thread Jingsong Lee (Jira)
Jingsong Lee created FLINK-22899: Summary: ValuesUpsertSinkFunction needs to use global upsert Key: FLINK-22899 URL: https://issues.apache.org/jira/browse/FLINK-22899 Project: Flink Issue

[jira] [Created] (FLINK-22898) HiveParallelismInference limit return wrong parallelism

2021-06-07 Thread Jingsong Lee (Jira)
Jingsong Lee created FLINK-22898: Summary: HiveParallelismInference limit return wrong parallelism Key: FLINK-22898 URL: https://issues.apache.org/jira/browse/FLINK-22898 Project: Flink

[jira] [Created] (FLINK-22897) FlinkSQL1.12 Sink to Hive with diffrent parallelism will due to produce many small files

2021-06-07 Thread zhengjiewen (Jira)
zhengjiewen created FLINK-22897: --- Summary: FlinkSQL1.12 Sink to Hive with diffrent parallelism will due to produce many small files Key: FLINK-22897 URL: https://issues.apache.org/jira/browse/FLINK-22897

Re: Add control mode for flink

2021-06-07 Thread Xintong Song
Thanks Jiangang for bringing this up, and Steven & Peter for the feedback. I was part of the preliminary offline discussions before this proposal went public. So maybe I can help clarify things a bit. In short, despite the phrase "control mode" might be a bit misleading, what we truly want to do

[jira] [Created] (FLINK-22896) Flink sql supports parallelism at the operator level

2021-06-07 Thread Fangliang Liu (Jira)
Fangliang Liu created FLINK-22896: - Summary: Flink sql supports parallelism at the operator level Key: FLINK-22896 URL: https://issues.apache.org/jira/browse/FLINK-22896 Project: Flink Issue