Hi Felipe,

>> But you said, "before it hits the Source".

I did not say this. Vijay said it. About this question, he may not think
about customizing the source connector.

If he does not try to find a solution in the Flink domain. Why he asked
Flink questions and pasted Flink program?

IMO, It's just a matter of expression. WDYT?

Best,
Vino

Felipe Gutierrez <felipe.o.gutier...@gmail.com> 于2019年11月26日周二 下午5:16写道:

> Hi Vino,
>
> yes, in the source function it is possible. But you said, "before it hits
> the Source". So, IMO I think it is outside of the flink workflow.
> Best,
> Felipe
> *--*
> *-- Felipe Gutierrez*
>
> *-- skype: felipe.o.gutierrez*
> *--* *https://felipeogutierrez.blogspot.com
> <https://felipeogutierrez.blogspot.com>*
>
>
> On Tue, Nov 26, 2019 at 10:09 AM vino yang <yanghua1...@gmail.com> wrote:
>
>> Hi Felipe,
>>
>> Why do you think it's not possible.
>>
>> My thought is we can do the data pre-procession in the source function.
>> If so, source function would contain consume upstream events then do
>> pre-processing then emits to the downstream.
>>
>> Best,
>> Vino
>>
>>
>> Felipe Gutierrez <felipe.o.gutier...@gmail.com> 于2019年11月26日周二 下午4:56写道:
>>
>>> I am afraid that this is not possible in FLink, since the entry point of
>>> all transformation is the source function. Everything that we can
>>> pre-process is in the source function or on the downstream operators.
>>> If you want to pre-process something before the data hits the source you
>>> will have to rely on the broker/storage/queue that the source consumes your
>>> data, not in FLink.
>>>
>>> Best,
>>> Felipe
>>> *--*
>>> *-- Felipe Gutierrez*
>>>
>>> *-- skype: felipe.o.gutierrez*
>>> *--* *https://felipeogutierrez.blogspot.com
>>> <https://felipeogutierrez.blogspot.com>*
>>>
>>>
>>> On Tue, Nov 26, 2019 at 2:57 AM vino yang <yanghua1...@gmail.com> wrote:
>>>
>>>> Hi Vijay,
>>>>
>>>> IMO, the semantics of the source is not changeless. It can contain
>>>> integrate with third-party systems and consume events. However, it can also
>>>> contain more business logic about your data pre-process after consuming
>>>> events.
>>>>
>>>> Maybe it needs some customization. WDYT?
>>>>
>>>> Best,
>>>> Vino
>>>>
>>>> Vijay Balakrishnan <bvija...@gmail.com> 于2019年11月26日周二 上午6:45写道:
>>>>
>>>>> Hi,
>>>>> Need to pre-process data(transform incoming data to a different
>>>>> format) before it hits the Source I have defined. How can I do that ?
>>>>>
>>>>> I tried to use a .map on the DataStream but that is too late as the
>>>>> data has already hit the Source I defined.
>>>>> FlinkKinesisConsumer<Map<String, Object>> kinesisConsumer =
>>>>> getMonitoringFlinkKinesisConsumer(local, localKinesis, kinesisTopicRead,
>>>>> region, getRecsMax, getRecsIntervalMs, connectionTimeout, maxConnections,
>>>>> socketTimeout);
>>>>> DataStreamSource<Map<String, Object>> monitoringDataStreamSource =
>>>>> env.addSource(kinesisConsumer);
>>>>>
>>>>> DataStream<Map<String, Object>> kinesisStream1 = kinesisStream.map(new
>>>>> TransformFunction(...));//too late here
>>>>>
>>>>> TIA,
>>>>>
>>>>

Reply via email to