Hi Averell,

Did you know Flink allow you to customize a partitioner?

Some resource :

official documentation :
https://ci.apache.org/projects/flink/flink-docs-release-1.5/dev/stream/operators/#physical-partitioning
discussing in mailing list :
http://apache-flink-user-mailing-list-archive.2336050.n4.nabble.com/keyBy-using-custom-partitioner-td5379.html
a code example : https://gist.github.com/chiwanpark/e71d27cc8edae8bc7298

Thanks, vino.


2018-07-30 18:57 GMT+08:00 Averell <lvhu...@gmail.com>:

> Hi everyone,
>
> We are collecting log files from tens of thousands of network nodes, and we
> need to do some data insights using that. The files are coming with the
> corresponding node ID in the file name, and I want to do custom
> partitioning
> using that Node ID.
> Right now (with Flink 1.5) I think that is not supported. I have been
> trying
> to look into the code, but it would take some time for me to understand.
> From the GUI, it looks like the first step of file source (directory
> monitoring) is rebalancing the stream to the 2nd step (file reader). And as
> per Flink document, rebalancing means round-robin. However, I could not
> find
> the call of "rebalancing" method, but "transform" is called. Not much
> information about that "transform" method though.
>
> Would it possible for me to ask for some guideline on this?
>
> Thanks for your help.
> Averell
>
>
>
> --
> Sent from: http://apache-flink-user-mailing-list-archive.2336050.
> n4.nabble.com/
>

Reply via email to