Hi,

we recently added the "flink-contrib" module for user contributed tools etc.

On one of the last weekends, I've created a distributed tpch generator,
based on this libary: https://github.com/airlift/tpch (which is from a
PrestoDB developer and available on Maven central).

You can find my code here:
https://github.com/rmetzger/scratch/tree/distributed-tpch-generator

It contains two examples:
a) a full TPC data generator (as a flink program):
https://github.com/rmetzger/scratch/blob/distributed-tpch-generator/src/main/java/flink/generators/programs/TPCHGenerator.java

b) an example which generates two TPC-H tables on-the-fly to join them:
https://github.com/rmetzger/scratch/blob/distributed-tpch-generator/src/main/java/flink/generators/programs/TPCHGeneratorExample.java


Before I spend time on integrating it into the "flink-contrib" package, I
was wondering if the community is willing this contribution to Flink.


Best,
Robert

Reply via email to