> https://github.com/hortonworks/hive-testbench > > The official procedure to generate and upload the data has never worked >for me (and it looks like it's not a supported software), so it could be >a bit tricky to do it manually and on a single host.
I wrote the MapReduce jobs for that (tpcds-gen/tpch-gen) after waiting a whole weekend for 1Tb of data to be generated on a single machine. If you or anyone else has issues with it, I can take a look at it. Cheers, Gopal