Hi,
i want to know how costly below case:
1 creat a temp tabl/cache dynamically. Table only has 3 or 4 columns.
2 insert millions rows data by mapreduce.
Each row data is small. Like each column less than 20 bytes.
3 Then do some simple aggregation query.
4 drop the cache.
Can above operation finish in less one second? Which steps is most costly?
Suppose run on a one-node cluster,8 cores,memory is enough.
Thanks.
