Hello,Tez experts:
I have known that, tez is used in DAG cases.
Because it can control the intermediate results do not write to disk,
and container reuse, so it is more effective in processing small amount of data
than mr. So, mybe I will think that hive on tez is better than hive on mr in
processing small amount of data, am I right?
Well, now, my questions are:
(1)Even though there are main design themes in https://tez.apache.org/ , I am
still not very clear about its application scenarios,and If there are some real
and main enterprise applications,so much the better.
(2)I am still not very clear what question It is mainly used to solving?
(3) Why it is use for hive and pig? how is it better than spark or mr?
(4)I looked at your official PPT and paper “Apache Tez: A Unifying Framework
for Modeling and Building Data Processing Applications" , but still not very
clearly.
How to understand this :"Don’t solve problems that have already been solved.
Or else you will have to solve them again!"? Is there any real example?
Apache tez is a great product , I hope to learn more about it.
Any reply are very appreciated.
Thankyou & Best Regards.
---LLBian