Hi Tianyi, thanks for contacting us!
Could you elaborate the biggest problems you are facing with this design? As we are moving to ASF, you can ask questions here [email protected]. I think for questions regarding design decisions and future improvement, +Dimitris and +Henry knows better. Huaisi From: 何天一 <[email protected]> Date: Monday, July 11, 2016 at 10:42 PM To: Huaisi Xu <[email protected]> Subject: Looking for OLAP suggestion Hi, Huaisi. We communicated before in cloudera JIRA (IMPALA-3499). I am currently working on distributed storage and computing, including OLAP engines, for 今日头条. I am looking for technical suggestions and hope you could help. I see that Impala Catalogd caches metadata from Hive Metastore and HDFS (or other storage). IMHO This can be considered as a good optimization for performance. However, in our production environment, this mechanism tend to cause problem. Could you help to explain the design choice behind this? Why did Impala cache meta in the first place? And, is there any optimization in progress to make the mechanism better? Thanks. -- Cheers, Tianyi HE (+86) 185 0042 4096
