Hi, I have a Presto (322) + Hive (2.3.4) + S3 setup that works fine. A simple "select count(*) from mydb.mytable" (underneath data is a single 52KB parquet file on S3) takes 200ms to complete.
I needed fix for https://issues.apache.org/jira/browse/HIVE-21631 so I dropped in a locally built (from HEAD of master) Hive 4.0. No change to config under /etc/hive. Now the same query took 20s to complete. Attached is log from hive-metastore.log during the execution of the query. Any suggestions on direction of investigation? -Bowen
@^ r=reporting-data-platform a=unilog-host $@[2020-03-09 19:35:32.010689] 2020-03-09T19:35:32,010 INFO [pool-5-thread-1] HiveMetaStore.audit: ugi=root ip=10.133.53.23 cmd=source:10.133.53.23 get_table : tbl=hive.mydb.mytable @^ r=reporting-data-platform a=unilog-host $@[2020-03-09 19:35:32.011562] 2020-03-09T19:35:32,011 INFO [pool-5-thread-1] metastore.HiveMetaStore: 1: Opening raw store with implementation class:org.apache.hadoop.hive.metastore.ObjectStore @^ r=reporting-data-platform a=unilog-host $@[2020-03-09 19:35:32.011743] 2020-03-09T19:35:32,011 INFO [pool-5-thread-1] metastore.PersistenceManagerProvider: Configuration datanucleus.autoStartMechanismMode is not set. Defaulting to 'ignored' @^ r=reporting-data-platform a=unilog-host $@[2020-03-09 19:35:32.013398] 2020-03-09T19:35:32,013 INFO [pool-5-thread-1] metastore.ObjectStore: RawStore: org.apache.hadoop.hive.metastore.ObjectStore@20feeaf1, with PersistenceManager: null will be shutdown @^ r=reporting-data-platform a=unilog-host $@[2020-03-09 19:35:32.013703] 2020-03-09T19:35:32,013 INFO [pool-5-thread-1] metastore.ObjectStore: RawStore: org.apache.hadoop.hive.metastore.ObjectStore@20feeaf1, with PersistenceManager: org.datanucleus.api.jdo.JDOPersistenceManager@5312ffac created in the thread with id: 54 @^ r=reporting-data-platform a=unilog-host $@[2020-03-09 19:35:32.029448] 2020-03-09T19:35:32,029 INFO [pool-5-thread-1] metastore.HiveMetaStore: Created RawStore: org.apache.hadoop.hive.metastore.ObjectStore@20feeaf1 from thread id: 54 @^ r=reporting-data-platform a=unilog-host $@[2020-03-09 19:35:32.172348] 2020-03-09T19:35:32,172 INFO [pool-5-thread-1] metastore.HiveMetaStore: Skipping translation for processor with null @^ r=reporting-data-platform a=unilog-host $@[2020-03-09 19:35:32.178310] 2020-03-09T19:35:32,178 INFO [pool-5-thread-1] HiveMetaStore.audit: ugi=root ip=10.133.53.23 cmd=Cleaning up thread local RawStore... @^ r=reporting-data-platform a=unilog-host $@[2020-03-09 19:35:32.178356] 2020-03-09T19:35:32,178 INFO [pool-5-thread-1] metastore.ObjectStore: RawStore: org.apache.hadoop.hive.metastore.ObjectStore@20feeaf1, with PersistenceManager: org.datanucleus.api.jdo.JDOPersistenceManager@5312ffac will be shutdown @^ r=reporting-data-platform a=unilog-host $@[2020-03-09 19:35:32.178678] 2020-03-09T19:35:32,178 INFO [pool-5-thread-1] HiveMetaStore.audit: ugi=root ip=10.133.53.23 cmd=Done cleaning up thread local RawStore @^ r=reporting-data-platform a=unilog-host $@[2020-03-09 19:35:32.192622] 2020-03-09T19:35:32,192 INFO [pool-5-thread-2] HiveMetaStore.audit: ugi=root ip=10.133.53.23 cmd=source:10.133.53.23 get_table : tbl=hive.mydb.mytable @^ r=reporting-data-platform a=unilog-host $@[2020-03-09 19:35:32.193197] 2020-03-09T19:35:32,193 INFO [pool-5-thread-2] metastore.HiveMetaStore: 2: Opening raw store with implementation class:org.apache.hadoop.hive.metastore.ObjectStore @^ r=reporting-data-platform a=unilog-host $@[2020-03-09 19:35:32.193348] 2020-03-09T19:35:32,193 INFO [pool-5-thread-2] metastore.PersistenceManagerProvider: Configuration datanucleus.autoStartMechanismMode is not set. Defaulting to 'ignored' @^ r=reporting-data-platform a=unilog-host $@[2020-03-09 19:35:32.194672] 2020-03-09T19:35:32,194 INFO [pool-5-thread-2] metastore.ObjectStore: RawStore: org.apache.hadoop.hive.metastore.ObjectStore@6e841ca1, with PersistenceManager: null will be shutdown @^ r=reporting-data-platform a=unilog-host $@[2020-03-09 19:35:32.194948] 2020-03-09T19:35:32,194 INFO [pool-5-thread-2] metastore.ObjectStore: RawStore: org.apache.hadoop.hive.metastore.ObjectStore@6e841ca1, with PersistenceManager: org.datanucleus.api.jdo.JDOPersistenceManager@384a5d60 created in the thread with id: 67 @^ r=reporting-data-platform a=unilog-host $@[2020-03-09 19:35:32.200707] 2020-03-09T19:35:32,200 INFO [pool-5-thread-2] metastore.HiveMetaStore: Created RawStore: org.apache.hadoop.hive.metastore.ObjectStore@6e841ca1 from thread id: 67 @^ r=reporting-data-platform a=unilog-host $@[2020-03-09 19:35:32.223618] 2020-03-09T19:35:32,223 INFO [pool-5-thread-2] metastore.HiveMetaStore: Skipping translation for processor with null @^ r=reporting-data-platform a=unilog-host $@[2020-03-09 19:35:32.224252] 2020-03-09T19:35:32,224 INFO [pool-5-thread-2] HiveMetaStore.audit: ugi=root ip=10.133.53.23 cmd=Cleaning up thread local RawStore... @^ r=reporting-data-platform a=unilog-host $@[2020-03-09 19:35:32.224292] 2020-03-09T19:35:32,224 INFO [pool-5-thread-2] metastore.ObjectStore: RawStore: org.apache.hadoop.hive.metastore.ObjectStore@6e841ca1, with PersistenceManager: org.datanucleus.api.jdo.JDOPersistenceManager@384a5d60 will be shutdown @^ r=reporting-data-platform a=unilog-host $@[2020-03-09 19:35:32.224461] 2020-03-09T19:35:32,224 INFO [pool-5-thread-2] HiveMetaStore.audit: ugi=root ip=10.133.53.23 cmd=Done cleaning up thread local RawStore @^ r=reporting-data-platform a=unilog-host $@[2020-03-09 19:35:32.228660] 2020-03-09T19:35:32,228 INFO [pool-5-thread-3] HiveMetaStore.audit: ugi=root ip=10.133.53.23 cmd=Done cleaning up thread local RawStore @^ r=reporting-data-platform a=unilog-host $@[2020-03-09 19:35:33.292074] 2020-03-09T19:35:33,291 INFO [pool-5-thread-4] HiveMetaStore.audit: ugi=root ip=10.133.53.23 cmd=Done cleaning up thread local RawStore @^ r=reporting-data-platform a=unilog-host $@[2020-03-09 19:35:34.312581] 2020-03-09T19:35:34,312 INFO [pool-5-thread-5] HiveMetaStore.audit: ugi=root ip=10.133.53.23 cmd=Done cleaning up thread local RawStore @^ r=reporting-data-platform a=unilog-host $@[2020-03-09 19:35:35.405902] 2020-03-09T19:35:35,405 INFO [pool-5-thread-6] HiveMetaStore.audit: ugi=root ip=10.133.53.23 cmd=Done cleaning up thread local RawStore @^ r=reporting-data-platform a=unilog-host $@[2020-03-09 19:35:36.451363] 2020-03-09T19:35:36,451 INFO [pool-5-thread-7] HiveMetaStore.audit: ugi=root ip=10.133.53.23 cmd=Done cleaning up thread local RawStore @^ r=reporting-data-platform a=unilog-host $@[2020-03-09 19:35:37.550814] 2020-03-09T19:35:37,550 INFO [pool-5-thread-8] HiveMetaStore.audit: ugi=root ip=10.133.53.23 cmd=Done cleaning up thread local RawStore @^ r=reporting-data-platform a=unilog-host $@[2020-03-09 19:35:38.566096] 2020-03-09T19:35:38,565 INFO [pool-5-thread-9] HiveMetaStore.audit: ugi=root ip=10.133.53.23 cmd=Done cleaning up thread local RawStore @^ r=reporting-data-platform a=unilog-host $@[2020-03-09 19:35:39.619569] 2020-03-09T19:35:39,619 INFO [pool-5-thread-10] HiveMetaStore.audit: ugi=root ip=10.133.53.23 cmd=Done cleaning up thread local RawStore @^ r=reporting-data-platform a=unilog-host $@[2020-03-09 19:35:40.681764] 2020-03-09T19:35:40,681 INFO [pool-5-thread-11] HiveMetaStore.audit: ugi=root ip=10.133.53.23 cmd=Done cleaning up thread local RawStore @^ r=reporting-data-platform a=unilog-host $@[2020-03-09 19:35:41.748134] 2020-03-09T19:35:41,747 INFO [pool-5-thread-12] HiveMetaStore.audit: ugi=root ip=10.133.53.23 cmd=Done cleaning up thread local RawStore @^ r=reporting-data-platform a=unilog-host $@[2020-03-09 19:35:41.750212] 2020-03-09T19:35:41,750 INFO [pool-5-thread-13] HiveMetaStore.audit: ugi=root ip=10.133.53.23 cmd=source:10.133.53.23 get_table : tbl=hive.mydb.mytable @^ r=reporting-data-platform a=unilog-host $@[2020-03-09 19:35:41.751247] 2020-03-09T19:35:41,751 INFO [pool-5-thread-13] metastore.HiveMetaStore: 13: Opening raw store with implementation class:org.apache.hadoop.hive.metastore.ObjectStore @^ r=reporting-data-platform a=unilog-host $@[2020-03-09 19:35:41.751426] 2020-03-09T19:35:41,751 INFO [pool-5-thread-13] metastore.PersistenceManagerProvider: Configuration datanucleus.autoStartMechanismMode is not set. Defaulting to 'ignored' @^ r=reporting-data-platform a=unilog-host $@[2020-03-09 19:35:41.753243] 2020-03-09T19:35:41,753 INFO [pool-5-thread-13] metastore.ObjectStore: RawStore: org.apache.hadoop.hive.metastore.ObjectStore@711b9f47, with PersistenceManager: null will be shutdown @^ r=reporting-data-platform a=unilog-host $@[2020-03-09 19:35:41.753654] 2020-03-09T19:35:41,753 INFO [pool-5-thread-13] metastore.ObjectStore: RawStore: org.apache.hadoop.hive.metastore.ObjectStore@711b9f47, with PersistenceManager: org.datanucleus.api.jdo.JDOPersistenceManager@44a96a48 created in the thread with id: 78 @^ r=reporting-data-platform a=unilog-host $@[2020-03-09 19:35:41.770129] 2020-03-09T19:35:41,769 INFO [pool-5-thread-13] metastore.HiveMetaStore: Created RawStore: org.apache.hadoop.hive.metastore.ObjectStore@711b9f47 from thread id: 78 @^ r=reporting-data-platform a=unilog-host $@[2020-03-09 19:35:41.797403] 2020-03-09T19:35:41,797 INFO [pool-5-thread-13] metastore.HiveMetaStore: Skipping translation for processor with null @^ r=reporting-data-platform a=unilog-host $@[2020-03-09 19:35:41.798270] 2020-03-09T19:35:41,798 INFO [pool-5-thread-13] HiveMetaStore.audit: ugi=root ip=10.133.53.23 cmd=Cleaning up thread local RawStore... @^ r=reporting-data-platform a=unilog-host $@[2020-03-09 19:35:41.798311] 2020-03-09T19:35:41,798 INFO [pool-5-thread-13] metastore.ObjectStore: RawStore: org.apache.hadoop.hive.metastore.ObjectStore@711b9f47, with PersistenceManager: org.datanucleus.api.jdo.JDOPersistenceManager@44a96a48 will be shutdown @^ r=reporting-data-platform a=unilog-host $@[2020-03-09 19:35:41.798442] 2020-03-09T19:35:41,798 INFO [pool-5-thread-13] HiveMetaStore.audit: ugi=root ip=10.133.53.23 cmd=Done cleaning up thread local RawStore @^ r=reporting-data-platform a=unilog-host $@[2020-03-09 19:35:41.799294] 2020-03-09T19:35:41,799 INFO [pool-5-thread-14] HiveMetaStore.audit: ugi=root ip=10.133.53.23 cmd=Done cleaning up thread local RawStore @^ r=reporting-data-platform a=unilog-host $@[2020-03-09 19:35:42.803847] 2020-03-09T19:35:42,803 INFO [pool-5-thread-15] HiveMetaStore.audit: ugi=root ip=10.133.53.23 cmd=Done cleaning up thread local RawStore @^ r=reporting-data-platform a=unilog-host $@[2020-03-09 19:35:43.876133] 2020-03-09T19:35:43,876 INFO [pool-5-thread-16] HiveMetaStore.audit: ugi=root ip=10.133.53.23 cmd=Done cleaning up thread local RawStore @^ r=reporting-data-platform a=unilog-host $@[2020-03-09 19:35:44.943427] 2020-03-09T19:35:44,943 INFO [pool-5-thread-17] HiveMetaStore.audit: ugi=root ip=10.133.53.23 cmd=Done cleaning up thread local RawStore @^ r=reporting-data-platform a=unilog-host $@[2020-03-09 19:35:45.947814] 2020-03-09T19:35:45,947 INFO [pool-5-thread-18] HiveMetaStore.audit: ugi=root ip=10.133.53.23 cmd=Done cleaning up thread local RawStore @^ r=reporting-data-platform a=unilog-host $@[2020-03-09 19:35:47.001007] 2020-03-09T19:35:47,000 INFO [pool-5-thread-19] HiveMetaStore.audit: ugi=root ip=10.133.53.23 cmd=Done cleaning up thread local RawStore @^ r=reporting-data-platform a=unilog-host $@[2020-03-09 19:35:48.012187] 2020-03-09T19:35:48,012 INFO [pool-5-thread-20] HiveMetaStore.audit: ugi=root ip=10.133.53.23 cmd=Done cleaning up thread local RawStore @^ r=reporting-data-platform a=unilog-host $@[2020-03-09 19:35:49.097519] 2020-03-09T19:35:49,097 INFO [pool-5-thread-21] HiveMetaStore.audit: ugi=root ip=10.133.53.23 cmd=Done cleaning up thread local RawStore @^ r=reporting-data-platform a=unilog-host $@[2020-03-09 19:35:50.135673] 2020-03-09T19:35:50,135 INFO [pool-5-thread-22] HiveMetaStore.audit: ugi=root ip=10.133.53.23 cmd=Done cleaning up thread local RawStore @^ r=reporting-data-platform a=unilog-host $@[2020-03-09 19:35:51.194302] 2020-03-09T19:35:51,194 INFO [pool-5-thread-23] HiveMetaStore.audit: ugi=root ip=10.133.53.23 cmd=Done cleaning up thread local RawStore