[ https://issues.apache.org/jira/browse/SPARK-6241?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Michael Armbrust resolved SPARK-6241. ------------------------------------- Resolution: Won't Fix Datasource tables have their own mechanism for reporting statistics that does not rely on "ANALYZE". Please reopen if that is not working for you. > hiveql ANALYZE TABLE doesn't work for external tables > ----------------------------------------------------- > > Key: SPARK-6241 > URL: https://issues.apache.org/jira/browse/SPARK-6241 > Project: Spark > Issue Type: Bug > Components: SQL > Affects Versions: 1.3.0 > Reporter: Kai Zeng > Priority: Critical > > "ANALYZE TABLE" does not collect statistics for external tables, but works > well for tables created by "CREATE AS SELECT". > Also tried to use "refresh table" to refresh metadata cache, but got > NullPointer error: > java.util.concurrent.ExecutionException: java.lang.NullPointerException > at > com.google.common.util.concurrent.AbstractFuture$Sync.getValue(AbstractFuture.java:299) > at > com.google.common.util.concurrent.AbstractFuture$Sync.get(AbstractFuture.java:286) > at > com.google.common.util.concurrent.AbstractFuture.get(AbstractFuture.java:116) > at > com.google.common.util.concurrent.Uninterruptibles.getUninterruptibly(Uninterruptibles.java:135) > at > com.google.common.cache.LocalCache$Segment.getAndRecordStats(LocalCache.java:2344) > at > com.google.common.cache.LocalCache$Segment$1.run(LocalCache.java:2327) > at > com.google.common.util.concurrent.MoreExecutors$SameThreadExecutorService.execute(MoreExecutors.java:297) > at > com.google.common.util.concurrent.ExecutionList.executeListener(ExecutionList.java:156) > at > com.google.common.util.concurrent.ExecutionList.add(ExecutionList.java:101) > at > com.google.common.util.concurrent.AbstractFuture.addListener(AbstractFuture.java:170) > at > com.google.common.cache.LocalCache$Segment.loadAsync(LocalCache.java:2322) > at > com.google.common.cache.LocalCache$Segment.refresh(LocalCache.java:2385) > at com.google.common.cache.LocalCache.refresh(LocalCache.java:4085) > at > com.google.common.cache.LocalCache$LocalLoadingCache.refresh(LocalCache.java:4825) > at > org.apache.spark.sql.hive.HiveMetastoreCatalog.refreshTable(HiveMetastoreCatalog.scala:108) > at org.apache.spark.sql.sources.RefreshTable.run(ddl.scala:404) > at > org.apache.spark.sql.execution.ExecutedCommand.sideEffectResult$lzycompute(commands.scala:55) > at > org.apache.spark.sql.execution.ExecutedCommand.sideEffectResult(commands.scala:55) > at > org.apache.spark.sql.execution.ExecutedCommand.execute(commands.scala:65) > at > org.apache.spark.sql.SQLContext$QueryExecution.toRdd$lzycompute(SQLContext.scala:1092) > at > org.apache.spark.sql.SQLContext$QueryExecution.toRdd(SQLContext.scala:1092) > at org.apache.spark.sql.DataFrame.<init>(DataFrame.scala:134) > at org.apache.spark.sql.DataFrame.<init>(DataFrame.scala:117) > at org.apache.spark.sql.DataFrame$.apply(DataFrame.scala:51) > at org.apache.spark.sql.hive.HiveContext.sql(HiveContext.scala:92) -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org