[
https://issues.apache.org/jira/browse/HIVE-3324?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13426345#comment-13426345
]
rohithsharma commented on HIVE-3324:
------------------------------------
There are 2 problems for analyze command.
1. JDBCStastPublisher fails to connect to database since derby jar is not in
classpath.In YarnChild logs we get below exception.
{noformat}
2012-08-01 10:25:17,954 ERROR [main]
org.apache.hadoop.hive.ql.stats.jdbc.JDBCStatsPublisher: Error during
instantiating JDBC driver org.apache.derby.jdbc.EmbeddedDriver.
java.lang.ClassNotFoundException: org.apache.derby.jdbc.EmbeddedDriver
at java.net.URLClassLoader$1.run(URLClassLoader.java:202)
at java.security.AccessController.doPrivileged(Native Method)
at java.net.URLClassLoader.findClass(URLClassLoader.java:190)
at java.lang.ClassLoader.loadClass(ClassLoader.java:306)
at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301)
at java.lang.ClassLoader.loadClass(ClassLoader.java:247)
at java.lang.Class.forName0(Native Method)
at java.lang.Class.forName(Class.java:169)
at
org.apache.hadoop.hive.ql.stats.jdbc.JDBCStatsPublisher.connect(JDBCStatsPublisher.java:69)
at
org.apache.hadoop.hive.ql.exec.TableScanOperator.publishStats(TableScanOperator.java:236)
{noformat}
>>> Above problem can be resolved by setting "hive.aux.jars.path" for derby jar.
2. JVM of Hive and YarnChild are different.So dbconnectionstring should be
common location.If it is set to common location , then there is problme if the
NodeManager and Hive are running in differetn machine.
> analyze command is not gathering "num_rows" present in the table.
> -----------------------------------------------------------------
>
> Key: HIVE-3324
> URL: https://issues.apache.org/jira/browse/HIVE-3324
> Project: Hive
> Issue Type: Bug
> Components: Statistics
> Affects Versions: 0.10.0, 0.9.1
> Reporter: rohithsharma
>
> When analyze command is executed, "collectableStats" i.e num_rows and
> raw_data_size is always zero even though table contains data.
> bq. [num_partitions: 0, num_files: 1, num_rows: 0, total_size: 5812,
> raw_data_size: 0]
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira