Various issues when using MS-SQL2005 as the Hive Metastore
----------------------------------------------------------

                 Key: HIVE-1391
                 URL: https://issues.apache.org/jira/browse/HIVE-1391
             Project: Hadoop Hive
          Issue Type: Bug
          Components: Metastore
    Affects Versions: 0.5.0
            Reporter: Alex Rovner
         Attachments: hive-trace.txt

When I have tried to use MS-SQL2005 as the hive metastore I have encountered 
numerous issues.

My configuration:

property>
  <name>javax.jdo.option.ConnectionURL</name>
 <value>jdbc:sqlserver://cwdbint05:1445;DatabaseName=HiveMetastore;</value>
  <description>JDBC connect string for a JDBC metastore</description>
</property>

<property>
  <name>javax.jdo.option.ConnectionDriverName</name>
  <value>com.microsoft.sqlserver.jdbc.SQLServerDriver</value>
  <description>Driver class name for a JDBC metastore</description>
</property>

<property>
  <name>javax.jdo.option.ConnectionUserName</name>
  <value>HiveUser</value>
  <description>username to use against metastore database</description>
</property>

<property>
  <name>javax.jdo.option.ConnectionPassword</name>
  <value>XXXXXXXXXXXXXX</value>
  <description>password to use against metastore database</description>
</property>

<property>
  <name>datanucleus.autoCreateSchema</name>
  <value>true</value>
  <description>creates necessary schema on a startup if one doesn't exist. set 
this to false, after creating it once</description>
</property>


Hive user has full rights to the HiveMetastore DB.
---------------------------------------------------------------------

When launching hive on command line and executing "show tables;" i got the 
following:

FAILED: Error in metadata: javax.jdo.JDOFatalInternalException: Error creating 
transactional connection factory
NestedThrowables:
java.lang.reflect.InvocationTargetException
FAILED: Execution Error, return code 1 from 
org.apache.hadoop.hive.ql.exec.DDLTask


When launching hive through the Java API (org.apache.commons.cli.CommandLine) 
the auto create kicked in but failed with the following (Full stack trace 
attached to ticket):

[2010-06-04 09:22:11,817] ERROR (Log4JLogger.java:115) - Error thrown executing 
ALTER TABLE COLUMNS ADD TYPE_NAME varchar(128) NOT NULL : Cannot find the 
object "COLUMNS" because it does not exist or you do not have permissions.
com.microsoft.sqlserver.jdbc.SQLServerException: Cannot find the object 
"COLUMNS" because it does not exist or you do not have permissions.
        at 
com.microsoft.sqlserver.jdbc.SQLServerException.makeFromDatabaseError(SQLServerException.java:196)
        at 
com.microsoft.sqlserver.jdbc.SQLServerStatement.getNextResult(SQLServerStatement.java:1454)
        at 
com.microsoft.sqlserver.jdbc.SQLServerStatement.doExecuteStatement(SQLServerStatement.java:786)
        at 
com.microsoft.sqlserver.jdbc.SQLServerStatement$StmtExecCmd.doExecute(SQLServerStatement.java:685)
        at com.microsoft.sqlserver.jdbc.TDSCommand.execute(IOBuffer.java:4026)
        at 
com.microsoft.sqlserver.jdbc.SQLServerConnection.executeCommand(SQLServerConnection.java:1416)
        at 
com.microsoft.sqlserver.jdbc.SQLServerStatement.executeCommand(SQLServerStatement.java:185)
        at 
com.microsoft.sqlserver.jdbc.SQLServerStatement.executeStatement(SQLServerStatement.java:160)
        at 
com.microsoft.sqlserver.jdbc.SQLServerStatement.execute(SQLServerStatement.java:658)
        at 
org.datanucleus.store.rdbms.table.AbstractTable.executeDdlStatement(AbstractTable.java:730)
        at 
org.datanucleus.store.rdbms.table.AbstractTable.executeDdlStatementList(AbstractTable.java:681)
        at 
org.datanucleus.store.rdbms.table.TableImpl.validateColumns(TableImpl.java:261)
        at 
org.datanucleus.store.rdbms.RDBMSManager$ClassAdder.performTablesValidation(RDBMSManager.java:2794)
        at 
org.datanucleus.store.rdbms.RDBMSManager$ClassAdder.addClassTablesAndValidate(RDBMSManager.java:2595)
        at 
org.datanucleus.store.rdbms.RDBMSManager$ClassAdder.run(RDBMSManager.java:2241)
        at 
org.datanucleus.store.rdbms.AbstractSchemaTransaction.execute(AbstractSchemaTransaction.java:113)
        at 
org.datanucleus.store.rdbms.RDBMSManager.addClasses(RDBMSManager.java:994)
        at 
org.datanucleus.store.rdbms.RDBMSManager.addClasses(RDBMSManager.java:960)
        at 
org.datanucleus.store.AbstractStoreManager.addClass(AbstractStoreManager.java:691)
        at 
org.datanucleus.store.mapped.MappedStoreManager.getDatastoreClass(MappedStoreManager.java:358)
        at 
org.datanucleus.store.rdbms.RDBMSManager.getExtent(RDBMSManager.java:1344)
        at 
org.datanucleus.ObjectManagerImpl.getExtent(ObjectManagerImpl.java:3736)
        at 
org.datanucleus.store.rdbms.query.JDOQLQueryCompiler.compileCandidates(JDOQLQueryCompiler.java:411)
        at 
org.datanucleus.store.rdbms.query.QueryCompiler.executionCompile(QueryCompiler.java:312)
        at 
org.datanucleus.store.rdbms.query.JDOQLQueryCompiler.compile(JDOQLQueryCompiler.java:225)
        at 
org.datanucleus.store.rdbms.query.JDOQLQuery.compileInternal(JDOQLQuery.java:174)
        at org.datanucleus.store.query.Query.executeQuery(Query.java:1443)
        at 
org.datanucleus.store.rdbms.query.JDOQLQuery.executeQuery(JDOQLQuery.java:244)
        at org.datanucleus.store.query.Query.executeWithArray(Query.java:1357)
        at org.datanucleus.jdo.JDOQuery.execute(JDOQuery.java:265)
        at 
org.apache.hadoop.hive.metastore.ObjectStore.getMTable(ObjectStore.java:551)
        at 
org.apache.hadoop.hive.metastore.ObjectStore.getTable(ObjectStore.java:494)
        at 
org.apache.hadoop.hive.metastore.HiveMetaStore$HMSHandler.get_table(HiveMetaStore.java:397)
        at 
org.apache.hadoop.hive.metastore.HiveMetaStore$HMSHandler.drop_table(HiveMetaStore.java:353)
        at 
org.apache.hadoop.hive.metastore.HiveMetaStoreClient.dropTable(HiveMetaStoreClient.java:340)
        at org.apache.hadoop.hive.ql.metadata.Hive.dropTable(Hive.java:308)
        at org.apache.hadoop.hive.ql.metadata.Hive.dropTable(Hive.java:293)
        at 
com.contextweb.hive.lookup.LookupDumper.createTable(LookupDumper.java:179)
        at 
com.contextweb.hive.lookup.LookupDumper.execute(LookupDumper.java:122)
        at com.contextweb.hive.cli.QueryCommand.runImpl(QueryCommand.java:142)
        at 
com.contextweb.hive.cli.HiveSessionAwareCommand.run(HiveSessionAwareCommand.java:48)
        at com.contextweb.hive.cli.Launcher.exec(Launcher.java:78)
        at com.contextweb.hive.cli.Launcher.main(Launcher.java:46)


At this point in time I have looked at what hive done to my DB and I saw that 
it created the following tables:
DBS
NUCLEUS_TABLES
SEQUENCE_TABLE

The table COLUMS does not exist and the alter statement is failing (Makes sense)

So I went ahead and created the table with the needed column:
CREATE TABLE COLUMNS (TYPE_NAME varchar(128) NOT NULL)

When I ran hive with the CLI the auto create managed to complete creation this 
time but during the ran failed with the following:

[2010-06-04 09:54:38,787] INFO  (SemanticAnalyzer.java:5399) - Creating 
tablelookup_CampaignId positin=22
FAILED: Error in metadata: javax.jdo.JDODataStoreException: Add request failed 
: INSERT INTO COLUMNS (SD_ID,COMMENT,"COLUMN_NAME",TYPE_NAME,INTEGER_IDX) 
VALUES (?,?,?,?,?) 
NestedThrowables:
java.sql.BatchUpdateException: Invalid column name 'COLUMN_NAME'.
[2010-06-04 09:54:39,158] ERROR (SessionState.java:248) - FAILED: Error in 
metadata: javax.jdo.JDODataStoreException: Add request failed : INSERT INTO 
COLUMNS (SD_ID,COMMENT,"COLUMN_NAME",TYPE_NAME,INTEGER_IDX) VALUES (?,?,?,?,?) 
NestedThrowables:
java.sql.BatchUpdateException: Invalid column name 'COLUMN_NAME'.
org.apache.hadoop.hive.ql.metadata.HiveException: 
javax.jdo.JDODataStoreException: Add request failed : INSERT INTO COLUMNS 
(SD_ID,COMMENT,"COLUMN_NAME",TYPE_NAME,INTEGER_IDX) VALUES (?,?,?,?,?) 
NestedThrowables:
java.sql.BatchUpdateException: Invalid column name 'COLUMN_NAME'.
        at org.apache.hadoop.hive.ql.metadata.Hive.createTable(Hive.java:281)
        at org.apache.hadoop.hive.ql.exec.DDLTask.createTable(DDLTask.java:1281)
        at org.apache.hadoop.hive.ql.exec.DDLTask.execute(DDLTask.java:119)
        at org.apache.hadoop.hive.ql.exec.Task.executeTask(Task.java:99)
        at com.contextweb.hive.session.HiveQuery.exec(HiveQuery.java:112)
        at 
com.contextweb.hive.lookup.LookupDumper.createTable(LookupDumper.java:201)
        at 
com.contextweb.hive.lookup.LookupDumper.execute(LookupDumper.java:122)
        at com.contextweb.hive.cli.QueryCommand.runImpl(QueryCommand.java:142)
        at 
com.contextweb.hive.cli.HiveSessionAwareCommand.run(HiveSessionAwareCommand.java:48)
        at com.contextweb.hive.cli.Launcher.exec(Launcher.java:78)
        at com.contextweb.hive.cli.Launcher.main(Launcher.java:46)
Caused by: javax.jdo.JDODataStoreException: Add request failed : INSERT INTO 
COLUMNS (SD_ID,COMMENT,"COLUMN_NAME",TYPE_NAME,INTEGER_IDX) VALUES (?,?,?,?,?) 
NestedThrowables:
java.sql.BatchUpdateException: Invalid column name 'COLUMN_NAME'.
        at 
org.datanucleus.jdo.NucleusJDOHelper.getJDOExceptionForNucleusException(NucleusJDOHelper.java:289)
        at 
org.datanucleus.jdo.JDOPersistenceManager.jdoMakePersistent(JDOPersistenceManager.java:673)
        at 
org.datanucleus.jdo.JDOPersistenceManager.makePersistent(JDOPersistenceManager.java:693)
        at 
org.apache.hadoop.hive.metastore.ObjectStore.createTable(ObjectStore.java:458)
        at 
org.apache.hadoop.hive.metastore.HiveMetaStore$HMSHandler.create_table(HiveMetaStore.java:321)
        at 
org.apache.hadoop.hive.metastore.HiveMetaStoreClient.createTable(HiveMetaStoreClient.java:254)
        at org.apache.hadoop.hive.ql.metadata.Hive.createTable(Hive.java:275)
        ... 10 more
Caused by: java.sql.BatchUpdateException: Invalid column name 'COLUMN_NAME'.
        at 
com.microsoft.sqlserver.jdbc.SQLServerPreparedStatement.executeBatch(SQLServerPreparedStatement.java:1132)
        at 
org.datanucleus.store.rdbms.SQLController.processConnectionStatement(SQLController.java:573)
        at 
org.datanucleus.store.rdbms.SQLController.executeStatementUpdate(SQLController.java:366)
        at 
org.datanucleus.store.rdbms.scostore.RDBMSJoinListStoreSpecialization.internalAdd(RDBMSJoinListStoreSpecialization.java:425)
        at 
org.datanucleus.store.mapped.scostore.JoinListStore.internalAdd(JoinListStore.java:239)
        at 
org.datanucleus.store.mapped.scostore.AbstractListStore.addAll(AbstractListStore.java:128)
        at 
org.datanucleus.store.mapped.mapping.CollectionMapping.postInsert(CollectionMapping.java:157)
        at 
org.datanucleus.store.rdbms.request.InsertRequest.execute(InsertRequest.java:515)
        at 
org.datanucleus.store.rdbms.RDBMSPersistenceHandler.insertTable(RDBMSPersistenceHandler.java:200)
        at 
org.datanucleus.store.rdbms.RDBMSPersistenceHandler.insertObject(RDBMSPersistenceHandler.java:179)
        at 
org.datanucleus.state.JDOStateManagerImpl.internalMakePersistent(JDOStateManagerImpl.java:3097)
        at 
org.datanucleus.state.JDOStateManagerImpl.makePersistent(JDOStateManagerImpl.java:3073)
        at 
org.datanucleus.ObjectManagerImpl.persistObjectInternal(ObjectManagerImpl.java:1280)
        at 
org.datanucleus.store.mapped.mapping.PersistenceCapableMapping.setObjectAsValue(PersistenceCapableMapping.java:604)
        at 
org.datanucleus.store.mapped.mapping.PersistenceCapableMapping.setObject(PersistenceCapableMapping.java:364)
        at 
org.datanucleus.store.rdbms.fieldmanager.ParameterSetter.storeObjectField(ParameterSetter.java:197)
        at 
org.datanucleus.state.AbstractStateManager.providedObjectField(AbstractStateManager.java:1011)
        at 
org.apache.hadoop.hive.metastore.model.MTable.jdoProvideField(MTable.java)
        at 
org.apache.hadoop.hive.metastore.model.MTable.jdoProvideFields(MTable.java)
        at 
org.datanucleus.state.JDOStateManagerImpl.provideFields(JDOStateManagerImpl.java:2627)
        at 
org.datanucleus.store.rdbms.request.InsertRequest.execute(InsertRequest.java:294)
        at 
org.datanucleus.store.rdbms.RDBMSPersistenceHandler.insertTable(RDBMSPersistenceHandler.java:200)
        at 
org.datanucleus.store.rdbms.RDBMSPersistenceHandler.insertObject(RDBMSPersistenceHandler.java:179)
        at 
org.datanucleus.state.JDOStateManagerImpl.internalMakePersistent(JDOStateManagerImpl.java:3097)
        at 
org.datanucleus.state.JDOStateManagerImpl.makePersistent(JDOStateManagerImpl.java:3073)
        at 
org.datanucleus.ObjectManagerImpl.persistObjectInternal(ObjectManagerImpl.java:1280)
        at 
org.datanucleus.ObjectManagerImpl.persistObject(ObjectManagerImpl.java:1157)
        at 
org.datanucleus.jdo.JDOPersistenceManager.jdoMakePersistent(JDOPersistenceManager.java:668)
        ... 15 more


Seems like the autocreate forgot to create the column "COLUMN_NAME" 

I have again ran the command manually in my db:
ALTER TABLE COLUMNS ADD COLUMN_NAME varchar(256) NOT NULL

At this point I was able to run the hive through the CLI successfully but 
running "show tables;" from the command line still give me:
FAILED: Error in metadata: javax.jdo.JDOFatalInternalException: Error creating 
transactional connection factory
NestedThrowables:
java.lang.reflect.InvocationTargetException
FAILED: Execution Error, return code 1 from 
org.apache.hadoop.hive.ql.exec.DDLTask

Please contact me if you need further information.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to