[jira] [Commented] (HIVE-7115) Support a mechanism for running hive locally that doesnt require having a hadoop executable.

2014-06-09 Thread Sushanth Sowmyan (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-7115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14025735#comment-14025735
 ] 

Sushanth Sowmyan commented on HIVE-7115:


Hi Jay,

There are two things I can think of that might help development of such a tool:

a) One possible place to look into for this is to look at the test framework 
section. For example, look at our TestCli framework over at 
ql/src/test/templates/TestCliDriver.vm and 
./ql/src/test/templates/TestNegativeCliDriver.vm which manage our .q and .q.out 
tests.
b) The other route I'd suggest is by directly launching a Hive Driver, as done 
in tests like 
hcatalog-pig-adapter/src/test/java/org/apache/hive/hcatalog/pig/TestE2EScenarios.java

 Support a mechanism for running hive locally that doesnt require having a 
 hadoop executable.
 

 Key: HIVE-7115
 URL: https://issues.apache.org/jira/browse/HIVE-7115
 Project: Hive
  Issue Type: Improvement
  Components: Testing Infrastructure, Tests
Reporter: jay vyas

 Mapreduce has a local mode by default, and likewise, tools such as pig and 
 SOLR do as well, maybe we can have a first class local mode for hive 
 also. 
 For local integration testing of a hadoop app, it would be nice if we could 
 fire up a local hive instance which didnt require bin/hadoop for running 
 local jobs.  This would allow us to maintain polyglot hadoop applications 
 much easier by incorporating hive into the integration tests.  For example:
 {noformat}
 LocalHiveInstance hive = new LocalHiveInstance();
 hive.set(course,crochet)l
 hive.runScript(hive_flow.ql)l
 {noformat} 
 Would essentially run a local hive query which mirrors
 {noformat}
 hive -f hive_flow.ql -hiveconf course=crochet
 {noformat{ 
 It seems like thee might be a simple way to do this, at least for small data 
 sets, by putting some kind of alternative (i.e. in memory) execution 
 environment under hive, if one is not already underway ?  



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HIVE-7115) Support a mechanism for running hive locally that doesnt require having a hadoop executable.

2014-06-09 Thread Sushanth Sowmyan (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-7115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14025745#comment-14025745
 ] 

Sushanth Sowmyan commented on HIVE-7115:


Oh, and one more thing : there is a miniHS2 implementation in the tests : 
hive-unit/src/main/java/org/apache/hive/jdbc/miniHS2/MiniHS2.java

I would recommend experimenting with this more than the other approaches, 
because it's cleaner from an integration standpoint. You can then use jdbc or 
beeline to connect to it.

 Support a mechanism for running hive locally that doesnt require having a 
 hadoop executable.
 

 Key: HIVE-7115
 URL: https://issues.apache.org/jira/browse/HIVE-7115
 Project: Hive
  Issue Type: Improvement
  Components: Testing Infrastructure, Tests
Reporter: jay vyas

 Mapreduce has a local mode by default, and likewise, tools such as pig and 
 SOLR do as well, maybe we can have a first class local mode for hive 
 also. 
 For local integration testing of a hadoop app, it would be nice if we could 
 fire up a local hive instance which didnt require bin/hadoop for running 
 local jobs.  This would allow us to maintain polyglot hadoop applications 
 much easier by incorporating hive into the integration tests.  For example:
 {noformat}
 LocalHiveInstance hive = new LocalHiveInstance();
 hive.set(course,crochet)l
 hive.runScript(hive_flow.ql)l
 {noformat} 
 Would essentially run a local hive query which mirrors
 {noformat}
 hive -f hive_flow.ql -hiveconf course=crochet
 {noformat{ 
 It seems like thee might be a simple way to do this, at least for small data 
 sets, by putting some kind of alternative (i.e. in memory) execution 
 environment under hive, if one is not already underway ?  



--
This message was sent by Atlassian JIRA
(v6.2#6252)


[jira] [Commented] (HIVE-7115) Support a mechanism for running hive locally that doesnt require having a hadoop executable.

2014-06-09 Thread Edward Capriolo (JIRA)

[ 
https://issues.apache.org/jira/browse/HIVE-7115?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14025767#comment-14025767
 ] 

Edward Capriolo commented on HIVE-7115:
---

That would be really nice especially if it could be extended to dependent 
projects 
https://github.com/edwardcapriolo/hive_test requires lots of trickery to launch 
a hive process.

 Support a mechanism for running hive locally that doesnt require having a 
 hadoop executable.
 

 Key: HIVE-7115
 URL: https://issues.apache.org/jira/browse/HIVE-7115
 Project: Hive
  Issue Type: Improvement
  Components: Testing Infrastructure, Tests
Reporter: jay vyas

 Mapreduce has a local mode by default, and likewise, tools such as pig and 
 SOLR do as well, maybe we can have a first class local mode for hive 
 also. 
 For local integration testing of a hadoop app, it would be nice if we could 
 fire up a local hive instance which didnt require bin/hadoop for running 
 local jobs.  This would allow us to maintain polyglot hadoop applications 
 much easier by incorporating hive into the integration tests.  For example:
 {noformat}
 LocalHiveInstance hive = new LocalHiveInstance();
 hive.set(course,crochet)l
 hive.runScript(hive_flow.ql)l
 {noformat} 
 Would essentially run a local hive query which mirrors
 {noformat}
 hive -f hive_flow.ql -hiveconf course=crochet
 {noformat{ 
 It seems like thee might be a simple way to do this, at least for small data 
 sets, by putting some kind of alternative (i.e. in memory) execution 
 environment under hive, if one is not already underway ?  



--
This message was sent by Atlassian JIRA
(v6.2#6252)