We can write a small example program to get files for a table/partition. To open a table using deserializer and get rows from it etc.
This would help people write java map-reduce on hive tables. ________________________________ From: Zheng Shao [mailto:[email protected]] Sent: Tuesday, February 24, 2009 5:38 PM To: [email protected] Subject: Re: How to simplify our development flow under the means of using Hive? Please take a look at: http://wiki.apache.org/hadoop/Hive/LanguageManual/Transform You will have to create a string command and pass it to Hive. There is no way of doing that directly (without creating a string) using Java API. Zheng On Tue, Feb 24, 2009 at 5:12 PM, Min Zhou <[email protected]<mailto:[email protected]>> wrote: My bad. I meant that use Hive's java API to do raw mapreduce things, not drive a sql. Sorry! On Wed, Feb 25, 2009 at 12:05 AM, Edward Capriolo <[email protected]<mailto:[email protected]>> wrote: One way to interact with hive is by starting hives thrift server. If you want to use the raw java API, I stole some code from the Command Line Interface and made a simple driver program. (I attached it). You could also take a look at the source hwi folder. That is how we have multiple hive clients started inside a web application. I would suggest the thrift service as that should present yoru client program with a stable API. If you just make a work-alike program like TestHive upstream changes may be an issue down the line. -- My research interests are distributed systems, parallel computing and bytecode based virtual machine. http://coderplay.javaeye.com -- Yours, Zheng
