Re: Pig | Yahoo! Research

Andrew Hitchcock Thu, 26 Apr 2007 16:53:25 -0700

My understanding is that both Pig and Sawzall are intended for batch
data processing and provide an abstraction layer above MapReduce.


BigTable, on the other hand, is more of a long-term random-access database.

On 4/26/07, Ian Holsman <[EMAIL PROTECTED]> wrote:

Jim Kellerman wrote:
> Can someone comment on how Pig compares with Bigtable?
>
> On Thu, 2007-04-26 at 13:10 -0700, Doug Cutting wrote:
>
>> FYI
>>
>> http://research.yahoo.com/project/pig
>>
>> Doug
>>
my understanding is

bigtable/hbase stores the data
mapreduce/hadoop manipulates/creates the data to be stored in bigtable
via functions, and controls the distribution
sawzall/pig is a query language to extract information from it. I think
it would use create functions for mapreduce/hadoop to run.

regards
Ian

Re: Pig | Yahoo! Research

Reply via email to