Re: Pig Performance Benchmarks

2009-02-17 Thread Alan Gates
That's correct. The 10m in the names weren't really meant to be hardcoded into the patch, as the idea is that the tables could be created at different sizes depending on your cluster size. Sorry for the incomplete state of things, obviously that patch needs some work before I commit it.

Re: Pig performance

2008-12-31 Thread Alan Gates
computations across multiple stores. Olga -Original Message- From: Ted Dunning [mailto:ted.dunn...@gmail.com] Sent: Saturday, December 20, 2008 10:33 AM To: pig-dev@hadoop.apache.org Cc: pig-dev@hadoop.apache.org Subject: Re: Pig performance I think the key points that Alan brought up

Re: Pig performance

2008-12-30 Thread Kevin Weil
combine computations across > multiple stores. > > Olga > > > -Original Message- > > From: Ted Dunning [mailto:ted.dunn...@gmail.com] > > Sent: Saturday, December 20, 2008 10:33 AM > > To: pig-dev@hadoop.apache.org > > Cc: pig-dev@hadoop.apache.

RE: Pig performance

2008-12-22 Thread Olga Natkovich
> Sent: Saturday, December 20, 2008 10:33 AM > To: pig-dev@hadoop.apache.org > Cc: pig-dev@hadoop.apache.org > Subject: Re: Pig performance > > > I think the key points that Alan brought up in his blog > comment were that trunk pig is paradoxically not the most > curr

Re: Pig performance

2008-12-20 Thread Ted Dunning
I think the key points that Alan brought up in his blog comment were that trunk pig is paradoxically not the most current and that storing intermediate results can decrease the scope of optimizations. On Dec 20, 2008, at 10:16, Alan Gates wrote: I left a comment on the blog addressing som

Re: Pig performance

2008-12-20 Thread Alan Gates
I left a comment on the blog addressing some of the issues he brought up. Alan. On Dec 20, 2008, at 1:00 AM, Jeff Hammerbacher wrote: Hey Pig team, Did anyone check out the recent claims about Pig's poor performance versus Cascading? Though I haven't worked extensively with either system,