Re: Strange CROSS behavior

2014-05-02 Thread Rohini Palaniswamy
This looks like a bug. Can you please file a jira with steps to reproduce? On Fri, Apr 18, 2014 at 2:45 PM, Alex Rasmussen alex...@trifacta.comwrote: I'm using PigStorage(',') for all stores. I agree about the expensiveness of CROSS, but I'm still kind of confused as to why it would lose

Strange CROSS behavior

2014-04-18 Thread Alex Rasmussen
I'm noticing some really strange behavior with a CROSS operation in one of my scripts. I'm CROSSing a table T1 with another table T2 to produce T3. T1 has one row, and T2 has 2,982,035 rows. If I STORE both T1 and T2 before CROSSing them together to get T3, like so: -- ... Long script that,

Re: Strange CROSS behavior

2014-04-18 Thread Pradeep Gollakota
What is the storage func you're using? My guess is that there is some shared state in the Storage func. Take a look at this SO that is dealing with shared state in Stores. http://stackoverflow.com/questions/20225842/apache-pig-append-one-dataset-to-another-one/20235592#20235592. The reason why

Re: Strange CROSS behavior

2014-04-18 Thread Alex Rasmussen
I'm using PigStorage(',') for all stores. I agree about the expensiveness of CROSS, but I'm still kind of confused as to why it would lose records in this case. --Alex On Fri, Apr 18, 2014 at 2:28 PM, Pradeep Gollakota pradeep...@gmail.comwrote: What is the storage func you're using? My

Re: Strange CROSS behavior

2014-04-18 Thread Russell Jurney
STORing and LOADing relations often is a workaround for these kinds of bugs. On Friday, April 18, 2014, Alex Rasmussen alex...@trifacta.com wrote: I'm using PigStorage(',') for all stores. I agree about the expensiveness of CROSS, but I'm still kind of confused as to why it would lose