Re: PIG + Junit

2010-07-21 Thread Corbin Hoenes
okay no attachments...try this gist: http://gist.github.com/484135 On Jul 21, 2010, at 12:02 AM, Corbin Hoenes wrote: Trying to attach the PigRunner class in case that helps give you a start using register script. On Jul 20, 2010, at 11:56 PM, Corbin Hoenes wrote: Hey Todd we run

Re: PIG + Junit

2010-07-21 Thread Corbin Hoenes
Dimitry, Nope that is new for me thanks for pointing it out, been using this home grown class since pig 0.5--really like the idea of unit testing moving into pig as a first class citizen. On Jul 21, 2010, at 2:11 AM, Dmitriy Ryaboy wrote: Corbin, Have you looked at PigUnit?

Using Pig with HBase

2010-07-21 Thread Dmitry Demeshchuk
Greetings. I'm trying to query HBase using Pig but do something wrong and cannot figure out what exactly. 1. First, I create a table in HBase: hbase(main):001:0 create 'test_table', 'test_family' and add values to it: hbase(main):002:0 put 'test_table', '1', 'test_family:body', 'body1'

Thread safety

2010-07-21 Thread Wouter de Bie
Hi all, We're building an application that starts multiple pig jobs in parallel by using PigServer. However, Pig doesn't seem to be thread-safe. And since we're running a Java application, I'm not sure how to solve this. Does anyone have an idea? // Wouter

Re: Using Pig with HBase

2010-07-21 Thread Dmitriy Ryaboy
Which version of Pig are you using? If 0.6, have you tried the elephant bird HBase loader? Is there a more detailed stack trace in the pig log? -Dmitriy On Wed, Jul 21, 2010 at 4:10 AM, Dmitry Demeshchuk demeshc...@gmail.comwrote: Greetings. I'm trying to query HBase using Pig but do

Re: how to set the conf in pig to use a cached file.

2010-07-21 Thread Scott Carey
I believe you set these by passing them in on the command line, not from within the script. For example add -Dmapred.create.symlink=yes to your command line when you call pig. The language itself does not have any features (yet) for passing in properties to the execution engine. On Jul 20,

Sorting a tuple's content

2010-07-21 Thread Renato Marroquín Mogrovejo
Hey everybody, Does any body know how I can sort a tuple's content? For example, I have (770001,880001,990001,770001) and I would like to obtain (770001,770001,880001,990001). I tried doing a group by the first field but the thing is that I still get the whole tuple as a resultant bag. Thanks in

Re: Sorting a tuple's content

2010-07-21 Thread Dmitriy Ryaboy
that has to be a UDF, there is nothing built in for this. On Wed, Jul 21, 2010 at 6:33 PM, Renato Marroquín Mogrovejo renatoj.marroq...@gmail.com wrote: Hey everybody, Does any body know how I can sort a tuple's content? For example, I have (770001,880001,990001,770001) and I would like to