RANDOM takes a seed. You can do a filter with RANDOM:

define rand100 RANDOM(β€˜100');

table1 = load 'school' using org.apache.hcatalog.pig.HCatLoader();
table2 = filter table1 by rand100()<0.01;



Daniel
On 5/15/15, 2:27 AM, "ζŽθΏη”°" <[email protected]> wrote:

>I USE 
> table1 = load 'school' using org.apache.hcatalog.pig.HCatLoader();
>table2 = sample table 0.01;
>every time I dump table2 ,I get different result, Is there one sample
>function with seed? so  the result is not changed every time.
>thank you.

Reply via email to