RANDOM takes a seed. You can do a filter with RANDOM:
define rand100 RANDOM(β100'); table1 = load 'school' using org.apache.hcatalog.pig.HCatLoader(); table2 = filter table1 by rand100()<0.01; Daniel On 5/15/15, 2:27 AM, "ζθΏη°" <[email protected]> wrote: >I USE > table1 = load 'school' using org.apache.hcatalog.pig.HCatLoader(); >table2 = sample table 0.01; >every time I dump table2 ,I get different result, Is there one sample >function with seed? so the result is not changed every time. >thank you.
