Praveen Rachabattuni created PIG-4217:
-----------------------------------------
Summary: Fix documentation in BuildBloom
Key: PIG-4217
URL: https://issues.apache.org/jira/browse/PIG-4217
Project: Pig
Issue Type: Bug
Reporter: Praveen Rachabattuni
Assignee: Praveen Rachabattuni
/**
* Build a bloom filter for use later in Bloom. This UDF is intended to run
* in a group all job. For example:
* define bb BuildBloom('jenkins', '100', '0.1');
* A = load 'foo' as (x, y);
* B = group A all;
* C = foreach B generate BuildBloom(A.x);
* store C into 'mybloom';
* The bloom filter can be on multiple keys by passing more than one field
* (or the entire bag) to BuildBloom.
* The resulting file can then be used in a Bloom filter as:
* define bloom Bloom(mybloom);
* A = load 'foo' as (x, y);
* B = load 'bar' as (z);
* C = filter B by Bloom(z);
* D = join C by z, A by x;
* It uses {@link org.apache.hadoop.util.bloom.BloomFilter}.
*/
Pig script inside above doc strings doesn't work
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)