[ 
https://issues.apache.org/jira/browse/PIG-4217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Praveen Rachabattuni updated PIG-4217:
--------------------------------------
    Status: Patch Available  (was: Open)

> Fix documentation in BuildBloom
> -------------------------------
>
>                 Key: PIG-4217
>                 URL: https://issues.apache.org/jira/browse/PIG-4217
>             Project: Pig
>          Issue Type: Bug
>            Reporter: Praveen Rachabattuni
>            Assignee: Praveen Rachabattuni
>         Attachments: PIG-4217-1.patch
>
>
> /**
>  * Build a bloom filter for use later in Bloom.  This UDF is intended to run
>  * in a group all job.  For example:
>  * define bb BuildBloom('jenkins', '100', '0.1');
>  * A = load 'foo' as (x, y);
>  * B = group A all;
>  * C = foreach B generate BuildBloom(A.x);
>  * store C into 'mybloom';
>  * The bloom filter can be on multiple keys by passing more than one field
>  * (or the entire bag) to BuildBloom.
>  * The resulting file can then be used in a Bloom filter as:
>  * define bloom Bloom(mybloom);
>  * A = load 'foo' as (x, y);
>  * B = load 'bar' as (z);
>  * C = filter B by Bloom(z);
>  * D = join C by z, A by x;
>  * It uses {@link org.apache.hadoop.util.bloom.BloomFilter}.
>  */
> Pig script inside above doc strings doesn't work



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to