[
https://issues.apache.org/jira/browse/PIG-4217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Daniel Dai updated PIG-4217:
----------------------------
Resolution: Fixed
Fix Version/s: 0.14.0
Hadoop Flags: Reviewed
Status: Resolved (was: Patch Available)
Patch committed to trunk and 0.14 branch. Thanks Praveen!
> Fix documentation in BuildBloom
> -------------------------------
>
> Key: PIG-4217
> URL: https://issues.apache.org/jira/browse/PIG-4217
> Project: Pig
> Issue Type: Bug
> Reporter: Praveen Rachabattuni
> Assignee: Praveen Rachabattuni
> Fix For: 0.14.0
>
> Attachments: PIG-4217-1.patch
>
>
> /**
> * Build a bloom filter for use later in Bloom. This UDF is intended to run
> * in a group all job. For example:
> * define bb BuildBloom('jenkins', '100', '0.1');
> * A = load 'foo' as (x, y);
> * B = group A all;
> * C = foreach B generate BuildBloom(A.x);
> * store C into 'mybloom';
> * The bloom filter can be on multiple keys by passing more than one field
> * (or the entire bag) to BuildBloom.
> * The resulting file can then be used in a Bloom filter as:
> * define bloom Bloom(mybloom);
> * A = load 'foo' as (x, y);
> * B = load 'bar' as (z);
> * C = filter B by Bloom(z);
> * D = join C by z, A by x;
> * It uses {@link org.apache.hadoop.util.bloom.BloomFilter}.
> */
> Pig script inside above doc strings doesn't work
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)