Bloom should be able to take a relation or a file
-------------------------------------------------
Key: PIG-2348
URL: https://issues.apache.org/jira/browse/PIG-2348
Project: Pig
Issue Type: Improvement
Components: internal-udfs
Affects Versions: 0.11
Reporter: Alan Gates
Assignee: Alan Gates
Priority: Minor
Fix For: 0.11
Currently Bloom requires the user to have stored the result of a previous query
using BuildBloom to an HDFS file before it can be used. This means the user
must submit an "exec" between this store and the filter that uses Bloom if they
are to be used in the same script.
If Bloom could take a relation as its first input (ala the relation cast to
scalar) then users would not need to put an exec in their script or manage a
storage location on HDFS.
Sometimes storing the results in the file makes sense, so we don't want to
remove the current behavior, just add another option.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira