All,
I have some data that I would like to store into a file and then load it in a
UDF to do some operations in the next pig statement.
For example,
doc_ids = FOREACH docs GENERATE doc_id;
STORE doc_ids INTO '$TEMP';
modifieddocs = FOREACH docs GENERATE myUDF('$TEMP', doc_id);
where myUDF loads doc_ids stored in '$TEMP' and does some operation using $TEMP
and doc_id. Now I need to make sure that the "STORE doc_ids INTO '$TEMP';"
occurs before the FOREACH statement, so that loading the index occurs smoothly.
Is there anyway to guarantee that that can happen?
Thanks
Esh