Apologies if this is captured elsewhere.  In the Pig 0.11.1 documentation for 
the builtin function REGEX_EXTRACT 
(http://pig.apache.org/docs/r0.11.1/func.html#regex-extract), the third 
parameter is the index of the matched group to return.  The documentation says 
this is a "1-based parameter".  That's incorrect-it's zero-based.  E.g., to get 
the first match instance I used:
REGEX_EXTRACT(string,'regex',0)


Reply via email to