I've been working with the hive regexp_extract(string subject, string pattern,
int index) command. In the hive language manual,
https://cwiki.apache.org/confluence/display/Hive/LanguageManual+UDF#LanguageManualUDF-StringFunctions,
the following description for this function is given:
Returns the string extracted using the pattern. e.g.
regexp_extract('foothebar', 'foo(.*?)(bar)', 2) returns 'bar.' Note that some
care is necessary in using predefined character classes: using '\s' as the
second argument will match the letter s; '
s' is necessary to match whitespace, etc. The 'index' parameter is the Java
regex Matcher group() method index. See docs/api/java/util/regex/Matcher.html
for more information on the 'index' or Java regex group() method.
I tried doing some basic web searches and cannot find what I seek,
understanding of what the Index value in the regexp() call does.
Where exactly is the information listed as, "See
docs/api/java/util/regex/Matcher.html?" Is it online somewhere?
---
Mark E. Sunderlin
Solutions Architect |AOL Data Warehouse
P: 703-256-6935 | C: 540-327-6222
AIM: MESunderlin
22000 AOL Way | Dulles, VA | 20166