[jira] [Commented] (PIG-5019) Pig generates tons of warnings for udf with enabled warnings aggregation
[ https://issues.apache.org/jira/browse/PIG-5019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15483614#comment-15483614 ] Murshid Chalaev commented on PIG-5019: -- [~rohini] , thank you for review. Attached PIG-5019_2.patch > Pig generates tons of warnings for udf with enabled warnings aggregation > > > Key: PIG-5019 > URL: https://issues.apache.org/jira/browse/PIG-5019 > Project: Pig > Issue Type: Bug > Components: internal-udfs >Affects Versions: 0.14.0 >Reporter: Murshid Chalaev >Assignee: Murshid Chalaev > Fix For: 0.16.1 > > Attachments: PIG-5019.patch, PIG-5019_2.patch, input_example.gz, > test_pig14_udf .pig > > > For data set containing 9 lines the aggregated warning message is displayed > {code} > 2016-09-01 19:40:33,664 [main] WARN > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher > - Encountered Warning UDF_WARNING_1 6 time(s). > {code} > but in contained logs I see a separate log message "Cannot > extract group for input" for every not matching value > {code} > 2016-09-01 19:40:28,115 INFO [main] > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigMapOnly$Map: > Aliases being processed per job phase (AliasName[line,offset]): M > : b[10,4],b[-1,-1],extract_fields[17,17] C: R: > 2016-09-01 19:40:28,122 WARN [main] > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigHadoopLogger: > org.apache.pig.builtin.REGEX_EXTRACT(UDF_WARNING_1): RegexExtrac > t : Cannot extract group for input /v1=1=9 > 2016-09-01 19:40:28,124 WARN [main] > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigHadoopLogger: > org.apache.pig.builtin.REGEX_EXTRACT(UDF_WARNING_1): RegexExtrac > t : Cannot extract group for input /v2=3=7 > 2016-09-01 19:40:28,124 WARN [main] > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigHadoopLogger: > org.apache.pig.builtin.REGEX_EXTRACT(UDF_WARNING_1): RegexExtract : Cannot > extract group for input /v1=4=6 > 2016-09-01 19:40:28,125 WARN [main] > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigHadoopLogger: > org.apache.pig.builtin.REGEX_EXTRACT(UDF_WARNING_1): RegexExtract : Cannot > extract group for input /v2=5=5 > 2016-09-01 19:40:28,125 WARN [main] > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigHadoopLogger: > org.apache.pig.builtin.REGEX_EXTRACT(UDF_WARNING_1): RegexExtract : Cannot > extract group for input /v1=8=2 > 2016-09-01 19:40:28,125 WARN [main] > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigHadoopLogger: > org.apache.pig.builtin.REGEX_EXTRACT(UDF_WARNING_1): RegexExtract : Cannot > extract group for input /v3=9=1 > {code} > It does not log the warning messages in the task logs. > The patch for PIG-2207 was committed to > Pig 0.13+ > In 0.12 we had a single counter for all UDF warnings, but in 0.13+ we have > separate counter and message for every unique warning log line. > Two lines below are unique > /v2=3=7 > /v1=4=6 > That's why Pig print both of them to the console. > Printing a separate log message for every data line slows down the overall > performance as well. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Updated] (PIG-5019) Pig generates tons of warnings for udf with enabled warnings aggregation
[ https://issues.apache.org/jira/browse/PIG-5019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Murshid Chalaev updated PIG-5019: - Status: Patch Available (was: Open) > Pig generates tons of warnings for udf with enabled warnings aggregation > > > Key: PIG-5019 > URL: https://issues.apache.org/jira/browse/PIG-5019 > Project: Pig > Issue Type: Bug > Components: internal-udfs >Affects Versions: 0.14.0 >Reporter: Murshid Chalaev >Assignee: Murshid Chalaev > Fix For: 0.16.1 > > Attachments: PIG-5019.patch, PIG-5019_2.patch, input_example.gz, > test_pig14_udf .pig > > > For data set containing 9 lines the aggregated warning message is displayed > {code} > 2016-09-01 19:40:33,664 [main] WARN > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher > - Encountered Warning UDF_WARNING_1 6 time(s). > {code} > but in contained logs I see a separate log message "Cannot > extract group for input" for every not matching value > {code} > 2016-09-01 19:40:28,115 INFO [main] > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigMapOnly$Map: > Aliases being processed per job phase (AliasName[line,offset]): M > : b[10,4],b[-1,-1],extract_fields[17,17] C: R: > 2016-09-01 19:40:28,122 WARN [main] > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigHadoopLogger: > org.apache.pig.builtin.REGEX_EXTRACT(UDF_WARNING_1): RegexExtrac > t : Cannot extract group for input /v1=1=9 > 2016-09-01 19:40:28,124 WARN [main] > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigHadoopLogger: > org.apache.pig.builtin.REGEX_EXTRACT(UDF_WARNING_1): RegexExtrac > t : Cannot extract group for input /v2=3=7 > 2016-09-01 19:40:28,124 WARN [main] > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigHadoopLogger: > org.apache.pig.builtin.REGEX_EXTRACT(UDF_WARNING_1): RegexExtract : Cannot > extract group for input /v1=4=6 > 2016-09-01 19:40:28,125 WARN [main] > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigHadoopLogger: > org.apache.pig.builtin.REGEX_EXTRACT(UDF_WARNING_1): RegexExtract : Cannot > extract group for input /v2=5=5 > 2016-09-01 19:40:28,125 WARN [main] > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigHadoopLogger: > org.apache.pig.builtin.REGEX_EXTRACT(UDF_WARNING_1): RegexExtract : Cannot > extract group for input /v1=8=2 > 2016-09-01 19:40:28,125 WARN [main] > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigHadoopLogger: > org.apache.pig.builtin.REGEX_EXTRACT(UDF_WARNING_1): RegexExtract : Cannot > extract group for input /v3=9=1 > {code} > It does not log the warning messages in the task logs. > The patch for PIG-2207 was committed to > Pig 0.13+ > In 0.12 we had a single counter for all UDF warnings, but in 0.13+ we have > separate counter and message for every unique warning log line. > Two lines below are unique > /v2=3=7 > /v1=4=6 > That's why Pig print both of them to the console. > Printing a separate log message for every data line slows down the overall > performance as well. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Updated] (PIG-5019) Pig generates tons of warnings for udf with enabled warnings aggregation
[ https://issues.apache.org/jira/browse/PIG-5019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Murshid Chalaev updated PIG-5019: - Attachment: PIG-5019_2.patch > Pig generates tons of warnings for udf with enabled warnings aggregation > > > Key: PIG-5019 > URL: https://issues.apache.org/jira/browse/PIG-5019 > Project: Pig > Issue Type: Bug > Components: internal-udfs >Affects Versions: 0.14.0 >Reporter: Murshid Chalaev >Assignee: Murshid Chalaev > Fix For: 0.16.1 > > Attachments: PIG-5019.patch, PIG-5019_2.patch, input_example.gz, > test_pig14_udf .pig > > > For data set containing 9 lines the aggregated warning message is displayed > {code} > 2016-09-01 19:40:33,664 [main] WARN > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher > - Encountered Warning UDF_WARNING_1 6 time(s). > {code} > but in contained logs I see a separate log message "Cannot > extract group for input" for every not matching value > {code} > 2016-09-01 19:40:28,115 INFO [main] > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigMapOnly$Map: > Aliases being processed per job phase (AliasName[line,offset]): M > : b[10,4],b[-1,-1],extract_fields[17,17] C: R: > 2016-09-01 19:40:28,122 WARN [main] > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigHadoopLogger: > org.apache.pig.builtin.REGEX_EXTRACT(UDF_WARNING_1): RegexExtrac > t : Cannot extract group for input /v1=1=9 > 2016-09-01 19:40:28,124 WARN [main] > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigHadoopLogger: > org.apache.pig.builtin.REGEX_EXTRACT(UDF_WARNING_1): RegexExtrac > t : Cannot extract group for input /v2=3=7 > 2016-09-01 19:40:28,124 WARN [main] > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigHadoopLogger: > org.apache.pig.builtin.REGEX_EXTRACT(UDF_WARNING_1): RegexExtract : Cannot > extract group for input /v1=4=6 > 2016-09-01 19:40:28,125 WARN [main] > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigHadoopLogger: > org.apache.pig.builtin.REGEX_EXTRACT(UDF_WARNING_1): RegexExtract : Cannot > extract group for input /v2=5=5 > 2016-09-01 19:40:28,125 WARN [main] > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigHadoopLogger: > org.apache.pig.builtin.REGEX_EXTRACT(UDF_WARNING_1): RegexExtract : Cannot > extract group for input /v1=8=2 > 2016-09-01 19:40:28,125 WARN [main] > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigHadoopLogger: > org.apache.pig.builtin.REGEX_EXTRACT(UDF_WARNING_1): RegexExtract : Cannot > extract group for input /v3=9=1 > {code} > It does not log the warning messages in the task logs. > The patch for PIG-2207 was committed to > Pig 0.13+ > In 0.12 we had a single counter for all UDF warnings, but in 0.13+ we have > separate counter and message for every unique warning log line. > Two lines below are unique > /v2=3=7 > /v1=4=6 > That's why Pig print both of them to the console. > Printing a separate log message for every data line slows down the overall > performance as well. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Commented] (PIG-5019) Pig generates tons of warnings for udf with enabled warnings aggregation
[ https://issues.apache.org/jira/browse/PIG-5019?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15455093#comment-15455093 ] Murshid Chalaev commented on PIG-5019: -- [~aniket486] , could you please check the patch. It's related to commit for PIG-2207 > Pig generates tons of warnings for udf with enabled warnings aggregation > > > Key: PIG-5019 > URL: https://issues.apache.org/jira/browse/PIG-5019 > Project: Pig > Issue Type: Bug > Components: internal-udfs >Affects Versions: 0.14.0 >Reporter: Murshid Chalaev > Attachments: PIG-5019.patch, input_example.gz, test_pig14_udf .pig > > > For data set containing 9 lines the aggregated warning message is displayed > {code} > 2016-09-01 19:40:33,664 [main] WARN > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher > - Encountered Warning UDF_WARNING_1 6 time(s). > {code} > but in contained logs I see a separate log message "Cannot > extract group for input" for every not matching value > {code} > 2016-09-01 19:40:28,115 INFO [main] > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigMapOnly$Map: > Aliases being processed per job phase (AliasName[line,offset]): M > : b[10,4],b[-1,-1],extract_fields[17,17] C: R: > 2016-09-01 19:40:28,122 WARN [main] > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigHadoopLogger: > org.apache.pig.builtin.REGEX_EXTRACT(UDF_WARNING_1): RegexExtrac > t : Cannot extract group for input /v1=1=9 > 2016-09-01 19:40:28,124 WARN [main] > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigHadoopLogger: > org.apache.pig.builtin.REGEX_EXTRACT(UDF_WARNING_1): RegexExtrac > t : Cannot extract group for input /v2=3=7 > 2016-09-01 19:40:28,124 WARN [main] > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigHadoopLogger: > org.apache.pig.builtin.REGEX_EXTRACT(UDF_WARNING_1): RegexExtract : Cannot > extract group for input /v1=4=6 > 2016-09-01 19:40:28,125 WARN [main] > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigHadoopLogger: > org.apache.pig.builtin.REGEX_EXTRACT(UDF_WARNING_1): RegexExtract : Cannot > extract group for input /v2=5=5 > 2016-09-01 19:40:28,125 WARN [main] > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigHadoopLogger: > org.apache.pig.builtin.REGEX_EXTRACT(UDF_WARNING_1): RegexExtract : Cannot > extract group for input /v1=8=2 > 2016-09-01 19:40:28,125 WARN [main] > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigHadoopLogger: > org.apache.pig.builtin.REGEX_EXTRACT(UDF_WARNING_1): RegexExtract : Cannot > extract group for input /v3=9=1 > {code} > It does not log the warning messages in the task logs. > The patch for PIG-2207 was committed to > Pig 0.13+ > In 0.12 we had a single counter for all UDF warnings, but in 0.13+ we have > separate counter and message for every unique warning log line. > Two lines below are unique > /v2=3=7 > /v1=4=6 > That's why Pig print both of them to the console. > Printing a separate log message for every data line slows down the overall > performance as well. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Updated] (PIG-5019) Pig generates tons of warnings for udf with enabled warnings aggregation
[ https://issues.apache.org/jira/browse/PIG-5019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Murshid Chalaev updated PIG-5019: - Attachment: PIG-5019.patch > Pig generates tons of warnings for udf with enabled warnings aggregation > > > Key: PIG-5019 > URL: https://issues.apache.org/jira/browse/PIG-5019 > Project: Pig > Issue Type: Bug > Components: internal-udfs >Affects Versions: 0.14.0 >Reporter: Murshid Chalaev > Attachments: PIG-5019.patch, input_example.gz, test_pig14_udf .pig > > > For data set containing 9 lines the aggregated warning message is displayed > {code} > 2016-09-01 19:40:33,664 [main] WARN > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher > - Encountered Warning UDF_WARNING_1 6 time(s). > {code} > but in contained logs I see a separate log message "Cannot > extract group for input" for every not matching value > {code} > 2016-09-01 19:40:28,115 INFO [main] > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigMapOnly$Map: > Aliases being processed per job phase (AliasName[line,offset]): M > : b[10,4],b[-1,-1],extract_fields[17,17] C: R: > 2016-09-01 19:40:28,122 WARN [main] > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigHadoopLogger: > org.apache.pig.builtin.REGEX_EXTRACT(UDF_WARNING_1): RegexExtrac > t : Cannot extract group for input /v1=1=9 > 2016-09-01 19:40:28,124 WARN [main] > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigHadoopLogger: > org.apache.pig.builtin.REGEX_EXTRACT(UDF_WARNING_1): RegexExtrac > t : Cannot extract group for input /v2=3=7 > 2016-09-01 19:40:28,124 WARN [main] > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigHadoopLogger: > org.apache.pig.builtin.REGEX_EXTRACT(UDF_WARNING_1): RegexExtract : Cannot > extract group for input /v1=4=6 > 2016-09-01 19:40:28,125 WARN [main] > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigHadoopLogger: > org.apache.pig.builtin.REGEX_EXTRACT(UDF_WARNING_1): RegexExtract : Cannot > extract group for input /v2=5=5 > 2016-09-01 19:40:28,125 WARN [main] > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigHadoopLogger: > org.apache.pig.builtin.REGEX_EXTRACT(UDF_WARNING_1): RegexExtract : Cannot > extract group for input /v1=8=2 > 2016-09-01 19:40:28,125 WARN [main] > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigHadoopLogger: > org.apache.pig.builtin.REGEX_EXTRACT(UDF_WARNING_1): RegexExtract : Cannot > extract group for input /v3=9=1 > {code} > It does not log the warning messages in the task logs. > The patch for PIG-2207 was committed to > Pig 0.13+ > In 0.12 we had a single counter for all UDF warnings, but in 0.13+ we have > separate counter and message for every unique warning log line. > Two lines below are unique > /v2=3=7 > /v1=4=6 > That's why Pig print both of them to the console. > Printing a separate log message for every data line slows down the overall > performance as well. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Updated] (PIG-5019) Pig generates tons of warnings for udf with enabled warnings aggregation
[ https://issues.apache.org/jira/browse/PIG-5019?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Murshid Chalaev updated PIG-5019: - Attachment: test_pig14_udf .pig input_example.gz > Pig generates tons of warnings for udf with enabled warnings aggregation > > > Key: PIG-5019 > URL: https://issues.apache.org/jira/browse/PIG-5019 > Project: Pig > Issue Type: Bug > Components: internal-udfs >Affects Versions: 0.14.0 >Reporter: Murshid Chalaev > Attachments: input_example.gz, test_pig14_udf .pig > > > For data set containing 9 lines the aggregated warning message is displayed > {code} > 2016-09-01 19:40:33,664 [main] WARN > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher > - Encountered Warning UDF_WARNING_1 6 time(s). > {code} > but in contained logs I see a separate log message "Cannot > extract group for input" for every not matching value > {code} > 2016-09-01 19:40:28,115 INFO [main] > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigMapOnly$Map: > Aliases being processed per job phase (AliasName[line,offset]): M > : b[10,4],b[-1,-1],extract_fields[17,17] C: R: > 2016-09-01 19:40:28,122 WARN [main] > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigHadoopLogger: > org.apache.pig.builtin.REGEX_EXTRACT(UDF_WARNING_1): RegexExtrac > t : Cannot extract group for input /v1=1=9 > 2016-09-01 19:40:28,124 WARN [main] > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigHadoopLogger: > org.apache.pig.builtin.REGEX_EXTRACT(UDF_WARNING_1): RegexExtrac > t : Cannot extract group for input /v2=3=7 > 2016-09-01 19:40:28,124 WARN [main] > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigHadoopLogger: > org.apache.pig.builtin.REGEX_EXTRACT(UDF_WARNING_1): RegexExtract : Cannot > extract group for input /v1=4=6 > 2016-09-01 19:40:28,125 WARN [main] > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigHadoopLogger: > org.apache.pig.builtin.REGEX_EXTRACT(UDF_WARNING_1): RegexExtract : Cannot > extract group for input /v2=5=5 > 2016-09-01 19:40:28,125 WARN [main] > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigHadoopLogger: > org.apache.pig.builtin.REGEX_EXTRACT(UDF_WARNING_1): RegexExtract : Cannot > extract group for input /v1=8=2 > 2016-09-01 19:40:28,125 WARN [main] > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigHadoopLogger: > org.apache.pig.builtin.REGEX_EXTRACT(UDF_WARNING_1): RegexExtract : Cannot > extract group for input /v3=9=1 > {code} > It does not log the warning messages in the task logs. > The patch for PIG-2207 was committed to > Pig 0.13+ > In 0.12 we had a single counter for all UDF warnings, but in 0.13+ we have > separate counter and message for every unique warning log line. > Two lines below are unique > /v2=3=7 > /v1=4=6 > That's why Pig print both of them to the console. > Printing a separate log message for every data line slows down the overall > performance as well. -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Created] (PIG-5019) Pig generates tons of warnings for udf with enabled warnings aggregation
Murshid Chalaev created PIG-5019: Summary: Pig generates tons of warnings for udf with enabled warnings aggregation Key: PIG-5019 URL: https://issues.apache.org/jira/browse/PIG-5019 Project: Pig Issue Type: Bug Components: internal-udfs Affects Versions: 0.14.0 Reporter: Murshid Chalaev For data set containing 9 lines the aggregated warning message is displayed {code} 2016-09-01 19:40:33,664 [main] WARN org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher - Encountered Warning UDF_WARNING_1 6 time(s). {code} but in contained logs I see a separate log message "Cannot extract group for input" for every not matching value {code} 2016-09-01 19:40:28,115 INFO [main] org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigMapOnly$Map: Aliases being processed per job phase (AliasName[line,offset]): M : b[10,4],b[-1,-1],extract_fields[17,17] C: R: 2016-09-01 19:40:28,122 WARN [main] org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigHadoopLogger: org.apache.pig.builtin.REGEX_EXTRACT(UDF_WARNING_1): RegexExtrac t : Cannot extract group for input /v1=1=9 2016-09-01 19:40:28,124 WARN [main] org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigHadoopLogger: org.apache.pig.builtin.REGEX_EXTRACT(UDF_WARNING_1): RegexExtrac t : Cannot extract group for input /v2=3=7 2016-09-01 19:40:28,124 WARN [main] org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigHadoopLogger: org.apache.pig.builtin.REGEX_EXTRACT(UDF_WARNING_1): RegexExtract : Cannot extract group for input /v1=4=6 2016-09-01 19:40:28,125 WARN [main] org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigHadoopLogger: org.apache.pig.builtin.REGEX_EXTRACT(UDF_WARNING_1): RegexExtract : Cannot extract group for input /v2=5=5 2016-09-01 19:40:28,125 WARN [main] org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigHadoopLogger: org.apache.pig.builtin.REGEX_EXTRACT(UDF_WARNING_1): RegexExtract : Cannot extract group for input /v1=8=2 2016-09-01 19:40:28,125 WARN [main] org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigHadoopLogger: org.apache.pig.builtin.REGEX_EXTRACT(UDF_WARNING_1): RegexExtract : Cannot extract group for input /v3=9=1 {code} It does not log the warning messages in the task logs. The patch for PIG-2207 was committed to Pig 0.13+ In 0.12 we had a single counter for all UDF warnings, but in 0.13+ we have separate counter and message for every unique warning log line. Two lines below are unique /v2=3=7 /v1=4=6 That's why Pig print both of them to the console. Printing a separate log message for every data line slows down the overall performance as well. -- This message was sent by Atlassian JIRA (v6.3.4#6332)