[jira] [Updated] (MAPREDUCE-7319) Log list of mappers at trace level in ShuffleHandler audit log
[ https://issues.apache.org/jira/browse/MAPREDUCE-7319?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eric Badger updated MAPREDUCE-7319: --- Resolution: Fixed Status: Resolved (was: Patch Available) > Log list of mappers at trace level in ShuffleHandler audit log > -- > > Key: MAPREDUCE-7319 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-7319 > Project: Hadoop Map/Reduce > Issue Type: Improvement > Components: yarn >Affects Versions: 3.4.0 >Reporter: Jim Brennan >Assignee: Jim Brennan >Priority: Minor > Fix For: 3.4.0, 3.1.5, 3.3.1, 2.10.2, 3.2.3 > > Attachments: MAPREDUCE-7319.001.patch > > > [MAPREDUCE-6958] added the content length to ShuffleHandler audit log, which > is logged at DEBUG level. After enabling it, we found that the list of > mappers for large jobs was filling up our audit logs. It would be good to > move the list of mappers to TRACE level to reduce the logging impact without > disabling the log message entirely. > For example a log message like this: > {noformat} > 2018-01-25 23:43:02,669 [New I/O worker #1] DEBUG ShuffleHandler.audit: > shuffle for job_1512479762132_1318600 reducer 241 length 482072 mappers: > [attempt_1512479762132_1318600_1_00_004852_0_10003, > attempt_1512479762132_1318600_1_00_004190_0_10003, > attempt_1512479762132_1318600_1_00_004393_0_10003, > attempt_1512479762132_1318600_1_00_005057_0_10003, > attempt_1512479762132_1318600_1_00_004855_0_10002, > attempt_1512479762132_1318600_1_00_003976_0_10003, > attempt_1512479762132_1318600_1_00_004058_0_10003, > attempt_1512479762132_1318600_1_00_004355_0_10003, > attempt_1512479762132_1318600_1_00_004436_0_10002, > attempt_1512479762132_1318600_1_00_004854_0_10003, > attempt_1512479762132_1318600_1_00_005174_0_10004, > attempt_1512479762132_1318600_1_00_003972_0_10002, > attempt_1512479762132_1318600_1_00_004853_0_10002, > attempt_1512479762132_1318600_1_00_004856_0_10002] > {noformat} > Would become this with > {{log4j.logger.org.apache.hadoop.mapred.ShuffleHandler.audit=DEBUG}}: > {noformat} > 2018-01-25 23:43:02,669 [New I/O worker #1] DEBUG ShuffleHandler.audit: > shuffle for job_1512479762132_1318600 reducer 241 length 482072 > {noformat} > And this with > {{log4j.logger.org.apache.hadoop.mapred.ShuffleHandler.audit=TRACE}}: > {noformat} > 2018-01-25 23:43:02,669 [New I/O worker #1] DEBUG ShuffleHandler.audit: > shuffle for job_1512479762132_1318600 reducer 241 length 482072 > 2018-01-25 23:43:02,669 [New I/O worker #1] TRACE ShuffleHandler.audit: > shuffle for job_1512479762132_1318600 mappers: > [attempt_1512479762132_1318600_1_00_004852_0_10003, > attempt_1512479762132_1318600_1_00_004190_0_10003, > attempt_1512479762132_1318600_1_00_004393_0_10003, > attempt_1512479762132_1318600_1_00_005057_0_10003, > attempt_1512479762132_1318600_1_00_004855_0_10002, > attempt_1512479762132_1318600_1_00_003976_0_10003, > attempt_1512479762132_1318600_1_00_004058_0_10003, > attempt_1512479762132_1318600_1_00_004355_0_10003, > attempt_1512479762132_1318600_1_00_004436_0_10002, > attempt_1512479762132_1318600_1_00_004854_0_10003, > attempt_1512479762132_1318600_1_00_005174_0_10004, > attempt_1512479762132_1318600_1_00_003972_0_10002, > attempt_1512479762132_1318600_1_00_004853_0_10002, > attempt_1512479762132_1318600_1_00_004856_0_10002] > {noformat} > One question is whether there are any downstream consumers of this audit log > that might have a problem with this change? -- This message was sent by Atlassian Jira (v8.3.4#803005) - To unsubscribe, e-mail: mapreduce-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: mapreduce-issues-h...@hadoop.apache.org
[jira] [Updated] (MAPREDUCE-7319) Log list of mappers at trace level in ShuffleHandler audit log
[ https://issues.apache.org/jira/browse/MAPREDUCE-7319?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eric Badger updated MAPREDUCE-7319: --- Fix Version/s: 3.2.3 2.10.2 3.3.1 3.1.5 3.4.0 +1 Thanks for the patch, [~Jim_Brennan]! I've committed this to trunk (3.4), branch-3.3, branch-3.2, branch-3.1, and branch-2.10 > Log list of mappers at trace level in ShuffleHandler audit log > -- > > Key: MAPREDUCE-7319 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-7319 > Project: Hadoop Map/Reduce > Issue Type: Improvement > Components: yarn >Affects Versions: 3.4.0 >Reporter: Jim Brennan >Assignee: Jim Brennan >Priority: Minor > Fix For: 3.4.0, 3.1.5, 3.3.1, 2.10.2, 3.2.3 > > Attachments: MAPREDUCE-7319.001.patch > > > [MAPREDUCE-6958] added the content length to ShuffleHandler audit log, which > is logged at DEBUG level. After enabling it, we found that the list of > mappers for large jobs was filling up our audit logs. It would be good to > move the list of mappers to TRACE level to reduce the logging impact without > disabling the log message entirely. > For example a log message like this: > {noformat} > 2018-01-25 23:43:02,669 [New I/O worker #1] DEBUG ShuffleHandler.audit: > shuffle for job_1512479762132_1318600 reducer 241 length 482072 mappers: > [attempt_1512479762132_1318600_1_00_004852_0_10003, > attempt_1512479762132_1318600_1_00_004190_0_10003, > attempt_1512479762132_1318600_1_00_004393_0_10003, > attempt_1512479762132_1318600_1_00_005057_0_10003, > attempt_1512479762132_1318600_1_00_004855_0_10002, > attempt_1512479762132_1318600_1_00_003976_0_10003, > attempt_1512479762132_1318600_1_00_004058_0_10003, > attempt_1512479762132_1318600_1_00_004355_0_10003, > attempt_1512479762132_1318600_1_00_004436_0_10002, > attempt_1512479762132_1318600_1_00_004854_0_10003, > attempt_1512479762132_1318600_1_00_005174_0_10004, > attempt_1512479762132_1318600_1_00_003972_0_10002, > attempt_1512479762132_1318600_1_00_004853_0_10002, > attempt_1512479762132_1318600_1_00_004856_0_10002] > {noformat} > Would become this with > {{log4j.logger.org.apache.hadoop.mapred.ShuffleHandler.audit=DEBUG}}: > {noformat} > 2018-01-25 23:43:02,669 [New I/O worker #1] DEBUG ShuffleHandler.audit: > shuffle for job_1512479762132_1318600 reducer 241 length 482072 > {noformat} > And this with > {{log4j.logger.org.apache.hadoop.mapred.ShuffleHandler.audit=TRACE}}: > {noformat} > 2018-01-25 23:43:02,669 [New I/O worker #1] DEBUG ShuffleHandler.audit: > shuffle for job_1512479762132_1318600 reducer 241 length 482072 > 2018-01-25 23:43:02,669 [New I/O worker #1] TRACE ShuffleHandler.audit: > shuffle for job_1512479762132_1318600 mappers: > [attempt_1512479762132_1318600_1_00_004852_0_10003, > attempt_1512479762132_1318600_1_00_004190_0_10003, > attempt_1512479762132_1318600_1_00_004393_0_10003, > attempt_1512479762132_1318600_1_00_005057_0_10003, > attempt_1512479762132_1318600_1_00_004855_0_10002, > attempt_1512479762132_1318600_1_00_003976_0_10003, > attempt_1512479762132_1318600_1_00_004058_0_10003, > attempt_1512479762132_1318600_1_00_004355_0_10003, > attempt_1512479762132_1318600_1_00_004436_0_10002, > attempt_1512479762132_1318600_1_00_004854_0_10003, > attempt_1512479762132_1318600_1_00_005174_0_10004, > attempt_1512479762132_1318600_1_00_003972_0_10002, > attempt_1512479762132_1318600_1_00_004853_0_10002, > attempt_1512479762132_1318600_1_00_004856_0_10002] > {noformat} > One question is whether there are any downstream consumers of this audit log > that might have a problem with this change? -- This message was sent by Atlassian Jira (v8.3.4#803005) - To unsubscribe, e-mail: mapreduce-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: mapreduce-issues-h...@hadoop.apache.org
[jira] [Updated] (MAPREDUCE-7319) Log list of mappers at trace level in ShuffleHandler audit log
[ https://issues.apache.org/jira/browse/MAPREDUCE-7319?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jim Brennan updated MAPREDUCE-7319: --- Status: Patch Available (was: Open) > Log list of mappers at trace level in ShuffleHandler audit log > -- > > Key: MAPREDUCE-7319 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-7319 > Project: Hadoop Map/Reduce > Issue Type: Improvement > Components: yarn >Affects Versions: 3.4.0 >Reporter: Jim Brennan >Assignee: Jim Brennan >Priority: Minor > Attachments: MAPREDUCE-7319.001.patch > > > [MAPREDUCE-6958] added the content length to ShuffleHandler audit log, which > is logged at DEBUG level. After enabling it, we found that the list of > mappers for large jobs was filling up our audit logs. It would be good to > move the list of mappers to TRACE level to reduce the logging impact without > disabling the log message entirely. > For example a log message like this: > {noformat} > 2018-01-25 23:43:02,669 [New I/O worker #1] DEBUG ShuffleHandler.audit: > shuffle for job_1512479762132_1318600 reducer 241 length 482072 mappers: > [attempt_1512479762132_1318600_1_00_004852_0_10003, > attempt_1512479762132_1318600_1_00_004190_0_10003, > attempt_1512479762132_1318600_1_00_004393_0_10003, > attempt_1512479762132_1318600_1_00_005057_0_10003, > attempt_1512479762132_1318600_1_00_004855_0_10002, > attempt_1512479762132_1318600_1_00_003976_0_10003, > attempt_1512479762132_1318600_1_00_004058_0_10003, > attempt_1512479762132_1318600_1_00_004355_0_10003, > attempt_1512479762132_1318600_1_00_004436_0_10002, > attempt_1512479762132_1318600_1_00_004854_0_10003, > attempt_1512479762132_1318600_1_00_005174_0_10004, > attempt_1512479762132_1318600_1_00_003972_0_10002, > attempt_1512479762132_1318600_1_00_004853_0_10002, > attempt_1512479762132_1318600_1_00_004856_0_10002] > {noformat} > Would become this with > {{log4j.logger.org.apache.hadoop.mapred.ShuffleHandler.audit=DEBUG}}: > {noformat} > 2018-01-25 23:43:02,669 [New I/O worker #1] DEBUG ShuffleHandler.audit: > shuffle for job_1512479762132_1318600 reducer 241 length 482072 > {noformat} > And this with > {{log4j.logger.org.apache.hadoop.mapred.ShuffleHandler.audit=TRACE}}: > {noformat} > 2018-01-25 23:43:02,669 [New I/O worker #1] DEBUG ShuffleHandler.audit: > shuffle for job_1512479762132_1318600 reducer 241 length 482072 > 2018-01-25 23:43:02,669 [New I/O worker #1] TRACE ShuffleHandler.audit: > shuffle for job_1512479762132_1318600 mappers: > [attempt_1512479762132_1318600_1_00_004852_0_10003, > attempt_1512479762132_1318600_1_00_004190_0_10003, > attempt_1512479762132_1318600_1_00_004393_0_10003, > attempt_1512479762132_1318600_1_00_005057_0_10003, > attempt_1512479762132_1318600_1_00_004855_0_10002, > attempt_1512479762132_1318600_1_00_003976_0_10003, > attempt_1512479762132_1318600_1_00_004058_0_10003, > attempt_1512479762132_1318600_1_00_004355_0_10003, > attempt_1512479762132_1318600_1_00_004436_0_10002, > attempt_1512479762132_1318600_1_00_004854_0_10003, > attempt_1512479762132_1318600_1_00_005174_0_10004, > attempt_1512479762132_1318600_1_00_003972_0_10002, > attempt_1512479762132_1318600_1_00_004853_0_10002, > attempt_1512479762132_1318600_1_00_004856_0_10002] > {noformat} > One question is whether there are any downstream consumers of this audit log > that might have a problem with this change? -- This message was sent by Atlassian Jira (v8.3.4#803005) - To unsubscribe, e-mail: mapreduce-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: mapreduce-issues-h...@hadoop.apache.org
[jira] [Updated] (MAPREDUCE-7319) Log list of mappers at trace level in ShuffleHandler audit log
[ https://issues.apache.org/jira/browse/MAPREDUCE-7319?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jim Brennan updated MAPREDUCE-7319: --- Attachment: MAPREDUCE-7319.001.patch > Log list of mappers at trace level in ShuffleHandler audit log > -- > > Key: MAPREDUCE-7319 > URL: https://issues.apache.org/jira/browse/MAPREDUCE-7319 > Project: Hadoop Map/Reduce > Issue Type: Improvement > Components: yarn >Affects Versions: 3.4.0 >Reporter: Jim Brennan >Assignee: Jim Brennan >Priority: Minor > Attachments: MAPREDUCE-7319.001.patch > > > [MAPREDUCE-6958] added the content length to ShuffleHandler audit log, which > is logged at DEBUG level. After enabling it, we found that the list of > mappers for large jobs was filling up our audit logs. It would be good to > move the list of mappers to TRACE level to reduce the logging impact without > disabling the log message entirely. > For example a log message like this: > {noformat} > 2018-01-25 23:43:02,669 [New I/O worker #1] DEBUG ShuffleHandler.audit: > shuffle for job_1512479762132_1318600 reducer 241 length 482072 mappers: > [attempt_1512479762132_1318600_1_00_004852_0_10003, > attempt_1512479762132_1318600_1_00_004190_0_10003, > attempt_1512479762132_1318600_1_00_004393_0_10003, > attempt_1512479762132_1318600_1_00_005057_0_10003, > attempt_1512479762132_1318600_1_00_004855_0_10002, > attempt_1512479762132_1318600_1_00_003976_0_10003, > attempt_1512479762132_1318600_1_00_004058_0_10003, > attempt_1512479762132_1318600_1_00_004355_0_10003, > attempt_1512479762132_1318600_1_00_004436_0_10002, > attempt_1512479762132_1318600_1_00_004854_0_10003, > attempt_1512479762132_1318600_1_00_005174_0_10004, > attempt_1512479762132_1318600_1_00_003972_0_10002, > attempt_1512479762132_1318600_1_00_004853_0_10002, > attempt_1512479762132_1318600_1_00_004856_0_10002] > {noformat} > Would become this with > {{log4j.logger.org.apache.hadoop.mapred.ShuffleHandler.audit=DEBUG}}: > {noformat} > 2018-01-25 23:43:02,669 [New I/O worker #1] DEBUG ShuffleHandler.audit: > shuffle for job_1512479762132_1318600 reducer 241 length 482072 > {noformat} > And this with > {{log4j.logger.org.apache.hadoop.mapred.ShuffleHandler.audit=TRACE}}: > {noformat} > 2018-01-25 23:43:02,669 [New I/O worker #1] DEBUG ShuffleHandler.audit: > shuffle for job_1512479762132_1318600 reducer 241 length 482072 > 2018-01-25 23:43:02,669 [New I/O worker #1] TRACE ShuffleHandler.audit: > shuffle for job_1512479762132_1318600 mappers: > [attempt_1512479762132_1318600_1_00_004852_0_10003, > attempt_1512479762132_1318600_1_00_004190_0_10003, > attempt_1512479762132_1318600_1_00_004393_0_10003, > attempt_1512479762132_1318600_1_00_005057_0_10003, > attempt_1512479762132_1318600_1_00_004855_0_10002, > attempt_1512479762132_1318600_1_00_003976_0_10003, > attempt_1512479762132_1318600_1_00_004058_0_10003, > attempt_1512479762132_1318600_1_00_004355_0_10003, > attempt_1512479762132_1318600_1_00_004436_0_10002, > attempt_1512479762132_1318600_1_00_004854_0_10003, > attempt_1512479762132_1318600_1_00_005174_0_10004, > attempt_1512479762132_1318600_1_00_003972_0_10002, > attempt_1512479762132_1318600_1_00_004853_0_10002, > attempt_1512479762132_1318600_1_00_004856_0_10002] > {noformat} > One question is whether there are any downstream consumers of this audit log > that might have a problem with this change? -- This message was sent by Atlassian Jira (v8.3.4#803005) - To unsubscribe, e-mail: mapreduce-issues-unsubscr...@hadoop.apache.org For additional commands, e-mail: mapreduce-issues-h...@hadoop.apache.org