RE: Tez log location?
Perhaps we should consider creating a TEE in the AM and always do SimpleHistoryLogging? From: Jianfeng (Jeff) Zhang [mailto:jzh...@hortonworks.com] Sent: Thursday, May 21, 2015 5:33 PM To: user@tez.apache.org Subject: Re: Tez log location? In that case you are using SimpleHistoryLoggingService rather than ATSHistoryLoggingService. SimpleHistoryLoggingService will log all the messages to the container logs. So if you will find something like the following which is the data same as in /ws/v1/timeline/TEZ_DAG_ID. But these data are only for diagnosis, if you are trying to consume these data for offline analysis, I would encourage you to use data from ATS 2015-05-21 18:52:06,245 INFO [Dispatcher thread: Central] history.HistoryEventHandler: [HISTORY][DAG:dag_1432205503669_0001_1][Event:DAG_FINISHED]: dagId=dag_1432205503669_0001_1, startTime=1432205516089, finishTime=1432205526204, timeTaken=10115, status=SUCCEEDED, diagnostics=, counters=Counters: 14, org.apache.tez.common.counters.DAGCounter, NUM_KILLED_TASKS=1, NUM_SUCCEEDED_TASKS=2, TOTAL_LAUNCHED_TASKS=3, AM_CPU_MILLISECONDS=0, AM_GC_TIME_MILLIS=0, File System Counters, HDFS_BYTES_READ=0, HDFS_BYTES_WRITTEN=24, HDFS_READ_OPS=6, HDFS_LARGE_READ_OPS=0, HDFS_WRITE_OPS=4, org.apache.tez.common.counters.TaskCounter, NUM_SPECULATIONS=1, GC_TIME_MILLIS=21, COMMITTED_HEAP_BYTES=514850816, OUTPUT_RECORDS=2 Best Regard, Jeff Zhang From: Xiaoyong Zhu mailto:xiaoy...@microsoft.com>> Reply-To: "user@tez.apache.org<mailto:user@tez.apache.org>" mailto:user@tez.apache.org>> Date: Friday, May 22, 2015 at 7:14 AM To: "user@tez.apache.org<mailto:user@tez.apache.org>" mailto:user@tez.apache.org>> Subject: Tez log location? Hi, I am wondering if I didn't configure YARN ATS integration, where would be the tez log (I mean the data available in /ws/v1/timeline/TEZ_DAG_ID if there is an integration happening) go to HDFS? Is there a configuration for that? Thanks! Xiaoyong
Re: Tez log location?
In that case you are using SimpleHistoryLoggingService rather than ATSHistoryLoggingService. SimpleHistoryLoggingService will log all the messages to the container logs. So if you will find something like the following which is the data same as in /ws/v1/timeline/TEZ_DAG_ID. But these data are only for diagnosis, if you are trying to consume these data for offline analysis, I would encourage you to use data from ATS 2015-05-21 18:52:06,245 INFO [Dispatcher thread: Central] history.HistoryEventHandler: [HISTORY][DAG:dag_1432205503669_0001_1][Event:DAG_FINISHED]: dagId=dag_1432205503669_0001_1, startTime=1432205516089, finishTime=1432205526204, timeTaken=10115, status=SUCCEEDED, diagnostics=, counters=Counters: 14, org.apache.tez.common.counters.DAGCounter, NUM_KILLED_TASKS=1, NUM_SUCCEEDED_TASKS=2, TOTAL_LAUNCHED_TASKS=3, AM_CPU_MILLISECONDS=0, AM_GC_TIME_MILLIS=0, File System Counters, HDFS_BYTES_READ=0, HDFS_BYTES_WRITTEN=24, HDFS_READ_OPS=6, HDFS_LARGE_READ_OPS=0, HDFS_WRITE_OPS=4, org.apache.tez.common.counters.TaskCounter, NUM_SPECULATIONS=1, GC_TIME_MILLIS=21, COMMITTED_HEAP_BYTES=514850816, OUTPUT_RECORDS=2 Best Regard, Jeff Zhang From: Xiaoyong Zhu mailto:xiaoy...@microsoft.com>> Reply-To: "user@tez.apache.org<mailto:user@tez.apache.org>" mailto:user@tez.apache.org>> Date: Friday, May 22, 2015 at 7:14 AM To: "user@tez.apache.org<mailto:user@tez.apache.org>" mailto:user@tez.apache.org>> Subject: Tez log location? Hi, I am wondering if I didn't configure YARN ATS integration, where would be the tez log (I mean the data available in /ws/v1/timeline/TEZ_DAG_ID if there is an integration happening) go to HDFS? Is there a configuration for that? Thanks! Xiaoyong
Re: Tez log location?
There is some history logging done that can be enabled via the SimpleHistoryLogger. This activates by default if ATS logger is not enabled. This is not fully compatible with the ATS data and also as it is mostly experimental, it may not have all the data. To use it, you can configure the “tez.history.logging.service.class” to “” or “org.apache.tez.dag.history.logging.impl.SimpleHistoryLoggingService”. The config property “tez.simple.history.logging.dir” controls the path on HDFS where the history is written to. If the dir path is not configured, it writes the logs as part of the Application Master container logs which can then be pulled via “bin/yarn logs -application” Using the HDFS logger does imply that the UI will no longer be functional. thanks — Hitesh On May 21, 2015, at 4:14 PM, Xiaoyong Zhu wrote: > Hi, I am wondering if I didn’t configure YARN ATS integration, where would be > the tez log (I mean the data available in /ws/v1/timeline/TEZ_DAG_ID if there > is an integration happening) go to HDFS? Is there a configuration for that? > > Thanks! > > Xiaoyong
Tez log location?
Hi, I am wondering if I didn't configure YARN ATS integration, where would be the tez log (I mean the data available in /ws/v1/timeline/TEZ_DAG_ID if there is an integration happening) go to HDFS? Is there a configuration for that? Thanks! Xiaoyong