Hello Impala Public Jenkins,

I'd like you to reexamine a change. Please visit

    http://gerrit.cloudera.org:8080/15914

to look at the new patch set (#3).

Change subject: IMPALA-9435: Usability enhancements for data cache access trace
......................................................................

IMPALA-9435: Usability enhancements for data cache access trace

The data cache access trace was added in IMPALA-8542 as a way
to capture a workload's cache accesses to allow later analysis.

This modifies the data cache access trace to improve usability:
1. The access trace now uses a SimpleLogger to limit the total
   number of trace entries per file and total number of trace
   files. This caps the disk usage for the access trace. The
   behavior is controlled by the data_cache_trace_dir,
   max_data_cache_trace_file_size, and max_data_cache_trace_files
   startup parameters.
2. This introduces the data_cache_trace_percentage, which allows
   tracing only a subset of the entries produced. It traces
   accesses for a consistent subset of the cache (i.e. accesses
   for a filename/mtime/offset are either always traced or
   never traced). This allows for better analysis than a random
   sample. Tracing a subset of accesses can reduce any performance
   overhead from tracing. It also provides a way to trace a longer
   time period in the same number of entries.

This also implements the ability to replay traces against a
specific cache configuration. The replayer can produce JSON output
with cache hit/miss information for the original trace and the
replay. This provides a building block for building analysis
comparing different cache sizes or cache eviction policies.

Testing:
 - New backend tests in data-cache-test, data-cache-trace-test
 - Manually testing the data-cache-trace-replayer

Change-Id: I0f84204d8e5145f5fa8d4851d9c19ac317db168e
---
M be/src/runtime/io/CMakeLists.txt
M be/src/runtime/io/data-cache-test.cc
A be/src/runtime/io/data-cache-trace-replayer.cc
A be/src/runtime/io/data-cache-trace-test.cc
A be/src/runtime/io/data-cache-trace.cc
A be/src/runtime/io/data-cache-trace.h
M be/src/runtime/io/data-cache.cc
M be/src/runtime/io/data-cache.h
M bin/start-impala-cluster.py
9 files changed, 1,473 insertions(+), 271 deletions(-)


  git pull ssh://gerrit.cloudera.org:29418/Impala-ASF refs/changes/14/15914/3
--
To view, visit http://gerrit.cloudera.org:8080/15914
To unsubscribe, visit http://gerrit.cloudera.org:8080/settings

Gerrit-Project: Impala-ASF
Gerrit-Branch: master
Gerrit-MessageType: newpatchset
Gerrit-Change-Id: I0f84204d8e5145f5fa8d4851d9c19ac317db168e
Gerrit-Change-Number: 15914
Gerrit-PatchSet: 3
Gerrit-Owner: Joe McDonnell <joemcdonn...@cloudera.com>
Gerrit-Reviewer: Impala Public Jenkins <impala-public-jenk...@cloudera.com>

Reply via email to