> On 15 Sep 2025, at 3:59 PM, Athira Rajeev <atraj...@linux.ibm.com> wrote:
>
> The pseries Shared Processor Logical Partition(SPLPAR) machines can
> retrieve a log of dispatch and preempt events from the hypervisor
> using data from Disptach Trace Log(DTL) buffer. With this information,
> user can retrieve when and why each dispatch & preempt has occurred.
> The vpa-dtl PMU exposes the Virtual Processor Area(VPA) DTL counters
> via perf.
>
> - Patches 1 to 6 has powerpc PMU driver code changes to capture DTL
> trace in perf.data. And patch 7 has documentation update.
>
> Infrastructure used
> ===================
>
> The VPA DTL PMU counters do not interrupt on overflow or generate any
> PMI interrupts. Therefore, hrtimer is used to poll the DTL data. The timer
> nterval can be provided by user via sample_period field in nano seconds.
> vpa dtl pmu has one hrtimer added per vpa-dtl pmu thread. DTL (Dispatch
> Trace Log) contains information about dispatch/preempt, enqueue time etc.
> We directly copy the DTL buffer data as part of auxiliary buffer and it
> will be processed later. This will avoid time taken to create samples
> in the kernel space. The PMU driver collecting Dispatch Trace Log (DTL)
> entries makes use of AUX support in perf infrastructure. On the tools side,
> this data is made available as PERF_RECORD_AUXTRACE records.
>
> To corelate each DTL entry with other events across CPU's, an auxtrace_queue
> is created for each CPU. Each auxtrace queue has a array/list of auxtrace
> buffers.
> All auxtrace queues is maintained in auxtrace heap. The queues are sorted
> based on timestamp. When the different PERF_RECORD_XX records are processed,
> compare the timestamp of perf record with timestamp of top element in the
> auxtrace heap so that DTL events can be co-related with other events
> Process the auxtrace queue if the timestamp of element from heap is
> lower than timestamp from entry in perf record. Sometimes it could happen that
> one buffer is only partially processed. if the timestamp of occurrence of
> another event is more than currently processed element in the queue, it will
> move on to next perf record. So keep track of position of buffer to continue
> processing next time. Update the timestamp of the auxtrace heap with the
> timestamp
> of last processed entry from the auxtrace buffer.
>
> This infrastructure ensures dispatch trace log entries can be corelated
> and presented along with other events like sched.
>
> With the kernel changes;
>
> # ls /sys/devices/vpa_dtl/
> events format perf_event_mux_interval_ms power subsystem type uevent
>
> Thanks
> Athira
>
> Aboorva Devarajan (1):
> powerpc/time: Expose boot_tb via accessor
>
> Athira Rajeev (4):
> powerpc/perf/vpa-dtl: Add support to setup and free aux buffer for
> capturing DTL data
> powerpc/perf/vpa-dtl: Add support to capture DTL data in aux buffer
> powerpc/perf/vpa-dtl: Handle the writing of perf record when aux wake
> up is needed
> powerpc/perf/vpa-dtl: Add documentation for VPA dispatch trace log PMU
>
> Kajol Jain (2):
> powerpc/vpa_dtl: Add interface to expose vpa dtl counters via perf
> docs: ABI: sysfs-bus-event_source-devices-vpa-dtl: Document sysfs
> event format entries for vpa_dtl pmu
>
> .../sysfs-bus-event_source-devices-vpa-dtl | 25 +
> Documentation/arch/powerpc/index.rst | 1 +
> Documentation/arch/powerpc/vpa-dtl.rst | 156 +++++
> arch/powerpc/include/asm/time.h | 4 +
> arch/powerpc/kernel/time.c | 8 +-
> arch/powerpc/perf/Makefile | 2 +-
> arch/powerpc/perf/vpa-dtl.c | 596 ++++++++++++++++++
> 7 files changed, 790 insertions(+), 2 deletions(-)
> create mode 100644
> Documentation/ABI/testing/sysfs-bus-event_source-devices-vpa-dtl
> create mode 100644 Documentation/arch/powerpc/vpa-dtl.rst
> create mode 100644 arch/powerpc/perf/vpa-dtl.c
>
> --
> 2.47.1
>
Tested this patch set by applying on top of today’s mainline kernel and its
working as expected.
Please add below tag for the patch set.
Tested-by: Venkat Rao Bagalkote <venka...@linux.ibm.com>
Regards,
Venkat.