On 10/27/15 6:33 AM, Arnaldo Carvalho de Melo wrote:

Correlating data to user readable information is a key part of perf.

Indeed, as best as it can.

One option that might be able to solve this problem is to have perf
kernel side walk the task list and generate the task events into the
ring buffer (task diag code could be leveraged). This would be a lot

It would have to do this over multiple iterations, locklessly wrt the
task list, in a non-intrusive way, which, in this case, could take
forever, no? :-)

taskdiag struggles to keep up because netlink messages have a limited size, the skb's have to be pushed to userspace and ack'ed and then the walk proceeds to the next task.

Filenames for the maps are the biggest killer on throughput wrt kernel side processing.

With a multi-MB ring buffer you have a much larger buffer to fill. In addition perf userspace can be kicked at a low watermark so it is draining that buffer as fast as it can:

   kernel   --->   ring buffer  --->  perf  -->  what?

The limiter here is perf userspace draining the buffer such that the kernel side does not have to take much if any break.

If the "What" is a file (e.g., perf record) then file I/O becomes a limiter. If the "What" is processing the data (e.g., perf top) we should be able to come up with some design that at least pulls the data into memory so the buffer never fills.

Sure there would need to be some progress limiters put to keep the kernel side from killing a cpu but I think this kind of design has the best chance of getting the most information for this class of problem.

And then for all of the much smaller more typical perf use cases this kind of data collection is much less expensive than walking proc. taskdiag shows that and this design is faster and more efficient than taskdiag.

David
--
To unsubscribe from this list: send the line "unsubscribe linux-perf-users" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Reply via email to