Re: [PATCH v8 0/3]: perf: reduce data loss when profiling highly parallel CPU bound workloads

2018-09-13 Thread Jiri Olsa
On Tue, Sep 11, 2018 at 04:42:09PM +0300, Alexey Budankov wrote: > Hi, > > On 11.09.2018 11:34, Jiri Olsa wrote: > > On Tue, Sep 11, 2018 at 11:16:45AM +0300, Alexey Budankov wrote: > >> > >> Hi Ingo, > >> > >> On 11.09.2018 9:35, Ingo Molnar wrote: > >>> > >>> * Alexey Budankov wrote: > >>> >

Re: [PATCH v8 0/3]: perf: reduce data loss when profiling highly parallel CPU bound workloads

2018-09-13 Thread Jiri Olsa
On Tue, Sep 11, 2018 at 04:42:09PM +0300, Alexey Budankov wrote: > Hi, > > On 11.09.2018 11:34, Jiri Olsa wrote: > > On Tue, Sep 11, 2018 at 11:16:45AM +0300, Alexey Budankov wrote: > >> > >> Hi Ingo, > >> > >> On 11.09.2018 9:35, Ingo Molnar wrote: > >>> > >>> * Alexey Budankov wrote: > >>> >

Re: [PATCH v8 0/3]: perf: reduce data loss when profiling highly parallel CPU bound workloads

2018-09-12 Thread Alexey Budankov
Hi, On 11.09.2018 17:19, Peter Zijlstra wrote: > On Tue, Sep 11, 2018 at 08:35:12AM +0200, Ingo Molnar wrote: >>> Well, explicit threading in the tool for AIO, in the simplest case, means >>> incorporating some POSIX API implementation into the tool, avoiding >>> code reuse in the first

Re: [PATCH v8 0/3]: perf: reduce data loss when profiling highly parallel CPU bound workloads

2018-09-12 Thread Alexey Budankov
Hi, On 11.09.2018 17:19, Peter Zijlstra wrote: > On Tue, Sep 11, 2018 at 08:35:12AM +0200, Ingo Molnar wrote: >>> Well, explicit threading in the tool for AIO, in the simplest case, means >>> incorporating some POSIX API implementation into the tool, avoiding >>> code reuse in the first

Re: [PATCH v8 0/3]: perf: reduce data loss when profiling highly parallel CPU bound workloads

2018-09-11 Thread Peter Zijlstra
On Tue, Sep 11, 2018 at 08:35:12AM +0200, Ingo Molnar wrote: > > Well, explicit threading in the tool for AIO, in the simplest case, means > > incorporating some POSIX API implementation into the tool, avoiding > > code reuse in the first place. That tends to be error prone and costly. > > It's

Re: [PATCH v8 0/3]: perf: reduce data loss when profiling highly parallel CPU bound workloads

2018-09-11 Thread Peter Zijlstra
On Tue, Sep 11, 2018 at 08:35:12AM +0200, Ingo Molnar wrote: > > Well, explicit threading in the tool for AIO, in the simplest case, means > > incorporating some POSIX API implementation into the tool, avoiding > > code reuse in the first place. That tends to be error prone and costly. > > It's

Re: [PATCH v8 0/3]: perf: reduce data loss when profiling highly parallel CPU bound workloads

2018-09-11 Thread Alexey Budankov
Hi, On 11.09.2018 11:34, Jiri Olsa wrote: > On Tue, Sep 11, 2018 at 11:16:45AM +0300, Alexey Budankov wrote: >> >> Hi Ingo, >> >> On 11.09.2018 9:35, Ingo Molnar wrote: >>> >>> * Alexey Budankov wrote: >>> It may sound too optimistic but glibc API is expected to be backward compatible

Re: [PATCH v8 0/3]: perf: reduce data loss when profiling highly parallel CPU bound workloads

2018-09-11 Thread Alexey Budankov
Hi, On 11.09.2018 11:34, Jiri Olsa wrote: > On Tue, Sep 11, 2018 at 11:16:45AM +0300, Alexey Budankov wrote: >> >> Hi Ingo, >> >> On 11.09.2018 9:35, Ingo Molnar wrote: >>> >>> * Alexey Budankov wrote: >>> It may sound too optimistic but glibc API is expected to be backward compatible

Re: [PATCH v8 0/3]: perf: reduce data loss when profiling highly parallel CPU bound workloads

2018-09-11 Thread Jiri Olsa
On Tue, Sep 11, 2018 at 11:16:45AM +0300, Alexey Budankov wrote: > > Hi Ingo, > > On 11.09.2018 9:35, Ingo Molnar wrote: > > > > * Alexey Budankov wrote: > > > >> It may sound too optimistic but glibc API is expected to be backward > >> compatible > >> and for POSIX AIO API part too.

Re: [PATCH v8 0/3]: perf: reduce data loss when profiling highly parallel CPU bound workloads

2018-09-11 Thread Jiri Olsa
On Tue, Sep 11, 2018 at 11:16:45AM +0300, Alexey Budankov wrote: > > Hi Ingo, > > On 11.09.2018 9:35, Ingo Molnar wrote: > > > > * Alexey Budankov wrote: > > > >> It may sound too optimistic but glibc API is expected to be backward > >> compatible > >> and for POSIX AIO API part too.

Re: [PATCH v8 0/3]: perf: reduce data loss when profiling highly parallel CPU bound workloads

2018-09-11 Thread Alexey Budankov
Hi Ingo, On 11.09.2018 9:35, Ingo Molnar wrote: > > * Alexey Budankov wrote: > >> It may sound too optimistic but glibc API is expected to be backward >> compatible >> and for POSIX AIO API part too. Internal implementation also tends to evolve >> to >> better option overtime, more

Re: [PATCH v8 0/3]: perf: reduce data loss when profiling highly parallel CPU bound workloads

2018-09-11 Thread Alexey Budankov
Hi Ingo, On 11.09.2018 9:35, Ingo Molnar wrote: > > * Alexey Budankov wrote: > >> It may sound too optimistic but glibc API is expected to be backward >> compatible >> and for POSIX AIO API part too. Internal implementation also tends to evolve >> to >> better option overtime, more

Re: [PATCH v8 0/3]: perf: reduce data loss when profiling highly parallel CPU bound workloads

2018-09-11 Thread Ingo Molnar
* Alexey Budankov wrote: > It may sound too optimistic but glibc API is expected to be backward > compatible > and for POSIX AIO API part too. Internal implementation also tends to evolve > to > better option overtime, more probably basing on modern kernel capabilities > mentioned here:

Re: [PATCH v8 0/3]: perf: reduce data loss when profiling highly parallel CPU bound workloads

2018-09-11 Thread Ingo Molnar
* Alexey Budankov wrote: > It may sound too optimistic but glibc API is expected to be backward > compatible > and for POSIX AIO API part too. Internal implementation also tends to evolve > to > better option overtime, more probably basing on modern kernel capabilities > mentioned here:

Re: [PATCH v8 0/3]: perf: reduce data loss when profiling highly parallel CPU bound workloads

2018-09-10 Thread Alexey Budankov
Hi, On 10.09.2018 16:58, Arnaldo Carvalho de Melo wrote: > Em Mon, Sep 10, 2018 at 02:06:43PM +0200, Ingo Molnar escreveu: >> * Alexey Budankov wrote: >>> On 10.09.2018 12:18, Ingo Molnar wrote: * Alexey Budankov wrote: > Currently in record mode the tool implements trace writing

Re: [PATCH v8 0/3]: perf: reduce data loss when profiling highly parallel CPU bound workloads

2018-09-10 Thread Alexey Budankov
Hi, On 10.09.2018 16:58, Arnaldo Carvalho de Melo wrote: > Em Mon, Sep 10, 2018 at 02:06:43PM +0200, Ingo Molnar escreveu: >> * Alexey Budankov wrote: >>> On 10.09.2018 12:18, Ingo Molnar wrote: * Alexey Budankov wrote: > Currently in record mode the tool implements trace writing

Re: [PATCH v8 0/3]: perf: reduce data loss when profiling highly parallel CPU bound workloads

2018-09-10 Thread Alexey Budankov
Hi Ingo, On 10.09.2018 15:06, Ingo Molnar wrote: > > * Alexey Budankov wrote: > >> Hi Ingo, >> >> On 10.09.2018 12:18, Ingo Molnar wrote: >>> >>> * Alexey Budankov wrote: >>> Currently in record mode the tool implements trace writing serially. The algorithm loops over mapped

Re: [PATCH v8 0/3]: perf: reduce data loss when profiling highly parallel CPU bound workloads

2018-09-10 Thread Alexey Budankov
Hi Ingo, On 10.09.2018 15:06, Ingo Molnar wrote: > > * Alexey Budankov wrote: > >> Hi Ingo, >> >> On 10.09.2018 12:18, Ingo Molnar wrote: >>> >>> * Alexey Budankov wrote: >>> Currently in record mode the tool implements trace writing serially. The algorithm loops over mapped

Re: [PATCH v8 0/3]: perf: reduce data loss when profiling highly parallel CPU bound workloads

2018-09-10 Thread Arnaldo Carvalho de Melo
Em Mon, Sep 10, 2018 at 02:06:43PM +0200, Ingo Molnar escreveu: > * Alexey Budankov wrote: > > On 10.09.2018 12:18, Ingo Molnar wrote: > > > * Alexey Budankov wrote: > > >> Currently in record mode the tool implements trace writing serially. > > >> The algorithm loops over mapped per-cpu data

Re: [PATCH v8 0/3]: perf: reduce data loss when profiling highly parallel CPU bound workloads

2018-09-10 Thread Arnaldo Carvalho de Melo
Em Mon, Sep 10, 2018 at 02:06:43PM +0200, Ingo Molnar escreveu: > * Alexey Budankov wrote: > > On 10.09.2018 12:18, Ingo Molnar wrote: > > > * Alexey Budankov wrote: > > >> Currently in record mode the tool implements trace writing serially. > > >> The algorithm loops over mapped per-cpu data

Re: [PATCH v8 0/3]: perf: reduce data loss when profiling highly parallel CPU bound workloads

2018-09-10 Thread Ingo Molnar
* Alexey Budankov wrote: > Hi Ingo, > > On 10.09.2018 12:18, Ingo Molnar wrote: > > > > * Alexey Budankov wrote: > > > >> > >> Currently in record mode the tool implements trace writing serially. > >> The algorithm loops over mapped per-cpu data buffers and stores > >> ready data chunks

Re: [PATCH v8 0/3]: perf: reduce data loss when profiling highly parallel CPU bound workloads

2018-09-10 Thread Ingo Molnar
* Alexey Budankov wrote: > Hi Ingo, > > On 10.09.2018 12:18, Ingo Molnar wrote: > > > > * Alexey Budankov wrote: > > > >> > >> Currently in record mode the tool implements trace writing serially. > >> The algorithm loops over mapped per-cpu data buffers and stores > >> ready data chunks

Re: [PATCH v8 0/3]: perf: reduce data loss when profiling highly parallel CPU bound workloads

2018-09-10 Thread Alexey Budankov
Hi, On 10.09.2018 13:23, Jiri Olsa wrote: > On Mon, Sep 10, 2018 at 12:13:25PM +0200, Ingo Molnar wrote: >> >> * Jiri Olsa wrote: >> >>> On Mon, Sep 10, 2018 at 12:03:03PM +0200, Ingo Molnar wrote: * Jiri Olsa wrote: >> Per-CPU threading the record session would have so many

Re: [PATCH v8 0/3]: perf: reduce data loss when profiling highly parallel CPU bound workloads

2018-09-10 Thread Alexey Budankov
Hi, On 10.09.2018 13:23, Jiri Olsa wrote: > On Mon, Sep 10, 2018 at 12:13:25PM +0200, Ingo Molnar wrote: >> >> * Jiri Olsa wrote: >> >>> On Mon, Sep 10, 2018 at 12:03:03PM +0200, Ingo Molnar wrote: * Jiri Olsa wrote: >> Per-CPU threading the record session would have so many

Re: [PATCH v8 0/3]: perf: reduce data loss when profiling highly parallel CPU bound workloads

2018-09-10 Thread Alexey Budankov
Hi Ingo, On 10.09.2018 12:18, Ingo Molnar wrote: > > * Alexey Budankov wrote: > >> >> Currently in record mode the tool implements trace writing serially. >> The algorithm loops over mapped per-cpu data buffers and stores >> ready data chunks into a trace file using write() system call. >>

Re: [PATCH v8 0/3]: perf: reduce data loss when profiling highly parallel CPU bound workloads

2018-09-10 Thread Alexey Budankov
Hi Ingo, On 10.09.2018 12:18, Ingo Molnar wrote: > > * Alexey Budankov wrote: > >> >> Currently in record mode the tool implements trace writing serially. >> The algorithm loops over mapped per-cpu data buffers and stores >> ready data chunks into a trace file using write() system call. >>

Re: [PATCH v8 0/3]: perf: reduce data loss when profiling highly parallel CPU bound workloads

2018-09-10 Thread Jiri Olsa
On Mon, Sep 10, 2018 at 12:13:25PM +0200, Ingo Molnar wrote: > > * Jiri Olsa wrote: > > > On Mon, Sep 10, 2018 at 12:03:03PM +0200, Ingo Molnar wrote: > > > > > > * Jiri Olsa wrote: > > > > > > > > Per-CPU threading the record session would have so many other > > > > > advantages as well

Re: [PATCH v8 0/3]: perf: reduce data loss when profiling highly parallel CPU bound workloads

2018-09-10 Thread Jiri Olsa
On Mon, Sep 10, 2018 at 12:13:25PM +0200, Ingo Molnar wrote: > > * Jiri Olsa wrote: > > > On Mon, Sep 10, 2018 at 12:03:03PM +0200, Ingo Molnar wrote: > > > > > > * Jiri Olsa wrote: > > > > > > > > Per-CPU threading the record session would have so many other > > > > > advantages as well

Re: [PATCH v8 0/3]: perf: reduce data loss when profiling highly parallel CPU bound workloads

2018-09-10 Thread Ingo Molnar
* Jiri Olsa wrote: > On Mon, Sep 10, 2018 at 12:03:03PM +0200, Ingo Molnar wrote: > > > > * Jiri Olsa wrote: > > > > > > Per-CPU threading the record session would have so many other > > > > advantages as well (scalability, > > > > etc.). > > > > > > > > Jiri did per-CPU recording

Re: [PATCH v8 0/3]: perf: reduce data loss when profiling highly parallel CPU bound workloads

2018-09-10 Thread Ingo Molnar
* Jiri Olsa wrote: > On Mon, Sep 10, 2018 at 12:03:03PM +0200, Ingo Molnar wrote: > > > > * Jiri Olsa wrote: > > > > > > Per-CPU threading the record session would have so many other > > > > advantages as well (scalability, > > > > etc.). > > > > > > > > Jiri did per-CPU recording

Re: [PATCH v8 0/3]: perf: reduce data loss when profiling highly parallel CPU bound workloads

2018-09-10 Thread Jiri Olsa
On Mon, Sep 10, 2018 at 12:03:03PM +0200, Ingo Molnar wrote: > > * Jiri Olsa wrote: > > > > Per-CPU threading the record session would have so many other advantages > > > as well (scalability, > > > etc.). > > > > > > Jiri did per-CPU recording patches a couple of months ago, not sure how >

Re: [PATCH v8 0/3]: perf: reduce data loss when profiling highly parallel CPU bound workloads

2018-09-10 Thread Jiri Olsa
On Mon, Sep 10, 2018 at 12:03:03PM +0200, Ingo Molnar wrote: > > * Jiri Olsa wrote: > > > > Per-CPU threading the record session would have so many other advantages > > > as well (scalability, > > > etc.). > > > > > > Jiri did per-CPU recording patches a couple of months ago, not sure how >

Re: [PATCH v8 0/3]: perf: reduce data loss when profiling highly parallel CPU bound workloads

2018-09-10 Thread Ingo Molnar
* Jiri Olsa wrote: > > Per-CPU threading the record session would have so many other advantages as > > well (scalability, > > etc.). > > > > Jiri did per-CPU recording patches a couple of months ago, not sure how > > usable they are at the > > moment? > > it's still usable, I can rebase

Re: [PATCH v8 0/3]: perf: reduce data loss when profiling highly parallel CPU bound workloads

2018-09-10 Thread Ingo Molnar
* Jiri Olsa wrote: > > Per-CPU threading the record session would have so many other advantages as > > well (scalability, > > etc.). > > > > Jiri did per-CPU recording patches a couple of months ago, not sure how > > usable they are at the > > moment? > > it's still usable, I can rebase

Re: [PATCH v8 0/3]: perf: reduce data loss when profiling highly parallel CPU bound workloads

2018-09-10 Thread Jiri Olsa
On Mon, Sep 10, 2018 at 11:18:41AM +0200, Ingo Molnar wrote: > > * Alexey Budankov wrote: > > > > > Currently in record mode the tool implements trace writing serially. > > The algorithm loops over mapped per-cpu data buffers and stores > > ready data chunks into a trace file using write()

Re: [PATCH v8 0/3]: perf: reduce data loss when profiling highly parallel CPU bound workloads

2018-09-10 Thread Jiri Olsa
On Mon, Sep 10, 2018 at 11:18:41AM +0200, Ingo Molnar wrote: > > * Alexey Budankov wrote: > > > > > Currently in record mode the tool implements trace writing serially. > > The algorithm loops over mapped per-cpu data buffers and stores > > ready data chunks into a trace file using write()

Re: [PATCH v8 0/3]: perf: reduce data loss when profiling highly parallel CPU bound workloads

2018-09-10 Thread Ingo Molnar
* Alexey Budankov wrote: > > Currently in record mode the tool implements trace writing serially. > The algorithm loops over mapped per-cpu data buffers and stores > ready data chunks into a trace file using write() system call. > > At some circumstances the kernel may lack free space in a

Re: [PATCH v8 0/3]: perf: reduce data loss when profiling highly parallel CPU bound workloads

2018-09-10 Thread Ingo Molnar
* Alexey Budankov wrote: > > Currently in record mode the tool implements trace writing serially. > The algorithm loops over mapped per-cpu data buffers and stores > ready data chunks into a trace file using write() system call. > > At some circumstances the kernel may lack free space in a

Re: [PATCH v8 0/3]: perf: reduce data loss when profiling highly parallel CPU bound workloads

2018-09-07 Thread Alexey Budankov
On 07.09.2018 10:07, Alexey Budankov wrote: > > Currently in record mode the tool implements trace writing serially. > The algorithm loops over mapped per-cpu data buffers and stores > ready data chunks into a trace file using write() system call. > > At some circumstances the kernel may

Re: [PATCH v8 0/3]: perf: reduce data loss when profiling highly parallel CPU bound workloads

2018-09-07 Thread Alexey Budankov
On 07.09.2018 10:07, Alexey Budankov wrote: > > Currently in record mode the tool implements trace writing serially. > The algorithm loops over mapped per-cpu data buffers and stores > ready data chunks into a trace file using write() system call. > > At some circumstances the kernel may