Re: [GIT PULL 0/2] perf/urgent fixes for 4.12
* Arnaldo Carvalho de Melowrote: > Hi Ingo, > > Please consider pulling, > > - Arnaldo > > Test results at the end of this message, as usual. > > The following changes since commit 47c1ded7fef108c730b803cd386241beffcdd15c: > > Merge tag 'perf-urgent-for-mingo-4.12-20170608' of > git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux into perf/urgent > (2017-06-09 00:41:33 +0200) > > are available in the git repository at: > > git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux.git > tags/perf-urgent-for-mingo-4.12-20170613 > > for you to fetch changes up to 9e0c6fd15fcaea39784d1fb3e9fc573f1cf0ae60: > > perf tools: Fix build with ARCH=x86_64 (2017-06-13 16:20:37 -0300) > > > perf/urgent fixes: > > - Fix probing of precise_ip level for default cycles event, that > got broken recently on x86_64 when its arch code started > considering invalid requesting precise samples when not sampling > (i.e. when attr.sample_period == 0). > > This also fixes another problem in s/390 where the precision > probing with sample_period == 0 returned precise_ip > 0, that > then, when setting up the real cycles event (not probing) would > return EOPNOTSUPP for precise_ip > 0 (as determined previously > by probing) and sample_period > 0. > > These problems resulted in attr_precise not being set to the > highest precision available on x86.64 when no event was specified, > i.e. the canonical: > > perf record ./workload > > would end up using attr.precise_ip = 0. As a workaround this would > need to be done: > > perf record -e cycles:P ./workload > > And on s/390 it would plain not work, requiring using: > > perf record -e cycles ./workload > > as a workaround. (Arnaldo Carvalho de Melo) > > - Fix perf build with ARCH=x86_64, when ARCH should be transformed > into ARCH=x86, just like with the main kernel Makefile and > tools/objtool's, i.e. use SRCARCH. (Jiada Wang) > > Signed-off-by: Arnaldo Carvalho de Melo > > > Arnaldo Carvalho de Melo (1): > perf evsel: Fix probing of precise_ip level for default cycles event > > Jiada Wang (1): > perf tools: Fix build with ARCH=x86_64 > > tools/perf/Makefile.config | 38 +++--- > tools/perf/Makefile.perf | 2 +- > tools/perf/arch/Build| 2 +- > tools/perf/pmu-events/Build | 4 ++-- > tools/perf/tests/Build | 2 +- > tools/perf/tests/task-exit.c | 2 +- > tools/perf/util/evsel.c | 5 + > tools/perf/util/header.c | 2 +- > 8 files changed, 31 insertions(+), 26 deletions(-) Pulled, thanks a lot Arnaldo! Ingo
Re: [GIT PULL 0/2] perf/urgent fixes for 4.12
* Arnaldo Carvalho de Melo wrote: > Hi Ingo, > > Please consider pulling, > > - Arnaldo > > Test results at the end of this message, as usual. > > The following changes since commit 47c1ded7fef108c730b803cd386241beffcdd15c: > > Merge tag 'perf-urgent-for-mingo-4.12-20170608' of > git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux into perf/urgent > (2017-06-09 00:41:33 +0200) > > are available in the git repository at: > > git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux.git > tags/perf-urgent-for-mingo-4.12-20170613 > > for you to fetch changes up to 9e0c6fd15fcaea39784d1fb3e9fc573f1cf0ae60: > > perf tools: Fix build with ARCH=x86_64 (2017-06-13 16:20:37 -0300) > > > perf/urgent fixes: > > - Fix probing of precise_ip level for default cycles event, that > got broken recently on x86_64 when its arch code started > considering invalid requesting precise samples when not sampling > (i.e. when attr.sample_period == 0). > > This also fixes another problem in s/390 where the precision > probing with sample_period == 0 returned precise_ip > 0, that > then, when setting up the real cycles event (not probing) would > return EOPNOTSUPP for precise_ip > 0 (as determined previously > by probing) and sample_period > 0. > > These problems resulted in attr_precise not being set to the > highest precision available on x86.64 when no event was specified, > i.e. the canonical: > > perf record ./workload > > would end up using attr.precise_ip = 0. As a workaround this would > need to be done: > > perf record -e cycles:P ./workload > > And on s/390 it would plain not work, requiring using: > > perf record -e cycles ./workload > > as a workaround. (Arnaldo Carvalho de Melo) > > - Fix perf build with ARCH=x86_64, when ARCH should be transformed > into ARCH=x86, just like with the main kernel Makefile and > tools/objtool's, i.e. use SRCARCH. (Jiada Wang) > > Signed-off-by: Arnaldo Carvalho de Melo > > > Arnaldo Carvalho de Melo (1): > perf evsel: Fix probing of precise_ip level for default cycles event > > Jiada Wang (1): > perf tools: Fix build with ARCH=x86_64 > > tools/perf/Makefile.config | 38 +++--- > tools/perf/Makefile.perf | 2 +- > tools/perf/arch/Build| 2 +- > tools/perf/pmu-events/Build | 4 ++-- > tools/perf/tests/Build | 2 +- > tools/perf/tests/task-exit.c | 2 +- > tools/perf/util/evsel.c | 5 + > tools/perf/util/header.c | 2 +- > 8 files changed, 31 insertions(+), 26 deletions(-) Pulled, thanks a lot Arnaldo! Ingo
[GIT PULL 0/2] perf/urgent fixes for 4.12
Hi Ingo, Please consider pulling, - Arnaldo Test results at the end of this message, as usual. The following changes since commit 47c1ded7fef108c730b803cd386241beffcdd15c: Merge tag 'perf-urgent-for-mingo-4.12-20170608' of git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux into perf/urgent (2017-06-09 00:41:33 +0200) are available in the git repository at: git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux.git tags/perf-urgent-for-mingo-4.12-20170613 for you to fetch changes up to 9e0c6fd15fcaea39784d1fb3e9fc573f1cf0ae60: perf tools: Fix build with ARCH=x86_64 (2017-06-13 16:20:37 -0300) perf/urgent fixes: - Fix probing of precise_ip level for default cycles event, that got broken recently on x86_64 when its arch code started considering invalid requesting precise samples when not sampling (i.e. when attr.sample_period == 0). This also fixes another problem in s/390 where the precision probing with sample_period == 0 returned precise_ip > 0, that then, when setting up the real cycles event (not probing) would return EOPNOTSUPP for precise_ip > 0 (as determined previously by probing) and sample_period > 0. These problems resulted in attr_precise not being set to the highest precision available on x86.64 when no event was specified, i.e. the canonical: perf record ./workload would end up using attr.precise_ip = 0. As a workaround this would need to be done: perf record -e cycles:P ./workload And on s/390 it would plain not work, requiring using: perf record -e cycles ./workload as a workaround. (Arnaldo Carvalho de Melo) - Fix perf build with ARCH=x86_64, when ARCH should be transformed into ARCH=x86, just like with the main kernel Makefile and tools/objtool's, i.e. use SRCARCH. (Jiada Wang) Signed-off-by: Arnaldo Carvalho de MeloArnaldo Carvalho de Melo (1): perf evsel: Fix probing of precise_ip level for default cycles event Jiada Wang (1): perf tools: Fix build with ARCH=x86_64 tools/perf/Makefile.config | 38 +++--- tools/perf/Makefile.perf | 2 +- tools/perf/arch/Build| 2 +- tools/perf/pmu-events/Build | 4 ++-- tools/perf/tests/Build | 2 +- tools/perf/tests/task-exit.c | 2 +- tools/perf/util/evsel.c | 5 + tools/perf/util/header.c | 2 +- 8 files changed, 31 insertions(+), 26 deletions(-) Test results: The first ones are container (docker) based builds of tools/perf with and without libelf support, objtool where it is supported and samples/bpf/, ditto. Where clang is available, it is also used to build perf with/without libelf. Several are cross builds, the ones with -x-ARCH, and the android one, and those may not have all the features built, due to lack of multi-arch devel packages, available and being used so far on just a few, like debian:experimental-x-{arm64,mipsel}. The 'perf test' one will perform a variety of tests exercising tools/perf/util/, tools/lib/{bpf,traceevent,etc}, as well as run perf commands with a variety of command line event specifications to then intercept the sys_perf_event syscall to check that the perf_event_attr fields are set up as expected, among a variety of other unit tests. Then there is the 'make -C tools/perf build-test' ones, that build tools/perf/ with a variety of feature sets, exercising the build with an incomplete set of features as well as with a complete one. It is planned to have it run on each of the containers mentioned above, using some container orchestration infrastructure. Get in contact if interested in helping having this in place. # dm 1 alpine:3.4: Ok 2 alpine:3.5: Ok 3 alpine:3.6: Ok 4 alpine:edge: Ok 5 android-ndk:r12b-arm: Ok 6 archlinux:latest: Ok 7 centos:5: Ok 8 centos:6: Ok 9 centos:7: Ok 10 debian:7: Ok 11 debian:8: Ok 12 debian:9: Ok 13 debian:experimental: Ok 14 debian:experimental-x-arm64: Ok 15 debian:experimental-x-mips: Ok 16 debian:experimental-x-mips64: Ok 17 debian:experimental-x-mipsel: Ok 18 fedora:20: Ok 19 fedora:21: Ok 20 fedora:22: Ok 21 fedora:23: Ok 22 fedora:24: Ok 23 fedora:24-x-ARC-uClibc: Ok 24 fedora:25: Ok 25 fedora:rawhide: Ok 26 mageia:5: Ok 27 opensuse:13.2: Ok 28 opensuse:42.1: Ok 29 opensuse:tumbleweed: Ok 30 ubuntu:12.04.5: Ok 31 ubuntu:14.04.4: Ok 32 ubuntu:14.04.4-x-linaro-arm64: Ok 33 ubuntu:15.10: Ok 34 ubuntu:16.04: Ok 35 ubuntu:16.04-x-arm: Ok 36 ubuntu:16.04-x-arm64: Ok 37 ubuntu:16.04-x-powerpc: Ok 38 ubuntu:16.04-x-powerpc64: Ok 39 ubuntu:16.04-x-powerpc64el: Ok 40 ubuntu:16.04-x-s390: Ok 41 ubuntu:16.10: Ok 42 ubuntu:17.04: Ok # # uname -a Linux jouet 4.12.0-rc4+ #1 SMP Fri Jun 9 12:59:23 -03 2017 x86_64 x86_64 x86_64 GNU/Linux # perf test 1:
[GIT PULL 0/2] perf/urgent fixes for 4.12
Hi Ingo, Please consider pulling, - Arnaldo Test results at the end of this message, as usual. The following changes since commit 47c1ded7fef108c730b803cd386241beffcdd15c: Merge tag 'perf-urgent-for-mingo-4.12-20170608' of git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux into perf/urgent (2017-06-09 00:41:33 +0200) are available in the git repository at: git://git.kernel.org/pub/scm/linux/kernel/git/acme/linux.git tags/perf-urgent-for-mingo-4.12-20170613 for you to fetch changes up to 9e0c6fd15fcaea39784d1fb3e9fc573f1cf0ae60: perf tools: Fix build with ARCH=x86_64 (2017-06-13 16:20:37 -0300) perf/urgent fixes: - Fix probing of precise_ip level for default cycles event, that got broken recently on x86_64 when its arch code started considering invalid requesting precise samples when not sampling (i.e. when attr.sample_period == 0). This also fixes another problem in s/390 where the precision probing with sample_period == 0 returned precise_ip > 0, that then, when setting up the real cycles event (not probing) would return EOPNOTSUPP for precise_ip > 0 (as determined previously by probing) and sample_period > 0. These problems resulted in attr_precise not being set to the highest precision available on x86.64 when no event was specified, i.e. the canonical: perf record ./workload would end up using attr.precise_ip = 0. As a workaround this would need to be done: perf record -e cycles:P ./workload And on s/390 it would plain not work, requiring using: perf record -e cycles ./workload as a workaround. (Arnaldo Carvalho de Melo) - Fix perf build with ARCH=x86_64, when ARCH should be transformed into ARCH=x86, just like with the main kernel Makefile and tools/objtool's, i.e. use SRCARCH. (Jiada Wang) Signed-off-by: Arnaldo Carvalho de Melo Arnaldo Carvalho de Melo (1): perf evsel: Fix probing of precise_ip level for default cycles event Jiada Wang (1): perf tools: Fix build with ARCH=x86_64 tools/perf/Makefile.config | 38 +++--- tools/perf/Makefile.perf | 2 +- tools/perf/arch/Build| 2 +- tools/perf/pmu-events/Build | 4 ++-- tools/perf/tests/Build | 2 +- tools/perf/tests/task-exit.c | 2 +- tools/perf/util/evsel.c | 5 + tools/perf/util/header.c | 2 +- 8 files changed, 31 insertions(+), 26 deletions(-) Test results: The first ones are container (docker) based builds of tools/perf with and without libelf support, objtool where it is supported and samples/bpf/, ditto. Where clang is available, it is also used to build perf with/without libelf. Several are cross builds, the ones with -x-ARCH, and the android one, and those may not have all the features built, due to lack of multi-arch devel packages, available and being used so far on just a few, like debian:experimental-x-{arm64,mipsel}. The 'perf test' one will perform a variety of tests exercising tools/perf/util/, tools/lib/{bpf,traceevent,etc}, as well as run perf commands with a variety of command line event specifications to then intercept the sys_perf_event syscall to check that the perf_event_attr fields are set up as expected, among a variety of other unit tests. Then there is the 'make -C tools/perf build-test' ones, that build tools/perf/ with a variety of feature sets, exercising the build with an incomplete set of features as well as with a complete one. It is planned to have it run on each of the containers mentioned above, using some container orchestration infrastructure. Get in contact if interested in helping having this in place. # dm 1 alpine:3.4: Ok 2 alpine:3.5: Ok 3 alpine:3.6: Ok 4 alpine:edge: Ok 5 android-ndk:r12b-arm: Ok 6 archlinux:latest: Ok 7 centos:5: Ok 8 centos:6: Ok 9 centos:7: Ok 10 debian:7: Ok 11 debian:8: Ok 12 debian:9: Ok 13 debian:experimental: Ok 14 debian:experimental-x-arm64: Ok 15 debian:experimental-x-mips: Ok 16 debian:experimental-x-mips64: Ok 17 debian:experimental-x-mipsel: Ok 18 fedora:20: Ok 19 fedora:21: Ok 20 fedora:22: Ok 21 fedora:23: Ok 22 fedora:24: Ok 23 fedora:24-x-ARC-uClibc: Ok 24 fedora:25: Ok 25 fedora:rawhide: Ok 26 mageia:5: Ok 27 opensuse:13.2: Ok 28 opensuse:42.1: Ok 29 opensuse:tumbleweed: Ok 30 ubuntu:12.04.5: Ok 31 ubuntu:14.04.4: Ok 32 ubuntu:14.04.4-x-linaro-arm64: Ok 33 ubuntu:15.10: Ok 34 ubuntu:16.04: Ok 35 ubuntu:16.04-x-arm: Ok 36 ubuntu:16.04-x-arm64: Ok 37 ubuntu:16.04-x-powerpc: Ok 38 ubuntu:16.04-x-powerpc64: Ok 39 ubuntu:16.04-x-powerpc64el: Ok 40 ubuntu:16.04-x-s390: Ok 41 ubuntu:16.10: Ok 42 ubuntu:17.04: Ok # # uname -a Linux jouet 4.12.0-rc4+ #1 SMP Fri Jun 9 12:59:23 -03 2017 x86_64 x86_64 x86_64 GNU/Linux # perf test 1: vmlinux symtab