[rcu] 7a7becb4d0: BUG:sleeping_function_called_from_invalid_context_at_include/linux/sched/mm.h
by kernel test robot
Greeting,
FYI, we noticed the following commit (built with gcc-9):
commit: 7a7becb4d01e99471523ac38adf3ed64f8be092e ("rcu-tasks: Create per-CPU callback lists")
https://git.kernel.org/cgit/linux/kernel/git/paulmck/linux-rcu.git dev.2021.11.01a
in testcase: boot
on test machine: qemu-system-x86_64 -enable-kvm -cpu SandyBridge -smp 2 -m 16G
caused below changes (please refer to attached dmesg/kmsg for entire log/backtrace):
+-------------------------------------------------------------------------------+------------+------------+
| | f62a1317c8 | 7a7becb4d0 |
+-------------------------------------------------------------------------------+------------+------------+
| boot_successes | 19 | 0 |
| boot_failures | 0 | 19 |
| BUG:sleeping_function_called_from_invalid_context_at_include/linux/sched/mm.h | 0 | 19 |
+-------------------------------------------------------------------------------+------------+------------+
If you fix the issue, kindly add following tag
Reported-by: kernel test robot <oliver.sang(a)intel.com>
[ 0.469207][ T1] BUG: sleeping function called from invalid context at include/linux/sched/mm.h:201
[ 0.469207][ T1] in_atomic(): 0, irqs_disabled(): 1, non_block: 0, pid: 1, name: swapper/0
[ 0.469207][ T1] CPU: 0 PID: 1 Comm: swapper/0 Not tainted 5.15.0-rc1-00145-g7a7becb4d01e #1
[ 0.469207][ T1] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.12.0-1 04/01/2014
[ 0.469207][ T1] Call Trace:
[ 0.469207][ T1] dump_stack_lvl (lib/dump_stack.c:107)
[ 0.469207][ T1] ___might_sleep.cold (kernel/sched/core.c:9539 kernel/sched/core.c:9496)
[ 0.469207][ T1] kmem_cache_alloc_trace (include/linux/sched/mm.h:201 mm/slab.h:492 mm/slub.c:3120 mm/slub.c:3214 mm/slub.c:3231)
[ 0.469207][ T1] ? rcu_tasks_wait_gp (kernel/rcu/tasks.h:228)
[ 0.469207][ T1] __kthread_create_on_node (kernel/kthread.c:367)
[ 0.469207][ T1] kthread_create_on_node (kernel/kthread.c:457)
[ 0.469207][ T1] ? cpumask_next (lib/cpumask.c:23)
[ 0.469207][ T1] ? cblist_init_generic (kernel/rcu/tasks.h:172 (discriminator 1))
[ 0.469207][ T1] rcu_spawn_tasks_kthread_generic (kernel/rcu/tasks.h:295)
[ 0.469207][ T1] rcu_init_tasks_generic (kernel/rcu/tasks.h:1317 kernel/rcu/tasks.h:1452)
[ 0.469207][ T1] kernel_init_freeable (init/main.c:1418 init/main.c:1603)
[ 0.469207][ T1] ? rest_init (init/main.c:1497)
[ 0.469207][ T1] kernel_init (init/main.c:1507)
[ 0.469207][ T1] ret_from_fork (arch/x86/entry/entry_64.S:301)
[ 0.469276][ T1] cblist_init_generic initializing CPU 0 rcu_tasks_percpu structure for RCU Tasks Trace
[ 0.470207][ T1] cblist_init_generic initializing CPU 1 rcu_tasks_percpu structure for RCU Tasks Trace
[ 0.470273][ T1] Performance Events: unsupported p6 CPU model 42 no PMU driver, software events only.
[ 0.471321][ T1] rcu: Hierarchical SRCU implementation.
[ 0.472610][ T1] NMI watchdog: Perf NMI watchdog permanently disabled
[ 0.473334][ T1] smp: Bringing up secondary CPUs ...
[ 0.474401][ T1] x86: Booting SMP configuration:
[ 0.475218][ T1] .... node #0, CPUs: #1
[ 0.138457][ T0] kvm-clock: cpu 1, msr 3d914d041, secondary cpu clock
[ 0.138457][ T0] masked ExtINT on CPU#1
[ 0.138457][ T0] smpboot: CPU 1 Converting physical 0 to logical die 1
[ 0.479245][ T16] kvm-guest: stealtime: cpu 1, msr 42fd17180
[ 0.480258][ T1] smp: Brought up 1 node, 2 CPUs
[ 0.481218][ T1] smpboot: Max logical packages: 2
[ 0.482215][ T1] smpboot: Total of 2 processors activated (11705.35 BogoMIPS)
[ 0.557489][ T21] node 0 deferred pages initialised in 74ms
[ 0.559462][ T1] devtmpfs: initialized
[ 0.560291][ T1] x86/mm: Memory block size: 128MB
[ 0.563554][ T1] clocksource: jiffies: mask: 0xffffffff max_cycles: 0xffffffff, max_idle_ns: 1911260446275000 ns
[ 0.564239][ T1] futex hash table entries: 512 (order: 3, 32768 bytes, linear)
[ 0.565300][ T1] pinctrl core: initialized pinctrl subsystem
[ 0.566487][ T1] NET: Registered PF_NETLINK/PF_ROUTE protocol family
[ 0.567568][ T1] audit: initializing netlink subsys (disabled)
[ 0.568288][ T26] audit: type=2000 audit(1636649402.272:1): state=initialized audit_enabled=0 res=1
[ 0.568368][ T1] thermal_sys: Registered thermal governor 'fair_share'
[ 0.569218][ T1] thermal_sys: Registered thermal governor 'bang_bang'
[ 0.570217][ T1] thermal_sys: Registered thermal governor 'step_wise'
[ 0.571217][ T1] thermal_sys: Registered thermal governor 'user_space'
[ 0.572247][ T1] cpuidle: using governor menu
[ 0.574593][ T1] ACPI: bus type PCI registered
[ 0.575215][ T1] acpiphp: ACPI Hot Plug PCI Controller Driver version: 0.5
[ 0.576371][ T1] PCI: Using configuration type 1 for base access
[ 0.580010][ T1] Kprobes globally optimized
[ 0.580284][ T1] HugeTLB registered 2.00 MiB page size, pre-allocated 0 pages
[ 0.581290][ T1] cryptd: max_cpu_qlen set to 1000
[ 0.584290][ T1] ACPI: Added _OSI(Module Device)
[ 0.585219][ T1] ACPI: Added _OSI(Processor Device)
[ 0.586215][ T1] ACPI: Added _OSI(3.0 _SCP Extensions)
[ 0.587214][ T1] ACPI: Added _OSI(Processor Aggregator Device)
[ 0.588217][ T1] ACPI: Added _OSI(Linux-Dell-Video)
[ 0.589215][ T1] ACPI: Added _OSI(Linux-Lenovo-NV-HDMI-Audio)
[ 0.590215][ T1] ACPI: Added _OSI(Linux-HPI-Hybrid-Graphics)
[ 0.592147][ T1] ACPI: 1 ACPI AML tables successfully acquired and loaded
[ 0.593481][ T1] ACPI: Interpreter enabled
[ 0.594250][ T1] ACPI: PM: (supports S0 S3 S4 S5)
[ 0.595215][ T1] ACPI: Using IOAPIC for interrupt routing
[ 0.596242][ T1] PCI: Using host bridge windows from ACPI; if necessary, use "pci=nocrs" and report a bug
[ 0.597394][ T1] ACPI: Enabled 2 GPEs in block 00 to 0F
[ 0.601755][ T1] ACPI: PCI Root Bridge [PCI0] (domain 0000 [bus 00-ff])
[ 0.602223][ T1] acpi PNP0A03:00: _OSC: OS supports [ASPM ClockPM Segments MSI HPX-Type3]
[ 0.603232][ T1] acpi PNP0A03:00: fail to add MMCONFIG information, can't access extended PCI configuration space under this bridge.
[ 0.604718][ T1] acpiphp: Slot [3] registered
[ 0.605248][ T1] acpiphp: Slot [4] registered
[ 0.606247][ T1] acpiphp: Slot [5] registered
[ 0.607247][ T1] acpiphp: Slot [6] registered
[ 0.608245][ T1] acpiphp: Slot [7] registered
[ 0.609250][ T1] acpiphp: Slot [8] registered
[ 0.610245][ T1] acpiphp: Slot [9] registered
[ 0.611261][ T1] acpiphp: Slot [10] registered
[ 0.612246][ T1] acpiphp: Slot [11] registered
[ 0.613244][ T1] acpiphp: Slot [12] registered
[ 0.614247][ T1] acpiphp: Slot [13] registered
[ 0.615245][ T1] acpiphp: Slot [14] registered
[ 0.616254][ T1] acpiphp: Slot [15] registered
[ 0.617245][ T1] acpiphp: Slot [16] registered
[ 0.618246][ T1] acpiphp: Slot [17] registered
[ 0.619246][ T1] acpiphp: Slot [18] registered
[ 0.620244][ T1] acpiphp: Slot [19] registered
[ 0.621251][ T1] acpiphp: Slot [20] registered
[ 0.622252][ T1] acpiphp: Slot [21] registered
[ 0.623246][ T1] acpiphp: Slot [22] registered
[ 0.624243][ T1] acpiphp: Slot [23] registered
[ 0.625248][ T1] acpiphp: Slot [24] registered
[ 0.626248][ T1] acpiphp: Slot [25] registered
[ 0.627245][ T1] acpiphp: Slot [26] registered
[ 0.628248][ T1] acpiphp: Slot [27] registered
[ 0.629248][ T1] acpiphp: Slot [28] registered
[ 0.630243][ T1] acpiphp: Slot [29] registered
[ 0.631242][ T1] acpiphp: Slot [30] registered
[ 0.632247][ T1] acpiphp: Slot [31] registered
[ 0.633234][ T1] PCI host bridge to bus 0000:00
[ 0.634217][ T1] pci_bus 0000:00: root bus resource [io 0x0000-0x0cf7 window]
To reproduce:
# build kernel
cd linux
cp config-5.15.0-rc1-00145-g7a7becb4d01e .config
make HOSTCC=gcc-9 CC=gcc-9 ARCH=x86_64 olddefconfig prepare modules_prepare bzImage
git clone https://github.com/intel/lkp-tests.git
cd lkp-tests
bin/lkp qemu -k <bzImage> job-script # job-script is attached in this email
# if come across any failure that blocks the test,
# please remove ~/.lkp and /lkp dir to run from a clean state.
---
0DAY/LKP+ Test Infrastructure Open Source Technology Center
https://lists.01.org/hyperkitty/list/[email protected] Intel Corporation
Thanks,
Oliver Sang
8 months, 3 weeks
[block] 900e080752: WARNING:at_block/mq-deadline.c:#dd_exit_sched
by kernel test robot
Greeting,
FYI, we noticed the following commit (built with gcc-9):
commit: 900e080752025f0016128f07c9ed4c50eba3654b ("block: move queue enter logic into blk_mq_submit_bio()")
https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master
in testcase: blktests
version: blktests-x86_64-3be7849-1_20211102
with following parameters:
disk: 1SSD
test: block-group-15
ucode: 0xe2
on test machine: 4 threads Intel(R) Core(TM) i5-6500 CPU @ 3.20GHz with 32G memory
caused below changes (please refer to attached dmesg/kmsg for entire log/backtrace):
please be noted we reported this as [1] when it's still on linux-next.
since it's now on mainline, we report it again FYI
[1] https://lists.01.org/hyperkitty/list/[email protected]/thread/W3L5D35B6CQL...
If you fix the issue, kindly add following tag
Reported-by: kernel test robot <oliver.sang(a)intel.com>
[ 54.012480][ T1243] WARNING: CPU: 0 PID: 1243 at block/mq-deadline.c:597 dd_exit_sched (block/mq-deadline.c:597 (discriminator 3))
[ 54.021976][ T1243] Modules linked in: null_blk loop dm_multipath dm_mod ipmi_devintf btrfs ipmi_msghandler blake2b_generic xor zstd_compress raid6_pq libcrc32c sd_mod t10_pi sg intel_rapl_msr intel_rapl_common x86_pkg_temp_thermal intel_powerclamp coretemp i915 kvm_intel kvm irqbypass crct10dif_pclmul crc32_pclmul crc32c_intel intel_gtt ttm mei_wdt wmi_bmof ghash_clmulni_intel rapl drm_kms_helper ahci libahci intel_cstate syscopyarea sysfillrect mei_me sysimgblt fb_sys_fops intel_pch_thermal libata mei drm joydev intel_uncore wmi intel_pmc_core video acpi_pad ip_tables [last unloaded: null_blk]
[ 54.077296][ T1243] CPU: 0 PID: 1243 Comm: rmdir Tainted: G I 5.15.0-rc6-00186-g900e08075202 #1
[ 54.087981][ T1243] Hardware name: Dell Inc. OptiPlex 7040/0Y7WYT, BIOS 1.1.1 10/07/2015
[ 54.096756][ T1243] RIP: 0010:dd_exit_sched (block/mq-deadline.c:597 (discriminator 3))
[ 54.102564][ T1243] Code: fb 1a 7f 01 00 75 d7 44 8b 0e 8b 4b 34 44 89 ee 48 c7 c7 f8 93 58 82 8b 53 30 44 8b 43 38 c6 05 db 1a 7f 01 01 e8 93 e5 66 00 <0f> 0b eb b0 0f 0b eb 85 66 66 2e 0f 1f 84 00 00 00 00 00 66 66 2e
All code
========
0: fb sti
1: 1a 7f 01 sbb 0x1(%rdi),%bh
4: 00 75 d7 add %dh,-0x29(%rbp)
7: 44 8b 0e mov (%rsi),%r9d
a: 8b 4b 34 mov 0x34(%rbx),%ecx
d: 44 89 ee mov %r13d,%esi
10: 48 c7 c7 f8 93 58 82 mov $0xffffffff825893f8,%rdi
17: 8b 53 30 mov 0x30(%rbx),%edx
1a: 44 8b 43 38 mov 0x38(%rbx),%r8d
1e: c6 05 db 1a 7f 01 01 movb $0x1,0x17f1adb(%rip) # 0x17f1b00
25: e8 93 e5 66 00 callq 0x66e5bd
2a:* 0f 0b ud2 <-- trapping instruction
2c: eb b0 jmp 0xffffffffffffffde
2e: 0f 0b ud2
30: eb 85 jmp 0xffffffffffffffb7
32: 66 66 2e 0f 1f 84 00 data16 nopw %cs:0x0(%rax,%rax,1)
39: 00 00 00 00
3d: 66 data16
3e: 66 data16
3f: 2e cs
Code starting with the faulting instruction
===========================================
0: 0f 0b ud2
2: eb b0 jmp 0xffffffffffffffb4
4: 0f 0b ud2
6: eb 85 jmp 0xffffffffffffff8d
8: 66 66 2e 0f 1f 84 00 data16 nopw %cs:0x0(%rax,%rax,1)
f: 00 00 00 00
13: 66 data16
14: 66 data16
15: 2e cs
[ 54.123408][ T1243] RSP: 0018:ffffc90000673cf8 EFLAGS: 00010282
[ 54.130167][ T1243] RAX: 0000000000000000 RBX: ffff88886c5ee480 RCX: 0000000000000000
[ 54.138688][ T1243] RDX: ffff88882d8239c0 RSI: ffff88882d817b50 RDI: ffff88882d817b50
[ 54.147419][ T1243] RBP: ffff88886c5ee400 R08: ffff88882d817b50 R09: ffffc90000673b18
[ 54.156016][ T1243] R10: 0000000000000001 R11: 0000000000000001 R12: ffff88886c5ee548
[ 54.164578][ T1243] R13: 0000000000000001 R14: 0000000000000000 R15: 0000000000000000
[ 54.173185][ T1243] FS: 00007f4931c77540(0000) GS:ffff88882d800000(0000) knlGS:0000000000000000
[ 54.182692][ T1243] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 54.190036][ T1243] CR2: 00007f4931920e74 CR3: 000000086ec2a001 CR4: 00000000003706f0
[ 54.198636][ T1243] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[ 54.207383][ T1243] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[ 54.216011][ T1243] Call Trace:
[ 54.219979][ T1243] blk_mq_exit_sched (block/blk-mq-sched.c:681)
[ 54.225419][ T1243] __elevator_exit (block/elevator.c:195)
[ 54.230662][ T1243] blk_release_queue (block/blk-sysfs.c:758 block/blk-sysfs.c:804)
[ 54.236184][ T1243] kobject_release (lib/kobject.c:709 lib/kobject.c:736)
[ 54.241555][ T1243] disk_release (block/genhd.c:1109 (discriminator 3))
[ 54.246577][ T1243] device_release (drivers/base/core.c:2236)
[ 54.251735][ T1243] kobject_release (lib/kobject.c:709 lib/kobject.c:736)
[ 54.257118][ T1243] null_del_dev+0x63/0x140 null_blk
[ 54.263768][ T1243] nullb_group_drop_item (drivers/block/null_blk/main.c:535) null_blk
[ 54.270538][ T1243] configfs_rmdir (fs/configfs/dir.c:1527)
[ 54.275928][ T1243] ? make_kgid (kernel/user_namespace.c:463)
[ 54.280886][ T1243] vfs_rmdir (fs/namei.c:3970 fs/namei.c:3948)
[ 54.285708][ T1243] do_rmdir (fs/namei.c:4032)
[ 54.290582][ T1243] __x64_sys_rmdir (fs/namei.c:4051 fs/namei.c:4049 fs/namei.c:4049)
[ 54.295885][ T1243] do_syscall_64 (arch/x86/entry/common.c:50 arch/x86/entry/common.c:80)
[ 54.300997][ T1243] entry_SYSCALL_64_after_hwframe (arch/x86/entry/entry_64.S:113)
[ 54.307546][ T1243] RIP: 0033:0x7f4931ba1027
[ 54.312602][ T1243] Code: 73 01 c3 48 8b 0d 69 ee 0c 00 f7 d8 64 89 01 48 83 c8 ff c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 b8 54 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 39 ee 0c 00 f7 d8 64 89 01 48
All code
========
0: 73 01 jae 0x3
2: c3 retq
3: 48 8b 0d 69 ee 0c 00 mov 0xcee69(%rip),%rcx # 0xcee73
a: f7 d8 neg %eax
c: 64 89 01 mov %eax,%fs:(%rcx)
f: 48 83 c8 ff or $0xffffffffffffffff,%rax
13: c3 retq
14: 66 2e 0f 1f 84 00 00 nopw %cs:0x0(%rax,%rax,1)
1b: 00 00 00
1e: 0f 1f 44 00 00 nopl 0x0(%rax,%rax,1)
23: b8 54 00 00 00 mov $0x54,%eax
28: 0f 05 syscall
2a:* 48 3d 01 f0 ff ff cmp $0xfffffffffffff001,%rax <-- trapping instruction
30: 73 01 jae 0x33
32: c3 retq
33: 48 8b 0d 39 ee 0c 00 mov 0xcee39(%rip),%rcx # 0xcee73
3a: f7 d8 neg %eax
3c: 64 89 01 mov %eax,%fs:(%rcx)
3f: 48 rex.W
Code starting with the faulting instruction
===========================================
0: 48 3d 01 f0 ff ff cmp $0xfffffffffffff001,%rax
6: 73 01 jae 0x9
8: c3 retq
9: 48 8b 0d 39 ee 0c 00 mov 0xcee39(%rip),%rcx # 0xcee49
10: f7 d8 neg %eax
12: 64 89 01 mov %eax,%fs:(%rcx)
15: 48 rex.W
[ 54.333584][ T1243] RSP: 002b:00007ffeed858548 EFLAGS: 00000206 ORIG_RAX: 0000000000000054
[ 54.342624][ T1243] RAX: ffffffffffffffda RBX: 00007ffeed859a24 RCX: 00007f4931ba1027
[ 54.351235][ T1243] RDX: 00007f4931c73000 RSI: 0000000000000001 RDI: 00007ffeed859a24
[ 54.359775][ T1243] RBP: 00007ffeed858678 R08: 0000000000000000 R09: 0000000000000000
[ 54.368376][ T1243] R10: fffffffffffffb94 R11: 0000000000000206 R12: 0000000000000002
[ 54.377017][ T1243] R13: 00007ffeed858670 R14: 0000000000000000 R15: 000055cb3cc8b0c7
[ 54.385624][ T1243] ---[ end trace bcff36f03b4a9e0a ]---
[ 54.422612][ T318] block/031 (do IO on null-blk with a host tag set) [failed]
[ 54.422615][ T318]
[ 54.435282][ T318] runtime ... 30.957s
[ 54.435285][ T318]
[ 54.444112][ T318] something found in dmesg:
[ 54.444114][ T318]
[ 54.454179][ T318]
[ 23.442135] run blktests block/031 at 2021-11-07 22:02:52
[ 54.454181][ T318]
[ 54.466434][ T318]
[ 23.457811] null_blk: module loaded
[ 54.466436][ T318]
[ 54.483322][ T318]
[ 26.191123] result_service: raw_upload, RESULT_MNT: /internal-lkp-server/result, RESULT_ROOT: /internal-lkp-server/result/blktests/1SSD-block-group-15-ucode=0xe2/lkp-skl-d04/debian-10.4-x86_64-20200603.cgz/x86_64-rhel-8.3-func/gcc-9/900e080752025f0016128f07c9ed4c50eba3654b/3, TMP_RESULT_ROOT: /tmp/lkp/result
[ 54.483325][ T318]
[ 54.518620][ T318]
[ 54.518622][ T318]
[ 54.529177][ T318]
[ 26.226596] run-job /lkp/jobs/scheduled/lkp-skl-d04/blktests-1SSD-block-group-15-ucode=0xe2-debian-10.4-x86_64-20200603.cgz-900e080752025f0016128f07c9ed4c50eba3654b-20211108-67459-gvgohg-2.yaml
[ 54.529179][ T318]
[ 54.553370][ T318]
[ 54.553372][ T318]
[ 54.915049][ T318]
[ 27.259606] /usr/bin/wget -q --timeout=1800 --tries=1 --local-encoding=UTF-8 http://internal-lkp-server:80/~lkp/cgi-bin/lkp-jobfile-append-var?job_fil... -O /dev/null
[ 54.915053][ T318]
[ 54.954058][ T318]
[ 54.954060][ T318]
[ 54.960929][ T318]
[ 27.296165] target ucode: 0xe2
[ 54.960931][ T318]
[ 54.970135][ T318]
[ 54.970136][ T318]
[ 54.976465][ T318] ...
[ 54.976467][ T318]
[ 54.984895][ T318] (See '/lkp/benchmarks/blktests/results/nodev/block/031.dmesg' for the entire message)
[ 54.984896][ T318]
[ 55.007006][ T318] /usr/bin/wget -q --timeout=1800 --tries=1 --local-encoding=UTF-8 http://internal-lkp-server:80/~lkp/cgi-bin/lkp-jobfile-append-var?job_fil... -O /dev/null
[ 55.007009][ T318]
[ 55.976112][ T318] kill 875 vmstat --timestamp -n 10
[ 55.976115][ T318]
[ 55.985610][ T318] kill 873 dmesg --follow --decode
[ 55.985612][ T318]
[ 55.995516][ T318] wait for background processes: 878 881 meminfo oom-killer
To reproduce:
git clone https://github.com/intel/lkp-tests.git
cd lkp-tests
sudo bin/lkp install job.yaml # job file is attached in this email
bin/lkp split-job --compatible job.yaml # generate the yaml file for lkp run
sudo bin/lkp run generated-yaml-file
# if come across any failure that blocks the test,
# please remove ~/.lkp and /lkp dir to run from a clean state.
---
0DAY/LKP+ Test Infrastructure Open Source Technology Center
https://lists.01.org/hyperkitty/list/[email protected] Intel Corporation
Thanks,
Oliver Sang
8 months, 3 weeks
[x86/asm] 0507503671: will-it-scale.per_process_ops -4.9% regression
by kernel test robot
Greeting,
FYI, we noticed a -4.9% regression of will-it-scale.per_process_ops due to commit:
commit: 0507503671f9b1c867e889cbec0f43abf904f23c ("x86/asm: Avoid adding register pressure for the init case in static_cpu_has()")
https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master
in testcase: will-it-scale
on test machine: 128 threads 2 sockets Intel(R) Xeon(R) Gold 6338 CPU @ 2.00GHz with 256G memory
with following parameters:
nr_task: 50%
mode: process
test: mmap2
cpufreq_governor: performance
ucode: 0xd000280
test-description: Will It Scale takes a testcase and runs it from 1 through to n parallel copies to see if the testcase will scale. It builds both a process and threads based test in order to see any differences between the two.
test-url: https://github.com/antonblanchard/will-it-scale
please be noted, since we don't have clue why this commit could cause
performance drop, so we did further tests on other platforms or with
different parameters, and got below results.
except the 1% improvement from the first test on a 4 sockets Haswell-EX,
others all show similar regression:
+------------------+----------------------------------------------------------------------------------+
| testcase: change | will-it-scale: will-it-scale.per_process_ops +1.0% improvement |
| test machine | 144 threads 4 sockets Intel(R) Xeon(R) CPU E7-8890 v3 @ 2.50GHz with 512G memory |
| test parameters | cpufreq_governor=performance |
| | mode=process |
| | nr_task=50% |
| | test=mmap2 |
| | ucode=0x16 |
+------------------+----------------------------------------------------------------------------------+
| testcase: change | will-it-scale: will-it-scale.per_process_ops -3.7% regression |
| test machine | 144 threads 4 sockets Intel(R) Xeon(R) Gold 5318H CPU @ 2.50GHz with 128G memory |
| test parameters | cpufreq_governor=performance |
| | mode=process |
| | nr_task=50% |
| | test=mmap2 |
| | ucode=0x700001e |
+------------------+----------------------------------------------------------------------------------+
| testcase: change | will-it-scale: will-it-scale.per_process_ops -5.1% regression |
| test machine | 128 threads 2 sockets Intel(R) Xeon(R) Gold 6338 CPU @ 2.00GHz with 256G memory |
| test parameters | cpufreq_governor=performance |
| | mode=process |
| | nr_task=16 |
| | test=mmap2 |
| | ucode=0xd000280 |
+------------------+----------------------------------------------------------------------------------+
| testcase: change | will-it-scale: will-it-scale.per_process_ops -5.9% regression |
| test machine | 88 threads 2 sockets Intel(R) Xeon(R) Gold 6238M CPU @ 2.10GHz with 128G memory |
| test parameters | cpufreq_governor=performance |
| | mode=process |
| | nr_task=16 |
| | test=mmap1 |
| | ucode=0x5003006 |
+------------------+----------------------------------------------------------------------------------+
| testcase: change | will-it-scale: will-it-scale.per_process_ops -3.5% regression |
| test machine | 88 threads 2 sockets Intel(R) Xeon(R) Gold 6238M CPU @ 2.10GHz with 128G memory |
| test parameters | cpufreq_governor=performance |
| | mode=process |
| | nr_task=50% |
| | test=mmap2 |
| | ucode=0x5003006 |
+------------------+----------------------------------------------------------------------------------+
If you fix the issue, kindly add following tag
Reported-by: kernel test robot <oliver.sang(a)intel.com>
Details are as below:
-------------------------------------------------------------------------------------------------->
To reproduce:
git clone https://github.com/intel/lkp-tests.git
cd lkp-tests
sudo bin/lkp install job.yaml # job file is attached in this email
bin/lkp split-job --compatible job.yaml # generate the yaml file for lkp run
sudo bin/lkp run generated-yaml-file
# if come across any failure that blocks the test,
# please remove ~/.lkp and /lkp dir to run from a clean state.
=========================================================================================
compiler/cpufreq_governor/kconfig/mode/nr_task/rootfs/tbox_group/test/testcase/ucode:
gcc-9/performance/x86_64-rhel-8.3/process/50%/debian-10.4-x86_64-20200603.cgz/lkp-icl-2sp2/mmap2/will-it-scale/0xd000280
commit:
f87bc8dc7a ("x86/asm: Add _ASM_RIP() macro for x86-64 (%rip) suffix")
0507503671 ("x86/asm: Avoid adding register pressure for the init case in static_cpu_has()")
f87bc8dc7a7c438c 0507503671f9b1c867e889cbec0
---------------- ---------------------------
%stddev %change %stddev
\ | \
41898923 -4.9% 39829159 will-it-scale.64.processes
654670 -4.9% 622330 will-it-scale.per_process_ops
41898923 -4.9% 39829159 will-it-scale.workload
6918 ± 54% +116.5% 14975 ± 14% softirqs.CPU20.SCHED
240.00 ± 18% +57.8% 378.67 ± 20% slabinfo.biovec-64.active_objs
240.00 ± 18% +57.8% 378.67 ± 20% slabinfo.biovec-64.num_objs
0.01 ± 28% -36.1% 0.01 ± 14% perf-sched.sch_delay.max.ms.__x64_sys_pause.do_syscall_64.entry_SYSCALL_64_after_hwframe.[unknown]
6114 ± 24% -46.1% 3296 ± 46% perf-sched.wait_and_delay.max.ms.schedule_hrtimeout_range_clock.poll_schedule_timeout.constprop.0.do_sys_poll
6114 ± 24% -46.1% 3296 ± 46% perf-sched.wait_time.max.ms.schedule_hrtimeout_range_clock.poll_schedule_timeout.constprop.0.do_sys_poll
1409 ± 30% -30.7% 977.00 ± 23% interrupts.CPU1.CAL:Function_call_interrupts
3001 ± 58% -70.3% 892.50 ± 69% interrupts.CPU1.RES:Rescheduling_interrupts
669.83 ±172% +696.9% 5338 ±102% interrupts.CPU108.NMI:Non-maskable_interrupts
669.83 ±172% +696.9% 5338 ±102% interrupts.CPU108.PMI:Performance_monitoring_interrupts
797.17 ± 6% +18.7% 946.00 ± 12% interrupts.CPU18.CAL:Function_call_interrupts
740.00 +33.3% 986.33 ± 25% interrupts.CPU20.CAL:Function_call_interrupts
530.17 ± 52% +336.3% 2313 ± 88% interrupts.CPU20.RES:Rescheduling_interrupts
793.33 ± 8% +16.2% 922.17 ± 9% interrupts.CPU54.CAL:Function_call_interrupts
3625 ± 61% -54.4% 1653 ± 23% interrupts.CPU63.RES:Rescheduling_interrupts
916.50 ± 38% +211.1% 2851 ± 40% interrupts.CPU68.RES:Rescheduling_interrupts
11834 ± 13% -63.0% 4374 ±124% interrupts.CPU7.NMI:Non-maskable_interrupts
11834 ± 13% -63.0% 4374 ±124% interrupts.CPU7.PMI:Performance_monitoring_interrupts
656.33 ±113% +817.2% 6020 ± 92% interrupts.CPU70.NMI:Non-maskable_interrupts
656.33 ±113% +817.2% 6020 ± 92% interrupts.CPU70.PMI:Performance_monitoring_interrupts
1.114e+11 -4.9% 1.06e+11 perf-stat.i.branch-instructions
0.06 ± 2% -0.0 0.06 perf-stat.i.branch-miss-rate%
68214361 -8.5% 62423727 perf-stat.i.branch-misses
2827 -2.3% 2762 perf-stat.i.context-switches
0.36 +5.2% 0.38 perf-stat.i.cpi
527757 ± 2% -7.0% 490681 perf-stat.i.dTLB-load-misses
1.162e+11 -4.9% 1.105e+11 perf-stat.i.dTLB-loads
0.00 ± 3% -0.0 0.00 ± 3% perf-stat.i.dTLB-store-miss-rate%
85596 -18.0% 70173 perf-stat.i.dTLB-store-misses
5.247e+10 -5.0% 4.985e+10 perf-stat.i.dTLB-stores
4.617e+11 -4.9% 4.39e+11 perf-stat.i.instructions
2.77 -4.9% 2.63 perf-stat.i.ipc
2187 -4.9% 2080 perf-stat.i.metric.M/sec
89058 ± 15% +74.1% 155016 ± 42% perf-stat.i.node-stores
0.06 ± 2% -0.0 0.06 perf-stat.overall.branch-miss-rate%
0.36 +5.2% 0.38 perf-stat.overall.cpi
0.00 -0.0 0.00 perf-stat.overall.dTLB-load-miss-rate%
0.00 -0.0 0.00 perf-stat.overall.dTLB-store-miss-rate%
2.77 -4.9% 2.64 perf-stat.overall.ipc
97.34 -3.4 93.95 ± 3% perf-stat.overall.node-store-miss-rate%
1.11e+11 -4.8% 1.056e+11 perf-stat.ps.branch-instructions
67990924 -8.4% 62247713 perf-stat.ps.branch-misses
2826 -2.2% 2763 perf-stat.ps.context-switches
525402 ± 2% -7.0% 488847 perf-stat.ps.dTLB-load-misses
1.158e+11 -4.9% 1.101e+11 perf-stat.ps.dTLB-loads
85273 -18.0% 69928 perf-stat.ps.dTLB-store-misses
5.227e+10 -4.9% 4.969e+10 perf-stat.ps.dTLB-stores
4.599e+11 -4.9% 4.376e+11 perf-stat.ps.instructions
89414 ± 15% +73.0% 154650 ± 41% perf-stat.ps.node-stores
1.39e+14 -4.9% 1.322e+14 perf-stat.total.instructions
40.27 -2.5 37.82 perf-profile.calltrace.cycles-pp.__mmap
36.89 -2.2 34.72 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.__mmap
36.56 -2.2 34.40 perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mmap
35.68 -2.1 33.56 perf-profile.calltrace.cycles-pp.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mmap
33.56 -2.1 31.50 perf-profile.calltrace.cycles-pp.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mmap
30.23 -1.9 28.36 perf-profile.calltrace.cycles-pp.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe
25.68 -1.6 24.13 perf-profile.calltrace.cycles-pp.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64
11.94 -0.8 11.17 perf-profile.calltrace.cycles-pp.perf_event_mmap.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
7.44 -0.7 6.74 perf-profile.calltrace.cycles-pp.free_pgd_range.unmap_region.__do_munmap.__vm_munmap.__x64_sys_munmap
7.05 -0.6 6.41 perf-profile.calltrace.cycles-pp.free_p4d_range.free_pgd_range.unmap_region.__do_munmap.__vm_munmap
1.69 ± 2% -0.3 1.38 perf-profile.calltrace.cycles-pp.arch_get_unmapped_area_topdown.shmem_get_unmapped_area.get_unmapped_area.do_mmap.vm_mmap_pgoff
2.70 -0.3 2.40 ± 2% perf-profile.calltrace.cycles-pp.get_unmapped_area.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64
2.32 -0.3 2.03 ± 3% perf-profile.calltrace.cycles-pp.shmem_get_unmapped_area.get_unmapped_area.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
1.14 ± 2% -0.3 0.87 perf-profile.calltrace.cycles-pp.vm_unmapped_area.arch_get_unmapped_area_topdown.shmem_get_unmapped_area.get_unmapped_area.do_mmap
3.61 -0.2 3.37 ± 2% perf-profile.calltrace.cycles-pp.vma_link.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
2.77 -0.2 2.53 ± 2% perf-profile.calltrace.cycles-pp.perf_iterate_sb.perf_event_mmap.mmap_region.do_mmap.vm_mmap_pgoff
1.15 ± 5% -0.2 0.95 ± 3% perf-profile.calltrace.cycles-pp.kfree.perf_event_mmap.mmap_region.do_mmap.vm_mmap_pgoff
1.43 -0.2 1.24 ± 5% perf-profile.calltrace.cycles-pp.__vma_link_rb.vma_link.mmap_region.do_mmap.vm_mmap_pgoff
2.12 -0.2 1.94 perf-profile.calltrace.cycles-pp.__entry_text_start.__mmap
4.07 -0.1 3.96 perf-profile.calltrace.cycles-pp.d_path.perf_event_mmap.mmap_region.do_mmap.vm_mmap_pgoff
0.94 ± 3% -0.1 0.84 ± 3% perf-profile.calltrace.cycles-pp.strlen.perf_event_mmap.mmap_region.do_mmap.vm_mmap_pgoff
2.12 -0.1 2.02 perf-profile.calltrace.cycles-pp.free_pgtables.unmap_region.__do_munmap.__vm_munmap.__x64_sys_munmap
1.16 -0.1 1.06 perf-profile.calltrace.cycles-pp.security_mmap_file.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe
2.32 -0.1 2.24 perf-profile.calltrace.cycles-pp.__entry_text_start.__munmap
1.53 -0.1 1.46 perf-profile.calltrace.cycles-pp.prepend_name.prepend_path.d_path.perf_event_mmap.mmap_region
1.26 -0.0 1.22 perf-profile.calltrace.cycles-pp.lru_add_drain.unmap_region.__do_munmap.__vm_munmap.__x64_sys_munmap
1.06 -0.0 1.02 perf-profile.calltrace.cycles-pp.prepend_copy.prepend_name.prepend_path.d_path.perf_event_mmap
0.58 ± 2% -0.0 0.54 ± 2% perf-profile.calltrace.cycles-pp.common_file_perm.security_mmap_file.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64
0.92 -0.0 0.88 perf-profile.calltrace.cycles-pp.invalidate_bh_lrus_cpu.lru_add_drain.unmap_region.__do_munmap.__vm_munmap
0.97 -0.0 0.94 perf-profile.calltrace.cycles-pp.down_write_killable.__vm_munmap.__x64_sys_munmap.do_syscall_64.entry_SYSCALL_64_after_hwframe
0.84 -0.0 0.81 perf-profile.calltrace.cycles-pp.down_write_killable.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe
0.65 -0.0 0.63 perf-profile.calltrace.cycles-pp.tlb_gather_mmu.unmap_region.__do_munmap.__vm_munmap.__x64_sys_munmap
1.69 +0.1 1.78 perf-profile.calltrace.cycles-pp.shmem_mmap.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
1.48 +0.1 1.56 perf-profile.calltrace.cycles-pp.touch_atime.shmem_mmap.mmap_region.do_mmap.vm_mmap_pgoff
1.26 +0.1 1.36 perf-profile.calltrace.cycles-pp.atime_needs_update.touch_atime.shmem_mmap.mmap_region.do_mmap
3.10 +0.2 3.32 perf-profile.calltrace.cycles-pp.zap_pte_range.unmap_page_range.unmap_vmas.unmap_region.__do_munmap
0.00 +0.5 0.53 ± 2% perf-profile.calltrace.cycles-pp._raw_spin_lock.zap_pte_range.unmap_page_range.unmap_vmas.unmap_region
3.74 ± 2% +0.6 4.34 perf-profile.calltrace.cycles-pp.rcu_all_qs.__cond_resched.unmap_page_range.unmap_vmas.unmap_region
7.31 ± 2% +1.5 8.85 perf-profile.calltrace.cycles-pp.__cond_resched.unmap_page_range.unmap_vmas.unmap_region.__do_munmap
8.20 ± 2% +1.6 9.82 perf-profile.calltrace.cycles-pp.___might_sleep.unmap_page_range.unmap_vmas.unmap_region.__do_munmap
59.20 +2.4 61.56 perf-profile.calltrace.cycles-pp.__munmap
55.60 +2.5 58.12 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.__munmap
55.26 +2.5 57.79 perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.__munmap
54.50 +2.6 57.06 perf-profile.calltrace.cycles-pp.__x64_sys_munmap.do_syscall_64.entry_SYSCALL_64_after_hwframe.__munmap
54.15 +2.6 56.72 perf-profile.calltrace.cycles-pp.__vm_munmap.__x64_sys_munmap.do_syscall_64.entry_SYSCALL_64_after_hwframe.__munmap
51.90 +2.6 54.54 perf-profile.calltrace.cycles-pp.__do_munmap.__vm_munmap.__x64_sys_munmap.do_syscall_64.entry_SYSCALL_64_after_hwframe
43.18 +2.9 46.12 perf-profile.calltrace.cycles-pp.unmap_region.__do_munmap.__vm_munmap.__x64_sys_munmap.do_syscall_64
30.28 +3.8 34.10 perf-profile.calltrace.cycles-pp.unmap_vmas.unmap_region.__do_munmap.__vm_munmap.__x64_sys_munmap
28.97 +4.2 33.14 perf-profile.calltrace.cycles-pp.unmap_page_range.unmap_vmas.unmap_region.__do_munmap.__vm_munmap
40.57 -2.4 38.14 perf-profile.children.cycles-pp.__mmap
35.82 -2.1 33.70 perf-profile.children.cycles-pp.ksys_mmap_pgoff
33.73 -2.1 31.66 perf-profile.children.cycles-pp.vm_mmap_pgoff
30.36 -1.9 28.47 perf-profile.children.cycles-pp.do_mmap
26.06 -1.6 24.50 perf-profile.children.cycles-pp.mmap_region
12.09 -0.8 11.32 perf-profile.children.cycles-pp.perf_event_mmap
7.49 -0.7 6.82 perf-profile.children.cycles-pp.free_pgd_range
7.10 -0.6 6.46 perf-profile.children.cycles-pp.free_p4d_range
1.82 ± 2% -0.3 1.50 perf-profile.children.cycles-pp.arch_get_unmapped_area_topdown
2.76 -0.3 2.46 ± 2% perf-profile.children.cycles-pp.get_unmapped_area
2.36 -0.3 2.07 ± 3% perf-profile.children.cycles-pp.shmem_get_unmapped_area
1.16 ± 2% -0.3 0.90 perf-profile.children.cycles-pp.vm_unmapped_area
3.77 -0.2 3.52 ± 2% perf-profile.children.cycles-pp.vma_link
2.88 -0.2 2.64 ± 2% perf-profile.children.cycles-pp.perf_iterate_sb
1.17 ± 6% -0.2 0.98 ± 3% perf-profile.children.cycles-pp.kfree
1.46 -0.2 1.26 ± 5% perf-profile.children.cycles-pp.__vma_link_rb
2.51 -0.1 2.36 perf-profile.children.cycles-pp.__entry_text_start
2.08 ± 2% -0.1 1.93 ± 2% perf-profile.children.cycles-pp.syscall_return_via_sysret
4.18 -0.1 4.06 perf-profile.children.cycles-pp.d_path
0.97 ± 3% -0.1 0.86 ± 3% perf-profile.children.cycles-pp.strlen
2.21 -0.1 2.10 perf-profile.children.cycles-pp.free_pgtables
1.23 -0.1 1.13 perf-profile.children.cycles-pp.security_mmap_file
1.78 -0.1 1.70 perf-profile.children.cycles-pp.prepend_copy
1.95 -0.1 1.88 perf-profile.children.cycles-pp.down_write_killable
1.61 -0.1 1.55 perf-profile.children.cycles-pp.prepend_name
1.39 -0.1 1.33 perf-profile.children.cycles-pp.__might_sleep
1.53 -0.1 1.47 perf-profile.children.cycles-pp.up_write
1.28 -0.0 1.24 perf-profile.children.cycles-pp.unlink_file_vma
1.31 -0.0 1.26 perf-profile.children.cycles-pp.lru_add_drain
0.94 -0.0 0.90 perf-profile.children.cycles-pp.invalidate_bh_lrus_cpu
0.63 -0.0 0.60 ± 2% perf-profile.children.cycles-pp.common_file_perm
0.43 ± 2% -0.0 0.40 ± 2% perf-profile.children.cycles-pp.current_time
0.67 ± 2% -0.0 0.63 ± 2% perf-profile.children.cycles-pp.mod_objcg_state
0.12 ± 22% -0.0 0.08 ± 16% perf-profile.children.cycles-pp.mem_cgroup_from_task
0.89 -0.0 0.86 perf-profile.children.cycles-pp.perf_event_mmap_output
0.19 ± 6% -0.0 0.16 ± 8% [email protected]
0.22 ± 2% -0.0 0.19 ± 2% perf-profile.children.cycles-pp.unlink_anon_vmas
0.67 -0.0 0.65 perf-profile.children.cycles-pp.tlb_gather_mmu
0.39 ± 2% -0.0 0.37 ± 2% perf-profile.children.cycles-pp.syscall_enter_from_user_mode
0.42 -0.0 0.40 perf-profile.children.cycles-pp.copy_from_kernel_nofault_allowed
0.42 -0.0 0.40 perf-profile.children.cycles-pp.security_vm_enough_memory_mm
0.40 -0.0 0.38 ± 2% perf-profile.children.cycles-pp.userfaultfd_unmap_complete
0.31 -0.0 0.29 ± 3% perf-profile.children.cycles-pp.entry_SYSCALL_64_safe_stack
0.25 ± 2% -0.0 0.23 ± 3% perf-profile.children.cycles-pp.cap_vm_enough_memory
0.20 ± 3% -0.0 0.18 ± 3% perf-profile.children.cycles-pp.vma_interval_tree_insert
0.34 ± 2% -0.0 0.32 perf-profile.children.cycles-pp.aa_file_perm
0.18 ± 2% -0.0 0.17 ± 2% perf-profile.children.cycles-pp.timestamp_truncate
0.12 ± 4% -0.0 0.10 ± 3% perf-profile.children.cycles-pp.__vma_link_file
0.15 -0.0 0.14 ± 2% perf-profile.children.cycles-pp.cap_capable
0.12 ± 4% -0.0 0.10 ± 4% perf-profile.children.cycles-pp.get_mmap_base
0.20 ± 2% -0.0 0.18 ± 2% perf-profile.children.cycles-pp.syscall_exit_to_user_mode_prepare
0.10 ± 3% +0.0 0.13 ± 2% perf-profile.children.cycles-pp.tlb_table_flush
0.29 +0.0 0.32 perf-profile.children.cycles-pp.tlb_flush_mmu
0.53 +0.0 0.57 perf-profile.children.cycles-pp._raw_spin_lock
1.73 +0.1 1.82 perf-profile.children.cycles-pp.shmem_mmap
1.54 +0.1 1.62 perf-profile.children.cycles-pp.touch_atime
1.40 +0.1 1.49 perf-profile.children.cycles-pp.atime_needs_update
0.58 +0.1 0.69 perf-profile.children.cycles-pp.map_id_range_down
0.36 ± 2% +0.1 0.48 ± 2% perf-profile.children.cycles-pp.make_kuid
3.19 +0.2 3.42 perf-profile.children.cycles-pp.zap_pte_range
92.69 +0.3 93.02 perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe
92.20 +0.4 92.56 perf-profile.children.cycles-pp.do_syscall_64
4.31 +0.5 4.85 perf-profile.children.cycles-pp.rcu_all_qs
8.66 ± 2% +1.2 9.89 perf-profile.children.cycles-pp.__cond_resched
9.05 +2.2 11.24 perf-profile.children.cycles-pp.___might_sleep
59.26 +2.3 61.59 perf-profile.children.cycles-pp.__munmap
54.60 +2.6 57.16 perf-profile.children.cycles-pp.__x64_sys_munmap
54.32 +2.6 56.88 perf-profile.children.cycles-pp.__vm_munmap
52.18 +2.6 54.81 perf-profile.children.cycles-pp.__do_munmap
43.35 +2.9 46.29 perf-profile.children.cycles-pp.unmap_region
30.38 +3.8 34.20 perf-profile.children.cycles-pp.unmap_vmas
29.40 +3.8 33.24 perf-profile.children.cycles-pp.unmap_page_range
7.04 -0.7 6.38 perf-profile.self.cycles-pp.free_p4d_range
1.14 ± 2% -0.3 0.88 perf-profile.self.cycles-pp.vm_unmapped_area
2.03 -0.2 1.82 ± 2% perf-profile.self.cycles-pp.perf_iterate_sb
1.14 ± 6% -0.2 0.94 ± 3% perf-profile.self.cycles-pp.kfree
1.42 -0.2 1.24 ± 5% perf-profile.self.cycles-pp.__vma_link_rb
2.08 ± 2% -0.1 1.92 ± 2% perf-profile.self.cycles-pp.syscall_return_via_sysret
1.48 ± 2% -0.1 1.34 ± 2% perf-profile.self.cycles-pp.kmem_cache_alloc
0.94 ± 3% -0.1 0.83 ± 3% perf-profile.self.cycles-pp.strlen
0.99 -0.1 0.92 ± 2% perf-profile.self.cycles-pp.__munmap
1.96 -0.1 1.90 perf-profile.self.cycles-pp.__do_munmap
1.44 -0.1 1.38 perf-profile.self.cycles-pp.up_write
0.43 -0.1 0.37 perf-profile.self.cycles-pp.security_mmap_file
1.14 -0.0 1.09 perf-profile.self.cycles-pp.__might_sleep
0.64 ± 2% -0.0 0.60 perf-profile.self.cycles-pp.tlb_finish_mmu
0.58 -0.0 0.54 ± 2% perf-profile.self.cycles-pp.__entry_text_start
0.46 -0.0 0.43 perf-profile.self.cycles-pp.arch_get_unmapped_area_topdown
0.91 -0.0 0.88 perf-profile.self.cycles-pp.invalidate_bh_lrus_cpu
0.68 -0.0 0.65 perf-profile.self.cycles-pp.vm_area_alloc
0.88 -0.0 0.85 perf-profile.self.cycles-pp.fput_many
0.58 -0.0 0.55 perf-profile.self.cycles-pp.entry_SYSCALL_64_after_hwframe
0.63 -0.0 0.60 perf-profile.self.cycles-pp.tlb_gather_mmu
0.52 -0.0 0.49 perf-profile.self.cycles-pp.vm_mmap_pgoff
0.27 ± 3% -0.0 0.24 ± 4% perf-profile.self.cycles-pp.free_pgtables
0.48 -0.0 0.46 perf-profile.self.cycles-pp.do_syscall_64
0.18 ± 3% -0.0 0.16 ± 3% perf-profile.self.cycles-pp.unlink_anon_vmas
0.18 ± 3% -0.0 0.16 ± 4% perf-profile.self.cycles-pp.vma_interval_tree_insert
0.32 -0.0 0.30 perf-profile.self.cycles-pp.syscall_enter_from_user_mode
0.31 -0.0 0.29 ± 3% perf-profile.self.cycles-pp.entry_SYSCALL_64_safe_stack
0.28 -0.0 0.26 ± 2% perf-profile.self.cycles-pp.aa_file_perm
0.10 ± 5% -0.0 0.08 ± 4% perf-profile.self.cycles-pp.get_mmap_base
0.09 ± 4% -0.0 0.08 ± 4% perf-profile.self.cycles-pp.__vma_link_file
0.31 ± 4% +0.0 0.34 ± 2% perf-profile.self.cycles-pp.atime_needs_update
0.08 ± 6% +0.0 0.11 ± 3% perf-profile.self.cycles-pp.tlb_table_flush
0.50 +0.0 0.54 perf-profile.self.cycles-pp._raw_spin_lock
1.08 +0.0 1.12 ± 2% perf-profile.self.cycles-pp.find_vma
0.51 +0.1 0.62 perf-profile.self.cycles-pp.map_id_range_down
2.33 +0.2 2.53 perf-profile.self.cycles-pp.zap_pte_range
2.95 +0.3 3.23 perf-profile.self.cycles-pp.rcu_all_qs
13.58 +0.5 14.07 perf-profile.self.cycles-pp.unmap_page_range
4.46 ± 2% +0.5 5.00 ± 2% perf-profile.self.cycles-pp.__cond_resched
7.30 ± 2% +2.2 9.48 perf-profile.self.cycles-pp.___might_sleep
will-it-scale.64.processes
4.25e+07 +----------------------------------------------------------------+
| .+.....+ |
4.2e+07 |.....+..... ...+.....+... .+..... ... |
4.15e+07 |-+ +.....+.. . ... +. |
| +. |
4.1e+07 |-+ |
| |
4.05e+07 |-+ |
| |
4e+07 |-+ O O O O O |
3.95e+07 |-+ O O |
| O O |
3.9e+07 |-+ |
| |
3.85e+07 +----------------------------------------------------------------+
will-it-scale.per_process_ops
670000 +------------------------------------------------------------------+
| |
660000 |-+ .+.....+ |
|.....+..... ...+.....+..... .+..... ... |
650000 |-+ +.....+.. . ... +. |
| +. |
640000 |-+ |
| |
630000 |-+ |
| O O O O O |
620000 |-+ O O |
| O O |
610000 |-+ |
| |
600000 +------------------------------------------------------------------+
will-it-scale.workload
4.25e+07 +----------------------------------------------------------------+
| .+.....+ |
4.2e+07 |.....+..... ...+.....+... .+..... ... |
4.15e+07 |-+ +.....+.. . ... +. |
| +. |
4.1e+07 |-+ |
| |
4.05e+07 |-+ |
| |
4e+07 |-+ O O O O O |
3.95e+07 |-+ O O |
| O O |
3.9e+07 |-+ |
| |
3.85e+07 +----------------------------------------------------------------+
[*] bisect-good sample
[O] bisect-bad sample
***************************************************************************************************
lkp-hsw-4ex1: 144 threads 4 sockets Intel(R) Xeon(R) CPU E7-8890 v3 @ 2.50GHz with 512G memory
=========================================================================================
compiler/cpufreq_governor/kconfig/mode/nr_task/rootfs/tbox_group/test/testcase/ucode:
gcc-9/performance/x86_64-rhel-8.3/process/50%/debian-10.4-x86_64-20200603.cgz/lkp-hsw-4ex1/mmap2/will-it-scale/0x16
commit:
f87bc8dc7a ("x86/asm: Add _ASM_RIP() macro for x86-64 (%rip) suffix")
0507503671 ("x86/asm: Avoid adding register pressure for the init case in static_cpu_has()")
f87bc8dc7a7c438c 0507503671f9b1c867e889cbec0
---------------- ---------------------------
%stddev %change %stddev
\ | \
25435342 +1.0% 25683534 will-it-scale.72.processes
48.98 +0.0% 48.99 will-it-scale.72.processes_idle
353268 +1.0% 356715 will-it-scale.per_process_ops
301.26 +0.0% 301.27 will-it-scale.time.elapsed_time
301.26 +0.0% 301.27 will-it-scale.time.elapsed_time.max
2.33 ± 91% +60.7% 3.75 ± 39% will-it-scale.time.involuntary_context_switches
0.50 ±152% +200.0% 1.50 ±110% will-it-scale.time.major_page_faults
9592 +0.6% 9648 will-it-scale.time.maximum_resident_set_size
6400 +0.1% 6408 will-it-scale.time.minor_page_faults
4096 +0.0% 4096 will-it-scale.time.page_size
0.03 ± 11% -13.2% 0.03 ± 30% will-it-scale.time.system_time
0.04 +12.5% 0.04 ± 11% will-it-scale.time.user_time
78.83 ± 2% +4.0% 82.00 ± 2% will-it-scale.time.voluntary_context_switches
25435342 +1.0% 25683534 will-it-scale.workload
2.138e+10 -0.1% 2.137e+10 cpuidle..time
44006849 -0.0% 43986831 cpuidle..usage
354.14 -0.5% 352.27 uptime.boot
28008 -0.9% 27752 uptime.idle
50.44 -3.5% 48.69 ± 4% boot-time.boot
25.66 -5.6% 24.24 ± 6% boot-time.dhcp
227.17 ± 5% +8.8% 247.25 ± 4% perf-sched.wait_and_delay.count.__traceiter_sched_switch.__traceiter_sched_switch.exit_to_user_mode_prepare.syscall_exit_to_user_mode.do_syscall_64
142.17 ± 10% -11.9% 125.25 ± 3% perf-sched.wait_and_delay.count.__traceiter_sched_switch.__traceiter_sched_switch.schedule_hrtimeout_range_clock.poll_schedule_timeout.constprop
391.00 ± 9% -21.7% 306.00 ± 11% slabinfo.Acpi-State.active_objs
391.00 ± 9% -21.7% 306.00 ± 11% slabinfo.Acpi-State.num_objs
70879937 ± 3% -9.7% 64039460 ± 3% perf-stat.i.iTLB-load-misses
4261 +12.1% 4775 ± 2% perf-stat.i.instructions-per-iTLB-miss
4231 +11.4% 4715 ± 3% perf-stat.overall.instructions-per-iTLB-miss
70534879 ± 3% -9.6% 63745525 ± 3% perf-stat.ps.iTLB-load-misses
3450 ± 13% +20.0% 4139 ± 3% syscalls.sys_read.med
1.537e+12 ± 19% -4.7e+11 1.067e+12 ± 9% syscalls.sys_read.noise.100%
1.537e+12 ± 19% -4.7e+11 1.067e+12 ± 9% syscalls.sys_read.noise.2%
1.537e+12 ± 19% -4.7e+11 1.067e+12 ± 9% syscalls.sys_read.noise.25%
1.537e+12 ± 19% -4.7e+11 1.067e+12 ± 9% syscalls.sys_read.noise.5%
1.537e+12 ± 19% -4.7e+11 1.067e+12 ± 9% syscalls.sys_read.noise.50%
1.537e+12 ± 19% -4.7e+11 1.067e+12 ± 9% syscalls.sys_read.noise.75%
9946 ± 18% +28.5% 12781 ± 14% softirqs.CPU10.RCU
14409 ± 52% +93.6% 27904 ± 18% softirqs.CPU123.SCHED
11664 ± 10% -24.0% 8868 ± 12% softirqs.CPU131.RCU
18565 ± 47% +76.3% 32735 ± 11% softirqs.CPU131.SCHED
12450 ± 16% -25.8% 9235 ± 11% softirqs.CPU137.RCU
14726 ± 52% +97.2% 29042 ± 20% softirqs.CPU137.SCHED
11700 ± 48% +112.4% 24849 ± 26% softirqs.CPU49.SCHED
9004 ± 13% +23.8% 11150 ± 9% softirqs.CPU51.RCU
27900 ± 26% -50.2% 13892 ± 48% softirqs.CPU51.SCHED
23905 ± 30% -65.5% 8244 ± 34% softirqs.CPU59.SCHED
10702 ± 11% -22.1% 8341 ± 4% softirqs.CPU64.RCU
16766 ± 46% +81.8% 30480 ± 14% softirqs.CPU64.SCHED
11227 ± 12% -20.5% 8930 ± 2% softirqs.CPU73.RCU
9594 ± 7% +17.1% 11231 ± 8% softirqs.CPU99.RCU
4958 ± 43% -46.0% 2677 ± 28% interrupts.CPU100.NMI:Non-maskable_interrupts
4958 ± 43% -46.0% 2677 ± 28% interrupts.CPU100.PMI:Performance_monitoring_interrupts
1416 ± 39% +38.1% 1955 ± 16% interrupts.CPU101.CAL:Function_call_interrupts
1362 ± 42% +36.8% 1864 ± 13% interrupts.CPU103.CAL:Function_call_interrupts
3843 ± 40% +98.8% 7639 ± 9% interrupts.CPU105.NMI:Non-maskable_interrupts
3843 ± 40% +98.8% 7639 ± 9% interrupts.CPU105.PMI:Performance_monitoring_interrupts
1424 ± 40% +36.5% 1945 ± 13% interrupts.CPU113.CAL:Function_call_interrupts
7245 ± 18% -39.6% 4378 ± 49% interrupts.CPU115.NMI:Non-maskable_interrupts
7245 ± 18% -39.6% 4378 ± 49% interrupts.CPU115.PMI:Performance_monitoring_interrupts
1410 ± 40% +32.3% 1865 ± 8% interrupts.CPU121.CAL:Function_call_interrupts
66.17 ± 81% +175.1% 182.00 ± 30% interrupts.CPU121.RES:Rescheduling_interrupts
205.67 ± 26% -45.2% 112.75 ± 30% interrupts.CPU123.RES:Rescheduling_interrupts
6571 ± 22% -63.2% 2420 ± 22% interrupts.CPU131.NMI:Non-maskable_interrupts
6571 ± 22% -63.2% 2420 ± 22% interrupts.CPU131.PMI:Performance_monitoring_interrupts
1369 ± 29% +44.3% 1976 ± 17% interrupts.CPU132.CAL:Function_call_interrupts
1424 ± 39% +107.5% 2955 ± 36% interrupts.CPU142.CAL:Function_call_interrupts
5617 ± 37% -50.4% 2785 ± 27% interrupts.CPU24.NMI:Non-maskable_interrupts
5617 ± 37% -50.4% 2785 ± 27% interrupts.CPU24.PMI:Performance_monitoring_interrupts
1319 ± 46% +43.8% 1896 ± 9% interrupts.CPU25.CAL:Function_call_interrupts
4789 ± 28% -55.2% 2145 ± 33% interrupts.CPU33.NMI:Non-maskable_interrupts
4789 ± 28% -55.2% 2145 ± 33% interrupts.CPU33.PMI:Performance_monitoring_interrupts
78.00 ± 82% +175.0% 214.50 ± 27% interrupts.CPU51.RES:Rescheduling_interrupts
112.17 ± 68% +119.3% 246.00 ± 13% interrupts.CPU59.RES:Rescheduling_interrupts
75.50 ± 96% +188.4% 217.75 ± 24% interrupts.CPU65.RES:Rescheduling_interrupts
1415 ± 40% +32.9% 1881 ± 12% interrupts.CPU67.CAL:Function_call_interrupts
182.50 ± 42% -57.4% 77.75 ± 82% interrupts.CPU73.RES:Rescheduling_interrupts
4942 ± 30% -55.7% 2188 ± 27% interrupts.CPU78.NMI:Non-maskable_interrupts
4942 ± 30% -55.7% 2188 ± 27% interrupts.CPU78.PMI:Performance_monitoring_interrupts
222.33 ± 19% -47.4% 117.00 ± 48% interrupts.CPU79.RES:Rescheduling_interrupts
3537 ± 33% -59.4% 1437 interrupts.CPU83.NMI:Non-maskable_interrupts
3537 ± 33% -59.4% 1437 interrupts.CPU83.PMI:Performance_monitoring_interrupts
1429 ± 40% +36.5% 1951 ± 7% interrupts.CPU9.CAL:Function_call_interrupts
691110 ± 10% -11.8% 609473 interrupts.NMI:Non-maskable_interrupts
691110 ± 10% -11.8% 609473 interrupts.PMI:Performance_monitoring_interrupts
0.94 ± 11% -0.2 0.77 ± 3% perf-profile.calltrace.cycles-pp.free_pgtables.unmap_region.__do_munmap.__vm_munmap.__x64_sys_munmap
0.99 ± 9% +0.1 1.13 ± 2% perf-profile.calltrace.cycles-pp.security_mmap_file.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe
0.43 ±143% +1.2 1.61 ± 31% perf-profile.calltrace.cycles-pp.page_counter_cancel.page_counter_uncharge.obj_cgroup_uncharge_pages.kmem_cache_free.remove_vma
1.41 ± 42% +1.2 2.60 ± 20% perf-profile.calltrace.cycles-pp.kmem_cache_free.remove_vma.__do_munmap.__vm_munmap.__x64_sys_munmap
1.75 ± 33% +1.2 2.96 ± 17% perf-profile.calltrace.cycles-pp.remove_vma.__do_munmap.__vm_munmap.__x64_sys_munmap.do_syscall_64
0.60 ±114% +1.2 1.84 ± 30% perf-profile.calltrace.cycles-pp.page_counter_uncharge.obj_cgroup_uncharge_pages.kmem_cache_free.remove_vma.__do_munmap
0.62 ±114% +1.3 1.88 ± 29% perf-profile.calltrace.cycles-pp.obj_cgroup_uncharge_pages.kmem_cache_free.remove_vma.__do_munmap.__vm_munmap
2.65 ± 30% +1.5 4.12 ± 17% perf-profile.calltrace.cycles-pp.vm_area_alloc.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
2.33 ± 34% +1.5 3.81 ± 19% perf-profile.calltrace.cycles-pp.kmem_cache_alloc.vm_area_alloc.mmap_region.do_mmap.vm_mmap_pgoff
0.78 ±116% +1.7 2.44 ± 30% perf-profile.calltrace.cycles-pp.page_counter_try_charge.obj_cgroup_charge_pages.obj_cgroup_charge.kmem_cache_alloc.vm_area_alloc
0.80 ±116% +1.7 2.47 ± 30% perf-profile.calltrace.cycles-pp.obj_cgroup_charge_pages.obj_cgroup_charge.kmem_cache_alloc.vm_area_alloc.mmap_region
0.83 ±115% +1.7 2.55 ± 29% perf-profile.calltrace.cycles-pp.obj_cgroup_charge.kmem_cache_alloc.vm_area_alloc.mmap_region.do_mmap
15.14 ± 9% +1.9 17.04 ± 3% perf-profile.calltrace.cycles-pp.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe
12.28 ± 9% +2.0 14.31 ± 3% perf-profile.calltrace.cycles-pp.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64
16.82 ± 9% +2.0 18.86 ± 2% perf-profile.calltrace.cycles-pp.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mmap
17.60 ± 9% +2.1 19.70 ± 2% perf-profile.calltrace.cycles-pp.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mmap
26.62 ± 9% +2.8 29.39 perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mmap
26.81 ± 9% +2.8 29.60 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.__mmap
30.88 ± 9% +3.1 33.99 perf-profile.calltrace.cycles-pp.__mmap
0.64 ± 10% -0.2 0.46 ± 3% perf-profile.children.cycles-pp.unlink_file_vma
0.96 ± 10% -0.2 0.78 ± 3% perf-profile.children.cycles-pp.free_pgtables
0.24 ± 11% -0.2 0.09 ± 9% perf-profile.children.cycles-pp.vma_interval_tree_remove
0.10 ± 12% -0.0 0.07 perf-profile.children.cycles-pp.blocking_notifier_call_chain
0.16 ± 8% +0.0 0.18 ± 2% perf-profile.children.cycles-pp.vmacache_find
0.12 ± 7% +0.0 0.14 perf-profile.children.cycles-pp.make_kgid
0.08 ± 13% +0.0 0.11 ± 4% perf-profile.children.cycles-pp.common_mmap
0.15 ± 11% +0.0 0.18 ± 2% perf-profile.children.cycles-pp.cap_mmap_addr
0.15 ± 11% +0.0 0.19 ± 3% perf-profile.children.cycles-pp.exit_to_user_mode_prepare
0.12 ± 10% +0.0 0.16 ± 2% perf-profile.children.cycles-pp.copy_from_kernel_nofault_allowed
0.20 ± 7% +0.0 0.24 ± 2% perf-profile.children.cycles-pp.security_mmap_addr
0.00 +0.1 0.05 perf-profile.children.cycles-pp.should_failslab
0.07 ± 25% +0.1 0.12 ± 6% perf-profile.children.cycles-pp.cmd_sched
0.07 ± 25% +0.1 0.12 ± 6% perf-profile.children.cycles-pp.record__finish_output
0.07 ± 25% +0.1 0.12 ± 6% perf-profile.children.cycles-pp.perf_session__process_events
0.07 ± 25% +0.1 0.12 ± 4% perf-profile.children.cycles-pp.cmd_record
0.07 ± 27% +0.1 0.13 ± 5% perf-profile.children.cycles-pp.__libc_start_main
0.07 ± 27% +0.1 0.13 ± 5% perf-profile.children.cycles-pp.main
0.07 ± 27% +0.1 0.13 ± 5% perf-profile.children.cycles-pp.run_builtin
0.05 ± 51% +0.1 0.11 ± 7% perf-profile.children.cycles-pp.process_simple
0.02 ±142% +0.1 0.08 ± 10% perf-profile.children.cycles-pp.queue_event
0.02 ±144% +0.1 0.08 ± 8% perf-profile.children.cycles-pp.ordered_events__queue
0.50 ± 9% +0.1 0.58 ± 3% perf-profile.children.cycles-pp.kmem_cache_alloc_trace
0.10 ± 6% +0.1 0.19 ± 2% perf-profile.children.cycles-pp.cap_mmap_file
1.01 ± 9% +0.1 1.14 ± 2% perf-profile.children.cycles-pp.security_mmap_file
0.23 ± 97% +0.4 0.63 ± 30% perf-profile.children.cycles-pp.propagate_protected_usage
0.61 ± 84% +1.0 1.61 ± 31% perf-profile.children.cycles-pp.page_counter_cancel
0.72 ± 84% +1.1 1.85 ± 30% perf-profile.children.cycles-pp.page_counter_uncharge
0.74 ± 82% +1.1 1.88 ± 29% perf-profile.children.cycles-pp.obj_cgroup_uncharge_pages
1.43 ± 42% +1.2 2.62 ± 20% perf-profile.children.cycles-pp.kmem_cache_free
1.78 ± 33% +1.2 3.00 ± 17% perf-profile.children.cycles-pp.remove_vma
2.40 ± 33% +1.4 3.84 ± 19% perf-profile.children.cycles-pp.kmem_cache_alloc
2.65 ± 30% +1.5 4.12 ± 17% perf-profile.children.cycles-pp.vm_area_alloc
0.91 ± 90% +1.5 2.45 ± 30% perf-profile.children.cycles-pp.page_counter_try_charge
0.93 ± 88% +1.5 2.48 ± 30% perf-profile.children.cycles-pp.obj_cgroup_charge_pages
1.01 ± 81% +1.6 2.56 ± 29% perf-profile.children.cycles-pp.obj_cgroup_charge
15.17 ± 9% +1.9 17.06 ± 3% perf-profile.children.cycles-pp.do_mmap
16.83 ± 9% +2.1 18.88 ± 2% perf-profile.children.cycles-pp.vm_mmap_pgoff
12.32 ± 9% +2.1 14.38 ± 3% perf-profile.children.cycles-pp.mmap_region
17.64 ± 9% +2.1 19.74 ± 2% perf-profile.children.cycles-pp.ksys_mmap_pgoff
31.15 ± 9% +3.1 34.27 perf-profile.children.cycles-pp.__mmap
0.22 ± 10% -0.1 0.10 ± 5% perf-profile.self.cycles-pp.get_unmapped_area
0.15 ± 12% -0.1 0.06 ± 7% perf-profile.self.cycles-pp.vma_interval_tree_remove
0.10 ± 12% -0.0 0.07 perf-profile.self.cycles-pp.blocking_notifier_call_chain
0.10 ± 11% +0.0 0.13 ± 5% perf-profile.self.cycles-pp.vma_link
0.08 ± 13% +0.0 0.11 ± 4% perf-profile.self.cycles-pp.common_mmap
0.14 ± 9% +0.0 0.18 ± 2% perf-profile.self.cycles-pp.cap_mmap_addr
0.11 ± 11% +0.0 0.14 ± 3% perf-profile.self.cycles-pp.copy_from_kernel_nofault_allowed
0.13 ± 11% +0.0 0.17 ± 4% perf-profile.self.cycles-pp.exit_to_user_mode_prepare
0.32 ± 9% +0.0 0.37 ± 3% perf-profile.self.cycles-pp.kmem_cache_alloc_trace
0.26 ± 10% +0.0 0.30 ± 2% perf-profile.self.cycles-pp.__x64_sys_munmap
0.19 ± 13% +0.0 0.23 perf-profile.self.cycles-pp.shmem_mmap
0.02 ±142% +0.1 0.08 ± 6% perf-profile.self.cycles-pp.queue_event
0.09 ± 9% +0.1 0.18 perf-profile.self.cycles-pp.cap_mmap_file
0.22 ± 98% +0.4 0.62 ± 30% perf-profile.self.cycles-pp.propagate_protected_usage
0.60 ± 84% +1.0 1.60 ± 31% perf-profile.self.cycles-pp.page_counter_cancel
0.78 ± 88% +1.3 2.04 ± 29% perf-profile.self.cycles-pp.page_counter_try_charge
***************************************************************************************************
lkp-cpl-4sp1: 144 threads 4 sockets Intel(R) Xeon(R) Gold 5318H CPU @ 2.50GHz with 128G memory
=========================================================================================
compiler/cpufreq_governor/kconfig/mode/nr_task/rootfs/tbox_group/test/testcase/ucode:
gcc-9/performance/x86_64-rhel-8.3/process/50%/debian-10.4-x86_64-20200603.cgz/lkp-cpl-4sp1/mmap2/will-it-scale/0x700001e
commit:
f87bc8dc7a ("x86/asm: Add _ASM_RIP() macro for x86-64 (%rip) suffix")
0507503671 ("x86/asm: Avoid adding register pressure for the init case in static_cpu_has()")
f87bc8dc7a7c438c 0507503671f9b1c867e889cbec0
---------------- ---------------------------
%stddev %change %stddev
\ | \
44230314 -3.7% 42591837 will-it-scale.72.processes
614309 -3.7% 591552 will-it-scale.per_process_ops
44230314 -3.7% 42591837 will-it-scale.workload
122674 ± 6% -14.9% 104412 ± 9% proc-vmstat.numa_pte_updates
1243 ± 8% -15.2% 1054 ± 7% slabinfo.file_lock_cache.active_objs
1243 ± 8% -15.2% 1054 ± 7% slabinfo.file_lock_cache.num_objs
0.01 ± 17% +33.7% 0.02 ± 16% perf-sched.sch_delay.max.ms.syslog_print.do_syslog.part.0.kmsg_read
317.00 ± 8% +13.2% 358.83 ± 5% perf-sched.wait_and_delay.count.preempt_schedule_common.__cond_resched.unmap_page_range.unmap_vmas.unmap_region
10.00 ± 45% +185.0% 28.50 ± 22% perf-sched.wait_and_delay.count.schedule_hrtimeout_range_clock.ep_poll.do_epoll_wait.__x64_sys_epoll_wait
14116 ± 44% +148.1% 35030 ± 13% softirqs.CPU1.SCHED
17448 ± 38% +96.2% 34241 ± 20% softirqs.CPU114.SCHED
28696 ± 25% -57.1% 12318 ± 60% softirqs.CPU42.SCHED
18990 ± 48% +60.9% 30559 ± 20% softirqs.CPU43.SCHED
7046 ± 19% -19.3% 5689 ± 10% softirqs.CPU46.RCU
33819 ± 16% -60.1% 13489 ± 37% softirqs.CPU73.SCHED
2534 ± 25% -27.6% 1835 ± 25% interrupts.CPU0.CAL:Function_call_interrupts
231.17 ± 22% -67.3% 75.50 ± 52% interrupts.CPU1.RES:Rescheduling_interrupts
191.83 ± 27% -71.2% 55.17 ±103% interrupts.CPU114.RES:Rescheduling_interrupts
6904 ± 26% -64.3% 2467 ± 37% interrupts.CPU13.NMI:Non-maskable_interrupts
6904 ± 26% -64.3% 2467 ± 37% interrupts.CPU13.PMI:Performance_monitoring_interrupts
5739 ± 16% -37.8% 3568 ± 12% interrupts.CPU15.NMI:Non-maskable_interrupts
5739 ± 16% -37.8% 3568 ± 12% interrupts.CPU15.PMI:Performance_monitoring_interrupts
3709 ± 37% +99.0% 7381 ± 11% interrupts.CPU50.NMI:Non-maskable_interrupts
3709 ± 37% +99.0% 7381 ± 11% interrupts.CPU50.PMI:Performance_monitoring_interrupts
72.50 ± 57% +239.1% 245.83 ± 18% interrupts.CPU73.RES:Rescheduling_interrupts
3269 ± 39% +61.3% 5274 ± 34% interrupts.CPU79.NMI:Non-maskable_interrupts
3269 ± 39% +61.3% 5274 ± 34% interrupts.CPU79.PMI:Performance_monitoring_interrupts
2097 ± 6% -11.5% 1855 ± 11% interrupts.CPU97.CAL:Function_call_interrupts
0.68 ± 4% -0.3 0.41 ± 70% perf-profile.calltrace.cycles-pp.get_obj_cgroup_from_current.kmem_cache_alloc.vm_area_alloc.mmap_region.do_mmap
8.33 ± 10% +2.1 10.43 ± 9% perf-profile.calltrace.cycles-pp.___might_sleep.unmap_page_range.unmap_vmas.unmap_region.__do_munmap
0.71 ± 4% -0.1 0.60 ± 13% perf-profile.children.cycles-pp.get_obj_cgroup_from_current
0.14 ± 10% +0.1 0.21 ± 11% perf-profile.children.cycles-pp.tlb_flush_mmu
0.08 ± 11% +0.1 0.15 ± 13% perf-profile.children.cycles-pp.cap_mmap_file
0.00 +0.1 0.14 ± 10% perf-profile.children.cycles-pp.cap_mmap_addr
0.09 ± 8% +0.1 0.23 ± 8% perf-profile.children.cycles-pp.security_mmap_addr
0.34 ± 9% +0.2 0.55 ± 11% perf-profile.children.cycles-pp.security_vm_enough_memory_mm
0.26 ± 10% +0.2 0.46 ± 11% perf-profile.children.cycles-pp.cap_vm_enough_memory
0.47 ± 13% -0.2 0.31 ± 11% perf-profile.self.cycles-pp.arch_get_unmapped_area_topdown
0.26 ± 11% -0.1 0.16 ± 9% perf-profile.self.cycles-pp.get_unmapped_area
0.07 ± 11% +0.0 0.09 ± 7% perf-profile.self.cycles-pp.security_mmap_addr
0.08 ± 8% +0.1 0.15 ± 11% perf-profile.self.cycles-pp.tlb_flush_mmu
0.06 ± 45% +0.1 0.14 ± 14% perf-profile.self.cycles-pp.cap_mmap_file
0.00 +0.1 0.13 ± 11% perf-profile.self.cycles-pp.cap_mmap_addr
0.13 ± 11% +0.2 0.36 ± 11% perf-profile.self.cycles-pp.cap_vm_enough_memory
1.177e+11 -3.6% 1.134e+11 perf-stat.i.branch-instructions
2.875e+08 -5.2% 2.726e+08 perf-stat.i.branch-misses
0.44 +4.5% 0.46 perf-stat.i.cpi
1.224e+11 -3.7% 1.179e+11 perf-stat.i.dTLB-loads
5.54e+10 -3.7% 5.336e+10 perf-stat.i.dTLB-stores
4.855e+11 -3.7% 4.676e+11 perf-stat.i.instructions
2.29 -4.3% 2.19 perf-stat.i.ipc
2051 -3.7% 1976 perf-stat.i.metric.M/sec
0.44 +4.5% 0.46 perf-stat.overall.cpi
2.29 -4.3% 2.19 perf-stat.overall.ipc
1.173e+11 -3.6% 1.13e+11 perf-stat.ps.branch-instructions
2.866e+08 -5.2% 2.717e+08 perf-stat.ps.branch-misses
1.22e+11 -3.7% 1.175e+11 perf-stat.ps.dTLB-loads
5.522e+10 -3.7% 5.319e+10 perf-stat.ps.dTLB-stores
4.839e+11 -3.7% 4.66e+11 perf-stat.ps.instructions
1.46e+14 -3.7% 1.407e+14 perf-stat.total.instructions
***************************************************************************************************
lkp-icl-2sp2: 128 threads 2 sockets Intel(R) Xeon(R) Gold 6338 CPU @ 2.00GHz with 256G memory
=========================================================================================
compiler/cpufreq_governor/kconfig/mode/nr_task/rootfs/tbox_group/test/testcase/ucode:
gcc-9/performance/x86_64-rhel-8.3/process/16/debian-10.4-x86_64-20200603.cgz/lkp-icl-2sp2/mmap2/will-it-scale/0xd000280
commit:
f87bc8dc7a ("x86/asm: Add _ASM_RIP() macro for x86-64 (%rip) suffix")
0507503671 ("x86/asm: Avoid adding register pressure for the init case in static_cpu_has()")
f87bc8dc7a7c438c 0507503671f9b1c867e889cbec0
---------------- ---------------------------
%stddev %change %stddev
\ | \
10605471 -5.1% 10059884 will-it-scale.16.processes
662841 -5.1% 628742 will-it-scale.per_process_ops
10605471 -5.1% 10059884 will-it-scale.workload
37850 ± 5% -10.0% 34062 ± 7% softirqs.CPU103.SCHED
1833056 ± 6% +10.2% 2020339 ± 2% numa-vmstat.node0.numa_hit
929507 ± 13% -21.4% 730330 ± 5% numa-vmstat.node1.numa_hit
314.67 +374.9% 1494 ±135% interrupts.CPU113.RES:Rescheduling_interrupts
1457 ± 45% -44.6% 807.33 ± 24% interrupts.CPU3.CAL:Function_call_interrupts
617.33 ± 95% +296.1% 2445 ± 95% interrupts.CPU46.RES:Rescheduling_interrupts
312.33 +1308.8% 4400 ±187% interrupts.CPU58.RES:Rescheduling_interrupts
312.33 +1984.4% 6510 ±195% interrupts.CPU60.RES:Rescheduling_interrupts
125.67 ± 24% +54.4% 194.00 ± 31% interrupts.CPU8.RES:Rescheduling_interrupts
2393 ± 14% -24.9% 1797 ± 4% slabinfo.UNIX-DGRAM.active_objs
2393 ± 14% -24.9% 1797 ± 4% slabinfo.UNIX-DGRAM.num_objs
31869 ± 4% -8.7% 29097 slabinfo.anon_vma.active_objs
31869 ± 4% -8.7% 29097 slabinfo.anon_vma.num_objs
3906 ± 12% -19.7% 3138 ± 3% slabinfo.sock_inode_cache.active_objs
3906 ± 12% -19.7% 3138 ± 3% slabinfo.sock_inode_cache.num_objs
2.838e+10 -5.1% 2.695e+10 perf-stat.i.branch-instructions
18626394 -5.7% 17569866 perf-stat.i.branch-misses
0.39 +4.9% 0.40 perf-stat.i.cpi
0.00 ± 2% +0.0 0.00 perf-stat.i.dTLB-load-miss-rate%
142422 ± 2% +5.0% 149521 ± 2% perf-stat.i.dTLB-load-misses
2.963e+10 -5.0% 2.814e+10 perf-stat.i.dTLB-loads
31137 -10.0% 28016 perf-stat.i.dTLB-store-misses
1.339e+10 -5.0% 1.272e+10 perf-stat.i.dTLB-stores
1.177e+11 -5.1% 1.117e+11 perf-stat.i.instructions
2.60 -4.7% 2.48 perf-stat.i.ipc
0.91 +4.0% 0.94 ± 2% perf-stat.i.major-faults
557.84 -5.0% 529.69 perf-stat.i.metric.M/sec
0.38 +4.9% 0.40 perf-stat.overall.cpi
0.00 ± 2% +0.0 0.00 ± 2% perf-stat.overall.dTLB-load-miss-rate%
0.00 -0.0 0.00 perf-stat.overall.dTLB-store-miss-rate%
2.60 -4.7% 2.48 perf-stat.overall.ipc
2.829e+10 -5.1% 2.686e+10 perf-stat.ps.branch-instructions
18561435 -5.7% 17511090 perf-stat.ps.branch-misses
141908 ± 2% +5.0% 149002 ± 2% perf-stat.ps.dTLB-load-misses
2.953e+10 -5.0% 2.804e+10 perf-stat.ps.dTLB-loads
31009 -10.0% 27905 perf-stat.ps.dTLB-store-misses
1.335e+10 -5.1% 1.268e+10 perf-stat.ps.dTLB-stores
1.174e+11 -5.1% 1.114e+11 perf-stat.ps.instructions
0.90 +3.9% 0.94 ± 2% perf-stat.ps.major-faults
3.546e+13 -5.1% 3.366e+13 perf-stat.total.instructions
37.04 -2.1 34.95 perf-profile.calltrace.cycles-pp.__mmap
33.84 -1.8 32.01 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.__mmap
33.52 -1.8 31.70 perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mmap
32.72 -1.8 30.92 perf-profile.calltrace.cycles-pp.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mmap
30.75 -1.7 29.02 perf-profile.calltrace.cycles-pp.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mmap
27.70 -1.6 26.12 perf-profile.calltrace.cycles-pp.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe
23.50 -1.4 22.15 perf-profile.calltrace.cycles-pp.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64
11.02 -0.7 10.33 perf-profile.calltrace.cycles-pp.perf_event_mmap.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
6.93 -0.6 6.31 perf-profile.calltrace.cycles-pp.free_pgd_range.unmap_region.__do_munmap.__vm_munmap.__x64_sys_munmap
6.56 -0.6 6.01 perf-profile.calltrace.cycles-pp.free_p4d_range.free_pgd_range.unmap_region.__do_munmap.__vm_munmap
3.39 -0.3 3.10 ± 3% perf-profile.calltrace.cycles-pp.vma_link.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
1.56 ± 3% -0.3 1.28 ± 2% perf-profile.calltrace.cycles-pp.arch_get_unmapped_area_topdown.shmem_get_unmapped_area.get_unmapped_area.do_mmap.vm_mmap_pgoff
2.57 ± 2% -0.3 2.31 ± 2% perf-profile.calltrace.cycles-pp.perf_iterate_sb.perf_event_mmap.mmap_region.do_mmap.vm_mmap_pgoff
1.04 ± 4% -0.2 0.81 ± 2% perf-profile.calltrace.cycles-pp.vm_unmapped_area.arch_get_unmapped_area_topdown.shmem_get_unmapped_area.get_unmapped_area.do_mmap
2.51 -0.2 2.28 ± 3% perf-profile.calltrace.cycles-pp.get_unmapped_area.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64
2.16 -0.2 1.94 ± 3% perf-profile.calltrace.cycles-pp.shmem_get_unmapped_area.get_unmapped_area.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
1.31 ± 2% -0.2 1.13 ± 2% perf-profile.calltrace.cycles-pp.__vma_link_rb.vma_link.mmap_region.do_mmap.vm_mmap_pgoff
1.01 ± 6% -0.2 0.85 ± 7% perf-profile.calltrace.cycles-pp.kfree.perf_event_mmap.mmap_region.do_mmap.vm_mmap_pgoff
1.94 -0.1 1.83 ± 2% perf-profile.calltrace.cycles-pp.__entry_text_start.__mmap
0.54 ± 2% -0.1 0.44 ± 44% perf-profile.calltrace.cycles-pp.prepend_copy.d_path.perf_event_mmap.mmap_region.do_mmap
1.42 -0.1 1.35 ± 3% perf-profile.calltrace.cycles-pp.prepend_name.prepend_path.d_path.perf_event_mmap.mmap_region
1.02 -0.1 0.96 perf-profile.calltrace.cycles-pp.security_mmap_file.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe
1.01 -0.1 0.96 ± 2% perf-profile.calltrace.cycles-pp.kmem_cache_alloc_trace.perf_event_mmap.mmap_region.do_mmap.vm_mmap_pgoff
0.80 ± 2% -0.0 0.75 ± 3% perf-profile.calltrace.cycles-pp.down_write_killable.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe
0.82 -0.0 0.77 ± 3% perf-profile.calltrace.cycles-pp.copy_from_kernel_nofault.prepend_copy.prepend_name.prepend_path.d_path
2.90 +0.2 3.07 ± 2% perf-profile.calltrace.cycles-pp.zap_pte_range.unmap_page_range.unmap_vmas.unmap_region.__do_munmap
3.44 ± 2% +0.6 4.03 ± 2% perf-profile.calltrace.cycles-pp.rcu_all_qs.__cond_resched.unmap_page_range.unmap_vmas.unmap_region
6.75 +1.4 8.11 perf-profile.calltrace.cycles-pp.__cond_resched.unmap_page_range.unmap_vmas.unmap_region.__do_munmap
7.58 +1.5 9.12 perf-profile.calltrace.cycles-pp.___might_sleep.unmap_page_range.unmap_vmas.unmap_region.__do_munmap
54.60 +2.5 57.12 perf-profile.calltrace.cycles-pp.__munmap
51.24 +2.6 53.86 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.__munmap
50.93 +2.6 53.56 perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.__munmap
50.25 +2.6 52.90 perf-profile.calltrace.cycles-pp.__x64_sys_munmap.do_syscall_64.entry_SYSCALL_64_after_hwframe.__munmap
49.92 +2.7 52.59 perf-profile.calltrace.cycles-pp.__vm_munmap.__x64_sys_munmap.do_syscall_64.entry_SYSCALL_64_after_hwframe.__munmap
47.82 +2.7 50.53 perf-profile.calltrace.cycles-pp.__do_munmap.__vm_munmap.__x64_sys_munmap.do_syscall_64.entry_SYSCALL_64_after_hwframe
40.04 +2.7 42.77 perf-profile.calltrace.cycles-pp.unmap_region.__do_munmap.__vm_munmap.__x64_sys_munmap.do_syscall_64
28.05 +3.5 31.57 perf-profile.calltrace.cycles-pp.unmap_vmas.unmap_region.__do_munmap.__vm_munmap.__x64_sys_munmap
26.87 +3.9 30.72 perf-profile.calltrace.cycles-pp.unmap_page_range.unmap_vmas.unmap_region.__do_munmap.__vm_munmap
37.31 -2.1 35.24 perf-profile.children.cycles-pp.__mmap
32.85 -1.8 31.05 perf-profile.children.cycles-pp.ksys_mmap_pgoff
30.91 -1.7 29.16 perf-profile.children.cycles-pp.vm_mmap_pgoff
27.81 -1.6 26.23 perf-profile.children.cycles-pp.do_mmap
23.86 -1.4 22.49 perf-profile.children.cycles-pp.mmap_region
11.16 -0.7 10.46 perf-profile.children.cycles-pp.perf_event_mmap
6.97 -0.6 6.38 perf-profile.children.cycles-pp.free_pgd_range
6.61 -0.6 6.06 perf-profile.children.cycles-pp.free_p4d_range
3.54 -0.3 3.25 ± 3% perf-profile.children.cycles-pp.vma_link
1.68 ± 2% -0.3 1.41 ± 2% perf-profile.children.cycles-pp.arch_get_unmapped_area_topdown
2.67 ± 2% -0.3 2.41 ± 2% perf-profile.children.cycles-pp.perf_iterate_sb
2.57 -0.2 2.34 ± 3% perf-profile.children.cycles-pp.get_unmapped_area
1.06 ± 4% -0.2 0.83 ± 2% perf-profile.children.cycles-pp.vm_unmapped_area
2.20 -0.2 1.98 ± 3% perf-profile.children.cycles-pp.shmem_get_unmapped_area
1.34 ± 2% -0.2 1.16 ± 2% perf-profile.children.cycles-pp.__vma_link_rb
1.03 ± 6% -0.2 0.88 ± 7% perf-profile.children.cycles-pp.kfree
1.94 ± 2% -0.1 1.85 ± 2% perf-profile.children.cycles-pp.syscall_return_via_sysret
1.50 -0.1 1.43 ± 2% perf-profile.children.cycles-pp.prepend_name
1.64 -0.1 1.58 ± 2% perf-profile.children.cycles-pp.prepend_copy
1.40 -0.1 1.33 ± 2% perf-profile.children.cycles-pp.copy_from_kernel_nofault
0.66 ± 3% -0.1 0.61 ± 2% perf-profile.children.cycles-pp.mod_objcg_state
1.30 -0.1 1.25 perf-profile.children.cycles-pp.__might_sleep
1.11 -0.1 1.06 ± 2% perf-profile.children.cycles-pp.kmem_cache_alloc_trace
1.08 -0.1 1.02 perf-profile.children.cycles-pp.security_mmap_file
0.58 ± 3% -0.0 0.55 ± 2% perf-profile.children.cycles-pp.common_file_perm
0.21 ± 4% -0.0 0.18 ± 3% perf-profile.children.cycles-pp.unlink_anon_vmas
0.27 ± 4% -0.0 0.24 ± 3% perf-profile.children.cycles-pp.userfaultfd_unmap_prep
0.11 ± 3% -0.0 0.09 ± 8% perf-profile.children.cycles-pp.__vma_link_file
0.35 -0.0 0.33 ± 2% perf-profile.children.cycles-pp.exit_to_user_mode_prepare
0.39 -0.0 0.37 perf-profile.children.cycles-pp.prepend
0.09 ± 7% +0.0 0.12 ± 8% perf-profile.children.cycles-pp.tlb_table_flush
0.28 ± 2% +0.0 0.31 ± 4% perf-profile.children.cycles-pp.tlb_flush_mmu
2.98 +0.2 3.16 ± 2% perf-profile.children.cycles-pp.zap_pte_range
4.00 +0.5 4.51 ± 2% perf-profile.children.cycles-pp.rcu_all_qs
7.99 +1.1 9.08 ± 2% perf-profile.children.cycles-pp.__cond_resched
8.39 +2.1 10.45 perf-profile.children.cycles-pp.___might_sleep
54.65 +2.5 57.14 perf-profile.children.cycles-pp.__munmap
50.33 +2.7 52.98 perf-profile.children.cycles-pp.__x64_sys_munmap
50.07 +2.7 52.73 perf-profile.children.cycles-pp.__vm_munmap
48.08 +2.7 50.79 perf-profile.children.cycles-pp.__do_munmap
40.20 +2.7 42.93 perf-profile.children.cycles-pp.unmap_region
28.15 +3.5 31.68 perf-profile.children.cycles-pp.unmap_vmas
27.25 +3.5 30.78 perf-profile.children.cycles-pp.unmap_page_range
6.56 -0.6 5.99 perf-profile.self.cycles-pp.free_p4d_range
1.05 ± 4% -0.2 0.81 ± 2% perf-profile.self.cycles-pp.vm_unmapped_area
1.88 ± 2% -0.2 1.65 ± 3% perf-profile.self.cycles-pp.perf_iterate_sb
1.31 ± 2% -0.2 1.13 ± 2% perf-profile.self.cycles-pp.__vma_link_rb
1.00 ± 6% -0.2 0.85 ± 7% perf-profile.self.cycles-pp.kfree
1.98 -0.1 1.88 perf-profile.self.cycles-pp.mmap_region
1.94 ± 2% -0.1 1.85 ± 2% perf-profile.self.cycles-pp.syscall_return_via_sysret
0.62 ± 3% -0.1 0.56 ± 2% perf-profile.self.cycles-pp.mod_objcg_state
1.82 -0.1 1.76 ± 2% perf-profile.self.cycles-pp.__do_munmap
0.91 ± 3% -0.1 0.86 ± 2% perf-profile.self.cycles-pp.__munmap
0.61 ± 2% -0.0 0.56 ± 3% perf-profile.self.cycles-pp.tlb_finish_mmu
1.07 -0.0 1.02 perf-profile.self.cycles-pp.__might_sleep
0.60 ± 2% -0.0 0.56 ± 3% perf-profile.self.cycles-pp.d_path
1.00 -0.0 0.96 ± 2% perf-profile.self.cycles-pp.copy_from_kernel_nofault
0.49 ± 3% -0.0 0.46 ± 2% perf-profile.self.cycles-pp.vm_mmap_pgoff
0.09 ± 6% -0.0 0.06 ± 7% perf-profile.self.cycles-pp.__vma_link_file
0.25 ± 5% -0.0 0.22 ± 4% perf-profile.self.cycles-pp.userfaultfd_unmap_prep
0.21 ± 4% -0.0 0.19 ± 5% perf-profile.self.cycles-pp.remove_vma
0.28 ± 4% -0.0 0.26 ± 2% perf-profile.self.cycles-pp.common_file_perm
0.17 ± 5% -0.0 0.15 ± 6% perf-profile.self.cycles-pp.unlink_anon_vmas
0.25 ± 4% -0.0 0.23 ± 2% perf-profile.self.cycles-pp.vma_set_page_prot
0.07 ± 9% +0.0 0.10 ± 7% perf-profile.self.cycles-pp.tlb_table_flush
2.18 +0.2 2.33 ± 2% perf-profile.self.cycles-pp.zap_pte_range
2.77 +0.2 2.99 ± 2% perf-profile.self.cycles-pp.rcu_all_qs
4.10 +0.4 4.53 ± 2% perf-profile.self.cycles-pp.__cond_resched
12.61 +0.5 13.08 perf-profile.self.cycles-pp.unmap_page_range
6.76 +2.1 8.83 perf-profile.self.cycles-pp.___might_sleep
***************************************************************************************************
lkp-csl-2sp9: 88 threads 2 sockets Intel(R) Xeon(R) Gold 6238M CPU @ 2.10GHz with 128G memory
=========================================================================================
compiler/cpufreq_governor/kconfig/mode/nr_task/rootfs/tbox_group/test/testcase/ucode:
gcc-9/performance/x86_64-rhel-8.3/process/16/debian-10.4-x86_64-20200603.cgz/lkp-csl-2sp9/mmap1/will-it-scale/0x5003006
commit:
f87bc8dc7a ("x86/asm: Add _ASM_RIP() macro for x86-64 (%rip) suffix")
0507503671 ("x86/asm: Avoid adding register pressure for the init case in static_cpu_has()")
f87bc8dc7a7c438c 0507503671f9b1c867e889cbec0
---------------- ---------------------------
%stddev %change %stddev
\ | \
10583704 -5.9% 9961518 will-it-scale.16.processes
661481 -5.9% 622594 will-it-scale.per_process_ops
10583704 -5.9% 9961518 will-it-scale.workload
42965 ± 53% -34.2% 28286 ± 88% numa-vmstat.node1.numa_other
0.03 ± 94% -62.2% 0.01 ± 46% perf-sched.sch_delay.max.ms.schedule_hrtimeout_range_clock.ep_poll.do_epoll_wait.__x64_sys_epoll_wait
19740 ± 41% -46.8% 10503 ± 54% softirqs.CPU12.SCHED
180.86 -1.1% 178.80 turbostat.PkgWatt
2071 ± 17% -18.4% 1690 ± 4% slabinfo.khugepaged_mm_slot.active_objs
2071 ± 17% -18.4% 1690 ± 4% slabinfo.khugepaged_mm_slot.num_objs
3707 ±123% -94.7% 196.00 ±199% interrupts.80:PCI-MSI.31981613-edge.i40e-eth0-TxRx-44
3706 ±123% -94.7% 195.80 ±199% interrupts.CPU44.80:PCI-MSI.31981613-edge.i40e-eth0-TxRx-44
5381 ± 46% -52.1% 2575 ± 38% interrupts.CPU45.NMI:Non-maskable_interrupts
5381 ± 46% -52.1% 2575 ± 38% interrupts.CPU45.PMI:Performance_monitoring_interrupts
5384 ± 46% -51.2% 2629 ± 32% interrupts.CPU50.NMI:Non-maskable_interrupts
5384 ± 46% -51.2% 2629 ± 32% interrupts.CPU50.PMI:Performance_monitoring_interrupts
4696 ± 45% -40.2% 2809 ± 28% interrupts.CPU52.NMI:Non-maskable_interrupts
4696 ± 45% -40.2% 2809 ± 28% interrupts.CPU52.PMI:Performance_monitoring_interrupts
2.549e+10 -5.8% 2.402e+10 perf-stat.i.branch-instructions
0.23 ± 5% +0.1 0.29 ± 3% perf-stat.i.branch-miss-rate%
57801771 ± 5% +20.1% 69394658 ± 3% perf-stat.i.branch-misses
0.45 +5.8% 0.47 perf-stat.i.cpi
2.565e+10 -5.5% 2.424e+10 perf-stat.i.dTLB-loads
1.142e+10 -5.8% 1.075e+10 perf-stat.i.dTLB-stores
31644150 ± 3% +36.2% 43090676 ± 2% perf-stat.i.iTLB-load-misses
1.044e+11 -5.8% 9.833e+10 perf-stat.i.instructions
3328 ± 3% -31.3% 2286 ± 2% perf-stat.i.instructions-per-iTLB-miss
2.23 -5.5% 2.11 perf-stat.i.ipc
710.89 -5.7% 670.51 perf-stat.i.metric.M/sec
0.23 ± 5% +0.1 0.29 ± 3% perf-stat.overall.branch-miss-rate%
0.45 +5.9% 0.47 perf-stat.overall.cpi
3303 ± 3% -30.9% 2283 ± 2% perf-stat.overall.instructions-per-iTLB-miss
2.24 -5.5% 2.11 perf-stat.overall.ipc
2.54e+10 -5.8% 2.394e+10 perf-stat.ps.branch-instructions
57619683 ± 5% +20.0% 69156262 ± 3% perf-stat.ps.branch-misses
2.557e+10 -5.5% 2.416e+10 perf-stat.ps.dTLB-loads
1.138e+10 -5.8% 1.072e+10 perf-stat.ps.dTLB-stores
31543790 ± 3% +36.2% 42949028 ± 2% perf-stat.ps.iTLB-load-misses
1.041e+11 -5.8% 9.8e+10 perf-stat.ps.instructions
3.146e+13 -5.9% 2.961e+13 perf-stat.total.instructions
11.71 ± 5% -1.4 10.30 ± 8% perf-profile.calltrace.cycles-pp.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64
8.88 ± 2% -0.8 8.08 ± 9% perf-profile.calltrace.cycles-pp.free_pgd_range.unmap_region.__do_munmap.__vm_munmap.__x64_sys_munmap
8.57 ± 2% -0.7 7.82 ± 9% perf-profile.calltrace.cycles-pp.free_p4d_range.free_pgd_range.unmap_region.__do_munmap.__vm_munmap
3.66 ± 21% -0.6 3.03 ± 8% perf-profile.calltrace.cycles-pp.vm_area_alloc.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
3.77 ± 2% -0.5 3.30 ± 9% perf-profile.calltrace.cycles-pp.perf_event_mmap.mmap_region.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff
2.10 ± 3% -0.3 1.83 ± 10% perf-profile.calltrace.cycles-pp.perf_iterate_sb.perf_event_mmap.mmap_region.do_mmap.vm_mmap_pgoff
1.17 ± 3% -0.1 1.05 ± 10% perf-profile.calltrace.cycles-pp.__vma_link_rb.vma_link.mmap_region.do_mmap.vm_mmap_pgoff
0.74 -0.1 0.68 ± 8% perf-profile.calltrace.cycles-pp.__vma_rb_erase.__do_munmap.__vm_munmap.__x64_sys_munmap.do_syscall_64
1.66 ± 4% +0.4 2.03 ± 9% perf-profile.calltrace.cycles-pp.get_unmapped_area.do_mmap.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64
21.42 ± 2% -1.6 19.81 ± 8% perf-profile.children.cycles-pp.__mmap
11.79 ± 5% -1.4 10.37 ± 8% perf-profile.children.cycles-pp.mmap_region
8.92 ± 2% -0.8 8.13 ± 9% perf-profile.children.cycles-pp.free_pgd_range
8.61 ± 2% -0.7 7.86 ± 9% perf-profile.children.cycles-pp.free_p4d_range
3.68 ± 21% -0.6 3.05 ± 8% perf-profile.children.cycles-pp.vm_area_alloc
3.89 ± 2% -0.5 3.41 ± 9% perf-profile.children.cycles-pp.perf_event_mmap
2.16 ± 3% -0.3 1.90 ± 10% perf-profile.children.cycles-pp.perf_iterate_sb
1.51 ± 3% -0.1 1.36 ± 9% perf-profile.children.cycles-pp.vma_link
1.17 ± 3% -0.1 1.06 ± 10% perf-profile.children.cycles-pp.__vma_link_rb
0.27 ± 4% -0.1 0.19 ± 6% perf-profile.children.cycles-pp.ima_file_mmap
0.74 -0.1 0.68 ± 8% perf-profile.children.cycles-pp.__vma_rb_erase
0.26 ± 6% -0.1 0.20 ± 9% perf-profile.children.cycles-pp.apparmor_mmap_file
0.24 ± 5% -0.1 0.18 ± 21% perf-profile.children.cycles-pp.task_tick_fair
0.21 ± 4% -0.0 0.17 ± 23% perf-profile.children.cycles-pp.perf_tp_event
0.20 ± 4% -0.0 0.15 ± 23% perf-profile.children.cycles-pp.perf_trace_sched_stat_runtime
0.47 -0.0 0.42 ± 12% perf-profile.children.cycles-pp.downgrade_write
0.25 ± 9% -0.0 0.20 ± 4% perf-profile.children.cycles-pp.may_expand_vm
0.20 ± 5% -0.0 0.17 ± 15% perf-profile.children.cycles-pp.userfaultfd_unmap_prep
0.26 ± 2% -0.0 0.23 ± 11% perf-profile.children.cycles-pp.syscall_enter_from_user_mode
0.10 ± 4% -0.0 0.09 perf-profile.children.cycles-pp.kfree
0.10 ± 11% +0.1 0.15 ± 13% perf-profile.children.cycles-pp.get_mmap_base
0.10 ± 4% +0.2 0.32 ± 11% perf-profile.children.cycles-pp.security_mmap_addr
0.00 +0.2 0.23 ± 8% perf-profile.children.cycles-pp.cap_mmap_addr
1.67 ± 4% +0.4 2.06 ± 9% perf-profile.children.cycles-pp.get_unmapped_area
8.53 ± 2% -0.7 7.80 ± 9% perf-profile.self.cycles-pp.free_p4d_range
1.40 ± 3% -0.2 1.21 ± 8% perf-profile.self.cycles-pp.perf_event_mmap
1.54 ± 4% -0.2 1.37 ± 9% perf-profile.self.cycles-pp.perf_iterate_sb
0.97 ± 4% -0.1 0.86 ± 9% perf-profile.self.cycles-pp.__mmap
1.16 ± 3% -0.1 1.05 ± 10% perf-profile.self.cycles-pp.__vma_link_rb
1.04 ± 4% -0.1 0.96 ± 4% perf-profile.self.cycles-pp.kmem_cache_free
0.26 ± 4% -0.1 0.18 ± 6% perf-profile.self.cycles-pp.ima_file_mmap
0.72 -0.1 0.66 ± 8% perf-profile.self.cycles-pp.__vma_rb_erase
0.25 ± 4% -0.1 0.19 ± 9% perf-profile.self.cycles-pp.apparmor_mmap_file
0.46 -0.0 0.42 ± 11% perf-profile.self.cycles-pp.downgrade_write
0.20 ± 4% -0.0 0.16 ± 14% perf-profile.self.cycles-pp.userfaultfd_unmap_prep
0.23 ± 10% -0.0 0.19 ± 5% perf-profile.self.cycles-pp.may_expand_vm
0.24 ± 2% -0.0 0.21 ± 11% perf-profile.self.cycles-pp.syscall_enter_from_user_mode
0.14 ± 3% -0.0 0.12 ± 9% perf-profile.self.cycles-pp.free_pgtables
0.18 ± 5% -0.0 0.16 ± 6% perf-profile.self.cycles-pp.vma_link
0.17 ± 4% -0.0 0.15 ± 12% perf-profile.self.cycles-pp.lru_add_drain_cpu
0.08 ± 5% -0.0 0.07 ± 6% perf-profile.self.cycles-pp.kfree
0.10 ± 10% +0.1 0.15 ± 15% perf-profile.self.cycles-pp.get_mmap_base
0.35 ± 18% +0.2 0.52 ± 9% perf-profile.self.cycles-pp.arch_get_unmapped_area_topdown
0.00 +0.2 0.22 ± 9% perf-profile.self.cycles-pp.cap_mmap_addr
***************************************************************************************************
lkp-csl-2sp9: 88 threads 2 sockets Intel(R) Xeon(R) Gold 6238M CPU @ 2.10GHz with 128G memory
=========================================================================================
compiler/cpufreq_governor/kconfig/mode/nr_task/rootfs/tbox_group/test/testcase/ucode:
gcc-9/performance/x86_64-rhel-8.3/process/50%/debian-10.4-x86_64-20200603.cgz/lkp-csl-2sp9/mmap2/will-it-scale/0x5003006
commit:
f87bc8dc7a ("x86/asm: Add _ASM_RIP() macro for x86-64 (%rip) suffix")
0507503671 ("x86/asm: Avoid adding register pressure for the init case in static_cpu_has()")
f87bc8dc7a7c438c 0507503671f9b1c867e889cbec0
---------------- ---------------------------
%stddev %change %stddev
\ | \
24670575 -3.5% 23801056 will-it-scale.44.processes
560694 -3.5% 540932 will-it-scale.per_process_ops
24670575 -3.5% 23801056 will-it-scale.workload
21496 ± 6% -9.4% 19470 ± 5% numa-vmstat.node0.nr_slab_reclaimable
1803982 ± 5% -18.1% 1477517 ± 11% numa-vmstat.node0.numa_hit
20706 ± 15% +22.3% 25318 numa-vmstat.node1.nr_slab_unreclaimable
54197 ± 11% +13.8% 61667 ± 5% slabinfo.anon_vma_chain.num_objs
3515 ± 3% -8.5% 3216 slabinfo.kmalloc-cg-1k.active_objs
3515 ± 3% -8.5% 3216 slabinfo.kmalloc-cg-1k.num_objs
28262 ± 23% -46.0% 15275 ± 57% softirqs.CPU27.SCHED
14729 ± 43% +90.5% 28061 ± 31% softirqs.CPU71.SCHED
8600 ±103% +229.7% 28352 ± 46% softirqs.CPU9.SCHED
1477 ± 16% +22.6% 1810 ± 19% interrupts.CPU1.CAL:Function_call_interrupts
1330 ± 2% +18.2% 1571 ± 17% interrupts.CPU50.CAL:Function_call_interrupts
56.17 ±122% +288.7% 218.33 ± 46% interrupts.CPU53.RES:Rescheduling_interrupts
6712 ± 22% -35.7% 4313 ± 55% interrupts.CPU9.NMI:Non-maskable_interrupts
6712 ± 22% -35.7% 4313 ± 55% interrupts.CPU9.PMI:Performance_monitoring_interrupts
270.00 ± 26% -59.8% 108.67 ± 90% interrupts.CPU9.RES:Rescheduling_interrupts
2406 ± 8% -42.7% 1379 ± 66% interrupts.TLB:TLB_shootdowns
85988 ± 6% -9.4% 77884 ± 5% numa-meminfo.node0.KReclaimable
3556982 ± 3% -9.7% 3213562 ± 2% numa-meminfo.node0.MemUsed
85988 ± 6% -9.4% 77884 ± 5% numa-meminfo.node0.SReclaimable
190750 ± 9% -13.0% 165986 ± 2% numa-meminfo.node0.Slab
7596 ± 38% +51.9% 11539 ± 12% numa-meminfo.node1.Mapped
685580 ± 19% +42.9% 979923 ± 10% numa-meminfo.node1.MemUsed
82826 ± 15% +22.3% 101275 numa-meminfo.node1.SUnreclaim
0.16 ± 15% -0.1 0.11 ± 13% perf-profile.children.cycles-pp.vma_interval_tree_remove
0.08 ± 16% +0.1 0.14 ± 12% perf-profile.children.cycles-pp.cap_mmap_file
0.00 +0.1 0.12 ± 12% perf-profile.children.cycles-pp.cap_mmap_addr
0.08 ± 15% +0.1 0.21 ± 11% perf-profile.children.cycles-pp.security_mmap_addr
0.54 ± 12% -0.2 0.33 ± 13% perf-profile.self.cycles-pp.arch_get_unmapped_area_topdown
0.25 ± 14% -0.1 0.15 ± 12% perf-profile.self.cycles-pp.get_unmapped_area
0.12 ± 13% -0.0 0.08 ± 13% perf-profile.self.cycles-pp.vma_interval_tree_remove
0.08 ± 18% +0.0 0.11 ± 13% perf-profile.self.cycles-pp.tlb_flush_mmu
0.06 ± 11% +0.1 0.13 ± 13% perf-profile.self.cycles-pp.cap_mmap_file
0.26 ± 12% +0.1 0.36 ± 11% perf-profile.self.cycles-pp.cap_vm_enough_memory
0.00 +0.1 0.12 ± 14% perf-profile.self.cycles-pp.cap_mmap_addr
6.568e+10 -3.4% 6.344e+10 perf-stat.i.branch-instructions
0.24 +0.0 0.27 perf-stat.i.branch-miss-rate%
1.549e+08 +12.0% 1.734e+08 perf-stat.i.branch-misses
0.45 +3.7% 0.47 perf-stat.i.cpi
0.00 ± 8% +0.0 0.00 ± 17% perf-stat.i.dTLB-load-miss-rate%
6.836e+10 -3.5% 6.595e+10 perf-stat.i.dTLB-loads
14037 -3.9% 13484 ± 2% perf-stat.i.dTLB-store-misses
3.096e+10 -3.5% 2.988e+10 perf-stat.i.dTLB-stores
98953573 +21.0% 1.198e+08 perf-stat.i.iTLB-load-misses
2.71e+11 -3.4% 2.617e+11 perf-stat.i.instructions
2739 -20.1% 2188 perf-stat.i.instructions-per-iTLB-miss
2.21 -3.6% 2.13 perf-stat.i.ipc
1874 -3.5% 1809 perf-stat.i.metric.M/sec
0.24 +0.0 0.27 perf-stat.overall.branch-miss-rate%
0.45 +3.7% 0.47 perf-stat.overall.cpi
2738 -20.2% 2185 perf-stat.overall.instructions-per-iTLB-miss
2.21 -3.6% 2.13 perf-stat.overall.ipc
6.546e+10 -3.4% 6.323e+10 perf-stat.ps.branch-instructions
1.544e+08 +12.0% 1.729e+08 perf-stat.ps.branch-misses
6.813e+10 -3.5% 6.573e+10 perf-stat.ps.dTLB-loads
14005 -4.0% 13446 ± 2% perf-stat.ps.dTLB-store-misses
3.086e+10 -3.5% 2.978e+10 perf-stat.ps.dTLB-stores
98636701 +21.0% 1.194e+08 perf-stat.ps.iTLB-load-misses
2.701e+11 -3.4% 2.608e+11 perf-stat.ps.instructions
8.157e+13 -3.5% 7.872e+13 perf-stat.total.instructions
Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.
---
0DAY/LKP+ Test Infrastructure Open Source Technology Center
https://lists.01.org/hyperkitty/list/[email protected] Intel Corporation
Thanks,
Oliver Sang
8 months, 4 weeks
lkp
by oiloncanvas@foxmail.com
You email us a picture,
we make an oil painting.
2021-11-17 02:08:01
Oil on canvas, 100% hand-painted.
lkp
Any picture will do.
D0F69507-5CDC-437C-852F-78DF800CAB59
Free shipping to your home address.
8 months, 4 weeks
[block] 72d1b2aab7: BUG:KASAN:use-after-free_in__rq_qos_issue
by kernel test robot
Greeting,
FYI, we noticed the following commit (built with gcc-9):
commit: 72d1b2aab791ef905294fe97afcf2e7c5a5fa865 ("block: use separate links for rq_qos tracking")
https://git.kernel.org/cgit/linux/kernel/git/axboe/linux-block.git perf-wip
in testcase: blktests
version: blktests-x86_64-3be7849-1_20211102
with following parameters:
disk: 1HDD
test: scsi-group-02
ucode: 0x7000019
on test machine: 16 threads 1 sockets Intel(R) Xeon(R) CPU D-1541 @ 2.10GHz with 48G memory
caused below changes (please refer to attached dmesg/kmsg for entire log/backtrace):
If you fix the issue, kindly add following tag
Reported-by: kernel test robot <oliver.sang(a)intel.com>
[ 81.967998][ T1300] BUG: KASAN: use-after-free in __rq_qos_issue (kbuild/src/x86_64/block/blk-rq-qos.c:53)
[ 81.977072][ T1300] Read of size 8 at addr ffff888c77e6f458 by task modprobe/1300
[ 81.986742][ T1300]
[ 81.991103][ T1300] CPU: 0 PID: 1300 Comm: modprobe Not tainted 5.16.0-rc1-00026-g72d1b2aab791 #1
[ 82.002187][ T1300] Hardware name: Supermicro SYS-5018D-FN4T/X10SDV-8C-TLN4F, BIOS 1.1 03/02/2016
[ 82.013298][ T1300] Call Trace:
[ 82.018681][ T1300] <TASK>
[ 82.023655][ T1300] dump_stack_lvl (kbuild/src/x86_64/lib/dump_stack.c:107)
[ 82.030146][ T1300] print_address_description+0x21/0x140
[ 82.038715][ T1300] ? __rq_qos_issue (kbuild/src/x86_64/block/blk-rq-qos.c:53)
[ 82.045319][ T1300] kasan_report.cold (kbuild/src/x86_64/mm/kasan/report.c:434 kbuild/src/x86_64/mm/kasan/report.c:450)
[ 82.052056][ T1300] ? __rq_qos_issue (kbuild/src/x86_64/block/blk-rq-qos.c:53)
[ 82.058581][ T1300] __rq_qos_issue (kbuild/src/x86_64/block/blk-rq-qos.c:53)
[ 82.064891][ T1300] blk_mq_start_request (kbuild/src/x86_64/include/linux/blk-mq.h:742 kbuild/src/x86_64/block/blk-mq.c:1043)
[ 82.071850][ T1300] scsi_queue_rq (kbuild/src/x86_64/drivers/scsi/scsi_lib.c:1464 kbuild/src/x86_64/drivers/scsi/scsi_lib.c:1708)
[ 82.078307][ T1300] ? scsi_mq_get_budget (kbuild/src/x86_64/arch/x86/include/asm/atomic.h:29 kbuild/src/x86_64/include/linux/atomic/atomic-instrumented.h:28 kbuild/src/x86_64/drivers/scsi/scsi_lib.c:1253 kbuild/src/x86_64/drivers/scsi/scsi_lib.c:1617)
[ 82.085110][ T1300] blk_mq_dispatch_rq_list (kbuild/src/x86_64/block/blk-mq.c:1660)
[ 82.092312][ T1300] ? sbitmap_get (kbuild/src/x86_64/lib/sbitmap.c:189 kbuild/src/x86_64/lib/sbitmap.c:216 kbuild/src/x86_64/lib/sbitmap.c:241)
[ 82.098499][ T1300] ? __blk_mq_try_issue_directly (kbuild/src/x86_64/block/blk-mq.c:1612)
[ 82.106048][ T1300] ? _raw_spin_lock (kbuild/src/x86_64/arch/x86/include/asm/atomic.h:202 kbuild/src/x86_64/include/linux/atomic/atomic-instrumented.h:513 kbuild/src/x86_64/include/asm-generic/qspinlock.h:82 kbuild/src/x86_64/include/linux/spinlock.h:185 kbuild/src/x86_64/include/linux/spinlock_api_smp.h:134 kbuild/src/x86_64/kernel/locking/spinlock.c:154)
[ 82.112356][ T1300] ? _raw_write_lock_irq (kbuild/src/x86_64/kernel/locking/spinlock.c:153)
[ 82.119135][ T1300] ? blk_mq_get_tag (kbuild/src/x86_64/arch/x86/include/asm/bitops.h:207 kbuild/src/x86_64/include/asm-generic/bitops/instrumented-non-atomic.h:135 kbuild/src/x86_64/block/blk-mq-tag.c:189)
[ 82.125430][ T1300] ? sdev_prefix_printk (kbuild/src/x86_64/drivers/scsi/scsi_logging.c:56)
[ 82.132003][ T1300] __blk_mq_sched_dispatch_requests (kbuild/src/x86_64/block/blk-mq-sched.c:326)
[ 82.139630][ T1300] ? blk_mq_put_tag (kbuild/src/x86_64/block/blk-mq-tag.c:105)
[ 82.145869][ T1300] ? wake_up_klogd (kbuild/src/x86_64/kernel/printk/printk.c:3243)
[ 82.152430][ T1300] ? vprintk_emit (kbuild/src/x86_64/kernel/printk/printk.c:2251)
[ 82.158479][ T1300] ? blk_mq_sched_assign_ioc (kbuild/src/x86_64/block/blk-mq-sched.c:294)
[ 82.165483][ T1300] ? recalibrate_cpu_khz (kbuild/src/x86_64/arch/x86/include/asm/msr.h:234 kbuild/src/x86_64/arch/x86/kernel/tsc.c:1095)
[ 82.171965][ T1300] ? ktime_get (kbuild/src/x86_64/kernel/time/timekeeping.c:290 kbuild/src/x86_64/kernel/time/timekeeping.c:386 kbuild/src/x86_64/kernel/time/timekeeping.c:829 kbuild/src/x86_64/kernel/time/timekeeping.c:817)
[ 82.177650][ T1300] ? kasan_save_stack (kbuild/src/x86_64/mm/kasan/common.c:41)
[ 82.183855][ T1300] blk_mq_sched_dispatch_requests (kbuild/src/x86_64/block/blk-mq-sched.c:359)
[ 82.191188][ T1300] __blk_mq_run_hw_queue (kbuild/src/x86_64/block/blk-mq.c:998 kbuild/src/x86_64/block/blk-mq.c:1782)
[ 82.197784][ T1300] __blk_mq_delay_run_hw_queue (kbuild/src/x86_64/arch/x86/include/asm/preempt.h:85 kbuild/src/x86_64/block/blk-mq.c:1859)
[ 82.204952][ T1300] blk_mq_sched_insert_request (kbuild/src/x86_64/block/blk-mq-sched.c:479)
[ 82.212117][ T1300] ? blk_mq_sched_bio_merge (kbuild/src/x86_64/block/blk-mq-sched.c:430)
[ 82.218998][ T1300] ? dev_vprintk_emit (kbuild/src/x86_64/drivers/base/core.c:4599)
[ 82.225325][ T1300] blk_execute_rq (kbuild/src/x86_64/block/blk-exec.c:105)
[ 82.231236][ T1300] ? blk_end_sync_rq (kbuild/src/x86_64/block/blk-exec.c:97)
[ 82.237294][ T1300] __scsi_execute (kbuild/src/x86_64/drivers/scsi/scsi_lib.c:252)
[ 82.243257][ T1300] ? kfree (kbuild/src/x86_64/mm/slub.c:1749 kbuild/src/x86_64/mm/slub.c:3513 kbuild/src/x86_64/mm/slub.c:4561)
[ 82.248474][ T1300] sd_sync_cache (kbuild/src/x86_64/drivers/scsi/sd.c:1713) sd_mod
[ 82.255071][ T1300] ? sd_resume_system (kbuild/src/x86_64/drivers/scsi/sd.c:1689) sd_mod
[ 82.261917][ T1300] ? mutex_unlock (kbuild/src/x86_64/arch/x86/include/asm/atomic64_64.h:190 kbuild/src/x86_64/include/linux/atomic/atomic-long.h:449 kbuild/src/x86_64/include/linux/atomic/atomic-instrumented.h:1677 kbuild/src/x86_64/kernel/locking/mutex.c:178 kbuild/src/x86_64/kernel/locking/mutex.c:544)
[ 82.267671][ T1300] sd_shutdown (kbuild/src/x86_64/drivers/scsi/sd.c:3741) sd_mod
[ 82.274031][ T1300] sd_remove (kbuild/src/x86_64/drivers/scsi/sd.c:3641) sd_mod
[ 82.280083][ T1300] device_release_driver_internal (kbuild/src/x86_64/drivers/base/dd.c:1205 kbuild/src/x86_64/drivers/base/dd.c:1236)
[ 82.287257][ T1300] bus_remove_device (kbuild/src/x86_64/drivers/base/bus.c:530)
[ 82.293307][ T1300] device_del (kbuild/src/x86_64/drivers/base/core.c:3582)
[ 82.298732][ T1300] ? __device_link_del (kbuild/src/x86_64/drivers/base/core.c:3537)
[ 82.304902][ T1300] ? kobject_put (kbuild/src/x86_64/arch/x86/include/asm/atomic.h:190 kbuild/src/x86_64/include/linux/atomic/atomic-instrumented.h:168 kbuild/src/x86_64/include/linux/refcount.h:272 kbuild/src/x86_64/include/linux/refcount.h:315 kbuild/src/x86_64/include/linux/refcount.h:333 kbuild/src/x86_64/include/linux/kref.h:64 kbuild/src/x86_64/lib/kobject.c:753)
[ 82.310345][ T1300] __scsi_remove_device (kbuild/src/x86_64/drivers/scsi/scsi_sysfs.c:1437)
[ 82.316569][ T1300] scsi_forget_host (kbuild/src/x86_64/drivers/scsi/scsi_scan.c:1916)
[ 82.322315][ T1300] scsi_remove_host (kbuild/src/x86_64/drivers/scsi/hosts.c:181)
[ 82.328078][ T1300] sdebug_driver_remove (kbuild/src/x86_64/drivers/scsi/scsi_debug.c:7712) scsi_debug
[ 82.335322][ T1300] ? up_write (kbuild/src/x86_64/arch/x86/include/asm/atomic64_64.h:172 kbuild/src/x86_64/include/linux/atomic/atomic-long.h:95 kbuild/src/x86_64/include/linux/atomic/atomic-instrumented.h:1261 kbuild/src/x86_64/kernel/locking/rwsem.c:1331 kbuild/src/x86_64/kernel/locking/rwsem.c:1580)
[ 82.340479][ T1300] device_release_driver_internal (kbuild/src/x86_64/drivers/base/dd.c:1207 kbuild/src/x86_64/drivers/base/dd.c:1236)
[ 82.347543][ T1300] bus_remove_device (kbuild/src/x86_64/drivers/base/bus.c:530)
[ 82.353478][ T1300] device_del (kbuild/src/x86_64/drivers/base/core.c:3582)
[ 82.358800][ T1300] ? __device_link_del (kbuild/src/x86_64/drivers/base/core.c:3537)
[ 82.364910][ T1300] ? _raw_spin_lock (kbuild/src/x86_64/arch/x86/include/asm/atomic.h:202 kbuild/src/x86_64/include/linux/atomic/atomic-instrumented.h:513 kbuild/src/x86_64/include/asm-generic/qspinlock.h:82 kbuild/src/x86_64/include/linux/spinlock.h:185 kbuild/src/x86_64/include/linux/spinlock_api_smp.h:134 kbuild/src/x86_64/kernel/locking/spinlock.c:154)
[ 82.370664][ T1300] ? _raw_write_lock_irq (kbuild/src/x86_64/kernel/locking/spinlock.c:153)
[ 82.377000][ T1300] device_unregister (kbuild/src/x86_64/drivers/base/core.c:3500 kbuild/src/x86_64/drivers/base/core.c:3615)
[ 82.382753][ T1300] sdebug_do_remove_host (kbuild/src/x86_64/drivers/scsi/scsi_debug.c:7175) scsi_debug
[ 82.390216][ T1300] scsi_debug_exit (kbuild/src/x86_64/drivers/scsi/scsi_debug.c:7726) scsi_debug
[ 82.397083][ T1300] __x64_sys_delete_module (kbuild/src/x86_64/kernel/module.c:970 kbuild/src/x86_64/kernel/module.c:912 kbuild/src/x86_64/kernel/module.c:912)
[ 82.403571][ T1300] ? __ia32_sys_delete_module (kbuild/src/x86_64/kernel/module.c:912)
[ 82.410314][ T1300] ? task_work_run (kbuild/src/x86_64/kernel/task_work.c:167 (discriminator 1))
[ 82.416017][ T1300] ? exit_to_user_mode_prepare (kbuild/src/x86_64/include/linux/sched.h:2207 kbuild/src/x86_64/include/linux/tracehook.h:201 kbuild/src/x86_64/kernel/entry/common.c:175 kbuild/src/x86_64/kernel/entry/common.c:207)
[ 82.422837][ T1300] do_syscall_64 (kbuild/src/x86_64/arch/x86/entry/common.c:50 kbuild/src/x86_64/arch/x86/entry/common.c:80)
[ 82.428262][ T1300] entry_SYSCALL_64_after_hwframe (kbuild/src/x86_64/arch/x86/entry/entry_64.S:113)
[ 82.435195][ T1300] RIP: 0033:0x7fb6b8d62dd7
[ 82.440621][ T1300] Code: 73 01 c3 48 8b 0d b9 10 0c 00 f7 d8 64 89 01 48 83 c8 ff c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 b8 b0 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 89 10 0c 00 f7 d8 64 89 01 48
All code
========
0: 73 01 jae 0x3
2: c3 retq
3: 48 8b 0d b9 10 0c 00 mov 0xc10b9(%rip),%rcx # 0xc10c3
a: f7 d8 neg %eax
c: 64 89 01 mov %eax,%fs:(%rcx)
f: 48 83 c8 ff or $0xffffffffffffffff,%rax
13: c3 retq
14: 66 2e 0f 1f 84 00 00 nopw %cs:0x0(%rax,%rax,1)
1b: 00 00 00
1e: 0f 1f 44 00 00 nopl 0x0(%rax,%rax,1)
23: b8 b0 00 00 00 mov $0xb0,%eax
28: 0f 05 syscall
2a:* 48 3d 01 f0 ff ff cmp $0xfffffffffffff001,%rax <-- trapping instruction
30: 73 01 jae 0x33
32: c3 retq
33: 48 8b 0d 89 10 0c 00 mov 0xc1089(%rip),%rcx # 0xc10c3
3a: f7 d8 neg %eax
3c: 64 89 01 mov %eax,%fs:(%rcx)
3f: 48 rex.W
Code starting with the faulting instruction
===========================================
0: 48 3d 01 f0 ff ff cmp $0xfffffffffffff001,%rax
6: 73 01 jae 0x9
8: c3 retq
9: 48 8b 0d 89 10 0c 00 mov 0xc1089(%rip),%rcx # 0xc1099
10: f7 d8 neg %eax
12: 64 89 01 mov %eax,%fs:(%rcx)
15: 48 rex.W
[ 82.462607][ T1300] RSP: 002b:00007fffff3de988 EFLAGS: 00000206 ORIG_RAX: 00000000000000b0
[ 82.472158][ T1300] RAX: ffffffffffffffda RBX: 0000563ddccfadf0 RCX: 00007fb6b8d62dd7
[ 82.481309][ T1300] RDX: 0000000000000000 RSI: 0000000000000800 RDI: 0000563ddccfae58
[ 82.490478][ T1300] RBP: 0000563ddccfae58 R08: 00007fffff3dd931 R09: 0000000000000000
[ 82.499649][ T1300] R10: 00007fb6b8dd4ae0 R11: 0000000000000206 R12: 0000000000000000
[ 82.508809][ T1300] R13: 0000000000000000 R14: 0000563ddccfae58 R15: 0000563ddccfaf70
[ 82.517953][ T1300] </TASK>
[ 82.522147][ T1300]
[ 82.525648][ T1300] Allocated by task 7:
[ 82.530883][ T1300] kasan_save_stack (kbuild/src/x86_64/mm/kasan/common.c:38)
[ 82.536721][ T1300] __kasan_kmalloc (kbuild/src/x86_64/mm/kasan/common.c:46 kbuild/src/x86_64/mm/kasan/common.c:434 kbuild/src/x86_64/mm/kasan/common.c:513 kbuild/src/x86_64/mm/kasan/common.c:522)
[ 82.542453][ T1300] wbt_init (kbuild/src/x86_64/include/linux/slab.h:590 kbuild/src/x86_64/include/linux/slab.h:724 kbuild/src/x86_64/block/blk-wbt.c:824)
[ 82.547686][ T1300] blk_register_queue (kbuild/src/x86_64/block/blk-sysfs.c:890)
[ 82.553875][ T1300] device_add_disk (kbuild/src/x86_64/block/genhd.c:489)
[ 82.559787][ T1300] sd_probe (kbuild/src/x86_64/drivers/scsi/sd.c:3582) sd_mod
[ 82.565870][ T1300] really_probe (kbuild/src/x86_64/drivers/base/dd.c:748)
[ 82.572127][ T1300] __driver_probe_device (kbuild/src/x86_64/drivers/base/dd.c:751)
[ 82.578574][ T1300] driver_probe_device (kbuild/src/x86_64/drivers/base/dd.c:781)
[ 82.584763][ T1300] __device_attach_driver (kbuild/src/x86_64/drivers/base/dd.c:899)
[ 82.591280][ T1300] bus_for_each_drv (kbuild/src/x86_64/drivers/base/bus.c:385 kbuild/src/x86_64/drivers/base/bus.c:426)
[ 82.597295][ T1300] __device_attach_async_helper (kbuild/src/x86_64/arch/x86/include/asm/jump_label.h:27 kbuild/src/x86_64/drivers/base/dd.c:928)
[ 82.604358][ T1300] async_run_entry_fn (kbuild/src/x86_64/arch/x86/include/asm/jump_label.h:27 kbuild/src/x86_64/kernel/async.c:129)
[ 82.610467][ T1300] process_one_work (kbuild/src/x86_64/arch/x86/include/asm/jump_label.h:27 kbuild/src/x86_64/include/linux/jump_label.h:212 kbuild/src/x86_64/include/trace/events/workqueue.h:108 kbuild/src/x86_64/kernel/workqueue.c:2303)
[ 82.616579][ T1300] worker_thread (kbuild/src/x86_64/include/linux/list.h:284 kbuild/src/x86_64/kernel/workqueue.c:2446)
[ 82.622246][ T1300] kthread (kbuild/src/x86_64/kernel/kthread.c:327)
[ 82.627480][ T1300] ret_from_fork (kbuild/src/x86_64/arch/x86/entry/entry_64.S:301)
[ 82.633053][ T1300]
[ 82.636510][ T1300] Freed by task 1300:
[ 82.641634][ T1300] kasan_save_stack (kbuild/src/x86_64/mm/kasan/common.c:38)
[ 82.647448][ T1300] kasan_set_track (kbuild/src/x86_64/mm/kasan/common.c:46)
[ 82.653168][ T1300] kasan_set_free_info (kbuild/src/x86_64/mm/kasan/generic.c:372)
[ 82.659218][ T1300] __kasan_slab_free (kbuild/src/x86_64/mm/kasan/common.c:368 kbuild/src/x86_64/mm/kasan/common.c:328 kbuild/src/x86_64/mm/kasan/common.c:374)
[ 82.665180][ T1300] kfree (kbuild/src/x86_64/mm/slub.c:1749 kbuild/src/x86_64/mm/slub.c:3513 kbuild/src/x86_64/mm/slub.c:4561)
[ 82.670093][ T1300] rq_qos_exit (kbuild/src/x86_64/block/blk-rq-qos.c:299)
[ 82.675503][ T1300] del_gendisk (kbuild/src/x86_64/block/genhd.c:626)
[ 82.681023][ T1300] sd_remove (kbuild/src/x86_64/drivers/scsi/sd.c:3637) sd_mod
[ 82.687047][ T1300] device_release_driver_internal (kbuild/src/x86_64/drivers/base/dd.c:1205 kbuild/src/x86_64/drivers/base/dd.c:1236)
[ 82.694196][ T1300] bus_remove_device (kbuild/src/x86_64/drivers/base/bus.c:530)
[ 82.700238][ T1300] device_del (kbuild/src/x86_64/drivers/base/core.c:3582)
[ 82.705661][ T1300] __scsi_remove_device (kbuild/src/x86_64/drivers/scsi/scsi_sysfs.c:1437)
[ 82.711945][ T1300] scsi_forget_host (kbuild/src/x86_64/drivers/scsi/scsi_scan.c:1916)
[ 82.717769][ T1300] scsi_remove_host (kbuild/src/x86_64/drivers/scsi/hosts.c:181)
[ 82.723584][ T1300] sdebug_driver_remove (kbuild/src/x86_64/drivers/scsi/scsi_debug.c:7712) scsi_debug
[ 82.730907][ T1300] device_release_driver_internal (kbuild/src/x86_64/drivers/base/dd.c:1207 kbuild/src/x86_64/drivers/base/dd.c:1236)
[ 82.738031][ T1300] bus_remove_device (kbuild/src/x86_64/drivers/base/bus.c:530)
[ 82.744029][ T1300] device_del (kbuild/src/x86_64/drivers/base/core.c:3582)
[ 82.749376][ T1300] device_unregister (kbuild/src/x86_64/drivers/base/core.c:3500 kbuild/src/x86_64/drivers/base/core.c:3615)
To reproduce:
git clone https://github.com/intel/lkp-tests.git
cd lkp-tests
sudo bin/lkp install job.yaml # job file is attached in this email
bin/lkp split-job --compatible job.yaml # generate the yaml file for lkp run
sudo bin/lkp run generated-yaml-file
# if come across any failure that blocks the test,
# please remove ~/.lkp and /lkp dir to run from a clean state.
---
0DAY/LKP+ Test Infrastructure Open Source Technology Center
https://lists.01.org/hyperkitty/list/[email protected] Intel Corporation
Thanks,
Oliver Sang
8 months, 4 weeks
[tracing/selftests] 4e9f63c9e5: BUG:KASAN:use-after-free_in_destroy_hist_field
by kernel test robot
Greeting,
FYI, we noticed the following commit (built with gcc-9):
commit: 4e9f63c9e5c2597692567ee1cb0851a21104a531 ("tracing/selftests: Add tests for hist trigger expression parsing")
https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master
in testcase: kernel-selftests
version: kernel-selftests-x86_64-c8c9111a-1_20210929
with following parameters:
group: ftrace
ucode: 0xe2
test-description: The kernel contains a set of "self tests" under the tools/testing/selftests/ directory. These are intended to be small unit tests to exercise individual code paths in the kernel.
test-url: https://www.kernel.org/doc/Documentation/kselftest.txt
on test machine: 4 threads Intel(R) Core(TM) i5-6500 CPU @ 3.20GHz with 32G memory
caused below changes (please refer to attached dmesg/kmsg for entire log/backtrace):
If you fix the issue, kindly add following tag
Reported-by: kernel test robot <oliver.sang(a)intel.com>
[ 466.629917][ T8233] BUG: KASAN: use-after-free in destroy_hist_field+0x138/0x140
[ 466.637943][ T8233] Read of size 8 at addr ffff88880dd64e08 by task ftracetest/8233
[ 466.645618][ T8233]
[ 466.647813][ T8233] CPU: 0 PID: 8233 Comm: ftracetest Not tainted 5.15.0-rc3-00114-g4e9f63c9e5c2 #1
[ 466.656879][ T8233] Hardware name: Dell Inc. OptiPlex 7040/0Y7WYT, BIOS 1.8.1 12/05/2017
[ 466.664985][ T8233] Call Trace:
[ 466.668135][ T8233] dump_stack_lvl (lib/dump_stack.c:107)
[ 466.672518][ T8233] print_address_description+0x21/0x140
[ 466.678977][ T8233] ? destroy_hist_field+0x138/0x140
[ 466.684647][ T8233] ? destroy_hist_field+0x138/0x140
[ 466.690320][ T8233] kasan_report.cold (mm/kasan/report.c:443 mm/kasan/report.c:459)
[ 466.695041][ T8233] ? destroy_hist_field+0x138/0x140
[ 466.700719][ T8233] destroy_hist_field+0x138/0x140
[ 466.706219][ T8233] parse_expr (kernel/trace/trace_events_hist.c:2722)
[ 466.710514][ T8233] ? hist_field_execname (kernel/trace/trace_events_hist.c:301)
[ 466.715500][ T8233] ? parse_atom (kernel/trace/trace_events_hist.c:2551)
[ 466.720042][ T8233] ? rcu_read_lock_sched_held (include/linux/lockdep.h:283 kernel/rcu/update.c:125)
[ 466.725544][ T8233] ? rcu_read_lock_bh_held (kernel/rcu/update.c:120)
[ 466.730698][ T8233] ? kasan_unpoison (mm/kasan/shadow.c:108 mm/kasan/shadow.c:142)
[ 466.735253][ T8233] __create_val_field (kernel/trace/trace_events_hist.c:4091)
[ 466.740063][ T8233] ? parse_expr (kernel/trace/trace_events_hist.c:4086)
[ 466.744615][ T8233] ? find_var (kernel/trace/trace_events_hist.c:1075 (discriminator 9))
[ 466.748731][ T8233] create_hist_fields (kernel/trace/trace_events_hist.c:4175 kernel/trace/trace_events_hist.c:4326 kernel/trace/trace_events_hist.c:4412)
[ 466.753722][ T8233] ? __create_val_field (kernel/trace/trace_events_hist.c:4401)
[ 466.758787][ T8233] ? track_data_parse (kernel/trace/trace_events_hist.c:4546)
[ 466.763676][ T8233] ? kasan_unpoison (mm/kasan/shadow.c:108 mm/kasan/shadow.c:142)
[ 466.768221][ T8233] ? __kasan_slab_alloc (mm/kasan/common.c:429 mm/kasan/common.c:467)
[ 466.773125][ T8233] event_hist_trigger_func (kernel/trace/trace_events_hist.c:4859 kernel/trace/trace_events_hist.c:6199)
[ 466.778547][ T8233] ? mutex_lock_io_nested (kernel/locking/mutex.c:728)
[ 466.783958][ T8233] ? preempt_count_sub (kernel/sched/core.c:5418 kernel/sched/core.c:5415 kernel/sched/core.c:5437)
[ 466.788937][ T8233] ? __mutex_lock (arch/x86/include/asm/preempt.h:103 kernel/locking/mutex.c:711 kernel/locking/mutex.c:729)
[ 466.793567][ T8233] ? rcu_read_lock_bh_held (kernel/rcu/update.c:120)
[ 466.798722][ T8233] ? parse_actions (kernel/trace/trace_events_hist.c:6125)
[ 466.803359][ T8233] ? mutex_lock_io_nested (kernel/locking/mutex.c:728)
[ 466.808785][ T8233] trigger_process_regex (kernel/trace/trace_events_trigger.c:248)
[ 466.813950][ T8233] ? event_trigger_callback (kernel/trace/trace_events_trigger.c:231)
[ 466.819403][ T8233] event_trigger_write (kernel/trace/trace_events_trigger.c:286 kernel/trace/trace_events_trigger.c:314)
[ 466.824306][ T8233] vfs_write (fs/read_write.c:592)
[ 466.828460][ T8233] ksys_write (fs/read_write.c:647)
[ 466.832578][ T8233] ? __ia32_sys_read (fs/read_write.c:637)
[ 466.837219][ T8233] ? syscall_enter_from_user_mode (arch/x86/include/asm/irqflags.h:45 arch/x86/include/asm/irqflags.h:80 kernel/entry/common.c:107)
[ 466.842987][ T8233] do_syscall_64 (arch/x86/entry/common.c:50 arch/x86/entry/common.c:80)
[ 466.847268][ T8233] ? asm_exc_page_fault (arch/x86/include/asm/idtentry.h:568)
[ 466.852075][ T8233] ? rcu_read_lock_sched_held (include/linux/lockdep.h:283 kernel/rcu/update.c:125)
[ 466.857577][ T8233] ? rcu_read_lock_bh_held (kernel/rcu/update.c:120)
[ 466.862737][ T8233] ? asm_exc_page_fault (arch/x86/include/asm/idtentry.h:568)
[ 466.867628][ T8233] ? asm_exc_page_fault (arch/x86/include/asm/idtentry.h:568)
[ 466.872445][ T8233] ? lockdep_hardirqs_on (kernel/locking/lockdep.c:4344)
[ 466.877524][ T8233] entry_SYSCALL_64_after_hwframe (arch/x86/entry/entry_64.S:113)
[ 466.883283][ T8233] RIP: 0033:0x7f85961ef504
[ 466.887566][ T8233] Code: 00 f7 d8 64 89 02 48 c7 c0 ff ff ff ff eb b3 0f 1f 80 00 00 00 00 48 8d 05 f9 61 0d 00 8b 00 85 c0 75 13 b8 01 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 54 c3 0f 1f 00 41 54 49 89 d4 55 48 89 f5 53
All code
========
0: 00 f7 add %dh,%bh
2: d8 64 89 02 fsubs 0x2(%rcx,%rcx,4)
6: 48 c7 c0 ff ff ff ff mov $0xffffffffffffffff,%rax
d: eb b3 jmp 0xffffffffffffffc2
f: 0f 1f 80 00 00 00 00 nopl 0x0(%rax)
16: 48 8d 05 f9 61 0d 00 lea 0xd61f9(%rip),%rax # 0xd6216
1d: 8b 00 mov (%rax),%eax
1f: 85 c0 test %eax,%eax
21: 75 13 jne 0x36
23: b8 01 00 00 00 mov $0x1,%eax
28: 0f 05 syscall
2a:* 48 3d 00 f0 ff ff cmp $0xfffffffffffff000,%rax <-- trapping instruction
30: 77 54 ja 0x86
32: c3 retq
33: 0f 1f 00 nopl (%rax)
36: 41 54 push %r12
38: 49 89 d4 mov %rdx,%r12
3b: 55 push %rbp
3c: 48 89 f5 mov %rsi,%rbp
3f: 53 push %rbx
Code starting with the faulting instruction
===========================================
0: 48 3d 00 f0 ff ff cmp $0xfffffffffffff000,%rax
6: 77 54 ja 0x5c
8: c3 retq
9: 0f 1f 00 nopl (%rax)
c: 41 54 push %r12
e: 49 89 d4 mov %rdx,%r12
11: 55 push %rbp
12: 48 89 f5 mov %rsi,%rbp
15: 53 push %rbx
[ 466.907066][ T8233] RSP: 002b:00007ffec34c7b08 EFLAGS: 00000246 ORIG_RAX: 0000000000000001
[ 466.915347][ T8233] RAX: ffffffffffffffda RBX: 000000000000001a RCX: 00007f85961ef504
[ 466.923194][ T8233] RDX: 000000000000001a RSI: 000056007e94df80 RDI: 0000000000000001
[ 466.931039][ T8233] RBP: 000056007e94df80 R08: 00007f85962c28c0 R09: 00007f8596102740
[ 466.938884][ T8233] R10: 0000000000000000 R11: 0000000000000246 R12: 00007f85962c1760
[ 466.946733][ T8233] R13: 000000000000001a R14: 00007f85962bc760 R15: 000000000000001a
[ 466.954603][ T8233]
[ 466.956793][ T8233] Allocated by task 8233:
[ 466.960984][ T8233] kasan_save_stack (mm/kasan/common.c:38)
[ 466.965536][ T8233] __kasan_kmalloc (mm/kasan/common.c:46 mm/kasan/common.c:434 mm/kasan/common.c:513 mm/kasan/common.c:522)
[ 466.969988][ T8233] create_hist_field (include/linux/slab.h:591 include/linux/slab.h:721 kernel/trace/trace_events_hist.c:1880)
[ 466.974700][ T8233] parse_atom (kernel/trace/trace_events_hist.c:2331 kernel/trace/trace_events_hist.c:2351)
[ 466.978978][ T8233] parse_expr (kernel/trace/trace_events_hist.c:2568)
[ 466.983256][ T8233] parse_expr (kernel/trace/trace_events_hist.c:2590)
[ 466.987550][ T8233] __create_val_field (kernel/trace/trace_events_hist.c:4091)
[ 466.992351][ T8233] create_hist_fields (kernel/trace/trace_events_hist.c:4175 kernel/trace/trace_events_hist.c:4326 kernel/trace/trace_events_hist.c:4412)
[ 466.997326][ T8233] event_hist_trigger_func (kernel/trace/trace_events_hist.c:4859 kernel/trace/trace_events_hist.c:6199)
[ 467.002734][ T8233] trigger_process_regex (kernel/trace/trace_events_trigger.c:248)
[ 467.007883][ T8233] event_trigger_write (kernel/trace/trace_events_trigger.c:286 kernel/trace/trace_events_trigger.c:314)
[ 467.012770][ T8233] vfs_write (fs/read_write.c:592)
[ 467.016874][ T8233] ksys_write (fs/read_write.c:647)
[ 467.020980][ T8233] do_syscall_64 (arch/x86/entry/common.c:50 arch/x86/entry/common.c:80)
[ 467.025260][ T8233] entry_SYSCALL_64_after_hwframe (arch/x86/entry/entry_64.S:113)
[ 467.031017][ T8233]
[ 467.033208][ T8233] Freed by task 8233:
[ 467.037051][ T8233] kasan_save_stack (mm/kasan/common.c:38)
[ 467.041591][ T8233] kasan_set_track (mm/kasan/common.c:46)
[ 467.046044][ T8233] kasan_set_free_info (mm/kasan/generic.c:362)
[ 467.050850][ T8233] __kasan_slab_free (mm/kasan/common.c:368 mm/kasan/common.c:328 mm/kasan/common.c:374)
[ 467.055571][ T8233] kfree (mm/slub.c:1725 mm/slub.c:3483 mm/slub.c:4543)
[ 467.059246][ T8233] parse_expr (kernel/trace/trace_events_hist.c:1861 kernel/trace/trace_events_hist.c:2718)
[ 467.063531][ T8233] __create_val_field (kernel/trace/trace_events_hist.c:4091)
[ 467.068336][ T8233] create_hist_fields (kernel/trace/trace_events_hist.c:4175 kernel/trace/trace_events_hist.c:4326 kernel/trace/trace_events_hist.c:4412)
[ 467.073320][ T8233] event_hist_trigger_func (kernel/trace/trace_events_hist.c:4859 kernel/trace/trace_events_hist.c:6199)
[ 467.078733][ T8233] trigger_process_regex (kernel/trace/trace_events_trigger.c:248)
[ 467.083881][ T8233] event_trigger_write (kernel/trace/trace_events_trigger.c:286 kernel/trace/trace_events_trigger.c:314)
[ 467.088769][ T8233] vfs_write (fs/read_write.c:592)
[ 467.092875][ T8233] ksys_write (fs/read_write.c:647)
[ 467.096979][ T8233] do_syscall_64 (arch/x86/entry/common.c:50 arch/x86/entry/common.c:80)
[ 467.101257][ T8233] entry_SYSCALL_64_after_hwframe (arch/x86/entry/entry_64.S:113)
[ 467.107014][ T8233]
[ 467.109203][ T8233] The buggy address belongs to the object at ffff88880dd64e00
[ 467.109203][ T8233] which belongs to the cache kmalloc-192 of size 192
[ 467.123134][ T8233] The buggy address is located 8 bytes inside of
[ 467.123134][ T8233] 192-byte region [ffff88880dd64e00, ffff88880dd64ec0)
[ 467.136110][ T8233] The buggy address belongs to the page:
[ 467.141605][ T8233] page:00000000bbfb9b6e refcount:1 mapcount:0 mapping:0000000000000000 index:0x0 pfn:0x80dd64
[ 467.151713][ T8233] flags: 0x17ffffc0000200(slab|node=0|zone=2|lastcpupid=0x1fffff)
[ 467.159406][ T8233] raw: 0017ffffc0000200 dead000000000100 dead000000000122 ffff888100042a00
[ 467.167862][ T8233] raw: 0000000000000000 0000000000100010 00000001ffffffff 0000000000000000
[ 467.176314][ T8233] page dumped because: kasan: bad access detected
[ 467.182590][ T8233]
[ 467.184779][ T8233] Memory state around the buggy address:
[ 467.190276][ T8233] ffff88880dd64d00: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 467.198209][ T8233] ffff88880dd64d80: 00 00 00 00 00 00 00 00 fc fc fc fc fc fc fc fc
[ 467.206141][ T8233] >ffff88880dd64e00: fa fb fb fb fb fb fb fb fb fb fb fb fb fb fb fb
[ 467.214072][ T8233] ^
[ 467.218263][ T8233] ffff88880dd64e80: fb fb fb fb fb fb fb fb fc fc fc fc fc fc fc fc
[ 467.226195][ T8233] ffff88880dd64f00: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
[ 467.234129][ T8233] ==================================================================
[ 467.242061][ T8233] Disabling lock debugging due to kernel taint
[ 467.270441][ T385] # [81] event trigger - test histogram expression parsing [PASS]
[ 467.270453][ T385]
[ 468.662599][ T385] # [82] event trigger - test histogram modifiers [PASS]
[ 468.662612][ T385]
[ 469.302698][ T385] # [83] event trigger - test histogram parser errors [PASS]
To reproduce:
git clone https://github.com/intel/lkp-tests.git
cd lkp-tests
sudo bin/lkp install job.yaml # job file is attached in this email
bin/lkp split-job --compatible job.yaml # generate the yaml file for lkp run
sudo bin/lkp run generated-yaml-file
# if come across any failure that blocks the test,
# please remove ~/.lkp and /lkp dir to run from a clean state.
---
0DAY/LKP+ Test Infrastructure Open Source Technology Center
https://lists.01.org/hyperkitty/list/[email protected] Intel Corporation
Thanks,
Oliver Sang
8 months, 4 weeks
[xfs] daa7a8913d: stress-ng.inode-flags.ops_per_sec 32.6% improvement
by kernel test robot
Greeting,
FYI, we noticed a 32.6% improvement of stress-ng.inode-flags.ops_per_sec due to commit:
commit: daa7a8913d631b64c32e1e99101ad8c2568240a6 ("xfs: Add order IDs to log items in CIL")
https://git.kernel.org/cgit/linux/kernel/git/dgc/linux-xfs.git xfs-cil-scale-3
in testcase: stress-ng
on test machine: 96 threads 2 sockets Intel(R) Xeon(R) Gold 6252 CPU @ 2.10GHz with 512G memory
with following parameters:
nr_threads: 10%
disk: 1HDD
testtime: 60s
fs: xfs
class: filesystem
test: inode-flags
cpufreq_governor: performance
ucode: 0x5003006
Details are as below:
-------------------------------------------------------------------------------------------------->
To reproduce:
git clone https://github.com/intel/lkp-tests.git
cd lkp-tests
sudo bin/lkp install job.yaml # job file is attached in this email
bin/lkp split-job --compatible job.yaml # generate the yaml file for lkp run
sudo bin/lkp run generated-yaml-file
# if come across any failure that blocks the test,
# please remove ~/.lkp and /lkp dir to run from a clean state.
=========================================================================================
class/compiler/cpufreq_governor/disk/fs/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime/ucode:
filesystem/gcc-9/performance/1HDD/xfs/x86_64-rhel-8.3/10%/debian-10.4-x86_64-20200603.cgz/lkp-csl-2sp7/inode-flags/stress-ng/60s/0x5003006
commit:
9600e278a3 ("xfs: convert CIL busy extents to per-cpu")
daa7a8913d ("xfs: Add order IDs to log items in CIL")
9600e278a342b7a5 daa7a8913d631b64c32e1e99101
---------------- ---------------------------
%stddev %change %stddev
\ | \
2992417 +32.6% 3968046 ± 3% stress-ng.inode-flags.ops
49873 +32.6% 66133 ± 3% stress-ng.inode-flags.ops_per_sec
3919028 -8.2% 3596273 ± 4% stress-ng.time.voluntary_context_switches
2657 ± 80% +120.8% 5865 ± 52% numa-vmstat.node1.nr_mapped
0.09 ± 2% -0.0 0.08 mpstat.cpu.all.soft%
0.22 +0.0 0.25 mpstat.cpu.all.usr%
186702 ± 8% -20.4% 148670 ± 14% numa-meminfo.node0.Slab
10621 ± 80% +120.8% 23448 ± 52% numa-meminfo.node1.Mapped
4027388 ± 35% -55.5% 1792134 ± 8% turbostat.C1
1.91 ±175% -1.7 0.20 ± 14% turbostat.C1%
122151 -7.9% 112492 ± 4% vmstat.system.cs
207020 +2.0% 211252 vmstat.system.in
470.33 ± 4% -73.6% 124.00 slabinfo.numa_policy.active_objs
470.33 ± 4% -73.6% 124.00 slabinfo.numa_policy.num_objs
129.00 -100.0% 0.00 slabinfo.xfs_icr.active_objs
129.00 -100.0% 0.00 slabinfo.xfs_icr.num_objs
16964 ± 7% -23.4% 12999 ± 14% softirqs.CPU10.SCHED
18156 ± 11% -24.2% 13764 ± 7% softirqs.CPU18.SCHED
16387 ± 11% -20.6% 13012 ± 5% softirqs.CPU32.SCHED
19093 ± 12% -28.2% 13708 ± 8% softirqs.CPU42.SCHED
18358 ± 12% -24.7% 13820 ± 9% softirqs.CPU45.SCHED
21238 ± 6% -20.3% 16925 ± 6% softirqs.CPU49.SCHED
21216 ± 5% -15.7% 17882 ± 10% softirqs.CPU52.SCHED
19236 ± 11% -15.6% 16241 ± 11% softirqs.CPU65.SCHED
16940 ± 12% -17.8% 13925 ± 14% softirqs.CPU7.SCHED
20677 ± 8% -15.5% 17480 ± 11% softirqs.CPU75.SCHED
17689 ± 13% -18.6% 14390 ± 8% softirqs.CPU9.SCHED
7.81 ± 5% +10.2% 8.61 ± 6% perf-stat.i.MPKI
0.49 +0.1 0.57 perf-stat.i.branch-miss-rate%
26831667 +21.5% 32593779 perf-stat.i.branch-misses
2.66e+08 ± 5% +10.7% 2.945e+08 ± 6% perf-stat.i.cache-references
126238 -8.1% 116014 ± 4% perf-stat.i.context-switches
167.26 ± 2% -3.8% 160.89 perf-stat.i.cpu-migrations
8.365e+09 +1.8% 8.513e+09 perf-stat.i.dTLB-loads
1.397e+09 +25.3% 1.75e+09 ± 2% perf-stat.i.dTLB-stores
63.65 +9.8 73.42 ± 2% perf-stat.i.iTLB-load-miss-rate%
11887350 ± 4% +34.7% 16013705 ± 6% perf-stat.i.iTLB-load-misses
6547834 ± 2% -16.6% 5463762 ± 3% perf-stat.i.iTLB-loads
3059 ± 4% -23.3% 2347 ± 6% perf-stat.i.instructions-per-iTLB-miss
178.93 +2.8% 183.88 perf-stat.i.metric.M/sec
90.13 -2.1 88.01 perf-stat.i.node-load-miss-rate%
11416381 ± 11% +21.7% 13891749 ± 6% perf-stat.i.node-store-misses
7.94 ± 5% +10.2% 8.74 ± 6% perf-stat.overall.MPKI
0.37 ± 2% +0.1 0.46 perf-stat.overall.branch-miss-rate%
64.46 +10.0 74.49 ± 2% perf-stat.overall.iTLB-load-miss-rate%
2827 ± 5% -25.2% 2113 ± 7% perf-stat.overall.instructions-per-iTLB-miss
90.92 -2.2 88.75 perf-stat.overall.node-load-miss-rate%
26353607 +21.6% 32047573 perf-stat.ps.branch-misses
2.619e+08 ± 5% +10.6% 2.898e+08 ± 6% perf-stat.ps.cache-references
124353 -8.2% 114142 ± 4% perf-stat.ps.context-switches
165.52 ± 2% -4.2% 158.53 perf-stat.ps.cpu-migrations
8.235e+09 +1.7% 8.374e+09 perf-stat.ps.dTLB-loads
1.375e+09 +25.2% 1.721e+09 ± 2% perf-stat.ps.dTLB-stores
11702757 ± 4% +34.6% 15754716 ± 6% perf-stat.ps.iTLB-load-misses
6446287 ± 2% -16.6% 5374537 ± 3% perf-stat.ps.iTLB-loads
11243473 ± 11% +21.5% 13665329 ± 6% perf-stat.ps.node-store-misses
11.90 ± 6% -10.6 1.26 ± 45% perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock.xlog_cil_insert_items.xlog_cil_commit.__xfs_trans_commit
12.24 ± 6% -10.5 1.79 ± 33% perf-profile.calltrace.cycles-pp._raw_spin_lock.xlog_cil_insert_items.xlog_cil_commit.__xfs_trans_commit.xfs_fileattr_set
13.84 ± 6% -10.3 3.57 ± 16% perf-profile.calltrace.cycles-pp.xlog_cil_insert_items.xlog_cil_commit.__xfs_trans_commit.xfs_fileattr_set.vfs_fileattr_set
17.23 ± 7% -5.6 11.66 ± 9% perf-profile.calltrace.cycles-pp.xlog_cil_commit.__xfs_trans_commit.xfs_fileattr_set.vfs_fileattr_set.do_vfs_ioctl
17.32 ± 7% -5.5 11.77 ± 9% perf-profile.calltrace.cycles-pp.__xfs_trans_commit.xfs_fileattr_set.vfs_fileattr_set.do_vfs_ioctl.__x64_sys_ioctl
0.17 ±141% +0.5 0.63 ± 12% perf-profile.calltrace.cycles-pp.down_read.xlog_cil_commit.__xfs_trans_commit.xfs_fileattr_set.vfs_fileattr_set
0.00 +1.0 0.98 ± 21% perf-profile.calltrace.cycles-pp.xlog_space_left.xlog_grant_push_threshold.xlog_grant_push_ail.xfs_log_reserve.xfs_trans_reserve
0.00 +1.0 1.00 ± 21% perf-profile.calltrace.cycles-pp.xlog_grant_push_ail.xfs_log_reserve.xfs_trans_reserve.xfs_trans_alloc.xfs_trans_alloc_ichange
0.00 +1.0 1.00 ± 21% perf-profile.calltrace.cycles-pp.xlog_grant_push_threshold.xlog_grant_push_ail.xfs_log_reserve.xfs_trans_reserve.xfs_trans_alloc
0.00 +1.2 1.20 ± 21% perf-profile.calltrace.cycles-pp.xfs_log_space_wake.xfs_log_ticket_ungrant.xlog_cil_commit.__xfs_trans_commit.xfs_fileattr_set
0.56 ± 45% +2.7 3.24 ± 19% perf-profile.calltrace.cycles-pp.xlog_grant_add_space.xfs_log_reserve.xfs_trans_reserve.xfs_trans_alloc.xfs_trans_alloc_ichange
1.27 ± 16% +3.8 5.03 ± 18% perf-profile.calltrace.cycles-pp.xfs_trans_reserve.xfs_trans_alloc.xfs_trans_alloc_ichange.xfs_fileattr_set.vfs_fileattr_set
1.26 ± 16% +3.8 5.02 ± 18% perf-profile.calltrace.cycles-pp.xfs_log_reserve.xfs_trans_reserve.xfs_trans_alloc.xfs_trans_alloc_ichange.xfs_fileattr_set
1.84 ± 14% +3.8 5.64 ± 17% perf-profile.calltrace.cycles-pp.xfs_trans_alloc_ichange.xfs_fileattr_set.vfs_fileattr_set.do_vfs_ioctl.__x64_sys_ioctl
1.46 ± 15% +3.8 5.28 ± 18% perf-profile.calltrace.cycles-pp.xfs_trans_alloc.xfs_trans_alloc_ichange.xfs_fileattr_set.vfs_fileattr_set.do_vfs_ioctl
1.35 ± 15% +4.1 5.48 ± 18% perf-profile.calltrace.cycles-pp.xfs_log_ticket_ungrant.xlog_cil_commit.__xfs_trans_commit.xfs_fileattr_set.vfs_fileattr_set
11.91 ± 6% -10.6 1.26 ± 45% perf-profile.children.cycles-pp.native_queued_spin_lock_slowpath
12.57 ± 6% -10.4 2.19 ± 26% perf-profile.children.cycles-pp._raw_spin_lock
13.84 ± 6% -10.3 3.58 ± 16% perf-profile.children.cycles-pp.xlog_cil_insert_items
17.24 ± 7% -5.6 11.67 ± 9% perf-profile.children.cycles-pp.xlog_cil_commit
17.32 ± 7% -5.5 11.77 ± 9% perf-profile.children.cycles-pp.__xfs_trans_commit
0.07 ± 7% -0.0 0.04 ± 71% perf-profile.children.cycles-pp._raw_spin_trylock
0.10 ± 14% +0.1 0.16 ± 29% perf-profile.children.cycles-pp.xlog_calc_unit_res
0.05 ± 45% +0.1 0.11 ± 12% perf-profile.children.cycles-pp.poll_idle
0.28 ± 9% +0.1 0.42 ± 13% perf-profile.children.cycles-pp.apparmor_capable
0.29 ± 9% +0.1 0.44 ± 12% perf-profile.children.cycles-pp.security_capable
0.00 +0.1 0.15 ± 23% perf-profile.children.cycles-pp.xlog_verify_grant_tail
0.34 ± 12% +0.2 0.49 ± 12% perf-profile.children.cycles-pp.ns_capable_common
0.50 ± 7% +0.2 0.66 ± 10% perf-profile.children.cycles-pp.up_read
0.50 ± 8% +0.2 0.67 ± 10% perf-profile.children.cycles-pp.xfs_iunlock
0.40 ± 13% +0.2 0.58 ± 12% perf-profile.children.cycles-pp.up_write
0.04 ± 73% +0.2 0.22 ± 43% perf-profile.children.cycles-pp.xlog_grant_head_check
0.80 ± 12% +0.2 1.04 ± 11% perf-profile.children.cycles-pp.down_read
0.26 ± 16% +0.7 1.00 ± 21% perf-profile.children.cycles-pp.xlog_grant_push_ail
0.26 ± 16% +0.7 1.00 ± 21% perf-profile.children.cycles-pp.xlog_grant_push_threshold
0.28 ± 19% +0.9 1.15 ± 23% perf-profile.children.cycles-pp.xlog_space_left
0.17 ± 29% +1.0 1.20 ± 21% perf-profile.children.cycles-pp.xfs_log_space_wake
0.63 ± 16% +2.6 3.25 ± 20% perf-profile.children.cycles-pp.xlog_grant_add_space
1.26 ± 16% +3.8 5.02 ± 18% perf-profile.children.cycles-pp.xfs_log_reserve
1.27 ± 16% +3.8 5.03 ± 18% perf-profile.children.cycles-pp.xfs_trans_reserve
1.84 ± 14% +3.8 5.64 ± 17% perf-profile.children.cycles-pp.xfs_trans_alloc_ichange
1.46 ± 15% +3.8 5.28 ± 18% perf-profile.children.cycles-pp.xfs_trans_alloc
1.35 ± 15% +4.1 5.48 ± 18% perf-profile.children.cycles-pp.xfs_log_ticket_ungrant
11.84 ± 6% -10.6 1.25 ± 45% perf-profile.self.cycles-pp.native_queued_spin_lock_slowpath
0.07 ± 7% -0.0 0.04 ± 71% perf-profile.self.cycles-pp._raw_spin_trylock
0.10 ± 14% +0.1 0.15 ± 28% perf-profile.self.cycles-pp.xlog_calc_unit_res
0.05 ± 45% +0.1 0.11 ± 14% perf-profile.self.cycles-pp.poll_idle
0.00 +0.1 0.14 ± 23% perf-profile.self.cycles-pp.xlog_verify_grant_tail
0.28 ± 9% +0.1 0.42 ± 12% perf-profile.self.cycles-pp.apparmor_capable
0.49 ± 8% +0.2 0.65 ± 10% perf-profile.self.cycles-pp.up_read
0.40 ± 13% +0.2 0.57 ± 12% perf-profile.self.cycles-pp.up_write
0.72 ± 12% +0.2 0.96 ± 11% perf-profile.self.cycles-pp.down_read
0.66 ± 11% +0.3 0.92 ± 11% perf-profile.self.cycles-pp._raw_spin_lock
0.80 ± 11% +0.3 1.11 ± 11% perf-profile.self.cycles-pp.xlog_cil_commit
0.91 ± 9% +0.4 1.33 ± 9% perf-profile.self.cycles-pp.xlog_cil_insert_items
0.28 ± 19% +0.9 1.14 ± 23% perf-profile.self.cycles-pp.xlog_space_left
0.17 ± 29% +1.0 1.19 ± 21% perf-profile.self.cycles-pp.xfs_log_space_wake
0.62 ± 15% +2.6 3.23 ± 20% perf-profile.self.cycles-pp.xlog_grant_add_space
1.17 ± 13% +3.1 4.26 ± 17% perf-profile.self.cycles-pp.xfs_log_ticket_ungrant
374478 ± 2% +57.1% 588327 ± 6% interrupts.CAL:Function_call_interrupts
3422 ± 17% +59.3% 5452 ± 19% interrupts.CPU11.CAL:Function_call_interrupts
3473 ± 23% +59.3% 5531 ± 18% interrupts.CPU13.CAL:Function_call_interrupts
3118 ± 16% +77.4% 5530 ± 25% interrupts.CPU17.CAL:Function_call_interrupts
4394 ± 15% +40.6% 6179 ± 16% interrupts.CPU19.CAL:Function_call_interrupts
2882 ± 19% +67.1% 4815 ± 22% interrupts.CPU2.CAL:Function_call_interrupts
3732 ± 24% +44.2% 5380 ± 13% interrupts.CPU20.NMI:Non-maskable_interrupts
3732 ± 24% +44.2% 5380 ± 13% interrupts.CPU20.PMI:Performance_monitoring_interrupts
3599 ± 24% +91.6% 6897 ± 21% interrupts.CPU22.CAL:Function_call_interrupts
3053 ± 21% +45.2% 4435 ± 8% interrupts.CPU25.CAL:Function_call_interrupts
101.67 ± 20% -37.4% 63.67 ± 44% interrupts.CPU26.RES:Rescheduling_interrupts
2878 ± 12% +71.8% 4944 ± 18% interrupts.CPU27.CAL:Function_call_interrupts
129.50 ± 19% -48.9% 66.17 ± 37% interrupts.CPU28.RES:Rescheduling_interrupts
2949 ± 16% +55.2% 4577 ± 24% interrupts.CPU29.CAL:Function_call_interrupts
3422 ± 19% +44.0% 4927 ± 26% interrupts.CPU31.CAL:Function_call_interrupts
3477 ± 14% +60.9% 5596 ± 16% interrupts.CPU33.CAL:Function_call_interrupts
3799 ± 25% +62.6% 6179 ± 16% interrupts.CPU39.CAL:Function_call_interrupts
3527 ± 13% +81.8% 6412 ± 25% interrupts.CPU40.CAL:Function_call_interrupts
3794 ± 16% +44.5% 5481 ± 7% interrupts.CPU43.CAL:Function_call_interrupts
139.17 ± 20% -39.0% 84.83 ± 33% interrupts.CPU45.RES:Rescheduling_interrupts
3897 ± 20% +64.9% 6428 ± 22% interrupts.CPU46.CAL:Function_call_interrupts
5251 ± 13% +61.0% 8454 ± 10% interrupts.CPU48.CAL:Function_call_interrupts
5327 ± 9% +39.1% 7411 ± 25% interrupts.CPU49.CAL:Function_call_interrupts
3065 ± 21% +67.6% 5137 ± 20% interrupts.CPU5.CAL:Function_call_interrupts
5050 ± 12% +47.9% 7469 ± 8% interrupts.CPU50.CAL:Function_call_interrupts
4924 ± 22% +62.0% 7977 ± 7% interrupts.CPU51.CAL:Function_call_interrupts
5049 ± 8% +61.4% 8147 ± 23% interrupts.CPU52.CAL:Function_call_interrupts
4611 ± 19% +86.2% 8586 ± 19% interrupts.CPU54.CAL:Function_call_interrupts
4134 ± 25% +117.3% 8985 ± 29% interrupts.CPU55.CAL:Function_call_interrupts
4812 ± 9% +67.0% 8037 ± 24% interrupts.CPU56.CAL:Function_call_interrupts
3977 ± 19% +79.7% 7148 ± 16% interrupts.CPU57.CAL:Function_call_interrupts
4077 ± 14% +103.0% 8277 ± 22% interrupts.CPU58.CAL:Function_call_interrupts
4565 ± 39% +48.3% 6770 ± 14% interrupts.CPU58.NMI:Non-maskable_interrupts
4565 ± 39% +48.3% 6770 ± 14% interrupts.CPU58.PMI:Performance_monitoring_interrupts
4830 ± 13% +63.2% 7884 ± 24% interrupts.CPU60.CAL:Function_call_interrupts
4044 ± 18% +70.4% 6891 ± 26% interrupts.CPU61.CAL:Function_call_interrupts
3631 ± 19% +84.6% 6702 ± 27% interrupts.CPU62.CAL:Function_call_interrupts
3736 ± 11% +79.2% 6696 ± 20% interrupts.CPU63.CAL:Function_call_interrupts
4097 ± 9% +105.3% 8413 ± 22% interrupts.CPU64.CAL:Function_call_interrupts
4465 ± 14% +38.2% 6170 ± 28% interrupts.CPU65.CAL:Function_call_interrupts
3463 ± 18% +107.3% 7179 ± 30% interrupts.CPU66.CAL:Function_call_interrupts
3870 ± 17% +56.9% 6070 ± 34% interrupts.CPU68.CAL:Function_call_interrupts
3482 ± 31% +102.5% 7051 ± 27% interrupts.CPU69.CAL:Function_call_interrupts
4691 ± 11% +83.1% 8591 ± 9% interrupts.CPU72.CAL:Function_call_interrupts
4581 ± 14% +76.3% 8078 ± 10% interrupts.CPU73.CAL:Function_call_interrupts
4880 ± 9% +83.0% 8931 ± 15% interrupts.CPU74.CAL:Function_call_interrupts
4924 ± 10% +48.5% 7310 ± 16% interrupts.CPU75.CAL:Function_call_interrupts
4336 ± 9% +95.0% 8455 ± 19% interrupts.CPU76.CAL:Function_call_interrupts
4309 ± 12% +83.3% 7897 ± 16% interrupts.CPU77.CAL:Function_call_interrupts
4217 ± 16% +53.4% 6468 ± 18% interrupts.CPU79.CAL:Function_call_interrupts
4109 ± 13% +87.4% 7700 ± 14% interrupts.CPU80.CAL:Function_call_interrupts
3797 ± 14% +62.9% 6185 ± 17% interrupts.CPU81.CAL:Function_call_interrupts
4109 ± 12% +82.2% 7486 ± 9% interrupts.CPU82.CAL:Function_call_interrupts
4087 ± 18% +81.1% 7403 ± 22% interrupts.CPU83.CAL:Function_call_interrupts
3889 ± 13% +99.7% 7766 ± 20% interrupts.CPU84.CAL:Function_call_interrupts
4079 ± 13% +93.1% 7878 ± 22% interrupts.CPU85.CAL:Function_call_interrupts
3059 ± 13% +94.0% 5935 ± 19% interrupts.CPU86.CAL:Function_call_interrupts
3506 ± 30% +74.0% 6100 ± 16% interrupts.CPU87.CAL:Function_call_interrupts
3624 ± 27% +124.7% 8143 ± 18% interrupts.CPU89.CAL:Function_call_interrupts
3014 ± 22% +163.1% 7930 ± 22% interrupts.CPU90.CAL:Function_call_interrupts
3860 ± 15% +78.9% 6905 ± 11% interrupts.CPU91.CAL:Function_call_interrupts
3607 ± 25% +95.6% 7053 ± 25% interrupts.CPU92.CAL:Function_call_interrupts
3304 ± 21% +133.2% 7705 ± 15% interrupts.CPU93.CAL:Function_call_interrupts
3574 ± 27% +65.2% 5902 ± 23% interrupts.CPU94.CAL:Function_call_interrupts
2942 ± 16% +93.9% 5704 ± 25% interrupts.CPU95.CAL:Function_call_interrupts
stress-ng.inode-flags.ops
5e+06 +-----------------------------------------------------------------+
| O O O O O O O O |
| O O O O O O O |
4.5e+06 |-+ OO O O O O O O OO O O O O O |
| O O O O O O O |
| O |
4e+06 |-+ O OO O |
| |
3.5e+06 |-+ |
| |
| |
3e+06 |-+ + .+.++.++.++ |
|.++.++. +. +.++.++.++.+.++.+ +.+ : + |
| + + +.+ :.+ : |
2.5e+06 +-----------------------------------------------------------------+
stress-ng.inode-flags.ops_per_sec
85000 +-------------------------------------------------------------------+
| O O O O |
80000 |-OO O OO O O O O O |
75000 |-+ O O O O O O O O O O O O O O |
| O O O O O O O O |
70000 |-+ O O |
65000 |-+ O OO O |
| |
60000 |-+ |
55000 |-+ |
| |
50000 |-+ +.+. +.+ +.++.++.+.++.+ |
45000 |.++.++.++.+.++.+ + +.++. .++.+ : |
| + +.+.+ |
40000 +-------------------------------------------------------------------+
[*] bisect-good sample
[O] bisect-bad sample
Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.
---
0DAY/LKP+ Test Infrastructure Open Source Technology Center
https://lists.01.org/hyperkitty/list/[email protected] Intel Corporation
Thanks,
Oliver Sang
8 months, 4 weeks
[mm/pte_ref] afcc9fb874: kernel_BUG_at_include/linux/pte_ref.h
by kernel test robot
Greeting,
FYI, we noticed the following commit (built with gcc-9):
commit: afcc9fb8741f26773a381ac1e159e0172344b7d5 ("[PATCH v3 13/15] mm/pte_ref: free user PTE page table pages")
url: https://github.com/0day-ci/linux/commits/Qi-Zheng/Free-user-PTE-page-tabl...
base: https://github.com/hnaz/linux-mm master
patch link: https://lore.kernel.org/linux-doc/[email protected]
in testcase: boot
on test machine: qemu-system-x86_64 -enable-kvm -cpu SandyBridge -smp 2 -m 16G
caused below changes (please refer to attached dmesg/kmsg for entire log/backtrace):
+------------------------------------------+------------+------------+
| | e249f0fa9a | afcc9fb874 |
+------------------------------------------+------------+------------+
| boot_successes | 16 | 0 |
| boot_failures | 0 | 14 |
| kernel_BUG_at_include/linux/pte_ref.h | 0 | 14 |
| invalid_opcode:#[##] | 0 | 14 |
| RIP:destroy_args | 0 | 14 |
| Kernel_panic-not_syncing:Fatal_exception | 0 | 14 |
+------------------------------------------+------------+------------+
If you fix the issue, kindly add following tag
Reported-by: kernel test robot <oliver.sang(a)intel.com>
[ 7.245922][ T1] kernel BUG at include/linux/pte_ref.h:56!
[ 7.269161][ T1] invalid opcode: 0000 [#1] PREEMPT SMP PTI
[ 7.271019][ T1] CPU: 0 PID: 1 Comm: swapper/0 Not tainted 5.15.0-rc7-mm1-00448-gafcc9fb8741f #1
[ 7.273761][ T1] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.12.0-1 04/01/2014
[ 7.276418][ T1] RIP: 0010:destroy_args (include/linux/pte_ref.h:56 include/linux/pte_ref.h:123 mm/debug_vm_pgtable.c:1051)
[ 7.277992][ T1] Code: 6b 58 4c 8b 2b 49 8b 3c 24 e8 c6 38 b4 fe 48 c1 e0 06 48 03 05 aa eb 4c ff 8b 50 30 81 e2 00 02 00 f0 81 fa 00 00 00 f0 74 02 <0f> 0b f0 83 68 20 01 75 15 48 89 ea 4c 89 e6 4c 89 ef 48 81 e2 00
All code
========
0: 6b 58 4c 8b imul $0xffffff8b,0x4c(%rax),%ebx
4: 2b 49 8b sub -0x75(%rcx),%ecx
7: 3c 24 cmp $0x24,%al
9: e8 c6 38 b4 fe callq 0xfffffffffeb438d4
e: 48 c1 e0 06 shl $0x6,%rax
12: 48 03 05 aa eb 4c ff add -0xb31456(%rip),%rax # 0xffffffffff4cebc3
19: 8b 50 30 mov 0x30(%rax),%edx
1c: 81 e2 00 02 00 f0 and $0xf0000200,%edx
22: 81 fa 00 00 00 f0 cmp $0xf0000000,%edx
28: 74 02 je 0x2c
2a:* 0f 0b ud2 <-- trapping instruction
2c: f0 83 68 20 01 lock subl $0x1,0x20(%rax)
31: 75 15 jne 0x48
33: 48 89 ea mov %rbp,%rdx
36: 4c 89 e6 mov %r12,%rsi
39: 4c 89 ef mov %r13,%rdi
3c: 48 rex.W
3d: 81 .byte 0x81
3e: e2 00 loop 0x40
Code starting with the faulting instruction
===========================================
0: 0f 0b ud2
2: f0 83 68 20 01 lock subl $0x1,0x20(%rax)
7: 75 15 jne 0x1e
9: 48 89 ea mov %rbp,%rdx
c: 4c 89 e6 mov %r12,%rsi
f: 4c 89 ef mov %r13,%rdi
12: 48 rex.W
13: 81 .byte 0x81
14: e2 00 loop 0x16
[ 7.283473][ T1] RSP: 0000:ffffc90000013da0 EFLAGS: 00010206
[ 7.285295][ T1] RAX: ffffea0000000000 RBX: ffffc90000013dc8 RCX: 0000000000000000
[ 7.287675][ T1] RDX: 00000000f0000200 RSI: ffffffff823848b5 RDI: 0000000000000000
[ 7.290056][ T1] RBP: 000024b4af3bd000 R08: 0000000000000001 R09: 0000000000000040
[ 7.292449][ T1] R10: ffff88842fc2fb60 R11: ffffc90000013d00 R12: ffff88812da63000
[ 7.294926][ T1] R13: ffff88810ca08c00 R14: 0000000140000067 R15: 0000000000000027
[ 7.297349][ T1] FS: 0000000000000000(0000) GS:ffff88842fc00000(0000) knlGS:0000000000000000
[ 7.300020][ T1] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 7.301949][ T1] CR2: 0000000000000000 CR3: 0000000002612000 CR4: 00000000000006f0
[ 7.304153][ T1] Call Trace:
[ 7.306975][ T1] <TASK>
[ 7.307966][ T1] debug_vm_pgtable (mm/debug_vm_pgtable.c:1334)
[ 7.309435][ T1] ? init_args (mm/debug_vm_pgtable.c:1241)
[ 7.310773][ T1] do_one_initcall (init/main.c:1303)
[ 7.312212][ T1] kernel_init_freeable (init/main.c:1377 init/main.c:1394 init/main.c:1413 init/main.c:1618)
[ 7.313728][ T1] ? rest_init (init/main.c:1499)
[ 7.315002][ T1] kernel_init (init/main.c:1509)
[ 7.316368][ T1] ret_from_fork (arch/x86/entry/entry_64.S:301)
[ 7.317692][ T1] </TASK>
[ 7.318697][ T1] Modules linked in:
[ 7.320060][ T1] ---[ end trace 1f2bbe378e842286 ]---
[ 7.321766][ T1] RIP: 0010:destroy_args (include/linux/pte_ref.h:56 include/linux/pte_ref.h:123 mm/debug_vm_pgtable.c:1051)
[ 7.323325][ T1] Code: 6b 58 4c 8b 2b 49 8b 3c 24 e8 c6 38 b4 fe 48 c1 e0 06 48 03 05 aa eb 4c ff 8b 50 30 81 e2 00 02 00 f0 81 fa 00 00 00 f0 74 02 <0f> 0b f0 83 68 20 01 75 15 48 89 ea 4c 89 e6 4c 89 ef 48 81 e2 00
All code
========
0: 6b 58 4c 8b imul $0xffffff8b,0x4c(%rax),%ebx
4: 2b 49 8b sub -0x75(%rcx),%ecx
7: 3c 24 cmp $0x24,%al
9: e8 c6 38 b4 fe callq 0xfffffffffeb438d4
e: 48 c1 e0 06 shl $0x6,%rax
12: 48 03 05 aa eb 4c ff add -0xb31456(%rip),%rax # 0xffffffffff4cebc3
19: 8b 50 30 mov 0x30(%rax),%edx
1c: 81 e2 00 02 00 f0 and $0xf0000200,%edx
22: 81 fa 00 00 00 f0 cmp $0xf0000000,%edx
28: 74 02 je 0x2c
2a:* 0f 0b ud2 <-- trapping instruction
2c: f0 83 68 20 01 lock subl $0x1,0x20(%rax)
31: 75 15 jne 0x48
33: 48 89 ea mov %rbp,%rdx
36: 4c 89 e6 mov %r12,%rsi
39: 4c 89 ef mov %r13,%rdi
3c: 48 rex.W
3d: 81 .byte 0x81
3e: e2 00 loop 0x40
Code starting with the faulting instruction
===========================================
0: 0f 0b ud2
2: f0 83 68 20 01 lock subl $0x1,0x20(%rax)
7: 75 15 jne 0x1e
9: 48 89 ea mov %rbp,%rdx
c: 4c 89 e6 mov %r12,%rsi
f: 4c 89 ef mov %r13,%rdi
12: 48 rex.W
13: 81 .byte 0x81
14: e2 00 loop 0x16
To reproduce:
# build kernel
cd linux
cp config-5.15.0-rc7-mm1-00448-gafcc9fb8741f .config
make HOSTCC=gcc-9 CC=gcc-9 ARCH=x86_64 olddefconfig prepare modules_prepare bzImage
git clone https://github.com/intel/lkp-tests.git
cd lkp-tests
bin/lkp qemu -k <bzImage> job-script # job-script is attached in this email
# if come across any failure that blocks the test,
# please remove ~/.lkp and /lkp dir to run from a clean state.
---
0DAY/LKP+ Test Infrastructure Open Source Technology Center
https://lists.01.org/hyperkitty/list/[email protected] Intel Corporation
Thanks,
Oliver Sang
9 months