[mm/page_alloc] 39907a939a: netperf.Throughput_Mbps -18.1% regression
by kernel test robot
Greeting,
FYI, we noticed a -18.1% regression of netperf.Throughput_Mbps due to commit:
commit: 39907a939a34033eeea112751f0e4330628d3a9a ("mm/page_alloc: Limit number of high-order pages on PCP during bulk free")
https://git.kernel.org/cgit/linux/kernel/git/mel/linux.git mm-pcpllist-v1r2
in testcase: netperf
on test machine: 128 threads 2 sockets Intel(R) Xeon(R) Platinum 8358 CPU @ 2.60GHz with 128G memory
with following parameters:
ip: ipv4
runtime: 300s
nr_threads: 1
cluster: cs-localhost
test: UDP_STREAM
cpufreq_governor: performance
ucode: 0xd000331
test-description: Netperf is a benchmark that can be use to measure various aspect of networking performance.
test-url: http://www.netperf.org/netperf/
If you fix the issue, kindly add following tag
Reported-by: kernel test robot <oliver.sang(a)intel.com>
Details are as below:
-------------------------------------------------------------------------------------------------->
To reproduce:
git clone https://github.com/intel/lkp-tests.git
cd lkp-tests
sudo bin/lkp install job.yaml # job file is attached in this email
bin/lkp split-job --compatible job.yaml # generate the yaml file for lkp run
sudo bin/lkp run generated-yaml-file
# if come across any failure that blocks the test,
# please remove ~/.lkp and /lkp dir to run from a clean state.
=========================================================================================
cluster/compiler/cpufreq_governor/ip/kconfig/nr_threads/rootfs/runtime/tbox_group/test/testcase/ucode:
cs-localhost/gcc-9/performance/ipv4/x86_64-rhel-8.3/1/debian-10.4-x86_64-20200603.cgz/300s/lkp-icl-2sp4/UDP_STREAM/netperf/0xd000331
commit:
2009ed59ab ("mm/page_alloc: Free pages in a single pass during bulk free")
39907a939a ("mm/page_alloc: Limit number of high-order pages on PCP during bulk free")
2009ed59ab8200e6 39907a939a34033eeea112751f0
---------------- ---------------------------
%stddev %change %stddev
\ | \
122291 -18.1% 100120 netperf.Throughput_Mbps
122291 -18.1% 100120 netperf.Throughput_total_Mbps
90.83 -2.0% 89.00 netperf.time.percent_of_cpu_this_job_got
70006621 -18.1% 57314514 netperf.workload
75331549 ± 3% +14.8% 86505421 cpuidle..usage
29422 +31.0% 38553 ± 2% meminfo.Shmem
9976 ± 36% +111.0% 21053 ± 30% numa-meminfo.node1.Shmem
77114773 +14.8% 88560654 turbostat.IRQ
1.371e+08 -28.6% 97904821 ± 44% numa-numastat.node0.local_node
1.366e+08 -29.0% 96946927 ± 44% numa-numastat.node0.numa_hit
1754 ± 2% +6848.5% 121876 ± 2% vmstat.system.cs
253706 +14.8% 291171 vmstat.system.in
35.54 ± 3% -7.2% 32.98 ± 3% boot-time.boot
17.61 ± 6% -14.1% 15.13 ± 8% boot-time.dhcp
4043 ± 3% -8.1% 3714 ± 4% boot-time.idle
69337635 -28.5% 49607209 ± 43% numa-vmstat.node0.numa_hit
69744503 -28.2% 50050712 ± 43% numa-vmstat.node0.numa_local
2499 ± 36% +112.0% 5297 ± 29% numa-vmstat.node1.nr_shmem
248090 ± 6% +24.4% 308574 ± 5% perf-stat.i.cache-misses
1683 ± 2% +7192.7% 122793 ± 2% perf-stat.i.context-switches
8145 ± 17% +37.1% 11170 ± 12% perf-stat.i.node-loads
35521 ± 18% +35.9% 48285 ± 18% perf-stat.i.node-stores
0.05 ± 14% +0.0 0.06 ± 8% perf-stat.overall.cache-miss-rate%
49509 ± 15% -20.5% 39345 ± 8% perf-stat.overall.cycles-between-cache-misses
18429 +22.1% 22495 perf-stat.overall.path-length
247210 ± 6% +24.4% 307535 ± 5% perf-stat.ps.cache-misses
1677 ± 2% +7194.0% 122383 ± 2% perf-stat.ps.context-switches
8114 ± 17% +37.2% 11131 ± 12% perf-stat.ps.node-loads
35383 ± 18% +36.0% 48111 ± 18% perf-stat.ps.node-stores
71035 +2.8% 73029 proc-vmstat.nr_inactive_anon
9465 +4.4% 9881 ± 2% proc-vmstat.nr_mapped
7362 +30.3% 9592 ± 3% proc-vmstat.nr_shmem
71035 +2.8% 73029 proc-vmstat.nr_zone_inactive_anon
1.371e+08 -14.3% 1.174e+08 ± 2% proc-vmstat.numa_hit
1.375e+08 -14.1% 1.182e+08 proc-vmstat.numa_local
15448 ±110% +207.4% 47492 ± 28% proc-vmstat.numa_pte_updates
8244 ± 4% +1194.8% 106745 ± 6% proc-vmstat.pgactivate
1.352e+08 -14.5% 1.155e+08 proc-vmstat.pgalloc_normal
1059186 +1.2% 1072281 proc-vmstat.pgfault
1.352e+08 -14.5% 1.156e+08 proc-vmstat.pgfree
24.78 ± 8% -5.4 19.42 ± 15% perf-profile.calltrace.cycles-pp.__skb_datagram_iter.skb_copy_datagram_iter.udp_recvmsg.inet_recvmsg.__sys_recvfrom
24.81 ± 8% -5.4 19.45 ± 15% perf-profile.calltrace.cycles-pp.skb_copy_datagram_iter.udp_recvmsg.inet_recvmsg.__sys_recvfrom.__x64_sys_recvfrom
23.21 ± 8% -5.0 18.16 ± 15% perf-profile.calltrace.cycles-pp._copy_to_iter.__skb_datagram_iter.skb_copy_datagram_iter.udp_recvmsg.inet_recvmsg
22.81 ± 8% -5.0 17.80 ± 15% perf-profile.calltrace.cycles-pp.copyout._copy_to_iter.__skb_datagram_iter.skb_copy_datagram_iter.udp_recvmsg
22.64 ± 8% -5.0 17.68 ± 15% perf-profile.calltrace.cycles-pp.copy_user_enhanced_fast_string.copyout._copy_to_iter.__skb_datagram_iter.skb_copy_datagram_iter
1.63 ± 18% +0.5 2.16 ± 9% perf-profile.calltrace.cycles-pp.ip_rcv.__netif_receive_skb_one_core.process_backlog.__napi_poll.net_rx_action
1.50 ± 18% +0.5 2.05 ± 9% perf-profile.calltrace.cycles-pp.ip_local_deliver.ip_rcv.__netif_receive_skb_one_core.process_backlog.__napi_poll
1.48 ± 18% +0.6 2.03 ± 9% perf-profile.calltrace.cycles-pp.ip_local_deliver_finish.ip_local_deliver.ip_rcv.__netif_receive_skb_one_core.process_backlog
1.46 ± 18% +0.6 2.02 ± 9% perf-profile.calltrace.cycles-pp.ip_protocol_deliver_rcu.ip_local_deliver_finish.ip_local_deliver.ip_rcv.__netif_receive_skb_one_core
1.37 ± 18% +0.6 1.94 ± 9% perf-profile.calltrace.cycles-pp.__udp4_lib_rcv.ip_protocol_deliver_rcu.ip_local_deliver_finish.ip_local_deliver.ip_rcv
1.09 ± 19% +0.6 1.70 ± 10% perf-profile.calltrace.cycles-pp.udp_unicast_rcv_skb.__udp4_lib_rcv.ip_protocol_deliver_rcu.ip_local_deliver_finish.ip_local_deliver
1.03 ± 18% +0.7 1.68 ± 10% perf-profile.calltrace.cycles-pp.udp_queue_rcv_one_skb.udp_unicast_rcv_skb.__udp4_lib_rcv.ip_protocol_deliver_rcu.ip_local_deliver_finish
0.00 +0.7 0.66 ± 9% perf-profile.calltrace.cycles-pp.free_pcppages_bulk.free_unref_page.skb_release_data.__consume_stateless_skb.udp_recvmsg
0.00 +0.7 0.69 ± 12% perf-profile.calltrace.cycles-pp.try_to_wake_up.autoremove_wake_function.__wake_up_common.__wake_up_common_lock.sock_def_readable
0.00 +0.7 0.70 ± 12% perf-profile.calltrace.cycles-pp.autoremove_wake_function.__wake_up_common.__wake_up_common_lock.sock_def_readable.__udp_enqueue_schedule_skb
0.00 +0.7 0.74 ± 16% perf-profile.calltrace.cycles-pp.__schedule.schedule_idle.do_idle.cpu_startup_entry.start_secondary
0.00 +0.8 0.77 ± 16% perf-profile.calltrace.cycles-pp.schedule_idle.do_idle.cpu_startup_entry.start_secondary.secondary_startup_64_no_verify
0.00 +0.8 0.81 ± 13% perf-profile.calltrace.cycles-pp.__wake_up_common.__wake_up_common_lock.sock_def_readable.__udp_enqueue_schedule_skb.udp_queue_rcv_one_skb
0.00 +0.8 0.85 ± 12% perf-profile.calltrace.cycles-pp.__wake_up_common_lock.sock_def_readable.__udp_enqueue_schedule_skb.udp_queue_rcv_one_skb.udp_unicast_rcv_skb
0.00 +0.9 0.88 ± 16% perf-profile.calltrace.cycles-pp.__schedule.schedule.schedule_timeout.__skb_wait_for_more_packets.__skb_recv_udp
0.00 +0.9 0.90 ± 15% perf-profile.calltrace.cycles-pp.schedule.schedule_timeout.__skb_wait_for_more_packets.__skb_recv_udp.udp_recvmsg
0.00 +0.9 0.94 ± 12% perf-profile.calltrace.cycles-pp.sock_def_readable.__udp_enqueue_schedule_skb.udp_queue_rcv_one_skb.udp_unicast_rcv_skb.__udp4_lib_rcv
0.00 +1.0 0.97 ± 15% perf-profile.calltrace.cycles-pp.schedule_timeout.__skb_wait_for_more_packets.__skb_recv_udp.udp_recvmsg.inet_recvmsg
0.18 ±141% +1.1 1.28 ± 9% perf-profile.calltrace.cycles-pp.__udp_enqueue_schedule_skb.udp_queue_rcv_one_skb.udp_unicast_rcv_skb.__udp4_lib_rcv.ip_protocol_deliver_rcu
0.00 +1.2 1.24 ± 15% perf-profile.calltrace.cycles-pp.__skb_wait_for_more_packets.__skb_recv_udp.udp_recvmsg.inet_recvmsg.__sys_recvfrom
0.51 ± 45% +1.6 2.09 ± 16% perf-profile.calltrace.cycles-pp.__skb_recv_udp.udp_recvmsg.inet_recvmsg.__sys_recvfrom.__x64_sys_recvfrom
24.80 ± 8% -5.4 19.44 ± 15% perf-profile.children.cycles-pp.__skb_datagram_iter
24.81 ± 8% -5.4 19.46 ± 15% perf-profile.children.cycles-pp.skb_copy_datagram_iter
23.23 ± 8% -5.0 18.18 ± 15% perf-profile.children.cycles-pp._copy_to_iter
22.84 ± 8% -5.0 17.82 ± 15% perf-profile.children.cycles-pp.copyout
0.41 ± 16% -0.2 0.22 ± 27% perf-profile.children.cycles-pp.udp_rmem_release
0.52 ± 8% -0.1 0.39 ± 12% perf-profile.children.cycles-pp.free_pcp_prepare
0.18 ± 18% -0.1 0.06 ± 45% perf-profile.children.cycles-pp.free_unref_page_commit
0.08 ± 19% -0.0 0.04 ± 45% perf-profile.children.cycles-pp.kmem_cache_free
0.10 ± 16% +0.0 0.15 ± 8% perf-profile.children.cycles-pp.syscall_exit_to_user_mode
0.03 ±100% +0.1 0.08 ± 13% perf-profile.children.cycles-pp.exit_to_user_mode_prepare
0.00 +0.1 0.06 ± 13% perf-profile.children.cycles-pp.ttwu_do_wakeup
0.00 +0.1 0.06 ± 16% perf-profile.children.cycles-pp.__update_load_avg_se
0.00 +0.1 0.06 ± 16% perf-profile.children.cycles-pp.perf_trace_sched_wakeup_template
0.00 +0.1 0.06 ± 21% perf-profile.children.cycles-pp.nohz_run_idle_balance
0.00 +0.1 0.07 ± 15% perf-profile.children.cycles-pp.__switch_to_asm
0.00 +0.1 0.07 ± 21% perf-profile.children.cycles-pp.llist_add_batch
0.00 +0.1 0.07 ± 21% perf-profile.children.cycles-pp.__smp_call_single_queue
0.02 ±141% +0.1 0.10 ± 32% perf-profile.children.cycles-pp.native_queued_spin_lock_slowpath
0.09 ± 20% +0.1 0.17 ± 7% perf-profile.children.cycles-pp.__list_add_valid
0.00 +0.1 0.08 ± 24% perf-profile.children.cycles-pp.__update_load_avg_cfs_rq
0.00 +0.1 0.08 ± 14% perf-profile.children.cycles-pp.prepare_to_wait_exclusive
0.20 ± 19% +0.1 0.29 ± 19% perf-profile.children.cycles-pp.skb_set_owner_w
0.00 +0.1 0.09 ± 27% perf-profile.children.cycles-pp.flush_smp_call_function_queue
0.08 ± 17% +0.1 0.17 ± 26% perf-profile.children.cycles-pp.__sk_mem_reduce_allocated
0.07 ± 80% +0.1 0.17 ± 27% perf-profile.children.cycles-pp._raw_spin_lock_bh
0.00 +0.1 0.14 ± 12% perf-profile.children.cycles-pp.set_next_entity
0.04 ± 72% +0.1 0.18 ± 15% perf-profile.children.cycles-pp.__zone_watermark_ok
0.00 +0.2 0.16 ± 18% perf-profile.children.cycles-pp.enqueue_entity
0.00 +0.2 0.17 ± 24% perf-profile.children.cycles-pp.sched_ttwu_pending
0.00 +0.2 0.18 ± 10% perf-profile.children.cycles-pp.__switch_to
0.00 +0.2 0.18 ± 14% perf-profile.children.cycles-pp.update_load_avg
0.00 +0.2 0.19 ± 16% perf-profile.children.cycles-pp.ttwu_queue_wakelist
0.00 +0.2 0.20 ± 12% perf-profile.children.cycles-pp.enqueue_task_fair
0.30 ± 7% +0.2 0.51 ± 7% perf-profile.children.cycles-pp._raw_spin_lock_irqsave
0.00 +0.2 0.21 ± 12% perf-profile.children.cycles-pp.ttwu_do_activate
0.00 +0.2 0.22 ± 14% perf-profile.children.cycles-pp.update_curr
0.00 +0.2 0.23 ± 14% perf-profile.children.cycles-pp.pick_next_task_fair
0.00 +0.3 0.25 ± 26% perf-profile.children.cycles-pp.__sysvec_call_function_single
0.00 +0.3 0.31 ± 25% perf-profile.children.cycles-pp.sysvec_call_function_single
0.36 ± 20% +0.3 0.67 ± 9% perf-profile.children.cycles-pp.free_pcppages_bulk
0.00 +0.4 0.36 ± 27% perf-profile.children.cycles-pp.finish_task_switch
0.00 +0.4 0.38 ± 16% perf-profile.children.cycles-pp.dequeue_entity
0.00 +0.4 0.41 ± 16% perf-profile.children.cycles-pp.dequeue_task_fair
0.00 +0.5 0.47 ± 24% perf-profile.children.cycles-pp.asm_sysvec_call_function_single
3.39 ± 4% +0.5 3.87 ± 6% perf-profile.children.cycles-pp.__softirqentry_text_start
1.63 ± 18% +0.5 2.16 ± 9% perf-profile.children.cycles-pp.ip_rcv
1.50 ± 18% +0.5 2.05 ± 9% perf-profile.children.cycles-pp.ip_local_deliver
1.48 ± 18% +0.6 2.04 ± 9% perf-profile.children.cycles-pp.ip_local_deliver_finish
1.47 ± 18% +0.6 2.02 ± 9% perf-profile.children.cycles-pp.ip_protocol_deliver_rcu
1.38 ± 18% +0.6 1.95 ± 9% perf-profile.children.cycles-pp.__udp4_lib_rcv
1.10 ± 19% +0.6 1.71 ± 10% perf-profile.children.cycles-pp.udp_unicast_rcv_skb
1.04 ± 18% +0.7 1.69 ± 10% perf-profile.children.cycles-pp.udp_queue_rcv_one_skb
0.00 +0.7 0.71 ± 12% perf-profile.children.cycles-pp.autoremove_wake_function
0.00 +0.7 0.71 ± 12% perf-profile.children.cycles-pp.try_to_wake_up
0.00 +0.8 0.78 ± 16% perf-profile.children.cycles-pp.schedule_idle
0.00 +0.8 0.82 ± 13% perf-profile.children.cycles-pp.__wake_up_common
0.46 ± 15% +0.8 1.29 ± 9% perf-profile.children.cycles-pp.__udp_enqueue_schedule_skb
0.00 +0.9 0.86 ± 56% perf-profile.children.cycles-pp.poll_idle
0.00 +0.9 0.86 ± 12% perf-profile.children.cycles-pp.__wake_up_common_lock
0.03 ±100% +0.9 0.95 ± 12% perf-profile.children.cycles-pp.sock_def_readable
0.02 ±142% +0.9 0.94 ± 14% perf-profile.children.cycles-pp.schedule
0.00 +1.0 0.98 ± 14% perf-profile.children.cycles-pp.schedule_timeout
0.00 +1.2 1.24 ± 15% perf-profile.children.cycles-pp.__skb_wait_for_more_packets
0.59 ± 13% +1.5 2.11 ± 16% perf-profile.children.cycles-pp.__skb_recv_udp
0.07 ± 16% +1.6 1.68 ± 14% perf-profile.children.cycles-pp.__schedule
0.23 ± 18% -0.2 0.03 ±103% perf-profile.self.cycles-pp.udp_rmem_release
0.52 ± 9% -0.1 0.38 ± 12% perf-profile.self.cycles-pp.free_pcp_prepare
0.15 ± 15% -0.1 0.04 ± 71% perf-profile.self.cycles-pp.free_unref_page_commit
0.29 ± 11% -0.1 0.20 ± 19% perf-profile.self.cycles-pp.__skb_datagram_iter
0.25 ± 13% -0.1 0.16 ± 17% perf-profile.self.cycles-pp.udp_recvmsg
0.14 ± 16% -0.0 0.10 ± 10% perf-profile.self.cycles-pp.__alloc_pages
0.08 ± 20% -0.0 0.03 ± 70% perf-profile.self.cycles-pp.kmem_cache_free
0.00 +0.1 0.06 ± 13% perf-profile.self.cycles-pp.__update_load_avg_se
0.00 +0.1 0.07 ± 15% perf-profile.self.cycles-pp.__switch_to_asm
0.00 +0.1 0.07 ± 23% perf-profile.self.cycles-pp.llist_add_batch
0.02 ±142% +0.1 0.09 ± 23% perf-profile.self.cycles-pp.sock_def_readable
0.00 +0.1 0.07 ± 11% perf-profile.self.cycles-pp.schedule_timeout
0.00 +0.1 0.08 ± 20% perf-profile.self.cycles-pp.enqueue_entity
0.00 +0.1 0.08 ± 26% perf-profile.self.cycles-pp.__update_load_avg_cfs_rq
0.08 ± 21% +0.1 0.16 ± 5% perf-profile.self.cycles-pp.__list_add_valid
0.02 ±141% +0.1 0.10 ± 32% perf-profile.self.cycles-pp.native_queued_spin_lock_slowpath
0.20 ± 17% +0.1 0.29 ± 19% perf-profile.self.cycles-pp.skb_set_owner_w
0.00 +0.1 0.09 ± 17% perf-profile.self.cycles-pp.set_next_entity
0.00 +0.1 0.09 ± 16% perf-profile.self.cycles-pp.update_curr
0.17 ± 19% +0.1 0.27 ± 11% perf-profile.self.cycles-pp.skb_page_frag_refill
0.07 ± 12% +0.1 0.17 ± 25% perf-profile.self.cycles-pp.__sk_mem_reduce_allocated
0.06 ± 79% +0.1 0.16 ± 28% perf-profile.self.cycles-pp._raw_spin_lock_bh
0.00 +0.1 0.11 ± 19% perf-profile.self.cycles-pp.__wake_up_common
0.00 +0.1 0.13 ± 24% perf-profile.self.cycles-pp.try_to_wake_up
0.00 +0.1 0.13 ± 21% perf-profile.self.cycles-pp.__skb_wait_for_more_packets
0.05 ± 75% +0.1 0.18 ± 6% perf-profile.self.cycles-pp.update_rq_clock
0.00 +0.2 0.15 ± 27% perf-profile.self.cycles-pp.finish_task_switch
0.17 ± 18% +0.2 0.32 ± 12% perf-profile.self.cycles-pp.skb_release_data
0.02 ±141% +0.2 0.18 ± 18% perf-profile.self.cycles-pp.__zone_watermark_ok
0.00 +0.2 0.17 ± 12% perf-profile.self.cycles-pp.__switch_to
0.28 ± 6% +0.2 0.47 ± 6% perf-profile.self.cycles-pp._raw_spin_lock_irqsave
0.00 +0.3 0.30 ± 18% perf-profile.self.cycles-pp.__schedule
0.04 ± 72% +0.4 0.40 ± 21% perf-profile.self.cycles-pp.__skb_recv_udp
0.00 +0.7 0.69 ± 70% perf-profile.self.cycles-pp.poll_idle
Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.
---
0DAY/LKP+ Test Infrastructure Open Source Technology Center
https://lists.01.org/hyperkitty/list/[email protected] Intel Corporation
Thanks,
Oliver Sang
5 months, 2 weeks
[mm/slub] ba10d4b466: BUG:sleeping_function_called_from_invalid_context_at_mm/page_alloc.c
by kernel test robot
Greeting,
FYI, we noticed the following commit (built with gcc-9):
commit: ba10d4b46655ce4839061a815cac33245c2ebcc2 ("mm/slub: use stackdepot to save stack trace in objects")
https://git.kernel.org/cgit/linux/kernel/git/vbabka/linux.git slub-stackdepot-v1
in testcase: boot
on test machine: qemu-system-x86_64 -enable-kvm -cpu Icelake-Server -smp 4 -m 16G
caused below changes (please refer to attached dmesg/kmsg for entire log/backtrace):
If you fix the issue, kindly add following tag
Reported-by: kernel test robot <oliver.sang(a)intel.com>
[ 3.221599][ T1] BUG: sleeping function called from invalid context at mm/page_alloc.c:5164
[ 3.222568][ T1] in_atomic(): 1, irqs_disabled(): 0, non_block: 0, pid: 1, name: swapper
[ 3.223483][ T1] preempt_count: 1, expected: 0
[ 3.224023][ T1] no locks held by swapper/1.
[ 3.224540][ T1] CPU: 0 PID: 1 Comm: swapper Not tainted 5.17.0-rc1-00002-gba10d4b46655 #1 25fb037e45765c0b5aa6dd8ac5af370d377555f5
[ 3.225961][ T1] Hardware name: QEMU Standard PC (i440FX + PIIX, 1996), BIOS 1.12.0-1 04/01/2014
[ 3.226970][ T1] Call Trace:
[ 3.227451][ T1] dump_stack_lvl (lib/dump_stack.c:108)
[ 3.227964][ T1] dump_stack (lib/dump_stack.c:114)
[ 3.228415][ T1] __might_resched.cold (kernel/sched/core.c:9584)
[ 3.228971][ T1] __might_sleep (kernel/sched/core.c:9513 (discriminator 14))
[ 3.229486][ T1] __alloc_pages (mm/page_alloc.c:5164 mm/page_alloc.c:5378)
[ 3.230000][ T1] __stack_depot_save (lib/stackdepot.c:396)
[ 3.230562][ T1] stack_depot_save (lib/stackdepot.c:453)
[ 3.231073][ T1] set_track (mm/slub.c:743)
[ 3.231451][ T1] ? __slab_alloc+0x10/0x20
[ 3.232133][ T1] ? kmem_cache_alloc (mm/slub.c:3193 mm/slub.c:3235 mm/slub.c:3240)
[ 3.232688][ T1] ? __kernfs_new_node (include/linux/slab.h:705 fs/kernfs/dir.c:585)
[ 3.233252][ T1] ? kernfs_new_node (fs/kernfs/dir.c:649)
[ 3.233798][ T1] ? __kernfs_create_file (fs/kernfs/file.c:989)
[ 3.234523][ T1] ? sysfs_add_file_mode_ns (fs/sysfs/file.c:296)
[ 3.235454][ T1] ? sysfs_create_file_ns (fs/sysfs/file.c:355)
[ 3.236324][ T1] ? bus_create_file (include/linux/sysfs.h:607 drivers/base/bus.c:135)
[ 3.237188][ T1] ? bus_register (drivers/base/bus.c:559 drivers/base/bus.c:831)
[ 3.238047][ T1] ? subsys_register (drivers/base/bus.c:1083)
[ 3.238584][ T1] ? subsys_system_register (drivers/base/bus.c:1140)
[ 3.239451][ T1] ? cpu_dev_init (drivers/base/cpu.c:608)
[ 3.239959][ T1] ? driver_init (drivers/base/init.c:38)
[ 3.240445][ T1] ? kernel_init_freeable (init/main.c:1406 init/main.c:1613)
[ 3.241027][ T1] ? kernel_init (init/main.c:1504)
[ 3.241620][ T1] ? ret_from_fork (arch/x86/entry/entry_32.S:772)
[ 3.242130][ T1] alloc_debug_processing (mm/slub.c:1316)
[ 3.242711][ T1] ? __kernfs_new_node (include/linux/slab.h:705 fs/kernfs/dir.c:585)
[ 3.243263][ T1] ___slab_alloc+0x7a7/0xa30
[ 3.243451][ T1] ? __kernfs_new_node (include/linux/slab.h:705 fs/kernfs/dir.c:585)
[ 3.244007][ T1] ? sched_clock_cpu (kernel/sched/clock.c:382)
[ 3.244535][ T1] ? __kernfs_new_node (include/linux/slab.h:705 fs/kernfs/dir.c:585)
[ 3.245111][ T1] __slab_alloc+0x10/0x20
[ 3.245765][ T1] kmem_cache_alloc (mm/slub.c:3193 mm/slub.c:3235 mm/slub.c:3240)
[ 3.246302][ T1] ? __kernfs_new_node (include/linux/slab.h:705 fs/kernfs/dir.c:585)
[ 3.246860][ T1] __kernfs_new_node (include/linux/slab.h:705 fs/kernfs/dir.c:585)
[ 3.247451][ T1] ? kernfs_activate (fs/kernfs/dir.c:1324)
[ 3.247979][ T1] ? lock_release (kernel/locking/lockdep.c:321 kernel/locking/lockdep.c:5321 kernel/locking/lockdep.c:5659)
[ 3.248504][ T1] ? up_write (kernel/locking/rwsem.c:1309 kernel/locking/rwsem.c:1567)
[ 3.249083][ T1] kernfs_new_node (fs/kernfs/dir.c:649)
[ 3.249587][ T1] __kernfs_create_file (fs/kernfs/file.c:989)
[ 3.250144][ T1] sysfs_add_file_mode_ns (fs/sysfs/file.c:296)
[ 3.250723][ T1] sysfs_create_file_ns (fs/sysfs/file.c:355)
[ 3.251451][ T1] bus_create_file (include/linux/sysfs.h:607 drivers/base/bus.c:135)
[ 3.251962][ T1] bus_register (drivers/base/bus.c:559 drivers/base/bus.c:831)
[ 3.252458][ T1] subsys_register (drivers/base/bus.c:1083)
[ 3.252960][ T1] subsys_system_register (drivers/base/bus.c:1140)
[ 3.253552][ T1] cpu_dev_init (drivers/base/cpu.c:608)
[ 3.254029][ T1] driver_init (drivers/base/init.c:38)
[ 3.254500][ T1] kernel_init_freeable (init/main.c:1406 init/main.c:1613)
[ 3.255451][ T1] ? rest_init (init/main.c:1494)
[ 3.255926][ T1] kernel_init (init/main.c:1504)
[ 3.256402][ T1] ret_from_fork (arch/x86/entry/entry_32.S:772)
[ 3.256965][ T10] Callback from call_rcu_tasks_trace() invoked.
[ 3.258372][ T1] version magic: 0x4139332a
[ 3.261460][ T1] clocksource: jiffies: mask: 0xffffffff max_cycles: 0xffffffff, max_idle_ns: 7645041785100000 ns
[ 3.262648][ T1] futex hash table entries: 256 (order: 1, 14336 bytes, linear)
[ 3.263896][ T1] prandom: seed boundary self test passed
[ 3.265307][ T1] prandom: 100 self tests passed
[ 3.268518][ T1] prandom32: self test passed (less than 6 bits correlated)
[ 3.269330][ T1] pinctrl core: initialized pinctrl subsystem
[ 3.271001][ T1] regulator-dummy: no parameters, enabled
[ 3.272379][ T1] NET: Registered PF_NETLINK/PF_ROUTE protocol family
[ 3.276474][ T1] thermal_sys: Registered thermal governor 'fair_share'
[ 3.276479][ T1] thermal_sys: Registered thermal governor 'bang_bang'
[ 3.277252][ T1] thermal_sys: Registered thermal governor 'step_wise'
[ 3.278030][ T1] thermal_sys: Registered thermal governor 'user_space'
[ 3.278885][ T1] EISA bus registered
[ 3.279899][ T1] cpuidle: using governor ladder
[ 3.280455][ T1] cpuidle: using governor menu
[ 3.282429][ T1] PCI: PCI BIOS area is rw and x. Use pci=nobios if you want it NX.
[ 3.283452][ T1] PCI: PCI BIOS revision 2.10 entry at 0xfd1bc, last bus=0
[ 3.284246][ T1] PCI: Using configuration type 1 for base access
[ 3.315570][ T9] Callback from call_rcu_tasks_rude() invoked.
[ 3.317303][ T1] cryptd: max_cpu_qlen set to 1000
[ 3.319314][ T1] ACPI: Added _OSI(Module Device)
[ 3.319452][ T1] ACPI: Added _OSI(Processor Device)
[ 3.320039][ T1] ACPI: Added _OSI(3.0 _SCP Extensions)
[ 3.320737][ T1] ACPI: Added _OSI(Processor Aggregator Device)
[ 3.321862][ T1] ACPI: Added _OSI(Linux-Dell-Video)
[ 3.322475][ T1] ACPI: Added _OSI(Linux-Lenovo-NV-HDMI-Audio)
[ 3.323477][ T1] ACPI: Added _OSI(Linux-HPI-Hybrid-Graphics)
[ 3.347453][ T1] ACPI: 1 ACPI AML tables successfully acquired and loaded
[ 3.355907][ T1] ACPI: Interpreter enabled
[ 3.356554][ T1] ACPI: PM: (supports S0 S3 S5)
[ 3.357106][ T1] ACPI: Using PIC for interrupt routing
[ 3.357885][ T1] PCI: Using host bridge windows from ACPI; if necessary, use "pci=nocrs" and report a bug
[ 3.360665][ T1] ACPI: Enabled 2 GPEs in block 00 to 0F
[ 3.415485][ T8] Callback from call_rcu_tasks() invoked.
[ 3.419676][ T1] ACPI: PCI Root Bridge [PCI0] (domain 0000 [bus 00-ff])
[ 3.420506][ T1] acpi PNP0A03:00: _OSC: OS supports [ASPM ClockPM Segments HPX-Type3]
[ 3.421444][ T1] acpi PNP0A03:00: PCIe port services disabled; not requesting _OSC control
[ 3.422525][ T1] acpi PNP0A03:00: fail to add MMCONFIG information, can't access extended PCI configuration space under this bridge.
[ 3.424010][ T1] PCI host bridge to bus 0000:00
[ 3.424572][ T1] pci_bus 0000:00: root bus resource [io 0x0000-0x0cf7 window]
[ 3.425442][ T1] pci_bus 0000:00: root bus resource [io 0x0d00-0xffff window]
[ 3.426287][ T1] pci_bus 0000:00: root bus resource [mem 0x000a0000-0x000bffff window]
[ 3.427456][ T1] pci_bus 0000:00: root bus resource [mem 0xc0000000-0xfebfffff window]
[ 3.428378][ T1] pci_bus 0000:00: root bus resource [mem 0x440000000-0x4bfffffff window]
[ 3.429354][ T1] pci_bus 0000:00: root bus resource [bus 00-ff]
[ 3.430252][ T1] pci 0000:00:00.0: [8086:1237] type 00 class 0x060000
[ 3.432716][ T1] pci 0000:00:01.0: [8086:7000] type 00 class 0x060100
[ 3.434723][ T1] pci 0000:00:01.1: [8086:7010] type 00 class 0x010180
[ 3.439047][ T1] pci 0000:00:01.1: reg 0x20: [io 0xc040-0xc04f]
[ 3.441009][ T1] pci 0000:00:01.1: legacy IDE quirk: reg 0x10: [io 0x01f0-0x01f7]
[ 3.441899][ T1] pci 0000:00:01.1: legacy IDE quirk: reg 0x14: [io 0x03f6]
[ 3.442709][ T1] pci 0000:00:01.1: legacy IDE quirk: reg 0x18: [io 0x0170-0x0177]
[ 3.443451][ T1] pci 0000:00:01.1: legacy IDE quirk: reg 0x1c: [io 0x0376]
[ 3.444748][ T1] pci 0000:00:01.3: [8086:7113] type 00 class 0x068000
[ 3.446517][ T1] pci 0000:00:01.3: quirk: [io 0x0600-0x063f] claimed by PIIX4 ACPI
[ 3.447460][ T1] pci 0000:00:01.3: quirk: [io 0x0700-0x070f] claimed by PIIX4 SMB
[ 3.449315][ T1] pci 0000:00:02.0: [1234:1111] type 00 class 0x030000
[ 3.452102][ T1] pci 0000:00:02.0: reg 0x10: [mem 0xfd000000-0xfdffffff pref]
[ 3.455458][ T1] pci 0000:00:02.0: reg 0x18: [mem 0xfebf0000-0xfebf0fff]
[ 3.462033][ T1] pci 0000:00:02.0: reg 0x30: [mem 0xfebe0000-0xfebeffff pref]
[ 3.464240][ T1] pci 0000:00:03.0: [8086:100e] type 00 class 0x020000
[ 3.466435][ T1] pci 0000:00:03.0: reg 0x10: [mem 0xfebc0000-0xfebdffff]
[ 3.468144][ T1] pci 0000:00:03.0: reg 0x14: [io 0xc000-0xc03f]
[ 3.474352][ T1] pci 0000:00:03.0: reg 0x30: [mem 0xfeb80000-0xfebbffff pref]
[ 3.476395][ T1] pci 0000:00:04.0: [8086:25ab] type 00 class 0x088000
[ 3.477767][ T1] pci 0000:00:04.0: reg 0x10: [mem 0xfebf1000-0xfebf100f]
[ 3.482779][ T1] pci_bus 0000:00: on NUMA node 0
[ 3.489518][ T1] ACPI: PCI: Interrupt link LNKA configured for IRQ 10
[ 3.491898][ T1] ACPI: PCI: Interrupt link LNKB configured for IRQ 10
[ 3.493813][ T1] ACPI: PCI: Interrupt link LNKC configured for IRQ 11
[ 3.495734][ T1] ACPI: PCI: Interrupt link LNKD configured for IRQ 11
[ 3.496992][ T1] ACPI: PCI: Interrupt link LNKS configured for IRQ 9
[ 3.503271][ T1] pci 0000:00:02.0: vgaarb: setting as boot VGA device
[ 3.503442][ T1] pci 0000:00:02.0: vgaarb: VGA device added: decodes=io+mem,owns=io+mem,locks=none
[ 3.503454][ T1] pci 0000:00:02.0: vgaarb: bridge control possible
[ 3.504181][ T1] vgaarb: loaded
[ 3.508615][ T1] pps_core: LinuxPPS API ver. 1 registered
[ 3.509269][ T1] pps_core: Software ver. 5.3.6 - Copyright 2005-2007 Rodolfo Giometti <giometti(a)linux.it>
[ 3.511195][ T1] FPGA manager framework
[ 3.512014][ T1] Advanced Linux Sound Architecture Driver Initialized.
[ 3.514031][ T1] PCI: Using ACPI for IRQ routing
[ 3.514604][ T1] PCI: pci_cache_line_size set to 64 bytes
[ 3.515545][ T1] e820: reserve RAM buffer [mem 0x0009fc00-0x0009ffff]
[ 3.516340][ T1] e820: reserve RAM buffer [mem 0xbffdf000-0xbfffffff]
[ 3.518469][ T1] clocksource: Switched to clocksource kvm-clock
[ 3.775596][ T1] VFS: Disk quotas dquot_6.6.0
[ 3.776288][ T1] VFS: Dquot-cache hash table entries: 1024 (order 0, 4096 bytes)
[ 3.777411][ T1] FS-Cache: Loaded
[ 3.778112][ T1] pnp: PnP ACPI init
[ 3.780104][ T1] pnp 00:03: [dma 2]
[ 3.785160][ T1] pnp: PnP ACPI: found 7 devices
To reproduce:
# build kernel
cd linux
cp config-5.17.0-rc1-00002-gba10d4b46655 .config
make HOSTCC=gcc-9 CC=gcc-9 ARCH=i386 olddefconfig prepare modules_prepare bzImage modules
make HOSTCC=gcc-9 CC=gcc-9 ARCH=i386 INSTALL_MOD_PATH=<mod-install-dir> modules_install
cd <mod-install-dir>
find lib/ | cpio -o -H newc --quiet | gzip > modules.cgz
git clone https://github.com/intel/lkp-tests.git
cd lkp-tests
bin/lkp qemu -k <bzImage> -m modules.cgz job-script # job-script is attached in this email
# if come across any failure that blocks the test,
# please remove ~/.lkp and /lkp dir to run from a clean state.
---
0DAY/LKP+ Test Infrastructure Open Source Technology Center
https://lists.01.org/hyperkitty/list/[email protected] Intel Corporation
Thanks,
Oliver Sang
5 months, 2 weeks
[drm/selftests] 39ec47bbfd: kernel_BUG_at_drivers/gpu/drm/drm_buddy.c
by kernel test robot
Greeting,
FYI, we noticed the following commit (built with gcc-9):
commit: 39ec47bbfd5dd3cea0b711ee9f1acdca37399c86 ("[PATCH v2 2/7] drm/selftests: add drm buddy alloc limit testcase")
url: https://github.com/0day-ci/linux/commits/Arunpravin/drm-selftests-Move-i9...
patch link: https://lore.kernel.org/dri-devel/20220222174845.2175-2-Arunpravin.Paneer...
in testcase: boot
on test machine: qemu-system-x86_64 -enable-kvm -cpu Icelake-Server -smp 4 -m 16G
caused below changes (please refer to attached dmesg/kmsg for entire log/backtrace):
+---------------------------------------------------+------------+------------+
| | be9e8c6c00 | 39ec47bbfd |
+---------------------------------------------------+------------+------------+
| boot_successes | 14 | 0 |
| boot_failures | 0 | 16 |
| UBSAN:shift-out-of-bounds_in_include/linux/log2.h | 0 | 16 |
| kernel_BUG_at_drivers/gpu/drm/drm_buddy.c | 0 | 16 |
| invalid_opcode:#[##] | 0 | 16 |
| EIP:drm_buddy_init | 0 | 16 |
| Kernel_panic-not_syncing:Fatal_exception | 0 | 16 |
+---------------------------------------------------+------------+------------+
If you fix the issue, kindly add following tag
Reported-by: kernel test robot <oliver.sang(a)intel.com>
[ 68.124177][ T1] UBSAN: shift-out-of-bounds in include/linux/log2.h:67:13
[ 68.125333][ T1] shift exponent 4294967295 is too large for 32-bit type 'long unsigned int'
[ 68.126563][ T1] CPU: 0 PID: 1 Comm: swapper Not tainted 5.17.0-rc2-00311-g39ec47bbfd5d #2
[ 68.127758][ T1] Call Trace:
[ 68.128187][ T1] dump_stack_lvl (lib/dump_stack.c:108)
[ 68.128793][ T1] dump_stack (lib/dump_stack.c:114)
[ 68.129331][ T1] ubsan_epilogue (lib/ubsan.c:152)
[ 68.129958][ T1] __ubsan_handle_shift_out_of_bounds.cold (arch/x86/include/asm/smap.h:85)
[ 68.130791][ T1] ? drm_block_alloc+0x28/0x80
[ 68.131582][ T1] ? rcu_read_lock_sched_held (kernel/rcu/update.c:125)
[ 68.132215][ T1] ? kmem_cache_alloc (include/trace/events/kmem.h:54 mm/slab.c:3501)
[ 68.132878][ T1] ? mark_free+0x2e/0x80
[ 68.133524][ T1] drm_buddy_init.cold (include/linux/log2.h:67 drivers/gpu/drm/drm_buddy.c:131)
[ 68.134145][ T1] ? test_drm_cmdline_init (drivers/gpu/drm/selftests/test-drm_buddy.c:87)
[ 68.134770][ T1] igt_buddy_alloc_limit (drivers/gpu/drm/selftests/test-drm_buddy.c:30)
[ 68.135472][ T1] ? vprintk_default (kernel/printk/printk.c:2257)
[ 68.136057][ T1] ? test_drm_cmdline_init (drivers/gpu/drm/selftests/test-drm_buddy.c:87)
[ 68.136812][ T1] test_drm_buddy_init (drivers/gpu/drm/selftests/drm_selftest.c:77 drivers/gpu/drm/selftests/test-drm_buddy.c:95)
[ 68.137475][ T1] do_one_initcall (init/main.c:1300)
[ 68.138111][ T1] ? parse_args (kernel/params.c:609 kernel/params.c:146 kernel/params.c:188)
[ 68.138717][ T1] do_basic_setup (init/main.c:1372 init/main.c:1389 init/main.c:1408)
[ 68.139366][ T1] kernel_init_freeable (init/main.c:1617)
[ 68.140040][ T1] ? rest_init (init/main.c:1494)
[ 68.140634][ T1] kernel_init (init/main.c:1504)
[ 68.141155][ T1] ret_from_fork (arch/x86/entry/entry_32.S:772)
[ 68.141607][ T1] ================================================================================
[ 68.146730][ T1] ------------[ cut here ]------------
[ 68.147460][ T1] kernel BUG at drivers/gpu/drm/drm_buddy.c:140!
[ 68.148280][ T1] invalid opcode: 0000 [#1]
[ 68.148895][ T1] CPU: 0 PID: 1 Comm: swapper Not tainted 5.17.0-rc2-00311-g39ec47bbfd5d #2
[ 68.149896][ T1] EIP: drm_buddy_init (drivers/gpu/drm/drm_buddy.c:140 (discriminator 1))
[ 68.149896][ T1] Code: 76 00 b8 ea ff ff ff 8d 65 f4 5b 5e 5f 5d c3 8d 76 00 0f bd 45 d8 75 05 b8 ff ff ff ff 83 c0 21 e9 5e ff ff ff 8d 74 26 00 90 <0f> 0b 8d b6 00 00 00 00 0f 0b 8d b6 00 00 00 00 8b 5d 0c 0f bd 45
All code
========
0: 76 00 jbe 0x2
2: b8 ea ff ff ff mov $0xffffffea,%eax
7: 8d 65 f4 lea -0xc(%rbp),%esp
a: 5b pop %rbx
b: 5e pop %rsi
c: 5f pop %rdi
d: 5d pop %rbp
e: c3 retq
f: 8d 76 00 lea 0x0(%rsi),%esi
12: 0f bd 45 d8 bsr -0x28(%rbp),%eax
16: 75 05 jne 0x1d
18: b8 ff ff ff ff mov $0xffffffff,%eax
1d: 83 c0 21 add $0x21,%eax
20: e9 5e ff ff ff jmpq 0xffffffffffffff83
25: 8d 74 26 00 lea 0x0(%rsi,%riz,1),%esi
29: 90 nop
2a:* 0f 0b ud2 <-- trapping instruction
2c: 8d b6 00 00 00 00 lea 0x0(%rsi),%esi
32: 0f 0b ud2
34: 8d b6 00 00 00 00 lea 0x0(%rsi),%esi
3a: 8b 5d 0c mov 0xc(%rbp),%ebx
3d: 0f .byte 0xf
3e: bd .byte 0xbd
3f: 45 rex.RB
Code starting with the faulting instruction
===========================================
0: 0f 0b ud2
2: 8d b6 00 00 00 00 lea 0x0(%rsi),%esi
8: 0f 0b ud2
a: 8d b6 00 00 00 00 lea 0x0(%rsi),%esi
10: 8b 5d 0c mov 0xc(%rbp),%ebx
13: 0f .byte 0xf
14: bd .byte 0xbd
15: 45 rex.RB
[ 68.149896][ T1] EAX: 8578e658 EBX: 8578e618 ECX: 8578e658 EDX: 83717c98
[ 68.149896][ T1] ESI: 83675ee0 EDI: 00000034 EBP: 83675ec0 ESP: 83675e94
[ 68.149896][ T1] DS: 007b ES: 007b FS: 0000 GS: 0000 SS: 0068 EFLAGS: 00010297
[ 68.149896][ T1] CR0: 80050033 CR2: 77f35844 CR3: 02a10000 CR4: 00150ed0
[ 68.149896][ T1] DR0: 00000000 DR1: 00000000 DR2: 00000000 DR3: 00000000
[ 68.149896][ T1] DR6: fffe0ff0 DR7: 00000400
[ 68.149896][ T1] Call Trace:
[ 68.149896][ T1] ? test_drm_cmdline_init (drivers/gpu/drm/selftests/test-drm_buddy.c:87)
[ 68.149896][ T1] igt_buddy_alloc_limit (drivers/gpu/drm/selftests/test-drm_buddy.c:30)
[ 68.149896][ T1] ? vprintk_default (kernel/printk/printk.c:2257)
[ 68.149896][ T1] ? test_drm_cmdline_init (drivers/gpu/drm/selftests/test-drm_buddy.c:87)
[ 68.149896][ T1] test_drm_buddy_init (drivers/gpu/drm/selftests/drm_selftest.c:77 drivers/gpu/drm/selftests/test-drm_buddy.c:95)
[ 68.149896][ T1] do_one_initcall (init/main.c:1300)
[ 68.149896][ T1] ? parse_args (kernel/params.c:609 kernel/params.c:146 kernel/params.c:188)
[ 68.149896][ T1] do_basic_setup (init/main.c:1372 init/main.c:1389 init/main.c:1408)
[ 68.149896][ T1] kernel_init_freeable (init/main.c:1617)
[ 68.149896][ T1] ? rest_init (init/main.c:1494)
[ 68.149896][ T1] kernel_init (init/main.c:1504)
[ 68.149896][ T1] ret_from_fork (arch/x86/entry/entry_32.S:772)
[ 68.149896][ T1] Modules linked in:
[ 68.167316][ T1] ---[ end trace 0000000000000000 ]---
[ 68.168062][ T1] EIP: drm_buddy_init (drivers/gpu/drm/drm_buddy.c:140 (discriminator 1))
[ 68.168739][ T1] Code: 76 00 b8 ea ff ff ff 8d 65 f4 5b 5e 5f 5d c3 8d 76 00 0f bd 45 d8 75 05 b8 ff ff ff ff 83 c0 21 e9 5e ff ff ff 8d 74 26 00 90 <0f> 0b 8d b6 00 00 00 00 0f 0b 8d b6 00 00 00 00 8b 5d 0c 0f bd 45
All code
========
0: 76 00 jbe 0x2
2: b8 ea ff ff ff mov $0xffffffea,%eax
7: 8d 65 f4 lea -0xc(%rbp),%esp
a: 5b pop %rbx
b: 5e pop %rsi
c: 5f pop %rdi
d: 5d pop %rbp
e: c3 retq
f: 8d 76 00 lea 0x0(%rsi),%esi
12: 0f bd 45 d8 bsr -0x28(%rbp),%eax
16: 75 05 jne 0x1d
18: b8 ff ff ff ff mov $0xffffffff,%eax
1d: 83 c0 21 add $0x21,%eax
20: e9 5e ff ff ff jmpq 0xffffffffffffff83
25: 8d 74 26 00 lea 0x0(%rsi,%riz,1),%esi
29: 90 nop
2a:* 0f 0b ud2 <-- trapping instruction
2c: 8d b6 00 00 00 00 lea 0x0(%rsi),%esi
32: 0f 0b ud2
34: 8d b6 00 00 00 00 lea 0x0(%rsi),%esi
3a: 8b 5d 0c mov 0xc(%rbp),%ebx
3d: 0f .byte 0xf
3e: bd .byte 0xbd
3f: 45 rex.RB
Code starting with the faulting instruction
===========================================
0: 0f 0b ud2
2: 8d b6 00 00 00 00 lea 0x0(%rsi),%esi
8: 0f 0b ud2
a: 8d b6 00 00 00 00 lea 0x0(%rsi),%esi
10: 8b 5d 0c mov 0xc(%rbp),%ebx
13: 0f .byte 0xf
14: bd .byte 0xbd
15: 45 rex.RB
To reproduce:
# build kernel
cd linux
cp config-5.17.0-rc2-00311-g39ec47bbfd5d .config
make HOSTCC=gcc-9 CC=gcc-9 ARCH=i386 olddefconfig prepare modules_prepare bzImage modules
make HOSTCC=gcc-9 CC=gcc-9 ARCH=i386 INSTALL_MOD_PATH=<mod-install-dir> modules_install
cd <mod-install-dir>
find lib/ | cpio -o -H newc --quiet | gzip > modules.cgz
git clone https://github.com/intel/lkp-tests.git
cd lkp-tests
bin/lkp qemu -k <bzImage> -m modules.cgz job-script # job-script is attached in this email
# if come across any failure that blocks the test,
# please remove ~/.lkp and /lkp dir to run from a clean state.
---
0DAY/LKP+ Test Infrastructure Open Source Technology Center
https://lists.01.org/hyperkitty/list/[email protected] Intel Corporation
Thanks,
Oliver Sang
5 months, 2 weeks
[mm/munlock] 07ca760673: stress-ng.tmpfs.ops_per_sec 24.4% improvement
by kernel test robot
Greeting,
FYI, we noticed a 24.4% improvement of stress-ng.tmpfs.ops_per_sec due to commit:
commit: 07ca760673088f262da57ff42c15558688565aa2 ("mm/munlock: maintain page->mlock_count while unevictable")
https://git.kernel.org/cgit/linux/kernel/git/next/linux-next.git master
in testcase: stress-ng
on test machine: 128 threads 2 sockets Intel(R) Xeon(R) Platinum 8358 CPU @ 2.60GHz with 128G memory
with following parameters:
nr_threads: 100%
testtime: 60s
class: memory
test: tmpfs
cpufreq_governor: performance
ucode: 0xd000331
Details are as below:
-------------------------------------------------------------------------------------------------->
To reproduce:
git clone https://github.com/intel/lkp-tests.git
cd lkp-tests
sudo bin/lkp install job.yaml # job file is attached in this email
bin/lkp split-job --compatible job.yaml # generate the yaml file for lkp run
sudo bin/lkp run generated-yaml-file
# if come across any failure that blocks the test,
# please remove ~/.lkp and /lkp dir to run from a clean state.
=========================================================================================
class/compiler/cpufreq_governor/kconfig/nr_threads/rootfs/tbox_group/test/testcase/testtime/ucode:
memory/gcc-9/performance/x86_64-rhel-8.3/100%/debian-10.4-x86_64-20200603.cgz/lkp-icl-2sp6/tmpfs/stress-ng/60s/0xd000331
commit:
b109b87050 ("mm/munlock: replace clear_page_mlock() by final clearance")
07ca760673 ("mm/munlock: maintain page->mlock_count while unevictable")
b109b87050df5438 07ca760673088f262da57ff42c1
---------------- ---------------------------
%stddev %change %stddev
\ | \
39.44 ± 5% -18.7% 32.08 ± 3% stress-ng.time.elapsed_time
39.44 ± 5% -18.7% 32.08 ± 3% stress-ng.time.elapsed_time.max
10529 ± 6% -22.7% 8134 ± 5% stress-ng.time.involuntary_context_switches
9988 ± 2% -3.4% 9646 stress-ng.time.percent_of_cpu_this_job_got
2663 ± 8% -33.3% 1776 ± 5% stress-ng.time.system_time
1276 +3.2% 1317 stress-ng.time.user_time
2955 ± 5% +24.4% 3677 ± 2% stress-ng.tmpfs.ops_per_sec
630625 ± 6% -13.9% 543204 ± 11% numa-numastat.node1.numa_hit
12458183 ± 9% -29.8% 8745870 ± 3% turbostat.IRQ
22.83 ± 6% +25.5% 28.67 ± 3% vmstat.cpu.us
3825 ± 2% +15.7% 4424 vmstat.system.cs
291800 ± 4% -15.1% 247782 vmstat.system.in
160426 +11.2% 178462 ± 2% meminfo.Active
160426 +11.2% 178462 ± 2% meminfo.Active(anon)
235963 ± 5% -16.1% 197931 ± 2% meminfo.Mapped
117810 ± 6% -24.1% 89437 ± 3% meminfo.Mlocked
23.66 ± 5% +3.1 26.73 ± 3% mpstat.cpu.all.idle%
0.02 ± 27% +0.0 0.03 ± 18% mpstat.cpu.all.soft%
50.99 ± 4% -9.5 41.53 ± 3% mpstat.cpu.all.sys%
24.59 ± 5% +6.3 30.87 ± 3% mpstat.cpu.all.usr%
78820 ± 3% +14.4% 90181 ± 3% numa-meminfo.node0.Active
78820 ± 3% +14.4% 90181 ± 3% numa-meminfo.node0.Active(anon)
59902 ± 10% -23.5% 45810 ± 4% numa-meminfo.node0.Mlocked
152030 ± 18% -25.1% 113836 ± 12% numa-meminfo.node1.Inactive
152030 ± 18% -25.1% 113836 ± 12% numa-meminfo.node1.Inactive(anon)
57489 ± 10% -26.6% 42196 ± 7% numa-meminfo.node1.Mlocked
233645 ± 5% -14.0% 201029 ± 3% numa-meminfo.node1.Shmem
19568 ± 2% +14.8% 22461 ± 3% numa-vmstat.node0.nr_active_anon
14599 ± 9% -23.2% 11217 ± 7% numa-vmstat.node0.nr_mlock
19562 ± 2% +14.8% 22462 ± 3% numa-vmstat.node0.nr_zone_active_anon
37654 ± 18% -24.1% 28589 ± 10% numa-vmstat.node1.nr_inactive_anon
14616 ± 10% -27.5% 10599 ± 10% numa-vmstat.node1.nr_mlock
58341 ± 5% -14.1% 50124 ± 3% numa-vmstat.node1.nr_shmem
37656 ± 18% -24.1% 28586 ± 10% numa-vmstat.node1.nr_zone_inactive_anon
39985 +10.5% 44174 ± 2% proc-vmstat.nr_active_anon
725611 -1.3% 716100 proc-vmstat.nr_file_pages
121814 ± 2% -6.2% 114239 proc-vmstat.nr_inactive_anon
59502 ± 5% -16.7% 49539 proc-vmstat.nr_mapped
29485 ± 6% -24.6% 22222 ± 3% proc-vmstat.nr_mlock
115710 ± 3% -8.2% 106199 proc-vmstat.nr_shmem
639255 -1.1% 632112 proc-vmstat.nr_unevictable
39985 +10.5% 44174 ± 2% proc-vmstat.nr_zone_active_anon
121814 ± 2% -6.2% 114239 proc-vmstat.nr_zone_inactive_anon
639256 -1.1% 632112 proc-vmstat.nr_zone_unevictable
1233200 -7.1% 1145498 proc-vmstat.numa_hit
1117527 -7.9% 1029792 proc-vmstat.numa_local
1233229 -7.1% 1145564 proc-vmstat.pgalloc_normal
1000156 -4.7% 952725 proc-vmstat.pgfree
17338 -7.4% 16058 ± 3% proc-vmstat.pgreuse
7.718e+10 ± 4% +20.3% 9.289e+10 ± 2% perf-stat.i.branch-instructions
2.104e+08 ± 5% +19.5% 2.515e+08 ± 3% perf-stat.i.branch-misses
1.232e+08 ± 2% +13.2% 1.395e+08 ± 2% perf-stat.i.cache-misses
5.521e+08 ± 6% +13.8% 6.284e+08 ± 3% perf-stat.i.cache-references
3388 ± 2% +15.3% 3905 perf-stat.i.context-switches
1.04 ± 4% -19.6% 0.83 ± 2% perf-stat.i.cpi
3.32e+11 ± 2% -4.3% 3.176e+11 perf-stat.i.cpu-cycles
281.54 ± 4% +11.5% 313.86 ± 3% perf-stat.i.cpu-migrations
2740 -8.8% 2497 ± 6% perf-stat.i.cycles-between-cache-misses
3.883e+10 ± 4% +17.8% 4.575e+10 ± 2% perf-stat.i.dTLB-loads
37861529 ± 5% +22.5% 46399080 ± 2% perf-stat.i.dTLB-store-misses
2.93e+10 ± 5% +22.1% 3.577e+10 ± 2% perf-stat.i.dTLB-stores
3.177e+11 ± 4% +20.4% 3.824e+11 ± 2% perf-stat.i.instructions
1.01 ± 5% +24.4% 1.25 ± 2% perf-stat.i.ipc
1385 ± 6% +28.2% 1776 ± 4% perf-stat.i.major-faults
2.59 ± 2% -4.3% 2.48 perf-stat.i.metric.GHz
670.12 +17.6% 788.18 perf-stat.i.metric.K/sec
1138 ± 4% +16.7% 1329 ± 2% perf-stat.i.metric.M/sec
4303675 ± 5% +22.3% 5264697 ± 2% perf-stat.i.minor-faults
4223176 ± 9% +21.5% 5132933 ± 5% perf-stat.i.node-loads
12619285 ± 9% +26.6% 15970682 ± 5% perf-stat.i.node-stores
4305061 ± 5% +22.3% 5266473 ± 2% perf-stat.i.page-faults
1.74 ± 2% -5.4% 1.64 perf-stat.overall.MPKI
1.05 ± 5% -20.6% 0.83 ± 3% perf-stat.overall.cpi
2694 ± 2% -15.5% 2276 perf-stat.overall.cycles-between-cache-misses
0.96 ± 5% +25.7% 1.20 ± 3% perf-stat.overall.ipc
7.499e+10 ± 4% +20.1% 9.003e+10 ± 2% perf-stat.ps.branch-instructions
2.044e+08 ± 5% +19.2% 2.437e+08 ± 3% perf-stat.ps.branch-misses
1.197e+08 ± 2% +13.0% 1.352e+08 ± 2% perf-stat.ps.cache-misses
5.364e+08 ± 6% +13.5% 6.09e+08 ± 3% perf-stat.ps.cache-references
3296 ± 2% +14.8% 3784 perf-stat.ps.context-switches
3.223e+11 ± 2% -4.5% 3.078e+11 perf-stat.ps.cpu-cycles
273.70 ± 4% +11.1% 304.20 ± 3% perf-stat.ps.cpu-migrations
3.773e+10 ± 4% +17.5% 4.434e+10 ± 2% perf-stat.ps.dTLB-loads
36783176 ± 5% +22.3% 44970404 ± 2% perf-stat.ps.dTLB-store-misses
2.847e+10 ± 5% +21.8% 3.467e+10 ± 2% perf-stat.ps.dTLB-stores
3.087e+11 ± 4% +20.1% 3.706e+11 ± 2% perf-stat.ps.instructions
1347 ± 6% +27.7% 1721 ± 4% perf-stat.ps.major-faults
4181461 ± 5% +22.0% 5102628 ± 2% perf-stat.ps.minor-faults
4097306 ± 9% +21.4% 4973600 ± 5% perf-stat.ps.node-loads
12251815 ± 9% +26.3% 15476164 ± 5% perf-stat.ps.node-stores
4182809 ± 5% +22.0% 5104350 ± 2% perf-stat.ps.page-faults
1.243e+13 -1.7% 1.222e+13 perf-stat.total.instructions
38.79 ± 13% -33.6 5.18 ±101% perf-profile.calltrace.cycles-pp.zap_pte_range.unmap_page_range.unmap_vmas.unmap_region.__do_munmap
39.23 ± 13% -33.6 5.65 ± 96% perf-profile.calltrace.cycles-pp.unmap_vmas.unmap_region.__do_munmap.__vm_munmap.__x64_sys_munmap
39.10 ± 13% -33.6 5.54 ± 97% perf-profile.calltrace.cycles-pp.unmap_page_range.unmap_vmas.unmap_region.__do_munmap.__vm_munmap
40.08 ± 12% -33.5 6.58 ± 88% perf-profile.calltrace.cycles-pp.unmap_region.__do_munmap.__vm_munmap.__x64_sys_munmap.do_syscall_64
42.54 ± 10% -33.2 9.33 ± 75% perf-profile.calltrace.cycles-pp.__do_munmap.__vm_munmap.__x64_sys_munmap.do_syscall_64.entry_SYSCALL_64_after_hwframe
42.84 ± 10% -33.2 9.65 ± 73% perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.__munmap.stress_oomable_child
42.75 ± 10% -33.2 9.57 ± 74% perf-profile.calltrace.cycles-pp.__vm_munmap.__x64_sys_munmap.do_syscall_64.entry_SYSCALL_64_after_hwframe.__munmap
42.77 ± 10% -33.2 9.60 ± 74% perf-profile.calltrace.cycles-pp.__x64_sys_munmap.do_syscall_64.entry_SYSCALL_64_after_hwframe.__munmap.stress_oomable_child
42.96 ± 10% -33.1 9.81 ± 73% perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.__munmap.stress_oomable_child
43.23 ± 10% -33.1 10.12 ± 72% perf-profile.calltrace.cycles-pp.__munmap.stress_oomable_child
79.55 -30.9 48.66 ± 51% perf-profile.calltrace.cycles-pp.stress_oomable_child
22.84 ± 14% -21.7 1.14 ±162% perf-profile.calltrace.cycles-pp.free_pages_and_swap_cache.tlb_flush_mmu.zap_pte_range.unmap_page_range.unmap_vmas
22.81 ± 14% -21.7 1.12 ±163% perf-profile.calltrace.cycles-pp.lru_add_drain.free_pages_and_swap_cache.tlb_flush_mmu.zap_pte_range.unmap_page_range
23.00 ± 14% -21.7 1.33 ±142% perf-profile.calltrace.cycles-pp.tlb_flush_mmu.zap_pte_range.unmap_page_range.unmap_vmas.unmap_region
13.86 ± 15% -13.9 0.00 perf-profile.calltrace.cycles-pp.lru_add_drain_cpu.lru_add_drain.free_pages_and_swap_cache.tlb_flush_mmu.zap_pte_range
13.83 ± 15% -13.8 0.00 perf-profile.calltrace.cycles-pp.__pagevec_lru_add.lru_add_drain_cpu.lru_add_drain.free_pages_and_swap_cache.tlb_flush_mmu
13.62 ± 15% -13.6 0.00 perf-profile.calltrace.cycles-pp.isolate_lru_page.munlock_page.zap_pte_range.unmap_page_range.unmap_vmas
13.62 ± 15% -13.6 0.00 perf-profile.calltrace.cycles-pp.isolate_lru_page.mlock_page.do_set_pte.filemap_map_pages.do_fault
15.07 ± 14% -13.5 1.59 ±168% perf-profile.calltrace.cycles-pp.handle_mm_fault.__get_user_pages.populate_vma_page_range.__mm_populate.vm_mmap_pgoff
15.07 ± 14% -13.5 1.59 ±168% perf-profile.calltrace.cycles-pp.__handle_mm_fault.handle_mm_fault.__get_user_pages.populate_vma_page_range.__mm_populate
15.06 ± 14% -13.5 1.59 ±167% perf-profile.calltrace.cycles-pp.do_fault.__handle_mm_fault.handle_mm_fault.__get_user_pages.populate_vma_page_range
15.06 ± 14% -13.5 1.59 ±167% perf-profile.calltrace.cycles-pp.filemap_map_pages.do_fault.__handle_mm_fault.handle_mm_fault.__get_user_pages
15.18 ± 14% -13.5 1.71 ±155% perf-profile.calltrace.cycles-pp.__mm_populate.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe
15.18 ± 14% -13.5 1.71 ±155% perf-profile.calltrace.cycles-pp.populate_vma_page_range.__mm_populate.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64
15.01 ± 14% -13.5 1.54 ±169% perf-profile.calltrace.cycles-pp.do_set_pte.filemap_map_pages.do_fault.__handle_mm_fault.handle_mm_fault
15.18 ± 14% -13.5 1.70 ±155% perf-profile.calltrace.cycles-pp.__get_user_pages.populate_vma_page_range.__mm_populate.vm_mmap_pgoff.ksys_mmap_pgoff
13.43 ± 15% -13.4 0.00 perf-profile.calltrace.cycles-pp.folio_lruvec_lock_irq.isolate_lru_page.munlock_page.zap_pte_range.unmap_page_range
13.40 ± 15% -13.4 0.00 perf-profile.calltrace.cycles-pp.folio_lruvec_lock_irq.isolate_lru_page.mlock_page.do_set_pte.filemap_map_pages
13.38 ± 15% -13.4 0.00 perf-profile.calltrace.cycles-pp._raw_spin_lock_irq.folio_lruvec_lock_irq.isolate_lru_page.munlock_page.zap_pte_range
13.38 ± 15% -13.4 0.00 perf-profile.calltrace.cycles-pp._raw_spin_lock_irq.folio_lruvec_lock_irq.isolate_lru_page.mlock_page.do_set_pte
13.35 ± 15% -13.3 0.00 perf-profile.calltrace.cycles-pp.folio_lruvec_lock_irqsave.__pagevec_lru_add.lru_add_drain_cpu.lru_add_drain.free_pages_and_swap_cache
13.30 ± 15% -13.3 0.00 perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.folio_lruvec_lock_irqsave.__pagevec_lru_add.lru_add_drain_cpu.lru_add_drain
13.28 ± 15% -13.3 0.00 perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irq.folio_lruvec_lock_irq.isolate_lru_page.mlock_page
13.26 ± 15% -13.3 0.00 perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irq.folio_lruvec_lock_irq.isolate_lru_page.munlock_page
13.17 ± 16% -13.2 0.00 perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.folio_lruvec_lock_irqsave.__pagevec_lru_add.lru_add_drain_cpu
17.87 ± 9% -13.0 4.91 ± 76% perf-profile.calltrace.cycles-pp.vm_mmap_pgoff.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mmap
17.97 ± 9% -12.9 5.02 ± 74% perf-profile.calltrace.cycles-pp.ksys_mmap_pgoff.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mmap
18.02 ± 9% -12.9 5.09 ± 74% perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.__mmap
18.01 ± 9% -12.9 5.08 ± 74% perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.__mmap
18.15 ± 9% -12.9 5.25 ± 73% perf-profile.calltrace.cycles-pp.__mmap
13.83 ± 15% -12.2 1.60 ±169% perf-profile.calltrace.cycles-pp.munlock_page.zap_pte_range.unmap_page_range.unmap_vmas.unmap_region
13.65 ± 15% -12.1 1.51 ±169% perf-profile.calltrace.cycles-pp.mlock_page.do_set_pte.filemap_map_pages.do_fault.__handle_mm_fault
8.93 ± 14% -7.8 1.09 ±164% perf-profile.calltrace.cycles-pp.pagevec_lru_move_fn.lru_add_drain.free_pages_and_swap_cache.tlb_flush_mmu.zap_pte_range
8.53 ± 15% -7.6 0.93 ±170% perf-profile.calltrace.cycles-pp.folio_lruvec_lock_irqsave.pagevec_lru_move_fn.lru_add_drain.free_pages_and_swap_cache.tlb_flush_mmu
8.47 ± 15% -7.6 0.91 ±171% perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.folio_lruvec_lock_irqsave.pagevec_lru_move_fn.lru_add_drain.free_pages_and_swap_cache
8.38 ± 15% -7.5 0.88 ±172% perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.folio_lruvec_lock_irqsave.pagevec_lru_move_fn.lru_add_drain
0.00 +2.9 2.95 ± 92% perf-profile.calltrace.cycles-pp.menu_select.do_idle.cpu_startup_entry.start_secondary.secondary_startup_64_no_verify
0.10 ±223% +3.9 3.98 ± 97% perf-profile.calltrace.cycles-pp.__hrtimer_run_queues.hrtimer_interrupt.__sysvec_apic_timer_interrupt.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt
0.13 ±223% +5.9 6.02 ± 87% perf-profile.calltrace.cycles-pp.hrtimer_interrupt.__sysvec_apic_timer_interrupt.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt.cpuidle_enter_state
0.14 ±223% +6.0 6.18 ± 88% perf-profile.calltrace.cycles-pp.__sysvec_apic_timer_interrupt.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt.cpuidle_enter_state.cpuidle_enter
0.20 ±223% +9.5 9.66 ± 85% perf-profile.calltrace.cycles-pp.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt.cpuidle_enter_state.cpuidle_enter.do_idle
0.22 ±223% +12.0 12.18 ± 84% perf-profile.calltrace.cycles-pp.asm_sysvec_apic_timer_interrupt.cpuidle_enter_state.cpuidle_enter.do_idle.cpu_startup_entry
0.72 ±114% +21.5 22.26 ± 59% perf-profile.calltrace.cycles-pp.intel_idle.cpuidle_enter_state.cpuidle_enter.do_idle.cpu_startup_entry
1.34 ± 87% +33.8 35.10 ± 68% perf-profile.calltrace.cycles-pp.cpuidle_enter_state.cpuidle_enter.do_idle.cpu_startup_entry.start_secondary
1.39 ± 87% +34.5 35.90 ± 67% perf-profile.calltrace.cycles-pp.cpuidle_enter.do_idle.cpu_startup_entry.start_secondary.secondary_startup_64_no_verify
1.53 ± 89% +38.5 39.98 ± 70% perf-profile.calltrace.cycles-pp.do_idle.cpu_startup_entry.start_secondary.secondary_startup_64_no_verify
1.53 ± 89% +38.5 40.04 ± 70% perf-profile.calltrace.cycles-pp.cpu_startup_entry.start_secondary.secondary_startup_64_no_verify
1.53 ± 89% +38.5 40.04 ± 70% perf-profile.calltrace.cycles-pp.start_secondary.secondary_startup_64_no_verify
1.54 ± 89% +38.8 40.32 ± 70% perf-profile.calltrace.cycles-pp.secondary_startup_64_no_verify
49.06 ± 15% -45.4 3.62 ±171% perf-profile.children.cycles-pp.native_queued_spin_lock_slowpath
63.53 ± 9% -42.6 20.90 ± 55% perf-profile.children.cycles-pp.do_syscall_64
63.70 ± 9% -42.6 21.12 ± 55% perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe
38.83 ± 13% -33.6 5.24 ±100% perf-profile.children.cycles-pp.zap_pte_range
39.24 ± 13% -33.6 5.69 ± 95% perf-profile.children.cycles-pp.unmap_vmas
39.12 ± 13% -33.5 5.58 ± 96% perf-profile.children.cycles-pp.unmap_page_range
40.10 ± 12% -33.5 6.60 ± 88% perf-profile.children.cycles-pp.unmap_region
42.57 ± 10% -33.2 9.39 ± 74% perf-profile.children.cycles-pp.__do_munmap
42.76 ± 10% -33.2 9.60 ± 74% perf-profile.children.cycles-pp.__vm_munmap
42.77 ± 10% -33.2 9.61 ± 74% perf-profile.children.cycles-pp.__x64_sys_munmap
43.32 ± 10% -33.1 10.21 ± 72% perf-profile.children.cycles-pp.__munmap
79.55 -30.9 48.66 ± 51% perf-profile.children.cycles-pp.stress_oomable_child
27.28 ± 15% -27.3 0.00 perf-profile.children.cycles-pp.isolate_lru_page
26.84 ± 15% -24.0 2.82 ±169% perf-profile.children.cycles-pp.folio_lruvec_lock_irq
26.77 ± 15% -23.9 2.87 ±165% perf-profile.children.cycles-pp._raw_spin_lock_irq
22.82 ± 15% -21.9 0.96 ±166% perf-profile.children.cycles-pp.folio_lruvec_lock_irqsave
22.90 ± 14% -21.6 1.30 ±136% perf-profile.children.cycles-pp.lru_add_drain
22.85 ± 14% -21.6 1.30 ±136% perf-profile.children.cycles-pp.free_pages_and_swap_cache
23.04 ± 14% -21.5 1.53 ±118% perf-profile.children.cycles-pp.tlb_flush_mmu
22.76 ± 15% -21.5 1.25 ±118% perf-profile.children.cycles-pp._raw_spin_lock_irqsave
15.03 ± 14% -15.0 0.00 perf-profile.children.cycles-pp.__pagevec_lru_add
13.92 ± 14% -13.8 0.08 ± 55% perf-profile.children.cycles-pp.lru_add_drain_cpu
15.37 ± 14% -13.5 1.90 ±141% perf-profile.children.cycles-pp.do_set_pte
15.18 ± 14% -13.4 1.79 ±145% perf-profile.children.cycles-pp.__mm_populate
15.18 ± 14% -13.4 1.79 ±145% perf-profile.children.cycles-pp.populate_vma_page_range
15.18 ± 14% -13.4 1.80 ±143% perf-profile.children.cycles-pp.__get_user_pages
16.02 ± 13% -13.3 2.68 ±108% perf-profile.children.cycles-pp.do_fault
15.06 ± 14% -13.3 1.76 ±145% perf-profile.children.cycles-pp.filemap_map_pages
16.20 ± 12% -13.2 2.98 ± 97% perf-profile.children.cycles-pp.__handle_mm_fault
16.42 ± 12% -13.2 3.22 ± 92% perf-profile.children.cycles-pp.handle_mm_fault
17.88 ± 9% -12.9 4.96 ± 74% perf-profile.children.cycles-pp.vm_mmap_pgoff
17.97 ± 9% -12.9 5.06 ± 74% perf-profile.children.cycles-pp.ksys_mmap_pgoff
18.20 ± 9% -12.9 5.32 ± 72% perf-profile.children.cycles-pp.__mmap
13.83 ± 15% -12.1 1.75 ±150% perf-profile.children.cycles-pp.munlock_page
13.65 ± 15% -12.1 1.60 ±157% perf-profile.children.cycles-pp.mlock_page
8.94 ± 14% -7.8 1.18 ±148% perf-profile.children.cycles-pp.pagevec_lru_move_fn
0.23 ± 4% -0.1 0.10 ± 75% perf-profile.children.cycles-pp.__list_del_entry_valid
0.06 ± 16% +0.0 0.10 ± 20% perf-profile.children.cycles-pp.fput_many
0.01 ±223% +0.1 0.10 ± 52% perf-profile.children.cycles-pp.__fput
0.01 ±223% +0.1 0.10 ± 49% perf-profile.children.cycles-pp.task_work_run
0.02 ±141% +0.1 0.12 ± 37% perf-profile.children.cycles-pp._find_next_bit
0.10 ± 20% +0.1 0.22 ± 40% perf-profile.children.cycles-pp.exit_to_user_mode_prepare
0.15 ± 18% +0.1 0.28 ± 35% perf-profile.children.cycles-pp.syscall_exit_to_user_mode
0.00 +0.1 0.14 ± 57% perf-profile.children.cycles-pp.rcu_core
0.00 +0.1 0.14 ± 92% perf-profile.children.cycles-pp.exit_mmap
0.00 +0.1 0.14 ± 91% perf-profile.children.cycles-pp.mmput
0.00 +0.2 0.15 ± 82% perf-profile.children.cycles-pp.__x64_sys_exit_group
0.00 +0.2 0.19 ±122% perf-profile.children.cycles-pp.__schedule
0.00 +0.2 0.19 ± 48% perf-profile.children.cycles-pp._raw_spin_trylock
0.00 +0.2 0.20 ±100% perf-profile.children.cycles-pp.do_group_exit
0.00 +0.2 0.20 ±100% perf-profile.children.cycles-pp.do_exit
0.00 +0.2 0.22 ± 62% perf-profile.children.cycles-pp.timekeeping_max_deferment
0.00 +0.2 0.22 ± 53% perf-profile.children.cycles-pp.__intel_pmu_enable_all
0.01 ±223% +0.3 0.28 ± 85% perf-profile.children.cycles-pp.update_rq_clock
0.00 +0.3 0.28 ± 63% perf-profile.children.cycles-pp.start_kernel
0.01 ±223% +0.3 0.29 ± 84% perf-profile.children.cycles-pp.rcu_sched_clock_irq
0.00 +0.3 0.28 ± 66% perf-profile.children.cycles-pp.arch_scale_freq_tick
0.00 +0.3 0.31 ± 83% perf-profile.children.cycles-pp.update_blocked_averages
0.00 +0.3 0.32 ± 81% perf-profile.children.cycles-pp.run_rebalance_domains
0.12 ± 22% +0.3 0.44 ± 57% perf-profile.children.cycles-pp._raw_spin_lock
0.00 +0.3 0.33 ± 77% perf-profile.children.cycles-pp.ktime_get_update_offsets_now
0.01 ±223% +0.3 0.34 ± 94% perf-profile.children.cycles-pp.update_sd_lb_stats
0.00 +0.3 0.34 ± 48% perf-profile.children.cycles-pp.tick_nohz_irq_exit
0.01 ±223% +0.3 0.35 ± 91% perf-profile.children.cycles-pp.find_busiest_group
0.00 +0.3 0.34 ±123% perf-profile.children.cycles-pp.hrtimer_next_event_without
0.00 +0.3 0.35 ±107% perf-profile.children.cycles-pp.get_next_timer_interrupt
0.00 +0.4 0.36 ± 87% perf-profile.children.cycles-pp.rcu_idle_exit
0.01 ±223% +0.4 0.38 ±103% perf-profile.children.cycles-pp.timerqueue_del
0.01 ±223% +0.4 0.43 ± 92% perf-profile.children.cycles-pp.__remove_hrtimer
0.01 ±223% +0.5 0.47 ±104% perf-profile.children.cycles-pp.irqtime_account_irq
0.01 ±223% +0.5 0.50 ± 80% perf-profile.children.cycles-pp.load_balance
0.01 ±223% +0.5 0.53 ± 95% perf-profile.children.cycles-pp.native_sched_clock
0.01 ±223% +0.6 0.62 ± 89% perf-profile.children.cycles-pp.lapic_next_deadline
0.01 ±223% +0.6 0.63 ± 93% perf-profile.children.cycles-pp.sched_clock_cpu
0.01 ±223% +0.6 0.65 ± 71% perf-profile.children.cycles-pp.perf_mux_hrtimer_handler
0.01 ±223% +0.6 0.66 ± 66% perf-profile.children.cycles-pp.rebalance_domains
0.01 ±223% +0.7 0.66 ±100% perf-profile.children.cycles-pp.read_tsc
0.47 ± 18% +0.7 1.19 ± 11% perf-profile.children.cycles-pp.native_irq_return_iret
0.02 ±223% +0.8 0.86 ± 62% perf-profile.children.cycles-pp.tick_nohz_next_event
0.02 ±223% +0.9 0.94 ± 99% perf-profile.children.cycles-pp.tick_irq_enter
0.02 ±223% +1.0 0.99 ±101% perf-profile.children.cycles-pp.irq_enter_rcu
0.13 ± 43% +1.0 1.11 ± 90% perf-profile.children.cycles-pp.scheduler_tick
0.08 ± 36% +1.1 1.13 ± 65% perf-profile.children.cycles-pp.clockevents_program_event
0.02 ±223% +1.3 1.30 ± 82% perf-profile.children.cycles-pp.tick_nohz_get_sleep_length
0.04 ±124% +1.3 1.34 ± 66% perf-profile.children.cycles-pp.__softirqentry_text_start
0.08 ± 32% +1.4 1.50 ± 54% perf-profile.children.cycles-pp.ktime_get
0.24 ± 55% +1.6 1.84 ± 44% perf-profile.children.cycles-pp.kthread
0.08 ± 63% +1.6 1.68 ± 69% perf-profile.children.cycles-pp.irq_exit_rcu
0.24 ± 55% +1.6 1.84 ± 44% perf-profile.children.cycles-pp.ret_from_fork
0.18 ± 61% +1.8 2.02 ± 95% perf-profile.children.cycles-pp.update_process_times
0.20 ± 64% +2.0 2.18 ± 97% perf-profile.children.cycles-pp.tick_sched_handle
0.21 ± 64% +2.2 2.43 ± 93% perf-profile.children.cycles-pp.tick_sched_timer
0.09 ±129% +2.9 2.99 ± 92% perf-profile.children.cycles-pp.menu_select
0.30 ± 66% +3.8 4.12 ± 93% perf-profile.children.cycles-pp.__hrtimer_run_queues
0.43 ± 60% +5.8 6.20 ± 84% perf-profile.children.cycles-pp.hrtimer_interrupt
0.44 ± 62% +5.9 6.36 ± 85% perf-profile.children.cycles-pp.__sysvec_apic_timer_interrupt
0.60 ± 64% +9.3 9.91 ± 83% perf-profile.children.cycles-pp.sysvec_apic_timer_interrupt
0.71 ± 60% +10.7 11.44 ± 81% perf-profile.children.cycles-pp.asm_sysvec_apic_timer_interrupt
0.88 ± 82% +21.6 22.48 ± 59% perf-profile.children.cycles-pp.intel_idle
1.40 ± 87% +34.7 36.14 ± 67% perf-profile.children.cycles-pp.cpuidle_enter_state
1.40 ± 87% +34.8 36.18 ± 67% perf-profile.children.cycles-pp.cpuidle_enter
1.53 ± 89% +38.5 40.04 ± 70% perf-profile.children.cycles-pp.start_secondary
1.54 ± 89% +38.8 40.32 ± 70% perf-profile.children.cycles-pp.secondary_startup_64_no_verify
1.54 ± 89% +38.8 40.32 ± 70% perf-profile.children.cycles-pp.cpu_startup_entry
1.54 ± 89% +38.8 40.32 ± 70% perf-profile.children.cycles-pp.do_idle
49.06 ± 15% -45.4 3.62 ±171% perf-profile.self.cycles-pp.native_queued_spin_lock_slowpath
0.23 ± 4% -0.1 0.10 ± 75% perf-profile.self.cycles-pp.__list_del_entry_valid
0.27 ± 8% -0.1 0.14 ± 37% perf-profile.self.cycles-pp.release_pages
0.01 ±223% +0.1 0.11 ± 28% perf-profile.self.cycles-pp._find_next_bit
0.07 ± 20% +0.1 0.18 ± 39% perf-profile.self.cycles-pp.error_entry
0.00 +0.2 0.18 ± 67% perf-profile.self.cycles-pp.asm_sysvec_apic_timer_interrupt
0.00 +0.2 0.19 ± 48% perf-profile.self.cycles-pp._raw_spin_trylock
0.00 +0.2 0.21 ± 71% perf-profile.self.cycles-pp.ktime_get_update_offsets_now
0.00 +0.2 0.21 ± 76% perf-profile.self.cycles-pp.tick_nohz_next_event
0.00 +0.2 0.22 ± 53% perf-profile.self.cycles-pp.__intel_pmu_enable_all
0.00 +0.2 0.23 ±121% perf-profile.self.cycles-pp.update_process_times
0.00 +0.2 0.24 ±100% perf-profile.self.cycles-pp.update_sd_lb_stats
0.01 ±223% +0.3 0.27 ± 82% perf-profile.self.cycles-pp.rcu_sched_clock_irq
0.00 +0.3 0.28 ± 67% perf-profile.self.cycles-pp.arch_scale_freq_tick
0.11 ± 24% +0.3 0.41 ± 57% perf-profile.self.cycles-pp._raw_spin_lock
0.01 ±223% +0.4 0.42 ±118% perf-profile.self.cycles-pp.do_idle
0.01 ±223% +0.5 0.51 ± 95% perf-profile.self.cycles-pp.native_sched_clock
0.01 ±223% +0.6 0.62 ± 89% perf-profile.self.cycles-pp.lapic_next_deadline
0.01 ±223% +0.6 0.64 ±100% perf-profile.self.cycles-pp.read_tsc
0.47 ± 18% +0.7 1.19 ± 11% perf-profile.self.cycles-pp.native_irq_return_iret
0.07 ± 33% +0.9 0.95 ± 35% perf-profile.self.cycles-pp.ktime_get
0.03 ±223% +1.4 1.39 ± 98% perf-profile.self.cycles-pp.menu_select
0.04 ±171% +2.0 2.02 ± 72% perf-profile.self.cycles-pp.cpuidle_enter_state
0.88 ± 82% +21.6 22.48 ± 59% perf-profile.self.cycles-pp.intel_idle
Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.
---
0DAY/LKP+ Test Infrastructure Open Source Technology Center
https://lists.01.org/hyperkitty/list/[email protected] Intel Corporation
Thanks,
Oliver Sang
5 months, 2 weeks