[linus:master] [nfs] 49b29a573d: filebench.sum_operations/s -85.6% regression
From: kernel test robot
Date: Thu Aug 08 2024 - 03:35:59 EST
Hello,
kernel test robot noticed a -85.6% regression of filebench.sum_operations/s on:
commit: 49b29a573da83b65d5f4ecf2db6619bab7aa910c ("nfs: add support for large folios")
https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master
testcase: filebench
test machine: 128 threads 2 sockets Intel(R) Xeon(R) Platinum 8358 CPU @ 2.60GHz (Ice Lake) with 128G memory
parameters:
disk: 1HDD
fs: ext4
fs2: nfsv4
test: randomrw.f
cpufreq_governor: performance
If you fix the issue in a separate patch/commit (i.e. not just a new version of
the same patch/commit), kindly add following tags
| Reported-by: kernel test robot <oliver.sang@xxxxxxxxx>
| Closes: https://lore.kernel.org/oe-lkp/202408081514.106c770e-oliver.sang@xxxxxxxxx
Details are as below:
-------------------------------------------------------------------------------------------------->
The kernel config and materials to reproduce are available at:
https://download.01.org/0day-ci/archive/20240808/202408081514.106c770e-oliver.sang@xxxxxxxxx
=========================================================================================
compiler/cpufreq_governor/disk/fs2/fs/kconfig/rootfs/tbox_group/test/testcase:
gcc-13/performance/1HDD/nfsv4/ext4/x86_64-rhel-8.3/debian-12-x86_64-20240206.cgz/lkp-icl-2sp6/randomrw.f/filebench
commit:
9aac777aaf ("filemap: Convert generic_perform_write() to support large folios")
49b29a573d ("nfs: add support for large folios")
9aac777aaf945978 49b29a573da83b65d5f4ecf2db6
---------------- ---------------------------
%stddev %change %stddev
\ | \
98.36 -1.0% 97.34 iostat.cpu.idle
1.28 ± 21% +89.3% 2.42 ± 10% iostat.cpu.iowait
203868 ± 9% -25.7% 151373 ± 17% numa-meminfo.node0.SUnreclaim
1026057 ± 40% -76.5% 241071 ± 70% numa-meminfo.node1.Dirty
1693537 ± 24% -74.2% 437697 ± 27% sched_debug.cpu.nr_switches.max
191731 ± 18% -61.9% 73070 ± 25% sched_debug.cpu.nr_switches.stddev
1849853 ± 66% +175.6% 5097620 ± 32% numa-vmstat.node0.nr_dirtied
50982 ± 9% -25.8% 37842 ± 17% numa-vmstat.node0.nr_slab_unreclaimable
1849844 ± 66% +175.6% 5097604 ± 32% numa-vmstat.node0.nr_written
256902 ± 40% -76.9% 59457 ± 71% numa-vmstat.node1.nr_dirty
43810 +35.9% 59529 ± 5% vmstat.io.bo
1.65 ± 22% +93.6% 3.19 ± 9% vmstat.procs.b
1.54 ± 3% -11.2% 1.37 vmstat.procs.r
4341 ± 3% -18.4% 3543 ± 4% vmstat.system.in
3713739 ± 3% +10.7% 4110439 ± 3% meminfo.Active
3688730 ± 2% +10.7% 4085171 ± 3% meminfo.Active(file)
1631748 ± 3% -63.1% 602285 ± 12% meminfo.Dirty
398961 -23.8% 304003 meminfo.SUnreclaim
655165 -16.2% 549118 meminfo.Slab
1411642 ± 3% +56.0% 2202614 ± 5% meminfo.Writeback
1.29 ± 21% +1.2 2.44 ± 10% mpstat.cpu.all.iowait%
0.02 ± 2% +0.0 0.02 ± 5% mpstat.cpu.all.soft%
0.28 ± 5% -0.1 0.15 ± 6% mpstat.cpu.all.sys%
0.05 ± 2% -0.0 0.04 ± 6% mpstat.cpu.all.usr%
22.67 ±184% -86.8% 3.00 mpstat.max_utilization.seconds
7.61 ± 9% -12.7% 6.65 mpstat.max_utilization_pct
919.42 ± 16% -85.6% 132.50 ± 34% filebench.sum_bytes_mb/s
7061641 ± 16% -85.6% 1017633 ± 34% filebench.sum_operations
117684 ± 16% -85.6% 16959 ± 34% filebench.sum_operations/s
59443 ± 16% -85.7% 8524 ± 34% filebench.sum_reads/s
0.02 ± 13% +611.2% 0.12 ± 31% filebench.sum_time_ms/op
58241 ± 16% -85.5% 8435 ± 34% filebench.sum_writes/s
21810424 +37.4% 29966336 ± 11% filebench.time.file_system_outputs
14251 -5.6% 13458 ± 2% filebench.time.minor_page_faults
17.67 ± 5% -90.6% 1.67 ± 66% filebench.time.percent_of_cpu_this_job_got
41.30 ± 5% -88.1% 4.90 ± 43% filebench.time.system_time
3582819 ± 16% -85.8% 509229 ± 33% filebench.time.voluntary_context_switches
921497 ± 2% +11.0% 1022743 ± 3% proc-vmstat.nr_active_file
5452715 +37.4% 7491524 ± 11% proc-vmstat.nr_dirtied
407500 ± 2% -62.8% 151456 ± 13% proc-vmstat.nr_dirty
1653629 -5.4% 1564747 ± 2% proc-vmstat.nr_inactive_file
64005 -4.2% 61291 proc-vmstat.nr_slab_reclaimable
99730 -23.8% 75999 proc-vmstat.nr_slab_unreclaimable
352757 ± 3% +55.8% 549420 ± 5% proc-vmstat.nr_writeback
5452662 +37.4% 7491481 ± 11% proc-vmstat.nr_written
921497 ± 2% +11.0% 1022743 ± 3% proc-vmstat.nr_zone_active_file
1653629 -5.4% 1564747 ± 2% proc-vmstat.nr_zone_inactive_file
1655 ± 15% -42.7% 947.83 ± 33% proc-vmstat.numa_hint_faults
1546 ± 16% -53.2% 723.33 ± 38% proc-vmstat.numa_hint_faults_local
4381534 ± 6% -25.2% 3276231 ± 16% proc-vmstat.numa_hit
4248725 ± 6% -26.0% 3143651 ± 17% proc-vmstat.numa_local
10725 ± 54% -57.7% 4532 ± 35% proc-vmstat.numa_pte_updates
1271493 ± 2% +3.6% 1317523 proc-vmstat.pgactivate
10906410 +37.4% 14983963 ± 11% proc-vmstat.pgpgout
1.982e+08 ± 5% -35.7% 1.274e+08 ± 3% perf-stat.i.branch-instructions
7.83 ± 3% -1.4 6.40 ± 7% perf-stat.i.cache-miss-rate%
5551160 ± 7% -48.1% 2879590 ± 5% perf-stat.i.cache-misses
25389854 ± 6% -30.6% 17609995 ± 2% perf-stat.i.cache-references
1.92 +3.1% 1.98 perf-stat.i.cpi
1.685e+09 ± 4% -35.6% 1.086e+09 ± 5% perf-stat.i.cpu-cycles
175.05 ± 3% -15.6% 147.75 ± 2% perf-stat.i.cpu-migrations
1277 ± 3% +27.3% 1625 ± 8% perf-stat.i.cycles-between-cache-misses
8.992e+08 ± 5% -32.6% 6.062e+08 ± 3% perf-stat.i.instructions
0.57 -3.2% 0.55 perf-stat.i.ipc
0.24 ± 14% -27.6% 0.17 ± 23% perf-stat.i.metric.K/sec
6.17 ± 2% -22.8% 4.76 ± 3% perf-stat.overall.MPKI
1.95 ± 4% +0.8 2.76 ± 4% perf-stat.overall.branch-miss-rate%
21.85 ± 2% -5.5 16.37 ± 4% perf-stat.overall.cache-miss-rate%
304.50 ± 4% +23.8% 377.10 ± 4% perf-stat.overall.cycles-between-cache-misses
0.53 ± 2% +4.7% 0.56 ± 4% perf-stat.overall.ipc
1.975e+08 ± 5% -35.7% 1.27e+08 ± 3% perf-stat.ps.branch-instructions
3849663 -8.9% 3506691 ± 4% perf-stat.ps.branch-misses
5532498 ± 7% -48.0% 2879018 ± 5% perf-stat.ps.cache-misses
25301762 ± 6% -30.5% 17579581 perf-stat.ps.cache-references
1.68e+09 ± 4% -35.4% 1.084e+09 ± 5% perf-stat.ps.cpu-cycles
174.27 ± 3% -15.6% 147.10 ± 2% perf-stat.ps.cpu-migrations
8.961e+08 ± 5% -32.5% 6.045e+08 ± 3% perf-stat.ps.instructions
2.211e+11 ± 4% -32.0% 1.504e+11 ± 6% perf-stat.total.instructions
52.52 ± 12% -45.4 7.08 ± 20% perf-profile.calltrace.cycles-pp.worker_thread.kthread.ret_from_fork.ret_from_fork_asm
51.90 ± 12% -45.0 6.86 ± 20% perf-profile.calltrace.cycles-pp.process_one_work.worker_thread.kthread.ret_from_fork.ret_from_fork_asm
79.10 ± 9% -43.9 35.15 ± 29% perf-profile.calltrace.cycles-pp.ret_from_fork.ret_from_fork_asm
79.10 ± 9% -43.9 35.15 ± 29% perf-profile.calltrace.cycles-pp.ret_from_fork_asm
79.10 ± 9% -43.9 35.15 ± 29% perf-profile.calltrace.cycles-pp.kthread.ret_from_fork.ret_from_fork_asm
42.11 ± 12% -42.1 0.00 perf-profile.calltrace.cycles-pp.rpc_async_release.process_one_work.worker_thread.kthread.ret_from_fork
42.11 ± 12% -42.1 0.00 perf-profile.calltrace.cycles-pp.rpc_free_task.rpc_async_release.process_one_work.worker_thread.kthread
28.67 ± 19% -28.7 0.00 perf-profile.calltrace.cycles-pp.nfs_write_completion.rpc_free_task.rpc_async_release.process_one_work.worker_thread
16.89 ± 31% -16.9 0.00 perf-profile.calltrace.cycles-pp.nfs_page_end_writeback.nfs_write_completion.rpc_free_task.rpc_async_release.process_one_work
16.28 ± 32% -16.3 0.00 perf-profile.calltrace.cycles-pp.folio_end_writeback.nfs_page_end_writeback.nfs_write_completion.rpc_free_task.rpc_async_release
15.82 ± 32% -15.8 0.00 perf-profile.calltrace.cycles-pp.__folio_end_writeback.folio_end_writeback.nfs_page_end_writeback.nfs_write_completion.rpc_free_task
13.99 ± 35% -14.0 0.00 perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.__folio_end_writeback.folio_end_writeback.nfs_page_end_writeback.nfs_write_completion
13.55 ± 35% -13.5 0.00 perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.__folio_end_writeback.folio_end_writeback.nfs_page_end_writeback
13.42 ± 15% -13.4 0.00 perf-profile.calltrace.cycles-pp.nfs_commit_release.rpc_free_task.rpc_async_release.process_one_work.worker_thread
13.42 ± 15% -13.4 0.00 perf-profile.calltrace.cycles-pp.nfs_commit_release_pages.nfs_commit_release.rpc_free_task.rpc_async_release.process_one_work
10.58 ± 16% -10.6 0.00 perf-profile.calltrace.cycles-pp.nfs_request_add_commit_list.nfs_write_completion.rpc_free_task.rpc_async_release.process_one_work
10.48 ± 15% -10.5 0.00 perf-profile.calltrace.cycles-pp.nfs_inode_remove_request.nfs_commit_release_pages.nfs_commit_release.rpc_free_task.rpc_async_release
8.89 ± 17% -8.9 0.00 perf-profile.calltrace.cycles-pp._raw_spin_lock.nfs_inode_remove_request.nfs_commit_release_pages.nfs_commit_release.rpc_free_task
8.19 ± 17% -8.2 0.00 perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock.nfs_inode_remove_request.nfs_commit_release_pages.nfs_commit_release
7.56 ± 24% -7.6 0.00 perf-profile.calltrace.cycles-pp.__mutex_lock.nfs_request_add_commit_list.nfs_write_completion.rpc_free_task.rpc_async_release
6.57 ± 27% -6.6 0.00 perf-profile.calltrace.cycles-pp.osq_lock.__mutex_lock.nfs_request_add_commit_list.nfs_write_completion.rpc_free_task
4.60 ± 18% -2.2 2.44 ± 25% perf-profile.calltrace.cycles-pp.__filemap_fdatawrite_range.file_write_and_wait_range.ext4_sync_file.nfsd_commit.nfsd4_commit
4.60 ± 18% -2.2 2.44 ± 25% perf-profile.calltrace.cycles-pp.do_writepages.filemap_fdatawrite_wbc.__filemap_fdatawrite_range.file_write_and_wait_range.ext4_sync_file
4.60 ± 18% -2.2 2.44 ± 25% perf-profile.calltrace.cycles-pp.ext4_do_writepages.ext4_writepages.do_writepages.filemap_fdatawrite_wbc.__filemap_fdatawrite_range
4.60 ± 18% -2.2 2.44 ± 25% perf-profile.calltrace.cycles-pp.ext4_writepages.do_writepages.filemap_fdatawrite_wbc.__filemap_fdatawrite_range.file_write_and_wait_range
4.60 ± 18% -2.2 2.44 ± 25% perf-profile.calltrace.cycles-pp.filemap_fdatawrite_wbc.__filemap_fdatawrite_range.file_write_and_wait_range.ext4_sync_file.nfsd_commit
4.45 ± 19% -2.1 2.31 ± 25% perf-profile.calltrace.cycles-pp.mpage_prepare_extent_to_map.ext4_do_writepages.ext4_writepages.do_writepages.filemap_fdatawrite_wbc
4.05 ± 13% -1.3 2.74 ± 21% perf-profile.calltrace.cycles-pp.intel_idle.cpuidle_enter_state.cpuidle_enter.cpuidle_idle_call.do_idle
0.20 ±141% +0.6 0.75 ± 16% perf-profile.calltrace.cycles-pp.cpu_startup_entry.rest_init.start_kernel.x86_64_start_reservations.x86_64_start_kernel
0.20 ±141% +0.6 0.75 ± 16% perf-profile.calltrace.cycles-pp.do_idle.cpu_startup_entry.rest_init.start_kernel.x86_64_start_reservations
0.20 ±141% +0.6 0.75 ± 16% perf-profile.calltrace.cycles-pp.rest_init.start_kernel.x86_64_start_reservations.x86_64_start_kernel.common_startup_64
0.20 ±141% +0.6 0.75 ± 16% perf-profile.calltrace.cycles-pp.start_kernel.x86_64_start_reservations.x86_64_start_kernel.common_startup_64
0.20 ±141% +0.6 0.75 ± 16% perf-profile.calltrace.cycles-pp.x86_64_start_kernel.common_startup_64
0.20 ±141% +0.6 0.75 ± 16% perf-profile.calltrace.cycles-pp.x86_64_start_reservations.x86_64_start_kernel.common_startup_64
0.00 +0.6 0.59 ± 14% perf-profile.calltrace.cycles-pp.tick_nohz_idle_exit.do_idle.cpu_startup_entry.start_secondary.common_startup_64
0.00 +1.1 1.12 ± 34% perf-profile.calltrace.cycles-pp.enqueue_task_fair.activate_task.ttwu_do_activate.sched_ttwu_pending.__flush_smp_call_function_queue
0.00 +1.2 1.16 ± 32% perf-profile.calltrace.cycles-pp.activate_task.ttwu_do_activate.sched_ttwu_pending.__flush_smp_call_function_queue.flush_smp_call_function_queue
0.31 ±102% +1.4 1.68 ± 15% perf-profile.calltrace.cycles-pp.schedule_idle.do_idle.cpu_startup_entry.start_secondary.common_startup_64
0.22 ±141% +1.4 1.62 ± 15% perf-profile.calltrace.cycles-pp.__schedule.schedule_idle.do_idle.cpu_startup_entry.start_secondary
0.00 +1.6 1.60 ± 25% perf-profile.calltrace.cycles-pp.ttwu_do_activate.sched_ttwu_pending.__flush_smp_call_function_queue.flush_smp_call_function_queue.do_idle
0.12 ±223% +2.4 2.50 ± 23% perf-profile.calltrace.cycles-pp.sched_ttwu_pending.__flush_smp_call_function_queue.flush_smp_call_function_queue.do_idle.cpu_startup_entry
0.23 ±146% +3.1 3.36 ± 23% perf-profile.calltrace.cycles-pp.__flush_smp_call_function_queue.flush_smp_call_function_queue.do_idle.cpu_startup_entry.start_secondary
0.00 +3.1 3.13 ± 95% perf-profile.calltrace.cycles-pp.folio_wait_bit_common.folio_wait_writeback.__filemap_fdatawait_range.file_write_and_wait_range.ext4_sync_file
0.00 +3.8 3.77 ± 94% perf-profile.calltrace.cycles-pp.folio_wait_writeback.__filemap_fdatawait_range.file_write_and_wait_range.ext4_sync_file.nfsd_commit
1.51 ± 30% +4.3 5.78 ± 48% perf-profile.calltrace.cycles-pp.flush_smp_call_function_queue.do_idle.cpu_startup_entry.start_secondary.common_startup_64
0.00 +4.5 4.53 ± 94% perf-profile.calltrace.cycles-pp.__filemap_fdatawait_range.file_write_and_wait_range.ext4_sync_file.nfsd_commit.nfsd4_commit
0.71 ± 24% +10.0 10.76 ± 40% perf-profile.calltrace.cycles-pp.poll_idle.cpuidle_enter_state.cpuidle_enter.cpuidle_idle_call.do_idle
10.00 ± 10% +12.6 22.59 ± 13% perf-profile.calltrace.cycles-pp.cpuidle_idle_call.do_idle.cpu_startup_entry.start_secondary.common_startup_64
8.46 ± 10% +12.7 21.13 ± 14% perf-profile.calltrace.cycles-pp.cpuidle_enter.cpuidle_idle_call.do_idle.cpu_startup_entry.start_secondary
8.27 ± 11% +13.2 21.46 ± 14% perf-profile.calltrace.cycles-pp.cpuidle_enter_state.cpuidle_enter.cpuidle_idle_call.do_idle.cpu_startup_entry
12.87 ± 10% +18.6 31.52 ± 18% perf-profile.calltrace.cycles-pp.do_idle.cpu_startup_entry.start_secondary.common_startup_64
12.89 ± 10% +18.7 31.56 ± 18% perf-profile.calltrace.cycles-pp.cpu_startup_entry.start_secondary.common_startup_64
12.90 ± 10% +18.7 31.60 ± 18% perf-profile.calltrace.cycles-pp.start_secondary.common_startup_64
13.40 ± 10% +19.0 32.35 ± 17% perf-profile.calltrace.cycles-pp.common_startup_64
52.52 ± 12% -45.4 7.08 ± 20% perf-profile.children.cycles-pp.worker_thread
51.90 ± 12% -45.0 6.86 ± 20% perf-profile.children.cycles-pp.process_one_work
79.10 ± 9% -43.9 35.15 ± 29% perf-profile.children.cycles-pp.kthread
79.12 ± 9% -43.9 35.18 ± 29% perf-profile.children.cycles-pp.ret_from_fork
79.12 ± 9% -43.9 35.18 ± 29% perf-profile.children.cycles-pp.ret_from_fork_asm
42.11 ± 12% -42.0 0.08 ± 19% perf-profile.children.cycles-pp.rpc_async_release
42.11 ± 12% -42.0 0.08 ± 19% perf-profile.children.cycles-pp.rpc_free_task
28.68 ± 19% -28.6 0.06 ± 19% perf-profile.children.cycles-pp.nfs_write_completion
23.08 ± 18% -21.5 1.54 ±113% perf-profile.children.cycles-pp.native_queued_spin_lock_slowpath
16.89 ± 31% -16.9 0.00 perf-profile.children.cycles-pp.nfs_page_end_writeback
16.54 ± 31% -15.9 0.62 ± 42% perf-profile.children.cycles-pp.__folio_end_writeback
17.05 ± 31% -14.9 2.20 ± 93% perf-profile.children.cycles-pp.folio_end_writeback
15.65 ± 32% -14.4 1.30 ± 49% perf-profile.children.cycles-pp._raw_spin_lock_irqsave
13.42 ± 15% -13.4 0.00 perf-profile.children.cycles-pp.nfs_commit_release
13.42 ± 15% -13.4 0.00 perf-profile.children.cycles-pp.nfs_commit_release_pages
10.58 ± 16% -10.6 0.00 perf-profile.children.cycles-pp.nfs_request_add_commit_list
10.48 ± 15% -10.5 0.00 perf-profile.children.cycles-pp.nfs_inode_remove_request
9.54 ± 16% -7.8 1.70 ± 31% perf-profile.children.cycles-pp._raw_spin_lock
7.59 ± 24% -7.6 0.00 perf-profile.children.cycles-pp.__mutex_lock
7.33 ± 14% -4.7 2.62 ± 24% perf-profile.children.cycles-pp.do_writepages
2.73 ± 25% -2.5 0.18 ± 47% perf-profile.children.cycles-pp.__writeback_inodes_wb
2.73 ± 25% -2.5 0.18 ± 47% perf-profile.children.cycles-pp.__writeback_single_inode
2.73 ± 25% -2.5 0.18 ± 47% perf-profile.children.cycles-pp.nfs_writepages
2.73 ± 25% -2.5 0.18 ± 47% perf-profile.children.cycles-pp.wb_do_writeback
2.73 ± 25% -2.5 0.18 ± 47% perf-profile.children.cycles-pp.wb_writeback
2.73 ± 25% -2.5 0.18 ± 48% perf-profile.children.cycles-pp.write_cache_pages
2.73 ± 25% -2.5 0.18 ± 47% perf-profile.children.cycles-pp.writeback_sb_inodes
2.73 ± 25% -2.5 0.18 ± 48% perf-profile.children.cycles-pp.wb_workfn
4.60 ± 18% -2.2 2.44 ± 25% perf-profile.children.cycles-pp.__filemap_fdatawrite_range
4.60 ± 18% -2.2 2.44 ± 25% perf-profile.children.cycles-pp.ext4_do_writepages
4.60 ± 18% -2.2 2.44 ± 25% perf-profile.children.cycles-pp.ext4_writepages
4.60 ± 18% -2.2 2.44 ± 25% perf-profile.children.cycles-pp.filemap_fdatawrite_wbc
4.46 ± 19% -2.1 2.32 ± 25% perf-profile.children.cycles-pp.mpage_prepare_extent_to_map
2.27 ± 28% -2.1 0.16 ± 46% perf-profile.children.cycles-pp.nfs_writepages_callback
2.20 ± 29% -2.0 0.16 ± 46% perf-profile.children.cycles-pp.nfs_page_async_flush
4.16 ± 12% -1.2 2.94 ± 21% perf-profile.children.cycles-pp.intel_idle
1.48 ± 28% -0.8 0.68 ± 22% perf-profile.children.cycles-pp.__folio_start_writeback
1.30 ± 14% -0.7 0.58 ± 19% perf-profile.children.cycles-pp.sched_balance_newidle
0.85 ± 29% -0.7 0.15 ± 37% perf-profile.children.cycles-pp.kmem_cache_free
1.44 ± 13% -0.7 0.77 ± 19% perf-profile.children.cycles-pp.sched_balance_rq
1.31 ± 13% -0.7 0.65 ± 18% perf-profile.children.cycles-pp.sched_balance_find_src_group
1.30 ± 14% -0.6 0.64 ± 17% perf-profile.children.cycles-pp.update_sd_lb_stats
1.16 ± 13% -0.6 0.57 ± 19% perf-profile.children.cycles-pp.update_sg_lb_stats
0.79 ± 22% -0.6 0.22 ±121% perf-profile.children.cycles-pp.nfs_folio_find_private_request
1.25 ± 13% -0.5 0.77 ± 22% perf-profile.children.cycles-pp.pick_next_task_fair
0.55 ± 20% -0.3 0.22 ± 39% perf-profile.children.cycles-pp.tick_nohz_stop_tick
0.56 ± 20% -0.3 0.23 ± 37% perf-profile.children.cycles-pp.tick_nohz_idle_stop_tick
0.34 ± 29% -0.3 0.05 ± 76% perf-profile.children.cycles-pp.__slab_free
0.68 ± 21% -0.3 0.40 ± 24% perf-profile.children.cycles-pp.__lruvec_stat_mod_folio
0.34 ± 25% -0.3 0.08 ± 27% perf-profile.children.cycles-pp.__xa_clear_mark
0.54 ± 10% -0.2 0.30 ± 17% perf-profile.children.cycles-pp.percpu_counter_add_batch
0.22 ± 15% -0.2 0.03 ± 70% perf-profile.children.cycles-pp.seq_read
0.38 ± 11% -0.2 0.20 ± 12% perf-profile.children.cycles-pp.seq_read_iter
0.21 ± 15% -0.2 0.04 ±102% perf-profile.children.cycles-pp.blk_mq_run_work_fn
0.62 ± 8% -0.2 0.45 ± 9% perf-profile.children.cycles-pp.ksys_read
0.24 ± 12% -0.2 0.07 ± 55% perf-profile.children.cycles-pp.blk_mq_sched_dispatch_requests
0.23 ± 12% -0.2 0.07 ± 55% perf-profile.children.cycles-pp.__blk_mq_sched_dispatch_requests
0.25 ± 22% -0.2 0.10 ± 25% perf-profile.children.cycles-pp.submit_bio_noacct_nocheck
0.22 ± 15% -0.2 0.07 ± 55% perf-profile.children.cycles-pp.__blk_mq_do_dispatch_sched
0.66 ± 8% -0.1 0.51 ± 9% perf-profile.children.cycles-pp.read
0.24 ± 25% -0.1 0.10 ± 26% perf-profile.children.cycles-pp.__submit_bio
0.24 ± 25% -0.1 0.10 ± 26% perf-profile.children.cycles-pp.blk_mq_submit_bio
0.21 ± 19% -0.1 0.07 ± 31% perf-profile.children.cycles-pp.__common_interrupt
0.19 ± 21% -0.1 0.05 ± 50% perf-profile.children.cycles-pp.__handle_irq_event_percpu
0.21 ± 20% -0.1 0.07 ± 28% perf-profile.children.cycles-pp.handle_edge_irq
0.20 ± 21% -0.1 0.06 ± 53% perf-profile.children.cycles-pp.handle_irq_event
0.18 ± 22% -0.1 0.05 ± 50% perf-profile.children.cycles-pp.ahci_single_level_irq_intr
0.24 ± 33% -0.1 0.10 ± 13% perf-profile.children.cycles-pp.idle_cpu
0.24 ± 18% -0.1 0.12 ± 30% perf-profile.children.cycles-pp.__get_next_timer_interrupt
0.28 ± 17% -0.1 0.16 ± 26% perf-profile.children.cycles-pp.__mod_memcg_lruvec_state
0.16 ± 26% -0.1 0.04 ± 73% perf-profile.children.cycles-pp.ahci_handle_port_intr
0.28 ± 22% -0.1 0.17 ± 10% perf-profile.children.cycles-pp.svc_send
0.28 ± 24% -0.1 0.17 ± 9% perf-profile.children.cycles-pp.svc_tcp_sendto
0.23 ± 19% -0.1 0.12 ± 19% perf-profile.children.cycles-pp.__mod_lruvec_state
0.16 ± 11% -0.1 0.06 ± 47% perf-profile.children.cycles-pp.blk_mq_dispatch_rq_list
0.27 ± 10% -0.1 0.17 ± 30% perf-profile.children.cycles-pp.tick_nohz_restart_sched_tick
0.14 ± 17% -0.1 0.04 ±112% perf-profile.children.cycles-pp.nfs_unlock_and_release_request
0.16 ± 13% -0.1 0.06 ± 47% perf-profile.children.cycles-pp.scsi_queue_rq
0.16 ± 22% -0.1 0.06 ± 50% perf-profile.children.cycles-pp.blk_mq_flush_plug_list
0.25 ± 26% -0.1 0.16 ± 4% perf-profile.children.cycles-pp.svc_tcp_sendmsg
0.19 ± 24% -0.1 0.10 ± 12% perf-profile.children.cycles-pp.__mod_node_page_state
0.16 ± 20% -0.1 0.06 ± 50% perf-profile.children.cycles-pp.blk_mq_dispatch_plug_list
0.21 ± 13% -0.1 0.13 ± 33% perf-profile.children.cycles-pp.tick_nohz_next_event
0.18 ± 25% -0.1 0.11 ± 25% perf-profile.children.cycles-pp.xas_start
0.11 ± 38% -0.1 0.04 ± 75% perf-profile.children.cycles-pp.inode_to_bdi
0.14 ± 23% -0.1 0.07 ± 52% perf-profile.children.cycles-pp.kick_pool
0.16 ± 27% -0.1 0.10 ± 15% perf-profile.children.cycles-pp.xs_stream_data_receive_workfn
0.19 ± 18% -0.1 0.12 ± 23% perf-profile.children.cycles-pp.__queue_work
0.10 ± 10% -0.1 0.04 ± 71% perf-profile.children.cycles-pp._find_next_and_bit
0.09 ± 11% -0.1 0.03 ±102% perf-profile.children.cycles-pp.hrtimer_cancel
0.17 ± 17% -0.1 0.12 ± 25% perf-profile.children.cycles-pp.sched_balance_update_blocked_averages
0.09 ± 18% -0.1 0.04 ±105% perf-profile.children.cycles-pp.__lock_sock
0.12 ± 10% -0.1 0.06 ± 45% perf-profile.children.cycles-pp.__hrtimer_start_range_ns
0.12 ± 19% -0.0 0.07 ± 34% perf-profile.children.cycles-pp.lock_sock_nested
0.04 ± 73% +0.0 0.09 ± 28% perf-profile.children.cycles-pp.copy_mc_enhanced_fast_string
0.07 ± 14% +0.1 0.12 ± 22% perf-profile.children.cycles-pp.balance_dirty_pages_ratelimited_flags
0.01 ±223% +0.1 0.06 ± 21% perf-profile.children.cycles-pp.pick_next_task_idle
0.00 +0.1 0.06 ± 19% perf-profile.children.cycles-pp.sched_clock_noinstr
0.08 ± 20% +0.1 0.14 ± 20% perf-profile.children.cycles-pp.ct_kernel_enter
0.03 ±143% +0.1 0.10 ± 23% perf-profile.children.cycles-pp.__collapse_huge_page_copy
0.00 +0.1 0.07 ± 15% perf-profile.children.cycles-pp.__dequeue_entity
0.02 ±141% +0.1 0.08 ± 23% perf-profile.children.cycles-pp.nohz_run_idle_balance
0.10 ± 17% +0.1 0.17 ± 18% perf-profile.children.cycles-pp.cpuidle_governor_latency_req
0.12 ± 19% +0.1 0.19 ± 17% perf-profile.children.cycles-pp.update_curr
0.12 ± 15% +0.1 0.20 ± 12% perf-profile.children.cycles-pp.read_tsc
0.02 ±142% +0.1 0.11 ± 24% perf-profile.children.cycles-pp.call_cpuidle
0.01 ±223% +0.1 0.10 ± 20% perf-profile.children.cycles-pp.avg_vruntime
0.00 +0.1 0.10 ± 34% perf-profile.children.cycles-pp.place_entity
0.02 ±142% +0.1 0.12 ± 23% perf-profile.children.cycles-pp.ct_kernel_exit_state
0.06 ± 63% +0.1 0.17 ± 37% perf-profile.children.cycles-pp.__update_load_avg_se
0.12 ± 14% +0.1 0.23 ± 12% perf-profile.children.cycles-pp.___perf_sw_event
0.02 ±144% +0.1 0.16 ± 69% perf-profile.children.cycles-pp.llist_add_batch
0.08 ± 64% +0.2 0.23 ± 17% perf-profile.children.cycles-pp.set_next_entity
0.02 ±223% +0.2 0.18 ± 24% perf-profile.children.cycles-pp.__wrgsbase_inactive
0.04 ±101% +0.2 0.22 ± 7% perf-profile.children.cycles-pp.tick_nohz_idle_enter
0.13 ± 20% +0.2 0.33 ± 15% perf-profile.children.cycles-pp.prepare_task_switch
0.15 ± 15% +0.2 0.35 ± 20% perf-profile.children.cycles-pp.sched_clock_cpu
0.11 ± 10% +0.2 0.30 ± 16% perf-profile.children.cycles-pp.local_clock_noinstr
0.40 ± 11% +0.2 0.62 ± 14% perf-profile.children.cycles-pp.tick_nohz_idle_exit
0.09 ± 23% +0.2 0.32 ± 21% perf-profile.children.cycles-pp.tick_nohz_stop_idle
0.07 ± 50% +0.2 0.31 ± 21% perf-profile.children.cycles-pp.nr_iowait_cpu
0.11 ± 24% +0.2 0.35 ± 6% perf-profile.children.cycles-pp.__switch_to_asm
0.15 ± 23% +0.2 0.39 ± 48% perf-profile.children.cycles-pp.update_rq_clock_task
0.50 ± 15% +0.2 0.75 ± 16% perf-profile.children.cycles-pp.rest_init
0.50 ± 15% +0.2 0.75 ± 16% perf-profile.children.cycles-pp.start_kernel
0.50 ± 15% +0.2 0.75 ± 16% perf-profile.children.cycles-pp.x86_64_start_kernel
0.50 ± 15% +0.2 0.75 ± 16% perf-profile.children.cycles-pp.x86_64_start_reservations
0.09 ± 28% +0.3 0.35 ± 18% perf-profile.children.cycles-pp.__update_load_avg_cfs_rq
0.10 ± 81% +0.3 0.36 ± 36% perf-profile.children.cycles-pp.syscall_exit_to_user_mode
0.08 ± 32% +0.3 0.36 ± 46% perf-profile.children.cycles-pp.wake_affine
0.08 ± 33% +0.3 0.40 ± 44% perf-profile.children.cycles-pp.available_idle_cpu
0.22 ± 22% +0.4 0.60 ± 16% perf-profile.children.cycles-pp.dequeue_entity
0.21 ± 21% +0.4 0.61 ± 31% perf-profile.children.cycles-pp.select_task_rq_fair
0.12 ± 30% +0.4 0.52 ± 10% perf-profile.children.cycles-pp.__switch_to
0.26 ± 17% +0.4 0.67 ± 16% perf-profile.children.cycles-pp.dequeue_task_fair
0.22 ± 32% +0.5 0.68 ± 18% perf-profile.children.cycles-pp.update_load_avg
0.17 ± 44% +0.5 0.66 ± 19% perf-profile.children.cycles-pp.sched_clock
0.14 ± 25% +0.5 0.62 ± 18% perf-profile.children.cycles-pp.update_rq_clock
1.82 ± 9% +0.5 2.35 ± 12% perf-profile.children.cycles-pp.schedule
0.10 ± 50% +0.6 0.70 ± 20% perf-profile.children.cycles-pp.llist_reverse_order
0.22 ± 31% +0.6 0.83 ± 11% perf-profile.children.cycles-pp.native_sched_clock
0.19 ± 41% +0.6 0.83 ± 19% perf-profile.children.cycles-pp.select_task_rq
0.11 ± 30% +0.7 0.82 ± 40% perf-profile.children.cycles-pp.call_function_single_prep_ipi
0.24 ± 31% +0.8 1.01 ± 32% perf-profile.children.cycles-pp.enqueue_entity
0.15 ± 29% +0.8 1.00 ± 42% perf-profile.children.cycles-pp.__smp_call_single_queue
0.29 ± 26% +0.9 1.19 ± 31% perf-profile.children.cycles-pp.enqueue_task_fair
0.31 ± 23% +0.9 1.24 ± 30% perf-profile.children.cycles-pp.activate_task
0.18 ± 30% +1.0 1.16 ± 40% perf-profile.children.cycles-pp.ttwu_queue_wakelist
0.21 ± 26% +1.1 1.32 ±130% perf-profile.children.cycles-pp._raw_spin_lock_irq
0.58 ± 17% +1.1 1.73 ± 14% perf-profile.children.cycles-pp.schedule_idle
0.35 ± 24% +1.3 1.66 ± 24% perf-profile.children.cycles-pp.ttwu_do_activate
2.40 ± 8% +1.6 3.98 ± 9% perf-profile.children.cycles-pp.__schedule
0.76 ± 14% +1.7 2.45 ± 30% perf-profile.children.cycles-pp.try_to_wake_up
0.43 ± 33% +2.1 2.53 ± 22% perf-profile.children.cycles-pp.sched_ttwu_pending
1.10 ± 19% +2.3 3.37 ± 85% perf-profile.children.cycles-pp.folio_wait_bit_common
4.11 ± 24% +2.7 6.81 ± 14% perf-profile.children.cycles-pp.rwsem_spin_on_owner
0.56 ± 32% +2.8 3.40 ± 22% perf-profile.children.cycles-pp.__flush_smp_call_function_queue
0.93 ±159% +3.6 4.53 ± 89% perf-profile.children.cycles-pp.intel_idle_irq
0.00 +3.8 3.78 ± 94% perf-profile.children.cycles-pp.folio_wait_writeback
1.51 ± 30% +4.3 5.79 ± 48% perf-profile.children.cycles-pp.flush_smp_call_function_queue
0.00 +4.5 4.54 ± 94% perf-profile.children.cycles-pp.__filemap_fdatawait_range
0.72 ± 24% +10.0 10.77 ± 40% perf-profile.children.cycles-pp.poll_idle
10.42 ± 9% +12.8 23.27 ± 13% perf-profile.children.cycles-pp.cpuidle_idle_call
8.76 ± 10% +12.9 21.66 ± 14% perf-profile.children.cycles-pp.cpuidle_enter_state
8.81 ± 10% +12.9 21.73 ± 14% perf-profile.children.cycles-pp.cpuidle_enter
12.90 ± 10% +18.7 31.60 ± 18% perf-profile.children.cycles-pp.start_secondary
13.38 ± 10% +18.9 32.30 ± 17% perf-profile.children.cycles-pp.do_idle
13.40 ± 10% +19.0 32.35 ± 17% perf-profile.children.cycles-pp.common_startup_64
13.40 ± 10% +19.0 32.35 ± 17% perf-profile.children.cycles-pp.cpu_startup_entry
23.04 ± 18% -21.5 1.54 ±112% perf-profile.self.cycles-pp.native_queued_spin_lock_slowpath
4.16 ± 12% -1.2 2.94 ± 21% perf-profile.self.cycles-pp.intel_idle
1.38 ± 16% -1.2 0.17 ± 70% perf-profile.self.cycles-pp.__folio_end_writeback
0.70 ± 22% -0.6 0.08 ± 84% perf-profile.self.cycles-pp.nfs_folio_find_private_request
0.87 ± 15% -0.4 0.44 ± 19% perf-profile.self.cycles-pp.update_sg_lb_stats
0.50 ± 21% -0.4 0.10 ±105% perf-profile.self.cycles-pp.folio_end_writeback
0.37 ± 29% -0.3 0.04 ±105% perf-profile.self.cycles-pp.kmem_cache_free
0.33 ± 30% -0.3 0.04 ±102% perf-profile.self.cycles-pp.__slab_free
0.49 ± 11% -0.2 0.26 ± 17% perf-profile.self.cycles-pp.percpu_counter_add_batch
0.26 ± 32% -0.2 0.10 ± 34% perf-profile.self.cycles-pp.mpage_prepare_extent_to_map
0.30 ± 26% -0.1 0.17 ± 33% perf-profile.self.cycles-pp.__folio_start_writeback
0.20 ± 20% -0.1 0.10 ± 14% perf-profile.self.cycles-pp.idle_cpu
0.18 ± 24% -0.1 0.09 ± 15% perf-profile.self.cycles-pp.__mod_node_page_state
0.20 ± 22% -0.1 0.11 ± 25% perf-profile.self.cycles-pp.__mod_memcg_lruvec_state
0.09 ± 17% -0.1 0.03 ±100% perf-profile.self.cycles-pp._find_next_and_bit
0.04 ± 73% +0.0 0.09 ± 28% perf-profile.self.cycles-pp.copy_mc_enhanced_fast_string
0.12 ± 16% +0.1 0.20 ± 15% perf-profile.self.cycles-pp.read_tsc
0.02 ±141% +0.1 0.09 ± 30% perf-profile.self.cycles-pp.sched_ttwu_pending
0.02 ±141% +0.1 0.09 ± 35% perf-profile.self.cycles-pp.balance_dirty_pages_ratelimited_flags
0.01 ±223% +0.1 0.08 ± 25% perf-profile.self.cycles-pp.schedule
0.01 ±223% +0.1 0.09 ± 30% perf-profile.self.cycles-pp.select_task_rq_fair
0.00 +0.1 0.08 ± 19% perf-profile.self.cycles-pp.nohz_run_idle_balance
0.03 ±101% +0.1 0.12 ± 20% perf-profile.self.cycles-pp.dequeue_entity
0.09 ± 26% +0.1 0.18 ± 16% perf-profile.self.cycles-pp.___perf_sw_event
0.01 ±223% +0.1 0.10 ± 17% perf-profile.self.cycles-pp.avg_vruntime
0.02 ±141% +0.1 0.10 ± 26% perf-profile.self.cycles-pp.call_cpuidle
0.00 +0.1 0.09 ± 24% perf-profile.self.cycles-pp.cpu_startup_entry
0.05 ± 88% +0.1 0.15 ± 36% perf-profile.self.cycles-pp.__update_load_avg_se
0.02 ±141% +0.1 0.12 ± 22% perf-profile.self.cycles-pp.ct_kernel_exit_state
0.06 ± 52% +0.1 0.17 ± 77% perf-profile.self.cycles-pp.ext4_finish_bio
0.02 ±142% +0.1 0.12 ± 36% perf-profile.self.cycles-pp.prepare_task_switch
0.02 ±141% +0.1 0.13 ± 27% perf-profile.self.cycles-pp.pick_next_task_fair
0.02 ±141% +0.1 0.13 ± 41% perf-profile.self.cycles-pp.ttwu_queue_wakelist
0.05 ± 79% +0.1 0.16 ± 20% perf-profile.self.cycles-pp.generic_perform_write
0.07 ± 15% +0.1 0.19 ± 24% perf-profile.self.cycles-pp.update_load_avg
0.09 ± 14% +0.1 0.22 ± 21% perf-profile.self.cycles-pp.cpuidle_idle_call
0.04 ± 73% +0.1 0.18 ± 45% perf-profile.self.cycles-pp.enqueue_task_fair
0.02 ±144% +0.1 0.16 ± 71% perf-profile.self.cycles-pp.llist_add_batch
0.01 ±223% +0.2 0.17 ± 47% perf-profile.self.cycles-pp.__flush_smp_call_function_queue
0.02 ±223% +0.2 0.18 ± 24% perf-profile.self.cycles-pp.__wrgsbase_inactive
0.05 ± 46% +0.2 0.23 ± 28% perf-profile.self.cycles-pp.enqueue_entity
0.05 ± 74% +0.2 0.26 ± 23% perf-profile.self.cycles-pp.do_idle
0.10 ± 23% +0.2 0.34 ± 8% perf-profile.self.cycles-pp.__switch_to_asm
0.07 ± 50% +0.2 0.31 ± 21% perf-profile.self.cycles-pp.nr_iowait_cpu
0.09 ± 28% +0.2 0.34 ± 17% perf-profile.self.cycles-pp.__update_load_avg_cfs_rq
0.11 ± 26% +0.2 0.36 ± 54% perf-profile.self.cycles-pp.update_rq_clock_task
0.03 ±163% +0.3 0.30 ± 35% perf-profile.self.cycles-pp.select_task_rq
0.07 ± 50% +0.3 0.36 ± 91% perf-profile.self.cycles-pp._raw_spin_lock_irq
0.00 +0.3 0.29 ± 98% perf-profile.self.cycles-pp.__filemap_fdatawait_range
0.08 ± 33% +0.3 0.39 ± 45% perf-profile.self.cycles-pp.available_idle_cpu
0.02 ±141% +0.3 0.34 ± 27% perf-profile.self.cycles-pp.flush_smp_call_function_queue
0.06 ± 57% +0.4 0.42 ± 18% perf-profile.self.cycles-pp.update_rq_clock
0.14 ± 22% +0.4 0.50 ± 6% perf-profile.self.cycles-pp.__schedule
0.12 ± 33% +0.4 0.50 ± 11% perf-profile.self.cycles-pp.__switch_to
0.01 ±223% +0.4 0.41 ± 55% perf-profile.self.cycles-pp.ttwu_do_activate
0.22 ± 28% +0.6 0.80 ± 11% perf-profile.self.cycles-pp.native_sched_clock
0.10 ± 50% +0.6 0.70 ± 20% perf-profile.self.cycles-pp.llist_reverse_order
0.00 +0.6 0.62 ± 89% perf-profile.self.cycles-pp.folio_wait_writeback
0.11 ± 30% +0.7 0.82 ± 41% perf-profile.self.cycles-pp.call_function_single_prep_ipi
0.20 ± 20% +1.1 1.27 ± 73% perf-profile.self.cycles-pp.folio_wait_bit_common
0.91 ±163% +3.6 4.50 ± 89% perf-profile.self.cycles-pp.intel_idle_irq
0.71 ± 25% +9.9 10.64 ± 40% perf-profile.self.cycles-pp.poll_idle
Disclaimer:
Results have been estimated based on internal Intel analysis and are provided
for informational purposes only. Any difference in system hardware or software
design or configuration may affect actual performance.
--
0-DAY CI Kernel Test Service
https://github.com/intel/lkp-tests/wiki