Re: [linus:master] [mm/slab] fb1091febd: will-it-scale.per_process_ops 132.5% improvement
From: Vlastimil Babka
Date: Wed Mar 11 2026 - 13:12:47 EST
On 3/11/26 15:51, kernel test robot wrote:
>
>
> Hello,
>
> kernel test robot noticed a 132.5% improvement of will-it-scale.per_process_ops on:
>
>
> commit: fb1091febd668398aa84c161b8d9a1834321e021 ("mm/slab: allow sheaf refill if blocking is not allowed")
> https://git.kernel.org/cgit/linux/kernel/git/torvalds/linux.git master
+CC more people
> testcase: will-it-scale
> config: x86_64-rhel-9.4
> compiler: gcc-14
> test machine: 48 threads 2 sockets Intel(R) Xeon(R) CPU E5-2697 v2 @ 2.70GHz (Ivy Bridge-EP) with 64G memory
> parameters:
>
> nr_task: 100%
> mode: process
> test: brk1
> cpufreq_governor: performance
>
>
>
> Details are as below:
> -------------------------------------------------------------------------------------------------->
>
>
> The kernel config and materials to reproduce are available at:
> https://download.01.org/0day-ci/archive/20260311/202603112232.f53ebe5d-lkp@xxxxxxxxx
>
> =========================================================================================
> compiler/cpufreq_governor/kconfig/mode/nr_task/rootfs/tbox_group/test/testcase:
> gcc-14/performance/x86_64-rhel-9.4/process/100%/debian-13-x86_64-20250902.cgz/lkp-ivb-2ep2/brk1/will-it-scale
>
> commit:
> 48647d3f9a ("slab: distinguish lock and trylock for sheaf_flush_main()")
> fb1091febd ("mm/slab: allow sheaf refill if blocking is not allowed")
>
> 48647d3f9a644d1e fb1091febd668398aa84c161b8d
> ---------------- ---------------------------
> %stddev %change %stddev
> \ | \
> 9079134 +132.5% 21109974 will-it-scale.48.processes
> 189148 +132.5% 439790 will-it-scale.per_process_ops
> 9079134 +132.5% 21109974 will-it-scale.workload
> 5.83 +246.4% 20.21 vmstat.cpu.us
> 5015917 +17.1% 5875075 vmstat.memory.cache
> 1452 ± 2% +506.2% 8801 ± 2% vmstat.system.cs
> 1.17 +0.1 1.29 mpstat.cpu.all.irq%
> 2.92 ± 2% +1.9 4.77 ± 3% mpstat.cpu.all.soft%
> 89.32 -16.2 73.08 mpstat.cpu.all.sys%
> 5.98 +14.3 20.26 mpstat.cpu.all.usr%
> 172.58 +5.3% 181.69 turbostat.CorWatt
> 0.24 +58.3% 0.38 turbostat.IPC
> 202.24 +5.4% 213.25 turbostat.PkgWatt
> 12.35 +67.0% 20.63 turbostat.RAMWatt
> 200.00 ± 9% +86.8% 373.67 ± 7% perf-c2c.DRAM.local
> 145.83 ± 9% -45.7% 79.17 ± 16% perf-c2c.DRAM.remote
> 8096 ± 4% -94.1% 479.33 ± 4% perf-c2c.HITM.local
> 141.17 ± 8% -74.5% 36.00 ± 24% perf-c2c.HITM.remote
> 8237 ± 4% -93.7% 515.33 ± 5% perf-c2c.HITM.total
> 0.57 ± 9% -66.6% 0.19 ± 3% perf-sched.sch_delay.avg.ms.[unknown].[unknown].[unknown].[unknown].[unknown]
> 0.57 ± 9% -66.6% 0.19 ± 3% perf-sched.total_sch_delay.average.ms
> 53.40 ± 10% -61.9% 20.35 ± 5% perf-sched.total_wait_and_delay.average.ms
> 9605 ± 11% +309.7% 39354 ± 5% perf-sched.total_wait_and_delay.count.ms
> 52.83 ± 10% -61.8% 20.16 ± 5% perf-sched.total_wait_time.average.ms
> 53.40 ± 10% -61.9% 20.35 ± 5% perf-sched.wait_and_delay.avg.ms.[unknown].[unknown].[unknown].[unknown].[unknown]
> 9605 ± 11% +309.7% 39354 ± 5% perf-sched.wait_and_delay.count.[unknown].[unknown].[unknown].[unknown].[unknown]
> 52.83 ± 10% -61.8% 20.16 ± 5% perf-sched.wait_time.avg.ms.[unknown].[unknown].[unknown].[unknown].[unknown]
> 0.18 ± 13% +333.5% 0.79 perf-stat.i.MPKI
> 1.089e+10 +48.8% 1.621e+10 perf-stat.i.branch-instructions
> 44860845 +49.9% 67266443 perf-stat.i.branch-misses
> 4.26 ± 13% +32.4 36.65 perf-stat.i.cache-miss-rate%
> 9391325 ± 13% +582.9% 64131079 perf-stat.i.cache-misses
> 2.22e+08 -21.2% 1.749e+08 perf-stat.i.cache-references
> 1418 ± 2% +521.6% 8819 ± 2% perf-stat.i.context-switches
> 2.77 -36.9% 1.75 perf-stat.i.cpi
> 196.34 ± 2% -5.8% 184.90 ± 4% perf-stat.i.cpu-migrations
> 18904 ± 14% -88.2% 2224 perf-stat.i.cycles-between-cache-misses
> 5.131e+10 +58.3% 8.12e+10 perf-stat.i.instructions
> 0.36 +58.0% 0.57 perf-stat.i.ipc
> 363.96 ± 4% +28.2% 466.62 ± 6% perf-stat.i.minor-faults
> 363.98 ± 4% +28.2% 466.65 ± 6% perf-stat.i.page-faults
> 0.18 ± 13% +331.0% 0.79 perf-stat.overall.MPKI
> 4.24 ± 13% +32.4 36.66 perf-stat.overall.cache-miss-rate%
> 2.77 -36.8% 1.75 perf-stat.overall.cpi
> 15320 ± 11% -85.6% 2213 perf-stat.overall.cycles-between-cache-misses
> 0.36 +58.3% 0.57 perf-stat.overall.ipc
> 1704731 -32.0% 1158643 perf-stat.overall.path-length
> 1.086e+10 +48.7% 1.615e+10 perf-stat.ps.branch-instructions
> 44644009 +50.1% 67000848 perf-stat.ps.branch-misses
> 9377842 ± 13% +581.8% 63942004 perf-stat.ps.cache-misses
> 2.214e+08 -21.2% 1.744e+08 perf-stat.ps.cache-references
> 1412 ± 2% +522.3% 8787 ± 2% perf-stat.ps.context-switches
> 195.57 ± 2% -5.8% 184.17 ± 3% perf-stat.ps.cpu-migrations
> 5.115e+10 +58.2% 8.093e+10 perf-stat.ps.instructions
> 362.16 ± 4% +28.3% 464.76 ± 6% perf-stat.ps.minor-faults
> 362.18 ± 4% +28.3% 464.78 ± 6% perf-stat.ps.page-faults
> 1.548e+13 +58.0% 2.446e+13 perf-stat.total.instructions
> 62.33 -62.3 0.00 perf-profile.calltrace.cycles-pp._raw_spin_lock_irqsave.get_from_partial_node.___slab_alloc.kmem_cache_alloc_noprof.mas_store_gfp
> 61.50 -61.5 0.00 perf-profile.calltrace.cycles-pp.native_queued_spin_lock_slowpath._raw_spin_lock_irqsave.get_from_partial_node.___slab_alloc.kmem_cache_alloc_noprof
> 33.79 -33.8 0.00 perf-profile.calltrace.cycles-pp.___slab_alloc.kmem_cache_alloc_noprof.mas_store_gfp.do_vmi_align_munmap.__do_sys_brk
> 33.71 -33.7 0.00 perf-profile.calltrace.cycles-pp.get_from_partial_node.___slab_alloc.kmem_cache_alloc_noprof.mas_store_gfp.do_vmi_align_munmap
> 33.24 -33.2 0.00 perf-profile.calltrace.cycles-pp.___slab_alloc.kmem_cache_alloc_noprof.mas_store_gfp.do_brk_flags.__do_sys_brk
> 33.16 -33.2 0.00 perf-profile.calltrace.cycles-pp.get_from_partial_node.___slab_alloc.kmem_cache_alloc_noprof.mas_store_gfp.do_brk_flags
Seems like mas_store_gfp is doing those allocations that don't allow
blocking and thus were impacted. I see it calling mas_wr_preallocate()
and that doing mas_alloc_nodes(mas, GFP_NOWAIT);
This was AFAICS introduced by 9b05890a25d9 ("maple_tree: Prefilled sheaf
conversion and testing").
I vaguely recall we mighth have discussed this, but it's unclear to me, why
e.g. a perfectly cromulent do_brk_flags -> vma_iter_store_gfp(GFP_KERNEL) ->
mas_store(GFP_KERNEL) results in GFP_NOWAIT allocation.
Here mas_alloc_nodes() AFAIU allocates only one node via mt_alloc_one() and
results in a slowpath instead of sheaf refill. But that slowpath could fail
due to lack of reclaim? Or if we needed more nodes and did
mt_get_sheaf(GFP_NOWAIT), we could be failing too?
Oh I see, mas_nomem(actual_gfp) would have handled that, IIUC. So it was
just a potential performance issue. Maybe it explains some of the
regressions reports we saw for will-it-scale and attributed to the loss of
double caching with cpu (partial) slabs.
TL;DR should be fine now, but it means fb1091febd66 ("mm/slab: allow sheaf
refill if blocking is not allowed") is an even more important perf fix than
we thought.
> 34.48 -31.5 3.02 ± 9% perf-profile.calltrace.cycles-pp.kmem_cache_alloc_noprof.mas_store_gfp.do_vmi_align_munmap.__do_sys_brk.do_syscall_64
> 33.95 -31.2 2.74 ± 10% perf-profile.calltrace.cycles-pp.kmem_cache_alloc_noprof.mas_store_gfp.do_brk_flags.__do_sys_brk.do_syscall_64
> 37.06 -25.9 11.12 ± 2% perf-profile.calltrace.cycles-pp.mas_store_gfp.do_brk_flags.__do_sys_brk.do_syscall_64.entry_SYSCALL_64_after_hwframe
> 37.65 -25.8 11.83 ± 2% perf-profile.calltrace.cycles-pp.mas_store_gfp.do_vmi_align_munmap.__do_sys_brk.do_syscall_64.entry_SYSCALL_64_after_hwframe
> 90.38 -21.0 69.37 perf-profile.calltrace.cycles-pp.__do_sys_brk.do_syscall_64.entry_SYSCALL_64_after_hwframe.brk
> 91.24 -19.1 72.10 perf-profile.calltrace.cycles-pp.do_syscall_64.entry_SYSCALL_64_after_hwframe.brk
> 91.43 -18.8 72.66 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_after_hwframe.brk
> 41.96 -15.8 26.12 perf-profile.calltrace.cycles-pp.do_brk_flags.__do_sys_brk.do_syscall_64.entry_SYSCALL_64_after_hwframe.brk
> 45.03 -12.0 33.08 perf-profile.calltrace.cycles-pp.do_vmi_align_munmap.__do_sys_brk.do_syscall_64.entry_SYSCALL_64_after_hwframe.brk
> 2.01 ± 4% -1.1 0.91 ± 3% perf-profile.calltrace.cycles-pp.rcu_core.handle_softirqs.__irq_exit_rcu.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt
> 1.98 ± 4% -1.1 0.90 ± 4% perf-profile.calltrace.cycles-pp.rcu_do_batch.rcu_core.handle_softirqs.__irq_exit_rcu.sysvec_apic_timer_interrupt
> 99.45 -1.0 98.44 perf-profile.calltrace.cycles-pp.brk
> 1.81 ± 5% -1.0 0.86 ± 4% perf-profile.calltrace.cycles-pp.rcu_free_sheaf.rcu_do_batch.rcu_core.handle_softirqs.__irq_exit_rcu
> 1.50 ± 6% -0.8 0.74 ± 4% perf-profile.calltrace.cycles-pp.__kmem_cache_free_bulk.rcu_free_sheaf.rcu_do_batch.rcu_core.handle_softirqs
> 1.25 ± 7% -0.7 0.55 ± 5% perf-profile.calltrace.cycles-pp.__slab_free.__kmem_cache_free_bulk.rcu_free_sheaf.rcu_do_batch.rcu_core
> 0.00 +0.5 0.52 perf-profile.calltrace.cycles-pp.security_mmap_addr.__get_unmapped_area.check_brk_limits.__do_sys_brk.do_syscall_64
> 0.00 +0.5 0.52 perf-profile.calltrace.cycles-pp.mas_find.vms_complete_munmap_vmas.do_vmi_align_munmap.__do_sys_brk.do_syscall_64
> 0.00 +0.6 0.56 perf-profile.calltrace.cycles-pp.mas_update_gap.mas_wr_node_store.mas_store_gfp.do_vmi_align_munmap.__do_sys_brk
> 0.00 +0.6 0.56 ± 5% perf-profile.calltrace.cycles-pp.__kfree_rcu_sheaf.kvfree_call_rcu.mas_wr_node_store.mas_store_gfp.do_brk_flags
> 0.00 +0.6 0.56 ± 3% perf-profile.calltrace.cycles-pp.__kfree_rcu_sheaf.kvfree_call_rcu.mas_wr_node_store.mas_store_gfp.do_vmi_align_munmap
> 0.00 +0.6 0.58 ± 12% perf-profile.calltrace.cycles-pp.rcu_do_batch.rcu_core.handle_softirqs.run_ksoftirqd.smpboot_thread_fn
> 0.00 +0.6 0.58 perf-profile.calltrace.cycles-pp.arch_exit_to_user_mode_prepare.do_syscall_64.entry_SYSCALL_64_after_hwframe.brk
> 0.00 +0.6 0.58 ± 12% perf-profile.calltrace.cycles-pp.handle_softirqs.run_ksoftirqd.smpboot_thread_fn.kthread.ret_from_fork
> 0.00 +0.6 0.58 ± 12% perf-profile.calltrace.cycles-pp.rcu_core.handle_softirqs.run_ksoftirqd.smpboot_thread_fn.kthread
> 0.00 +0.6 0.58 ± 12% perf-profile.calltrace.cycles-pp.run_ksoftirqd.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
> 0.58 ± 2% +0.6 1.17 ± 2% perf-profile.calltrace.cycles-pp.kvfree_call_rcu.mas_wr_node_store.mas_store_gfp.do_brk_flags.__do_sys_brk
> 0.00 +0.6 0.61 perf-profile.calltrace.cycles-pp.mas_update_gap.mas_wr_node_store.mas_store_gfp.do_brk_flags.__do_sys_brk
> 0.00 +0.6 0.61 ± 11% perf-profile.calltrace.cycles-pp.smpboot_thread_fn.kthread.ret_from_fork.ret_from_fork_asm
> 0.00 +0.6 0.62 perf-profile.calltrace.cycles-pp.arch_get_unmapped_area_topdown.__get_unmapped_area.check_brk_limits.__do_sys_brk.do_syscall_64
> 0.00 +0.6 0.63 ± 2% perf-profile.calltrace.cycles-pp.static_key_count.security_vm_enough_memory_mm.do_brk_flags.__do_sys_brk.do_syscall_64
> 0.00 +0.7 0.67 ± 6% perf-profile.calltrace.cycles-pp.queue_event.ordered_events__queue.process_simple.reader__read_event.perf_session__process_events
> 0.00 +0.7 0.68 ± 7% perf-profile.calltrace.cycles-pp.ordered_events__queue.process_simple.reader__read_event.perf_session__process_events.record__finish_output
> 0.00 +0.7 0.68 perf-profile.calltrace.cycles-pp.can_vma_merge_left.vma_merge_new_range.do_brk_flags.__do_sys_brk.do_syscall_64
> 0.00 +0.7 0.68 ± 11% perf-profile.calltrace.cycles-pp.kthread.ret_from_fork.ret_from_fork_asm
> 0.00 +0.7 0.68 ± 11% perf-profile.calltrace.cycles-pp.ret_from_fork.ret_from_fork_asm
> 0.00 +0.7 0.68 ± 11% perf-profile.calltrace.cycles-pp.ret_from_fork_asm
> 0.00 +0.7 0.72 ± 6% perf-profile.calltrace.cycles-pp.process_simple.reader__read_event.perf_session__process_events.record__finish_output.cmd_record
> 0.00 +0.8 0.77 perf-profile.calltrace.cycles-pp.__vma_start_exclude_readers.__vma_start_write.vms_gather_munmap_vmas.do_vmi_align_munmap.__do_sys_brk
> 0.62 +0.9 1.49 perf-profile.calltrace.cycles-pp.kvfree_call_rcu.mas_wr_node_store.mas_store_gfp.do_vmi_align_munmap.__do_sys_brk
> 0.00 +0.9 0.87 ± 6% perf-profile.calltrace.cycles-pp.perf_event_mmap_output.perf_iterate_sb.perf_event_mmap_event.perf_event_mmap.do_brk_flags
> 0.00 +0.9 0.88 perf-profile.calltrace.cycles-pp.pte_offset_map_lock.zap_pte_range.zap_pmd_range.unmap_page_range.unmap_vmas
> 0.00 +0.9 0.92 ± 3% perf-profile.calltrace.cycles-pp.handle_softirqs.__irq_exit_rcu.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt.brk
> 0.00 +0.9 0.92 ± 4% perf-profile.calltrace.cycles-pp.__irq_exit_rcu.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt.brk
> 0.00 +0.9 0.93 ± 3% perf-profile.calltrace.cycles-pp.down_write_killable.__do_sys_brk.do_syscall_64.entry_SYSCALL_64_after_hwframe.brk
> 0.00 +1.0 0.96 perf-profile.calltrace.cycles-pp.mas_wr_store_type.mas_store_gfp.do_brk_flags.__do_sys_brk.do_syscall_64
> 0.00 +1.0 0.97 perf-profile.calltrace.cycles-pp.vma_merge_new_range.do_brk_flags.__do_sys_brk.do_syscall_64.entry_SYSCALL_64_after_hwframe
> 0.00 +1.0 1.02 ± 3% perf-profile.calltrace.cycles-pp.sysvec_apic_timer_interrupt.asm_sysvec_apic_timer_interrupt.brk
> 0.55 ± 2% +1.0 1.57 perf-profile.calltrace.cycles-pp.mas_find.vms_gather_munmap_vmas.do_vmi_align_munmap.__do_sys_brk.do_syscall_64
> 0.00 +1.0 1.02 ± 3% perf-profile.calltrace.cycles-pp.asm_sysvec_apic_timer_interrupt.brk
> 0.44 ± 44% +1.0 1.48 perf-profile.calltrace.cycles-pp.mas_prev_slot.vms_gather_munmap_vmas.do_vmi_align_munmap.__do_sys_brk.do_syscall_64
> 0.00 +1.1 1.05 perf-profile.calltrace.cycles-pp.memcpy_orig.mas_wr_node_store.mas_store_gfp.do_vmi_align_munmap.__do_sys_brk
> 0.00 +1.1 1.10 perf-profile.calltrace.cycles-pp.free_pgtables.unmap_region.vms_complete_munmap_vmas.do_vmi_align_munmap.__do_sys_brk
> 0.57 +1.1 1.67 perf-profile.calltrace.cycles-pp.security_vm_enough_memory_mm.do_brk_flags.__do_sys_brk.do_syscall_64.entry_SYSCALL_64_after_hwframe
> 0.52 ± 5% +1.1 1.65 ± 4% perf-profile.calltrace.cycles-pp.__memcg_slab_free_hook.kmem_cache_free.vms_complete_munmap_vmas.do_vmi_align_munmap.__do_sys_brk
> 0.00 +1.1 1.13 perf-profile.calltrace.cycles-pp.memcpy_orig.mas_wr_node_store.mas_store_gfp.do_brk_flags.__do_sys_brk
> 0.00 +1.1 1.13 ± 6% perf-profile.calltrace.cycles-pp.reader__read_event.perf_session__process_events.record__finish_output.cmd_record
> 0.00 +1.1 1.13 ± 6% perf-profile.calltrace.cycles-pp.perf_session__process_events.record__finish_output.cmd_record
> 0.00 +1.1 1.13 ± 5% perf-profile.calltrace.cycles-pp.cmd_record
> 0.00 +1.1 1.13 ± 5% perf-profile.calltrace.cycles-pp.record__finish_output.cmd_record
> 0.00 +1.1 1.14 perf-profile.calltrace.cycles-pp.__vma_start_write.vms_gather_munmap_vmas.do_vmi_align_munmap.__do_sys_brk.do_syscall_64
> 0.00 +1.1 1.15 perf-profile.calltrace.cycles-pp.mas_wr_store_type.mas_store_gfp.do_vmi_align_munmap.__do_sys_brk.do_syscall_64
> 0.58 ± 3% +1.2 1.76 ± 3% perf-profile.calltrace.cycles-pp.__memcg_slab_post_alloc_hook.kmem_cache_alloc_noprof.vm_area_alloc.do_brk_flags.__do_sys_brk
> 0.55 +1.2 1.74 perf-profile.calltrace.cycles-pp.check_brk_limits.__do_sys_brk.do_syscall_64.entry_SYSCALL_64_after_hwframe.brk
> 0.00 +1.2 1.21 perf-profile.calltrace.cycles-pp.mas_next_slot.mas_find.vms_gather_munmap_vmas.do_vmi_align_munmap.__do_sys_brk
> 0.68 ± 2% +1.2 1.92 perf-profile.calltrace.cycles-pp.zap_pte_range.zap_pmd_range.unmap_page_range.unmap_vmas.unmap_region
> 0.34 ± 70% +1.3 1.61 perf-profile.calltrace.cycles-pp.__get_unmapped_area.check_brk_limits.__do_sys_brk.do_syscall_64.entry_SYSCALL_64_after_hwframe
> 0.00 +1.3 1.32 perf-profile.calltrace.cycles-pp.mas_store_gfp.vms_gather_munmap_vmas.do_vmi_align_munmap.__do_sys_brk.do_syscall_64
> 0.84 ± 2% +1.5 2.36 perf-profile.calltrace.cycles-pp.zap_pmd_range.unmap_page_range.unmap_vmas.unmap_region.vms_complete_munmap_vmas
> 0.00 +1.6 1.55 ± 16% perf-profile.calltrace.cycles-pp.refill_objects.__pcs_replace_empty_main.kmem_cache_alloc_noprof.mas_store_gfp.do_brk_flags
> 0.77 ± 4% +1.6 2.42 ± 2% perf-profile.calltrace.cycles-pp.kmem_cache_free.vms_complete_munmap_vmas.do_vmi_align_munmap.__do_sys_brk.do_syscall_64
> 0.00 +1.7 1.74 ± 16% perf-profile.calltrace.cycles-pp.__pcs_replace_empty_main.kmem_cache_alloc_noprof.mas_store_gfp.do_brk_flags.__do_sys_brk
> 0.89 ± 2% +1.8 2.68 ± 2% perf-profile.calltrace.cycles-pp.kmem_cache_alloc_noprof.vm_area_alloc.do_brk_flags.__do_sys_brk.do_syscall_64
> 0.00 +1.8 1.79 ± 13% perf-profile.calltrace.cycles-pp.refill_objects.__pcs_replace_empty_main.kmem_cache_alloc_noprof.mas_store_gfp.do_vmi_align_munmap
> 0.65 ± 6% +1.9 2.56 ± 2% perf-profile.calltrace.cycles-pp.perf_iterate_sb.perf_event_mmap_event.perf_event_mmap.do_brk_flags.__do_sys_brk
> 0.00 +2.0 2.01 ± 13% perf-profile.calltrace.cycles-pp.__pcs_replace_empty_main.kmem_cache_alloc_noprof.mas_store_gfp.do_vmi_align_munmap.__do_sys_brk
> 1.06 ± 2% +2.0 3.07 perf-profile.calltrace.cycles-pp.mas_walk.mas_find.__do_sys_brk.do_syscall_64.entry_SYSCALL_64_after_hwframe
> 1.08 +2.2 3.26 perf-profile.calltrace.cycles-pp.vm_area_alloc.do_brk_flags.__do_sys_brk.do_syscall_64.entry_SYSCALL_64_after_hwframe
> 0.60 +2.4 2.97 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64_safe_stack.brk
> 1.31 +2.4 3.70 perf-profile.calltrace.cycles-pp.unmap_page_range.unmap_vmas.unmap_region.vms_complete_munmap_vmas.do_vmi_align_munmap
> 1.31 ± 2% +2.5 3.86 perf-profile.calltrace.cycles-pp.mas_find.__do_sys_brk.do_syscall_64.entry_SYSCALL_64_after_hwframe.brk
> 0.00 +2.9 2.94 perf-profile.calltrace.cycles-pp.__refill_objects_node.refill_objects.__pcs_replace_empty_main.kmem_cache_alloc_noprof.mas_store_gfp
> 1.65 +3.1 4.74 perf-profile.calltrace.cycles-pp.unmap_vmas.unmap_region.vms_complete_munmap_vmas.do_vmi_align_munmap.__do_sys_brk
> 1.32 ± 2% +3.2 4.49 perf-profile.calltrace.cycles-pp.perf_event_mmap_event.perf_event_mmap.do_brk_flags.__do_sys_brk.do_syscall_64
> 1.53 ± 2% +3.6 5.12 perf-profile.calltrace.cycles-pp.perf_event_mmap.do_brk_flags.__do_sys_brk.do_syscall_64.entry_SYSCALL_64_after_hwframe
> 1.82 ± 2% +3.6 5.41 perf-profile.calltrace.cycles-pp.syscall_return_via_sysret.brk
> 1.60 +3.7 5.28 perf-profile.calltrace.cycles-pp.entry_SYSCALL_64.brk
> 2.30 +3.9 6.18 perf-profile.calltrace.cycles-pp.mas_wr_node_store.mas_store_gfp.do_brk_flags.__do_sys_brk.do_syscall_64
> 2.30 +4.1 6.38 perf-profile.calltrace.cycles-pp.mas_wr_node_store.mas_store_gfp.do_vmi_align_munmap.__do_sys_brk.do_syscall_64
> 2.52 +4.7 7.20 perf-profile.calltrace.cycles-pp.unmap_region.vms_complete_munmap_vmas.do_vmi_align_munmap.__do_sys_brk.do_syscall_64
> 2.61 +5.0 7.56 perf-profile.calltrace.cycles-pp.vms_gather_munmap_vmas.do_vmi_align_munmap.__do_sys_brk.do_syscall_64.entry_SYSCALL_64_after_hwframe
> 3.36 +6.7 10.02 perf-profile.calltrace.cycles-pp.entry_SYSRETQ_unsafe_stack.brk
> 4.40 +8.2 12.62 perf-profile.calltrace.cycles-pp.vms_complete_munmap_vmas.do_vmi_align_munmap.__do_sys_brk.do_syscall_64.entry_SYSCALL_64_after_hwframe
> 67.04 -67.0 0.00 perf-profile.children.cycles-pp.___slab_alloc
> 66.95 -67.0 0.00 perf-profile.children.cycles-pp.get_from_partial_node
> 63.85 -61.7 2.16 ± 5% perf-profile.children.cycles-pp._raw_spin_lock_irqsave
> 62.92 -61.3 1.64 ± 7% perf-profile.children.cycles-pp.native_queued_spin_lock_slowpath
> 69.33 -60.9 8.46 perf-profile.children.cycles-pp.kmem_cache_alloc_noprof
> 75.21 -50.8 24.39 perf-profile.children.cycles-pp.mas_store_gfp
> 90.42 -20.9 69.48 perf-profile.children.cycles-pp.__do_sys_brk
> 91.30 -19.1 72.18 perf-profile.children.cycles-pp.do_syscall_64
> 91.49 -18.8 72.73 perf-profile.children.cycles-pp.entry_SYSCALL_64_after_hwframe
> 42.00 -15.8 26.24 perf-profile.children.cycles-pp.do_brk_flags
> 45.05 -11.9 33.12 perf-profile.children.cycles-pp.do_vmi_align_munmap
> 2.43 ± 4% -2.3 0.15 ± 3% perf-profile.children.cycles-pp._raw_spin_unlock_irqrestore
> 99.67 -1.8 97.90 perf-profile.children.cycles-pp.brk
> 0.29 ± 3% -0.1 0.19 ± 3% perf-profile.children.cycles-pp.barn_replace_empty_sheaf
> 0.16 ± 4% -0.1 0.10 ± 5% perf-profile.children.cycles-pp.barn_put_full_sheaf
> 0.09 ± 5% +0.0 0.10 ± 4% perf-profile.children.cycles-pp.task_tick_fair
> 0.05 +0.0 0.09 ± 4% perf-profile.children.cycles-pp.mas_nomem
> 0.26 +0.0 0.31 ± 4% perf-profile.children.cycles-pp.tick_nohz_handler
> 0.23 ± 2% +0.0 0.28 ± 4% perf-profile.children.cycles-pp.update_process_times
> 0.00 +0.1 0.05 ± 7% perf-profile.children.cycles-pp.__x86_indirect_thunk_rax
> 0.00 +0.1 0.05 ± 7% perf-profile.children.cycles-pp.arch_vma_name
> 0.00 +0.1 0.05 ± 7% perf-profile.children.cycles-pp.memset_orig
> 0.00 +0.1 0.06 ± 13% perf-profile.children.cycles-pp.process_one_work
> 0.00 +0.1 0.06 ± 6% perf-profile.children.cycles-pp.unlink_file_vma_batch_add
> 0.00 +0.1 0.06 ± 11% perf-profile.children.cycles-pp.worker_thread
> 0.34 +0.1 0.40 ± 3% perf-profile.children.cycles-pp.__hrtimer_run_queues
> 0.00 +0.1 0.06 perf-profile.children.cycles-pp.__mt_destroy
> 0.00 +0.1 0.06 perf-profile.children.cycles-pp.__schedule
> 0.40 +0.1 0.46 ± 3% perf-profile.children.cycles-pp.__sysvec_apic_timer_interrupt
> 0.40 +0.1 0.46 ± 3% perf-profile.children.cycles-pp.hrtimer_interrupt
> 0.00 +0.1 0.06 ± 11% perf-profile.children.cycles-pp.evlist__event2evsel
> 0.00 +0.1 0.06 ± 11% perf-profile.children.cycles-pp.evlist__parse_sample
> 0.08 +0.1 0.14 ± 4% perf-profile.children.cycles-pp.barn_get_empty_sheaf
> 0.00 +0.1 0.06 ± 7% perf-profile.children.cycles-pp.schedule
> 0.00 +0.1 0.06 ± 7% perf-profile.children.cycles-pp.unlink_file_vma_batch_init
> 0.06 ± 6% +0.1 0.13 ± 2% perf-profile.children.cycles-pp.remove_vma
> 0.00 +0.1 0.07 ± 10% perf-profile.children.cycles-pp.flush_tlb_batched_pending
> 0.00 +0.1 0.07 ± 14% perf-profile.children.cycles-pp.finish_rcuwait
> 0.00 +0.1 0.07 perf-profile.children.cycles-pp.build_id_parse_nofault
> 0.00 +0.1 0.07 ± 6% perf-profile.children.cycles-pp.__refill_objects_any
> 0.00 +0.1 0.08 ± 4% perf-profile.children.cycles-pp.unlink_file_vma_batch_final
> 0.00 +0.1 0.08 ± 4% perf-profile.children.cycles-pp._raw_spin_unlock
> 0.00 +0.1 0.08 ± 5% perf-profile.children.cycles-pp.__call_rcu_common
> 0.02 ±142% +0.1 0.11 ± 12% perf-profile.children.cycles-pp.map__new
> 0.06 ± 9% +0.1 0.15 ± 3% perf-profile.children.cycles-pp.__pte_offset_map
> 0.00 +0.1 0.09 ± 5% perf-profile.children.cycles-pp.rcu_cblist_dequeue
> 0.00 +0.1 0.10 ± 6% perf-profile.children.cycles-pp.ksm_vma_flags
> 0.07 +0.1 0.17 ± 2% perf-profile.children.cycles-pp.mas_next_setup
> 0.02 ±141% +0.1 0.12 ± 5% perf-profile.children.cycles-pp.userfaultfd_unmap_prep
> 0.05 ± 47% +0.1 0.16 ± 10% perf-profile.children.cycles-pp.brk@plt
> 0.06 ± 13% +0.1 0.18 ± 4% perf-profile.children.cycles-pp.mm_get_unmapped_area_vmflags
> 0.06 ± 7% +0.1 0.18 ± 2% perf-profile.children.cycles-pp.unlink_anon_vmas
> 0.02 ±141% +0.1 0.14 ± 11% perf-profile.children.cycles-pp.__x86_indirect_thunk_r12
> 0.08 ± 4% +0.1 0.22 ± 4% perf-profile.children.cycles-pp.cap_vm_enough_memory
> 0.08 ± 4% +0.1 0.22 ± 6% perf-profile.children.cycles-pp.__x64_sys_brk
> 0.08 ± 5% +0.1 0.23 ± 2% perf-profile.children.cycles-pp.strlen
> 0.00 +0.2 0.15 perf-profile.children.cycles-pp.__kmalloc_noprof
> 0.10 ± 3% +0.2 0.25 ± 2% perf-profile.children.cycles-pp.__pi_memcpy
> 0.00 +0.2 0.16 ± 3% perf-profile.children.cycles-pp.__alloc_empty_sheaf
> 0.07 ± 6% +0.2 0.23 ± 8% perf-profile.children.cycles-pp.testcase
> 0.07 ± 12% +0.2 0.24 ± 4% perf-profile.children.cycles-pp.is_vmalloc_addr
> 0.07 ± 84% +0.2 0.23 ± 13% perf-profile.children.cycles-pp.machine__process_mmap2_event
> 0.09 ± 5% +0.2 0.25 ± 3% perf-profile.children.cycles-pp.free_pgd_range
> 0.09 ± 5% +0.2 0.27 ± 2% perf-profile.children.cycles-pp.__account_obj_stock
> 0.00 +0.2 0.19 ± 9% perf-profile.children.cycles-pp.setup_object
> 0.10 ± 6% +0.2 0.29 ± 7% perf-profile.children.cycles-pp.cap_capable
> 0.10 ± 3% +0.2 0.29 ± 2% perf-profile.children.cycles-pp.is_mergeable_anon_vma
> 0.09 +0.2 0.29 perf-profile.children.cycles-pp.may_expand_vm
> 0.12 ± 9% +0.2 0.32 ± 4% perf-profile.children.cycles-pp.x64_sys_call
> 0.12 ± 4% +0.2 0.33 perf-profile.children.cycles-pp.userfaultfd_unmap_complete
> 0.12 ± 6% +0.2 0.34 ± 7% perf-profile.children.cycles-pp.vm_get_page_prot
> 0.00 +0.2 0.23 ± 9% perf-profile.children.cycles-pp.shuffle_freelist
> 0.13 ± 6% +0.2 0.36 ± 6% perf-profile.children.cycles-pp.percpu_counter_add_batch
> 0.11 ± 5% +0.2 0.34 ± 2% perf-profile.children.cycles-pp.strnlen
> 0.18 ± 2% +0.2 0.41 ± 3% perf-profile.children.cycles-pp.vm_area_free
> 0.10 ± 3% +0.2 0.34 perf-profile.children.cycles-pp.unmap_single_vma
> 0.34 +0.2 0.58 ± 4% perf-profile.children.cycles-pp.__rcu_free_sheaf_prepare
> 0.11 ± 85% +0.2 0.34 ± 13% perf-profile.children.cycles-pp.perf_session__deliver_event
> 0.14 ± 4% +0.3 0.40 perf-profile.children.cycles-pp.tlb_finish_mmu
> 0.00 +0.3 0.26 ± 8% perf-profile.children.cycles-pp.allocate_slab
> 0.13 ± 2% +0.3 0.39 perf-profile.children.cycles-pp.__vm_enough_memory
> 0.14 ± 2% +0.3 0.40 ± 2% perf-profile.children.cycles-pp.up_write
> 0.16 ± 2% +0.3 0.43 perf-profile.children.cycles-pp.mas_wr_store_entry
> 0.12 ± 85% +0.3 0.39 ± 12% perf-profile.children.cycles-pp.__ordered_events__flush
> 0.13 ± 3% +0.3 0.40 perf-profile.children.cycles-pp.downgrade_write
> 0.15 ± 2% +0.3 0.42 perf-profile.children.cycles-pp.mas_prev_setup
> 0.11 ± 85% +0.3 0.39 ± 11% perf-profile.children.cycles-pp.perf_session__process_user_event
> 0.16 ± 2% +0.3 0.46 ± 3% perf-profile.children.cycles-pp._raw_spin_lock
> 0.16 ± 3% +0.3 0.47 ± 4% perf-profile.children.cycles-pp.kfree
> 0.16 ± 4% +0.3 0.46 perf-profile.children.cycles-pp.mas_prev_range
> 0.18 ± 2% +0.3 0.48 perf-profile.children.cycles-pp.mas_next_range
> 0.17 ± 2% +0.3 0.48 ± 2% perf-profile.children.cycles-pp.tlb_gather_mmu
> 0.15 ± 2% +0.3 0.46 perf-profile.children.cycles-pp.cap_mmap_addr
> 0.17 +0.3 0.48 perf-profile.children.cycles-pp.sized_strscpy
> 0.14 ± 8% +0.3 0.46 ± 13% perf-profile.children.cycles-pp.obj_cgroup_charge_account
> 0.18 ± 2% +0.3 0.51 perf-profile.children.cycles-pp.up_read
> 0.18 ± 2% +0.4 0.55 perf-profile.children.cycles-pp.security_mmap_addr
> 0.21 ± 2% +0.4 0.60 perf-profile.children.cycles-pp.mas_prev
> 0.21 +0.4 0.60 perf-profile.children.cycles-pp.arch_exit_to_user_mode_prepare
> 0.19 ± 3% +0.4 0.63 perf-profile.children.cycles-pp.arch_get_unmapped_area_topdown
> 0.17 ± 9% +0.4 0.60 ± 13% perf-profile.children.cycles-pp.refill_obj_stock
> 0.25 +0.4 0.70 perf-profile.children.cycles-pp.can_vma_merge_left
> 0.22 ± 2% +0.5 0.67 perf-profile.children.cycles-pp.static_key_count
> 0.24 ± 3% +0.5 0.76 perf-profile.children.cycles-pp.mas_leaf_max_gap
> 0.31 ± 2% +0.6 0.90 perf-profile.children.cycles-pp.pte_offset_map_lock
> 0.00 +0.6 0.58 ± 12% perf-profile.children.cycles-pp.run_ksoftirqd
> 0.06 ± 87% +0.6 0.67 ± 6% perf-profile.children.cycles-pp.queue_event
> 0.00 +0.6 0.61 ± 11% perf-profile.children.cycles-pp.smpboot_thread_fn
> 0.06 ± 90% +0.6 0.68 ± 7% perf-profile.children.cycles-pp.ordered_events__queue
> 0.31 ± 2% +0.6 0.93 ± 2% perf-profile.children.cycles-pp.__vma_start_exclude_readers
> 0.30 ± 4% +0.6 0.93 ± 3% perf-profile.children.cycles-pp.down_write_killable
> 0.50 ± 2% +0.6 1.14 ± 4% perf-profile.children.cycles-pp.__kfree_rcu_sheaf
> 0.35 +0.6 1.00 perf-profile.children.cycles-pp.vma_merge_new_range
> 0.07 ± 86% +0.7 0.72 ± 6% perf-profile.children.cycles-pp.process_simple
> 0.23 ± 11% +0.7 0.89 ± 6% perf-profile.children.cycles-pp.perf_event_mmap_output
> 0.00 +0.7 0.68 ± 11% perf-profile.children.cycles-pp.kthread
> 0.00 +0.7 0.68 ± 11% perf-profile.children.cycles-pp.ret_from_fork
> 0.00 +0.7 0.68 ± 11% perf-profile.children.cycles-pp.ret_from_fork_asm
> 0.44 +0.8 1.21 perf-profile.children.cycles-pp.free_pgtables
> 0.41 ± 2% +0.8 1.21 perf-profile.children.cycles-pp.mas_update_gap
> 0.19 ± 86% +0.9 1.13 ± 6% perf-profile.children.cycles-pp.reader__read_event
> 0.19 ± 85% +0.9 1.13 ± 5% perf-profile.children.cycles-pp.perf_session__process_events
> 0.19 ± 85% +0.9 1.13 ± 5% perf-profile.children.cycles-pp.record__finish_output
> 0.21 ± 82% +1.0 1.16 ± 6% perf-profile.children.cycles-pp.cmd_record
> 0.49 +1.0 1.46 perf-profile.children.cycles-pp.__vma_start_write
> 0.53 ± 5% +1.1 1.65 ± 5% perf-profile.children.cycles-pp.__memcg_slab_free_hook
> 0.53 +1.1 1.67 perf-profile.children.cycles-pp.__get_unmapped_area
> 0.60 +1.2 1.76 perf-profile.children.cycles-pp.security_vm_enough_memory_mm
> 0.58 ± 3% +1.2 1.77 ± 3% perf-profile.children.cycles-pp.__memcg_slab_post_alloc_hook
> 0.56 +1.2 1.76 perf-profile.children.cycles-pp.check_brk_limits
> 0.60 +1.2 1.81 perf-profile.children.cycles-pp.entry_SYSCALL_64_safe_stack
> 0.69 ± 2% +1.3 1.95 perf-profile.children.cycles-pp.zap_pte_range
> 0.69 ± 2% +1.3 1.98 perf-profile.children.cycles-pp.mas_prev_slot
> 0.73 +1.3 2.05 perf-profile.children.cycles-pp.mas_next_slot
> 0.78 +1.4 2.20 perf-profile.children.cycles-pp.mas_wr_store_type
> 1.20 +1.5 2.70 perf-profile.children.cycles-pp.kvfree_call_rcu
> 0.86 ± 2% +1.5 2.39 perf-profile.children.cycles-pp.zap_pmd_range
> 0.73 +1.6 2.30 perf-profile.children.cycles-pp.memcpy_orig
> 0.77 ± 3% +1.7 2.43 ± 2% perf-profile.children.cycles-pp.kmem_cache_free
> 1.70 ± 8% +1.9 3.56 ± 2% perf-profile.children.cycles-pp.__slab_free
> 0.66 ± 5% +1.9 2.59 perf-profile.children.cycles-pp.perf_iterate_sb
> 2.67 ± 5% +2.2 4.84 ± 3% perf-profile.children.cycles-pp.__irq_exit_rcu
> 1.09 +2.2 3.28 perf-profile.children.cycles-pp.vm_area_alloc
> 1.18 ± 2% +2.2 3.41 perf-profile.children.cycles-pp.mas_walk
> 3.09 ± 4% +2.2 5.33 ± 2% perf-profile.children.cycles-pp.sysvec_apic_timer_interrupt
> 3.12 ± 4% +2.2 5.36 ± 2% perf-profile.children.cycles-pp.asm_sysvec_apic_timer_interrupt
> 1.97 ± 7% +2.4 4.35 ± 2% perf-profile.children.cycles-pp.__kmem_cache_free_bulk
> 1.32 +2.4 3.73 perf-profile.children.cycles-pp.unmap_page_range
> 2.39 ± 5% +2.6 5.04 ± 2% perf-profile.children.cycles-pp.rcu_free_sheaf
> 2.64 ± 5% +2.7 5.37 ± 2% perf-profile.children.cycles-pp.rcu_core
> 2.60 ± 5% +2.7 5.34 ± 2% perf-profile.children.cycles-pp.rcu_do_batch
> 2.66 ± 5% +2.8 5.42 ± 2% perf-profile.children.cycles-pp.handle_softirqs
> 0.00 +3.0 3.04 perf-profile.children.cycles-pp.__refill_objects_node
> 0.74 +3.1 3.80 perf-profile.children.cycles-pp.__pcs_replace_empty_main
> 1.66 +3.1 4.76 perf-profile.children.cycles-pp.unmap_vmas
> 1.38 ± 2% +3.3 4.64 perf-profile.children.cycles-pp.perf_event_mmap_event
> 0.00 +3.4 3.39 perf-profile.children.cycles-pp.refill_objects
> 1.54 ± 2% +3.6 5.13 perf-profile.children.cycles-pp.perf_event_mmap
> 1.85 +3.7 5.52 perf-profile.children.cycles-pp.syscall_return_via_sysret
> 1.99 +3.8 5.81 perf-profile.children.cycles-pp.entry_SYSCALL_64
> 2.27 +4.3 6.58 perf-profile.children.cycles-pp.mas_find
> 2.54 +4.7 7.23 perf-profile.children.cycles-pp.unmap_region
> 2.65 +5.0 7.67 perf-profile.children.cycles-pp.vms_gather_munmap_vmas
> 3.77 +7.4 11.20 perf-profile.children.cycles-pp.entry_SYSRETQ_unsafe_stack
> 4.68 +8.1 12.74 perf-profile.children.cycles-pp.mas_wr_node_store
> 4.43 +8.3 12.71 perf-profile.children.cycles-pp.vms_complete_munmap_vmas
> 62.91 -61.3 1.64 ± 7% perf-profile.self.cycles-pp.native_queued_spin_lock_slowpath
> 0.93 -0.4 0.49 perf-profile.self.cycles-pp._raw_spin_lock_irqsave
> 0.43 -0.3 0.09 perf-profile.self.cycles-pp.__pcs_replace_empty_main
> 0.09 ± 5% +0.0 0.11 ± 4% perf-profile.self.cycles-pp.barn_replace_empty_sheaf
> 0.05 ± 7% +0.1 0.10 ± 3% perf-profile.self.cycles-pp.remove_vma
> 0.00 +0.1 0.05 perf-profile.self.cycles-pp.__kmalloc_noprof
> 0.00 +0.1 0.05 perf-profile.self.cycles-pp.unlink_file_vma_batch_init
> 0.00 +0.1 0.05 ± 7% perf-profile.self.cycles-pp.build_id_parse_nofault
> 0.00 +0.1 0.06 ± 6% perf-profile.self.cycles-pp.__mt_destroy
> 0.00 +0.1 0.06 ± 11% perf-profile.self.cycles-pp.evlist__event2evsel
> 0.00 +0.1 0.06 ± 6% perf-profile.self.cycles-pp.__call_rcu_common
> 0.00 +0.1 0.06 ± 6% perf-profile.self.cycles-pp.flush_tlb_batched_pending
> 0.00 +0.1 0.07 ± 7% perf-profile.self.cycles-pp._raw_spin_unlock
> 0.00 +0.1 0.07 ± 13% perf-profile.self.cycles-pp.finish_rcuwait
> 0.00 +0.1 0.07 perf-profile.self.cycles-pp.mas_nomem
> 0.04 ± 44% +0.1 0.12 ± 6% perf-profile.self.cycles-pp.__x64_sys_brk
> 0.00 +0.1 0.08 ± 6% perf-profile.self.cycles-pp.barn_get_empty_sheaf
> 0.00 +0.1 0.08 ± 6% perf-profile.self.cycles-pp.security_mmap_addr
> 0.00 +0.1 0.08 perf-profile.self.cycles-pp.check_brk_limits
> 0.05 +0.1 0.13 ± 2% perf-profile.self.cycles-pp.__pte_offset_map
> 0.00 +0.1 0.09 ± 4% perf-profile.self.cycles-pp.ksm_vma_flags
> 0.06 +0.1 0.15 perf-profile.self.cycles-pp.mas_next_setup
> 0.00 +0.1 0.09 ± 5% perf-profile.self.cycles-pp.rcu_cblist_dequeue
> 0.00 +0.1 0.10 ± 8% perf-profile.self.cycles-pp.rcu_free_sheaf
> 0.10 ± 5% +0.1 0.19 ± 9% perf-profile.self.cycles-pp.vm_area_free
> 0.00 +0.1 0.10 ± 4% perf-profile.self.cycles-pp.__pi_memcpy
> 0.00 +0.1 0.10 ± 7% perf-profile.self.cycles-pp.userfaultfd_unmap_prep
> 0.00 +0.1 0.10 ± 13% perf-profile.self.cycles-pp.__x86_indirect_thunk_r12
> 0.06 +0.1 0.16 ± 3% perf-profile.self.cycles-pp.unlink_anon_vmas
> 0.03 ±100% +0.1 0.14 ± 15% perf-profile.self.cycles-pp.brk@plt
> 0.07 ± 6% +0.1 0.18 ± 4% perf-profile.self.cycles-pp.cap_vm_enough_memory
> 0.02 ±141% +0.1 0.14 ± 3% perf-profile.self.cycles-pp.mm_get_unmapped_area_vmflags
> 0.08 ± 6% +0.1 0.20 ± 2% perf-profile.self.cycles-pp.strlen
> 0.07 +0.1 0.20 ± 3% perf-profile.self.cycles-pp.__vm_enough_memory
> 0.08 ± 6% +0.1 0.22 ± 4% perf-profile.self.cycles-pp.free_pgd_range
> 0.07 ± 14% +0.1 0.21 ± 4% perf-profile.self.cycles-pp.is_vmalloc_addr
> 0.07 ± 5% +0.1 0.21 ± 7% perf-profile.self.cycles-pp.testcase
> 0.09 ± 4% +0.2 0.24 ± 3% perf-profile.self.cycles-pp.__account_obj_stock
> 0.08 ± 4% +0.2 0.24 ± 6% perf-profile.self.cycles-pp.obj_cgroup_charge_account
> 0.00 +0.2 0.16 ± 8% perf-profile.self.cycles-pp.setup_object
> 0.10 ± 4% +0.2 0.27 ± 2% perf-profile.self.cycles-pp.is_mergeable_anon_vma
> 0.10 ± 5% +0.2 0.27 ± 7% perf-profile.self.cycles-pp.cap_capable
> 0.08 ± 5% +0.2 0.26 perf-profile.self.cycles-pp.may_expand_vm
> 0.10 ± 4% +0.2 0.29 perf-profile.self.cycles-pp.pte_offset_map_lock
> 0.10 ± 3% +0.2 0.29 perf-profile.self.cycles-pp.mas_next_range
> 0.10 ± 7% +0.2 0.28 ± 2% perf-profile.self.cycles-pp.mas_prev_range
> 0.10 ± 5% +0.2 0.29 perf-profile.self.cycles-pp.vma_merge_new_range
> 0.12 ± 4% +0.2 0.31 perf-profile.self.cycles-pp.userfaultfd_unmap_complete
> 0.10 ± 4% +0.2 0.30 ± 4% perf-profile.self.cycles-pp.x64_sys_call
> 0.11 ± 4% +0.2 0.31 ± 8% perf-profile.self.cycles-pp.vm_get_page_prot
> 0.12 ± 4% +0.2 0.32 perf-profile.self.cycles-pp.mas_prev
> 0.12 ± 7% +0.2 0.32 ± 6% perf-profile.self.cycles-pp.percpu_counter_add_batch
> 0.10 ± 3% +0.2 0.31 ± 2% perf-profile.self.cycles-pp.strnlen
> 0.09 ± 4% +0.2 0.31 perf-profile.self.cycles-pp.unmap_single_vma
> 0.15 ± 3% +0.2 0.36 ± 2% perf-profile.self.cycles-pp.mas_wr_store_entry
> 0.14 ± 2% +0.2 0.38 ± 2% perf-profile.self.cycles-pp.can_vma_merge_left
> 0.34 +0.2 0.58 ± 4% perf-profile.self.cycles-pp.__rcu_free_sheaf_prepare
> 0.12 ± 6% +0.2 0.36 perf-profile.self.cycles-pp.tlb_finish_mmu
> 0.13 ± 2% +0.2 0.36 perf-profile.self.cycles-pp.mas_prev_setup
> 0.13 +0.2 0.37 ± 2% perf-profile.self.cycles-pp.up_write
> 0.15 ± 3% +0.2 0.39 ± 6% perf-profile.self.cycles-pp.kfree
> 0.12 ± 3% +0.2 0.37 ± 2% perf-profile.self.cycles-pp.__get_unmapped_area
> 0.14 +0.2 0.39 perf-profile.self.cycles-pp.unmap_region
> 0.10 ± 10% +0.2 0.35 ± 13% perf-profile.self.cycles-pp.refill_obj_stock
> 0.12 ± 3% +0.3 0.38 perf-profile.self.cycles-pp.downgrade_write
> 0.14 ± 3% +0.3 0.39 ± 2% perf-profile.self.cycles-pp.unmap_vmas
> 0.16 ± 4% +0.3 0.42 ± 3% perf-profile.self.cycles-pp.zap_pmd_range
> 0.16 ± 3% +0.3 0.43 ± 3% perf-profile.self.cycles-pp._raw_spin_lock
> 0.15 ± 2% +0.3 0.43 perf-profile.self.cycles-pp.sized_strscpy
> 0.14 ± 3% +0.3 0.41 perf-profile.self.cycles-pp.cap_mmap_addr
> 0.17 ± 4% +0.3 0.45 perf-profile.self.cycles-pp.mas_update_gap
> 0.16 ± 3% +0.3 0.45 perf-profile.self.cycles-pp.tlb_gather_mmu
> 0.19 +0.3 0.48 perf-profile.self.cycles-pp.free_pgtables
> 0.17 ± 2% +0.3 0.47 perf-profile.self.cycles-pp.up_read
> 0.16 ± 4% +0.3 0.47 perf-profile.self.cycles-pp.perf_event_mmap
> 0.18 ± 5% +0.3 0.50 perf-profile.self.cycles-pp.__vma_start_write
> 0.19 ± 2% +0.4 0.55 perf-profile.self.cycles-pp.static_key_count
> 0.20 ± 2% +0.4 0.56 ± 5% perf-profile.self.cycles-pp.vm_area_alloc
> 0.20 +0.4 0.58 perf-profile.self.cycles-pp.entry_SYSCALL_64_after_hwframe
> 0.20 ± 2% +0.4 0.58 perf-profile.self.cycles-pp.arch_exit_to_user_mode_prepare
> 0.18 ± 3% +0.4 0.58 perf-profile.self.cycles-pp.arch_get_unmapped_area_topdown
> 0.22 +0.4 0.63 ± 2% perf-profile.self.cycles-pp.security_vm_enough_memory_mm
> 0.21 ± 3% +0.4 0.64 perf-profile.self.cycles-pp.entry_SYSCALL_64_safe_stack
> 0.23 +0.5 0.69 perf-profile.self.cycles-pp.mas_leaf_max_gap
> 0.24 ± 6% +0.5 0.74 ± 9% perf-profile.self.cycles-pp.kmem_cache_free
> 0.29 ± 2% +0.5 0.80 perf-profile.self.cycles-pp.do_vmi_align_munmap
> 0.27 +0.5 0.80 ± 2% perf-profile.self.cycles-pp.__vma_start_exclude_readers
> 0.35 ± 3% +0.5 0.87 ± 5% perf-profile.self.cycles-pp.__kfree_rcu_sheaf
> 0.28 +0.5 0.82 perf-profile.self.cycles-pp.perf_event_mmap_event
> 0.32 ± 3% +0.5 0.86 ± 2% perf-profile.self.cycles-pp.zap_pte_range
> 0.33 +0.6 0.89 ± 2% perf-profile.self.cycles-pp.vms_complete_munmap_vmas
> 0.21 ± 11% +0.6 0.78 ± 6% perf-profile.self.cycles-pp.perf_event_mmap_output
> 0.05 ± 91% +0.6 0.62 ± 7% perf-profile.self.cycles-pp.queue_event
> 0.29 ± 3% +0.6 0.87 ± 2% perf-profile.self.cycles-pp.down_write_killable
> 0.61 +0.6 1.24 perf-profile.self.cycles-pp.kvfree_call_rcu
> 0.36 ± 5% +0.7 1.02 ± 11% perf-profile.self.cycles-pp.__memcg_slab_free_hook
> 0.41 ± 2% +0.7 1.14 perf-profile.self.cycles-pp.vms_gather_munmap_vmas
> 0.40 ± 4% +0.8 1.16 ± 4% perf-profile.self.cycles-pp.do_syscall_64
> 0.43 ± 3% +0.8 1.22 ± 8% perf-profile.self.cycles-pp.__memcg_slab_post_alloc_hook
> 0.46 ± 2% +0.8 1.28 ± 2% perf-profile.self.cycles-pp.unmap_page_range
> 0.32 +0.8 1.16 ± 2% perf-profile.self.cycles-pp.__kmem_cache_free_bulk
> 0.56 +1.0 1.55 perf-profile.self.cycles-pp.do_brk_flags
> 0.54 ± 2% +1.0 1.54 perf-profile.self.cycles-pp.mas_find
> 0.47 ± 5% +1.0 1.51 ± 6% perf-profile.self.cycles-pp.brk
> 0.66 ± 2% +1.2 1.82 perf-profile.self.cycles-pp.mas_prev_slot
> 0.41 ± 12% +1.2 1.58 ± 5% perf-profile.self.cycles-pp.perf_iterate_sb
> 0.69 +1.2 1.87 perf-profile.self.cycles-pp.mas_next_slot
> 0.76 +1.3 2.06 perf-profile.self.cycles-pp.mas_wr_store_type
> 0.68 +1.3 2.01 perf-profile.self.cycles-pp.__do_sys_brk
> 0.64 +1.4 2.04 ± 2% perf-profile.self.cycles-pp.__slab_free
> 0.70 +1.4 2.13 perf-profile.self.cycles-pp.memcpy_orig
> 0.95 +1.8 2.70 perf-profile.self.cycles-pp.kmem_cache_alloc_noprof
> 1.10 +2.0 3.09 perf-profile.self.cycles-pp.mas_store_gfp
> 1.13 ± 2% +2.0 3.16 perf-profile.self.cycles-pp.mas_walk
> 0.00 +2.1 2.06 perf-profile.self.cycles-pp.__refill_objects_node
> 1.76 +3.4 5.18 perf-profile.self.cycles-pp.entry_SYSCALL_64
> 1.85 +3.7 5.50 perf-profile.self.cycles-pp.syscall_return_via_sysret
> 2.26 +3.9 6.16 perf-profile.self.cycles-pp.mas_wr_node_store
> 3.73 +7.4 11.09 perf-profile.self.cycles-pp.entry_SYSRETQ_unsafe_stack
>
>
>
>
> Disclaimer:
> Results have been estimated based on internal Intel analysis and are provided
> for informational purposes only. Any difference in system hardware or software
> design or configuration may affect actual performance.
>
>