Re: [PATCH 0/5] hugetlbfs: Disable PMD sharing for large systems

From: Dave Chinner
Date: Thu Sep 12 2019 - 21:51:26 EST

Next message: Sean Christopherson: "Re: [PATCH] KVM: x86: Handle unexpected MMIO accesses using master abort semantics"
Previous message: Kees Cook: "Re: problem starting /sbin/init (32-bit 5.3-rc8)"
In reply to: Waiman Long: "Re: [PATCH 5/5] hugetlbfs: Limit wait time when trying to share huge PMD"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

On Wed, Sep 11, 2019 at 04:05:32PM +0100, Waiman Long wrote:
> A customer with large SMP systems (up to 16 sockets) with application
> that uses large amount of static hugepages (~500-1500GB) are experiencing
> random multisecond delays. These delays was caused by the long time it
> took to scan the VMA interval tree with mmap_sem held.
>
> To fix this problem while perserving existing behavior as much as
> possible, we need to allow timeout in down_write() and disabling PMD
> sharing when it is taking too long to do so. Since a transaction can
> involving touching multiple huge pages, timing out for each of the huge
> page interactions does not completely solve the problem. So a threshold
> is set to completely disable PMD sharing if too many timeouts happen.
>
> The first 4 patches of this 5-patch series adds a new
> down_write_timedlock() API which accepts a timeout argument and return
> true is locking is successful or false otherwise. It works more or less
> than a down_write_trylock() but the calling thread may sleep.

Just on general principle, this is a non-starter. If a lock is being
held too long, then whatever the lock is protecting needs fixing.
Adding timeouts to locks and sysctls to tune them is not a viable
solution to address latencies caused by algorithm scalability
issues.

Cheers,

Dave.
--
Dave Chinner
david@xxxxxxxxxxxxx

Next message: Sean Christopherson: "Re: [PATCH] KVM: x86: Handle unexpected MMIO accesses using master abort semantics"
Previous message: Kees Cook: "Re: problem starting /sbin/init (32-bit 5.3-rc8)"
In reply to: Waiman Long: "Re: [PATCH 5/5] hugetlbfs: Limit wait time when trying to share huge PMD"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]