Re: [PATCH 39/41] afs: Overhaul invalidation handling to better support RO volumes
From: Jeffrey E Altman
Date: Thu Nov 09 2023 - 14:00:56 EST
On 11/9/2023 10:40 AM, David Howells wrote:
This allows better handling of RO (and Backup) volumes. Since these are
snapshot of a RW volume that are updated atomically simultantanously across
all servers that host them, they only require a single callback promise for
the entire volume.
The atomic visibility of a volume cloning operation is limited to
individual volume locations.
It is untrue that a "vos release" updates all RO volume locations
simultaneously. If that were
to occur then there would be the potential for all volume locations to
be offline simultaneously
causing an outage. Instead, the traditional "volume release" process
updates the RO volume
site co-located with the RW volume site and makes that version visible
to cache managers.
Then 50% of the remaining sites are updated in parallel followed by the
remaining sites.
Each time a site is updated to the new version the volume location entry
is updated to flag
the site as VLSF_NEWREPSITE. However, changes to the volume location
site flags might
not be known to the cache manager. Prior to completion of the "volume
release" process
it is possible for a cache manager to failover from a newer snapshot to
an older one.
The currently upstream code assumes that RO volumes
operate in the same manner as RW volumes, and that each file has its own
individual callback - which means that it does a status fetch for *every*
file in a RO volume, whether or not the volume got "released" (volume
callback breaks can occur for other reasons too, such as the volumeserver
taking ownership of a volume from a fileserver).
Knowledge that the volume snapshot has not changed can be used to avoid
individual
FetchStatus queries when accessing vnodes anonymously. In that case
the vnode's
FetchStatus.AnonymousAccess can be trusted to be unchanged. However, for
authenticated
access the individual FetchStatus or InlineBulkStat queries must still
be issued because
the FetchStatus.CallerAccess rights are determined not only by the
access control list
contents but the caller identity's group memberships which might have
changed. This
distinction only matters when evaluating authorization decisions.
Jeffrey Altman
Attachment:
smime.p7s
Description: S/MIME Cryptographic Signature