Re: [PATCH 1b/7] dlm: core locking

From: David Teigland
Date: Tue Apr 26 2005 - 22:00:16 EST


On Tue, Apr 26, 2005 at 10:40:24AM -0700, Steven Dake wrote:
> Hate to admit ignorance, but I'm not really sure what SCTP does.. I
> guess point to point communication like tcp but with some other kind of
> characteristics.. I wanted to have some idea of how locking messages
> are related to the current membership. I think I understand the system
> from your descriptions and reading the code. One scenario I could see
> happeing is that there are 2 processors A, B.
>
> B drops out of membership
> A sends lock to lock master B (but A doens't know B has dropped out of
> membership yet)
> B gets lock request, but has dropped out of membership or failed in some
> way
>
> In this case the order of lock messages with the membership changes is
> important.

I think this might help clarify: no membership change is applied to the
lockspace on any nodes until the lockspace has first been suspended on
all. Suspending means no locking activity is processed. The lockspace on
all nodes is then told the new membership and does recovery. Locking is
then resumed.

If some lock message from a failed node is somehow still in the network
and arrives at one of the lockspace members now running again, it will
simply be ignored when they see it's from a non-member node.


> This is the essential race that describes almost every issue with
> distributed systems... virtual synchrony makes this scenario impossible
> by ensuring that messages are ordered in relationship to membership
> changes.

Yes, but the messages discussed in that context are a part of group
multicasts. Specifically, one message from a particular processor needs
to be delivered on all other processors, and all processors must agree on
the order of that group message with respect to membership changes. To do
that you do indeed need to employ VS algorithms.

I know you're more familiar with those details than I am. What I keep
trying to explain is that the dlm is in a different, simpler category.
The dlm's point-to-point messages (think client-server on a per lock
basis) just don't require the same rigorous approach used for _group_
communication.


> Do you intend to eventually move the point to point communication into
> userspace, or keep it within kernel?

In the kernel, there's nothing to it.


> I'd like to understand if there is a general need for cluster
> communication as a kernel service, or the intent is for all
> communication to be done in userspace...

I see no need for group communication in the kernel.


> Can I ask a performance question.. How many locks per second can be
> acquired and then released with a system of 2-3 processors?

We haven't done any measurements.


> In the case that the processor requesting the lock is on the lock server
> processor,

When a lock is requested on the master node, there's no communication, no
messages, it's all local processing. No one else in the cluster even
knows about the lock.


> and in the case that the processor requesting the lock is not
> on the lock server processor... (processor in this case is the system
> that processes the lock operations). Assuming all locks are
> uncontended...

OK, we'll try to do some measurements at some point.

Thanks,
Dave

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/