Re: [Linux-cluster] Re: GFS, what's remaining

From: Antonio Vargas
Date: Wed Sep 14 2005 - 04:46:54 EST


On 9/14/05, Patrick Caulfield <pcaulfie@xxxxxxxxxx> wrote:
> I've just returned from holiday so I'm late to this discussion so let me tell
> you what we do now and why and lets see what's wrong with it.
>
> [snip lots of useful stuff]
>
> While having an FD per lock sounds like a nice unixy idea I don't think it would
> work very well in practice. Applications with hundreds or thousands of locks
> (such as databases) would end up with huge pollfd structs to manage, and it

Or rather use epoll instead of poll/select to avoid scalability issues.

> while it helps the refcounting (currently the nastiest bit of the current
> dlm_device code) removes the possibility of having persistent locks that exist

The refcounting is already implemented for FDs and will be kept
working 100% of the time because if it failed the kernel would not
even boot for most people due to initscripts leaking massive memory at
exit time. Not that GFS will not be maintained, but the
kernel-at-large has more people behind it.

> after the process exits - a handy feature that some people do use, though I
> don't think it's in the currently submitted DLM code. One FD per lock also gives

Perhaps persistent locks could be registered with a daemon by passing
them via unix pipes.

> each lock two handles, the lock ID used internally by the DLM and the FD used
> externally by the application which I think is a little confusing.
>
> I don't think a dlmfs is useful, personally. The features you can export from it
> are either minimal compared to the full DLM functionality (so you have to export
> the rest by some other means anyway) or are going to be so un-filesystemlike as
> to be very awkward to use. Doing lock operations in shell scripts is all very
> cool but how often do you /really/ need to do that?
>
> I'm not saying that what we have is perfect - far from it - but we have thought
> about how this works and what we came up with seems like a good compromise
> between providing full DLM functionality to userspace using unix features. But
> we're very happy to listen to other ideas - and have been doing I hope.
>
> --
>
> patrick
>
>
> -
> To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
> the body of a message to majordomo@xxxxxxxxxxxxxxx
> More majordomo info at http://vger.kernel.org/majordomo-info.html
> Please read the FAQ at http://www.tux.org/lkml/
>


--
Greetz, Antonio Vargas aka winden of network

http://wind.codepixel.com/

Las cosas no son lo que parecen, excepto cuando parecen lo que si son.
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/