Re: [osd-dev] [PATCH 7/9] exofs: mkexofs

From: Benny Halevy
Date: Tue Jan 13 2009 - 08:32:54 EST


On Jan. 13, 2009, 15:24 +0200, Jeff Garzik <jeff@xxxxxxxxxx> wrote:
> Benny Halevy wrote:
>> IMO the main advantage of moving block allocation down to the OSD target
>> is more apparent with distributed file systems a-la pNFS over objects
>> where paralleling that task is a key for scalable performance.
>>
>> The thing is that the target needs to implement its own mapping from
>> object logical offsets into disk blocks and this is usually done
>> using some kind of a (possibly trimmed down) local file system.
>> Therefore the I/O performance of a single OSD is likely to be similar
>> to a single file server's.
>
> Well, modern SATA devices are already mini-filesystems internally, when
> you consider logical block remapping etc.
>
> And the claim by drive research guys at the filesystem/storage summit
> was that OSD offered the potential to better optimize storage based on
> access/usage patterns.
>
> (of course, whether or not reality bears out this guess is another question)

That's true for multi-user access where knowing the context for each I/O
request - i.e. the object that holds it provides a crucial hint for
read-ahead and write allocation, where for a dumb device that doesn't
know anything about the filesystem's internals, it's much harder to
associate different blocks with their respective containers, or "streams"
(in case the container is typically accessed in a sequential pattern).

>
>
>> I can understand representing a single object as a block device (although I
>> think that using a file for that should be good enough and easier) but
>> why representing the whole OSD as a block device? The OSD holds partitions
>> and objects each with attributes and OSD security related support. Hence
>> representing that in a namespace using a filesystem seems straight forward.
>
> I am actually considering writing a simple "osdblk" driver, that would
> represent a single object as a block device.
>
> This would NOT replace exofs or other OSD filesystems, but it would be
> nice to have, and it will give me more experience with OSDs.

That's awesome!
It be really interesting to benchmark one against the other.

Benny

>
> Jeff
>
>
> _______________________________________________
> osd-dev mailing list
> osd-dev@xxxxxxxxxxxx
> http://mailman.open-osd.org/mailman/listinfo/osd-dev

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/