Re: [PATCH v3 RESEND] zram: auto add new devices on demand

From: Timofey Titovets
Date: Tue Jul 29 2014 - 17:13:16 EST


Thanks for reply Minchan Kim.
2014-07-29 6:00 GMT+03:00 Minchan Kim <minchan@xxxxxxxxxx>:
> Hello Timofey,
>
> Sorry for late response and thanks for suggesting new feature.
>
> First of all, I'd like to know your usecase.
>
> I don't mean I am against this feature and I tend to agree it would
> be good if we can make new device dynamically but until now, I don't
> hear any requirement like that. So we need compelling usecase to
> justify maintainance overhead.

Use Case:
I step back from loop device, and try formulate a "purely".
Just as example, after born compcache idea for using zram as swap
device on low memory systems, different developers try to create
"tool" for setup zram devices as swap. On gentoo this is udev rule
with bash scripting, on debian systems compcache package (sh script)
on arch linux, several tools, of the most popular zramswap from aur
(awk script without author(?)).

I'm not much different from them. I see this zoo of solutions, and i
try to unify all swaps case in one utility. (systemd in this case is
only init system, because nobody requested it to porting on classic
init or upstart).
as child of my thoughts born systemd-swap (on github) for setup zram
swaps, partition swap or dynamic swap file.

After some time, people request to adding new setting "additional zram
devices", for creating static not used device for their own needs
Example: zramdev, tool setup zram device as cache, for file operations
img dd to zramX, working with data in memory, after unmount and save
data with dd from zram to img.
Example 2: zram device as storage for /tmp.

But after several attempts, i just create modprobe config for creating
maximum of possible free zram devices with module loading. Agree, it's
not pretty.
Then i understand what for "comfort" i must write kernel side fix (I
was inspired loop device, what creating on demand with losetup tool).

And then i try write this patch for kernel and create zramctl, as
start on C++, after i rework it on clear C for util-linux.

linux is a big cake, where everybody cook their own piece and in
result we have very powerful and diverse system.
I love linux and i just do attempt to make it better.

This is my story =)

> On Mon, Jul 21, 2014 at 01:46:14PM +0300, Timofey Titovets wrote:
>> From: Timofey Titovets <nefelim4ag@xxxxxxxxx>
>>
>> This add supporting of auto allocate new zram devices on demand,
>> like loop devices
>>
>> This working by following rules:
>> - Pre create zram devices specified by num_device
>> - if all device already in use -> add new free device
>>
>> From v1 -> v2:
>> Delete useless variable 'ret', added documentation for explain new
>> zram behavior
>>
>> From v2 -> v3
>> Delete logic to destroy not used devices, for avoid concurrency issues
>> Code refactoring for made patch small and clean as possible
>> Patch can pass the test:
>>
>> #!/bin/sh
>> modprobe zram
>> while true; do
>> echo 10485760 > /sys/block/zram0/disksize&
>> echo 1 > /sys/block/zram0/reset&
>> done
>>
>> Can be pulled from:
>> https://github.com/Nefelim4ag/linux.git
>>
>> Tested on 3.15.5-2-ARCH, can be applied on any kernel version
>> after this patch 'zram: add LZ4 compression support' - https://git.kernel.org/cgit/linux/kernel/git/next/linux-next.git/commit/?id=6e76668e415adf799839f0ab205142ad7002d260
>>
>> ---
>>
>> diff --git a/Documentation/blockdev/zram.txt
>> b/Documentation/blockdev/zram.txt
>> index 0595c3f..a090ac7 100644
>> --- a/Documentation/blockdev/zram.txt
>> +++ b/Documentation/blockdev/zram.txt
>> @@ -21,6 +21,9 @@ Following shows a typical sequence of steps for
>> using zram.
>> This creates 4 devices: /dev/zram{0,1,2,3}
>> (num_devices parameter is optional. Default: 1)
>>
>> + If all device in use kernel will create new zram device
>> + (between num_devices and 31)
>> +
>> 2) Set max number of compression streams
>> Compression backend may use up to max_comp_streams compression streams,
>> thus allowing up to max_comp_streams concurrent compression operations.
>> diff --git a/drivers/block/zram/zram_drv.c b/drivers/block/zram/zram_drv.c
>> index 089e72c..cc78779 100644
>> --- a/drivers/block/zram/zram_drv.c
>> +++ b/drivers/block/zram/zram_drv.c
>> @@ -43,6 +43,8 @@ static const char *default_compressor = "lzo";
>> /* Module params (documentation at end) */
>> static unsigned int num_devices = 1;
>>
>> +static inline int zram_add_dev(void);
>> +
>> #define ZRAM_ATTR_RO(name) \
>> static ssize_t zram_attr_##name##_show(struct device *d, \
>> struct device_attribute *attr, char *b) \
>> @@ -168,6 +170,7 @@ static ssize_t comp_algorithm_store(struct device *dev,
>> struct device_attribute *attr, const char *buf, size_t len)
>> {
>> struct zram *zram = dev_to_zram(dev);
>> +
>
> unnecessary change
Linux coding style requires using checkpatch.pl before send it to mail.
>> down_write(&zram->init_lock);
>> if (init_done(zram)) {
>> up_write(&zram->init_lock);
>> @@ -239,6 +242,7 @@ static struct zram_meta *zram_meta_alloc(u64 disksize)
>> {
>> size_t num_pages;
>> struct zram_meta *meta = kmalloc(sizeof(*meta), GFP_KERNEL);
>> +
>
> Ditto
>
>> if (!meta)
>> goto out;
>>
>> @@ -374,6 +378,7 @@ static int zram_bvec_read(struct zram *zram,
>> struct bio_vec *bvec,
>> struct page *page;
>> unsigned char *user_mem, *uncmem = NULL;
>> struct zram_meta *meta = zram->meta;
>> +
>
> Ditto
>
>> page = bvec->bv_page;
>>
>> read_lock(&meta->tb_lock);
>> @@ -607,6 +612,7 @@ static void zram_reset_device(struct zram *zram,
>> bool reset_capacity)
>> /* Free all pages that are still in this zram device */
>> for (index = 0; index < zram->disksize >> PAGE_SHIFT; index++) {
>> unsigned long handle = meta->table[index].handle;
>> +
>
> Ditto
>
>> if (!handle)
>> continue;
>>
>> @@ -667,6 +673,7 @@ static ssize_t disksize_store(struct device *dev,
>> zram->disksize = disksize;
>> set_capacity(zram->disk, zram->disksize >> SECTOR_SHIFT);
>> revalidate_disk(zram->disk);
>> + zram_add_dev();
>
> Why do you add new device unconditionally?
Because we haven't userspace tool to call something, to add new
device. If we try to setup new device.

> Maybe we need new konb on sysfs or ioctl for adding new device?
> Any thought, guys?
I do not want to clutter up the ioctl. =[
sysfs looks more attractive, but how can i do it prettily?

>
>
>> up_write(&zram->init_lock);
>> return len;
>>
>> @@ -954,6 +961,34 @@ static void destroy_device(struct zram *zram)
>> blk_cleanup_queue(zram->queue);
>> }
>>
>> +/*
>> + * Automatically add empty zram device,
>> + * if all created devices already initialized
>> + */
>> +static inline int zram_add_dev(void)
>> +{
>> + int dev_id;
>> +
>> + for (dev_id = 0; dev_id < num_devices; dev_id++) {
>> + if (!zram_devices[dev_id].disksize)
>> + return 0;
>> + }
>> +
>> + if (max_num_devices <= num_devices) {
>> + pr_warn("Can't add zram%u, max device number reached\n", num_devices);
>> + return 1;
>> + }
>> +
>> + if (create_device(&zram_devices[num_devices], num_devices)) {
>> + destroy_device(&zram_devices[num_devices]);
>> + return 1;
>> + }
>> + pr_info("Created zram%u device\n", num_devices);
>> + num_devices++;
>
> Who protects num_devices race?
I am can't understand what you mean, but i create another test:
My changes can't pass this test:
#!/usr/bin/bash
while :; do
echo 1024M | tee /sys/block/zram*/disksize &
echo 1 | tee /sys/block/zram*/reset &
ls /sys/block/ | grep zram
done
Working some time, after OOM (in my case)
And i think what if we using sysfs or ioctl entry (for add/remove), he
also can't pass stress test on several threads with requests add
or/and delete device in same time =\

>> +
>> + return 0;
>> +}
>
> There is only adding function. Where is removing function?

If leave my simple hack, remove function causes race and kernel panic,
and i delete this function.


> Sorry, I am on vacation tomorrow so pz, understand my late response.
>
>> +
>> static int __init zram_init(void)
>> {
>> int ret, dev_id;
>> @@ -972,13 +1007,14 @@ static int __init zram_init(void)
>> goto out;
>> }
>>
>> - /* Allocate the device array and initialize each one */
>> - zram_devices = kzalloc(num_devices * sizeof(struct zram), GFP_KERNEL);
>> + /* Allocate the device array */
>> + zram_devices = kcalloc(max_num_devices, sizeof(struct zram), GFP_KERNEL);
>> if (!zram_devices) {
>> ret = -ENOMEM;
>> goto unregister;
>> }
>>
>> + /* Initialise zram{0..num_devices} */
>> for (dev_id = 0; dev_id < num_devices; dev_id++) {
>> ret = create_device(&zram_devices[dev_id], dev_id);
>> if (ret)
>> @@ -1025,7 +1061,7 @@ module_init(zram_init);
>> module_exit(zram_exit);
>>
>> module_param(num_devices, uint, 0);
>> -MODULE_PARM_DESC(num_devices, "Number of zram devices");
>> +MODULE_PARM_DESC(num_devices, "Number of pre created zram devices");
>>
>> MODULE_LICENSE("Dual BSD/GPL");
>> MODULE_AUTHOR("Nitin Gupta <ngupta@xxxxxxxxxx>");
>> --
>> To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
>> the body of a message to majordomo@xxxxxxxxxxxxxxx
>> More majordomo info at http://vger.kernel.org/majordomo-info.html
>> Please read the FAQ at http://www.tux.org/lkml/
>
> --
> Kind regards,
> Minchan Kim



--
Best regards,
Timofey.
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/