Re: [RFC 1/3] /dev/low_mem_notify

From: Pekka Enberg
Date: Wed Jan 18 2012 - 04:15:48 EST

Next message: Viresh Kumar: "[PATCH V2 02/12] dmaengine: Add flow controller information to dma_slave_config"
Previous message: Viresh Kumar: "[PATCH V2 08/12] dmaengine/dw_dmac: Unmap all memory buffers after completion of slave transfers"
In reply to: leonid.moiseichuk: "RE: [RFC 1/3] /dev/low_mem_notify"
Next in thread: leonid.moiseichuk: "RE: [RFC 1/3] /dev/low_mem_notify"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

On Wed, Jan 18, 2012 at 11:06 AM, <leonid.moiseichuk@xxxxxxxxx> wrote:
> Would be possible to not use percents for thesholds? Accounting in pages even
> not so difficult to user-space.

How does that work with memory hotplug?

On Wed, Jan 18, 2012 at 11:06 AM, <leonid.moiseichuk@xxxxxxxxx> wrote:
> Also, looking on vmnotify_match I understand that events propagated to
> user-space only in case threshold trigger change state from 0 to 1 but not
> back, 1-> 0 is very useful event as well.
>
> Would be possible to use for threshold pointed value(s) e.g. according to
> enum zone_state_item, because kinds of memory to track could be different?
> E.g. to tracking paging activity NR_ACTIVE_ANON and NR_ACTIVE_FILE could be
> interesting, not only free.

I don't think there's anything in the ABI that would prevent that.

>> +struct vmnotify_event {
>> + /* Size of the struct for ABI extensibility. */
>> + __u32 size;
>> +
>> + __u64 nr_avail_pages;
>> +
>> + __u64 nr_swap_pages;
>> +
>> + __u64 nr_free_pages;
>> +};
>
> Two fields here most likely session-constant, (nr_avail_pages and
> nr_swap_pages), seems not much sense to report them in every event. If we
> have memory/swap hotplug user-space can use sysinfo() call.

I actually changed the ABI to look like this:

struct vmnotify_event {
/*
* Size of the struct for ABI extensibility.
*/
__u32 size;

__u64 attrs;

__u64 attr_values[];
};

So userspace can decide which fields to include in notifications.

On Wed, Jan 18, 2012 at 11:06 AM, <leonid.moiseichuk@xxxxxxxxx> wrote:
>> +static void vmnotify_sample(struct vmnotify_watch *watch) {
> ...
>> + si_meminfo(&si);
>> + event.nr_avail_pages = si.totalram;
>> +
>> +#ifdef CONFIG_SWAP
>> + si_swapinfo(&si);
>> + event.nr_swap_pages = si.totalswap;
>> +#endif
>> +
>
> Why not to use global_page_state() directly? si_meminfo() and especial
> si_swapinfo are quite expensive call.

Sure, we can do that. Feel free to send a patch :-).

>> +static void vmnotify_start_timer(struct vmnotify_watch *watch) {
>> + u64 sample_period = watch->config.sample_period_ns;
>> +
>> + hrtimer_init(&watch->timer, CLOCK_MONOTONIC,
>> HRTIMER_MODE_REL);
>> + watch->timer.function = vmnotify_timer_fn;
>> +
>> + hrtimer_start(&watch->timer, ns_to_ktime(sample_period),
>> +HRTIMER_MODE_REL_PINNED); }
>
> Do I understand correct you allocate timer for every user-space client and
> propagate events every pointed interval? What will happened with system if
> we have a timer but need to turn CPU off? The timer must not be a reason to
> wakeup if user-space is sleeping.

No idea what happens. The sampling code is just a proof of concept thing and I
expect it to be buggy as hell. :-)

Pekka
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/

Next message: Viresh Kumar: "[PATCH V2 02/12] dmaengine: Add flow controller information to dma_slave_config"
Previous message: Viresh Kumar: "[PATCH V2 08/12] dmaengine/dw_dmac: Unmap all memory buffers after completion of slave transfers"
In reply to: leonid.moiseichuk: "RE: [RFC 1/3] /dev/low_mem_notify"
Next in thread: leonid.moiseichuk: "RE: [RFC 1/3] /dev/low_mem_notify"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]