Re: [PATCH 1/1] fanotify: handling of errors when reading events
From: Jan Kara
Date: Mon May 19 2014 - 11:53:15 EST
On Sun 18-05-14 10:37:16, Heinrich Schuchardt wrote:
> When reading from the fanotify file descriptor read() calls
> fanotify_read().
> fanotify_read will copies events available into the user buffer
> until either the fanotify event queue is empty or the user buffer
> is full.
>
> If an error occurs for the first event copied, fanotify_read
> returns an error code.
> If an error code occurs for the second or any further event copied
> fanotify_read returns the concatenated length of all events
> successfully copied.
>
> In the second case the userland has no chance to determine that an
> error has occured. This error has been described in the fanotify.7
> manpage.
>
> Jan Kara suggested in
> https://lkml.org/lkml/2014/4/24/182
> to change the logic to a peek-check-pop sequence.
>
> This logic is implementd with the appended patch:
>
> If an event is successfully copied to user space, it is removed from the queue.
> If an error occurs for the first event read, the event is removed form the
> queue and and the error code is returned.
> If an error occurs for the second or a later event read, the event is left on
> the queue and the length of the prior events is returned to the user.
> A later call to fanotify_read will see this event as first
> event and will provide the error to the user.
Thanks for the patch! Have you tried testing this patch with lockdep
enabled? I'm slightly nervous about running copy_event_to_user() with
notification_mutex held - that does a lot of things including opening a
file so holding notification_mutex while doing that might have interesting
locking implications...
Honza
>
> Signed-off-by: Heinrich Schuchardt <xypron.glpk@xxxxxx>
> ---
> fs/notify/fanotify/fanotify_user.c | 56 +++++++++++++++++++++++++++++++++++---
> 1 file changed, 52 insertions(+), 4 deletions(-)
>
> diff --git a/fs/notify/fanotify/fanotify_user.c b/fs/notify/fanotify/fanotify_user.c
> index 9163a6e..da8e235 100644
> --- a/fs/notify/fanotify/fanotify_user.c
> +++ b/fs/notify/fanotify/fanotify_user.c
> @@ -38,7 +38,7 @@ struct kmem_cache *fanotify_perm_event_cachep __read_mostly;
> *
> * Called with the group->notification_mutex held.
> */
> -static struct fsnotify_event *get_one_event(struct fsnotify_group *group,
> +static struct fsnotify_event *peek_one_event(struct fsnotify_group *group,
> size_t count)
> {
> BUG_ON(!mutex_is_locked(&group->notification_mutex));
> @@ -51,9 +51,28 @@ static struct fsnotify_event *get_one_event(struct fsnotify_group *group,
> if (FAN_EVENT_METADATA_LEN > count)
> return ERR_PTR(-EINVAL);
>
> + return fsnotify_peek_notify_event(group);
> +}
> +
> +/*
> + * Remove fsnotify notification event from queue.
> + * Returns -EINVAL if queue is empty.
> + *
> + * Called with the group->notification_mutex held.
> + */
> +static int pop_one_event(struct fsnotify_group *group)
> +{
> + BUG_ON(!mutex_is_locked(&group->notification_mutex));
> +
> + pr_debug("%s: group=%p\n", __func__, group);
> +
> + if (fsnotify_notify_queue_is_empty(group))
> + return -EINVAL;
> +
> /* held the notification_mutex the whole time, so this is the
> * same event we peeked above */
> - return fsnotify_remove_notify_event(group);
> + fsnotify_remove_notify_event(group);
> + return 0;
> }
>
> static int create_fd(struct fsnotify_group *group,
> @@ -257,15 +276,24 @@ static ssize_t fanotify_read(struct file *file, char __user *buf,
> prepare_to_wait(&group->notification_waitq, &wait, TASK_INTERRUPTIBLE);
>
> mutex_lock(&group->notification_mutex);
> - kevent = get_one_event(group, count);
> - mutex_unlock(&group->notification_mutex);
> + kevent = peek_one_event(group, count);
>
> + /*
> + * There is not enough space left in the user buffer.
> + * The event will be left on the queue to be fetched by a later
> + * call to fanotify_read.
> + */
> if (IS_ERR(kevent)) {
> + mutex_unlock(&group->notification_mutex);
> ret = PTR_ERR(kevent);
> break;
> }
>
> + /*
> + * The queue is empty.
> + */
> if (!kevent) {
> + mutex_unlock(&group->notification_mutex);
> ret = -EAGAIN;
> if (file->f_flags & O_NONBLOCK)
> break;
> @@ -280,7 +308,27 @@ static ssize_t fanotify_read(struct file *file, char __user *buf,
> continue;
> }
>
> + /*
> + * An event was successfully peeked from the queue.
> + * Copy it to user space.
> + */
> ret = copy_event_to_user(group, kevent, buf);
> +
> + /*
> + * If an event was successfully copied to user space, it can be
> + * removed from the queue.
> + * If an error occured for the first event read, the error code
> + * can be returned. The event has to be removed from the queue.
> + * If an error occured for the second or a later event read,
> + * the event will be left on the queue and the length of the
> + * prior events will be returned to the user.
> + * A later call to fanotify_read will see this event as first
> + * event and will provide the error to the user.
> + */
> + if (ret >= 0 || start == buf)
> + pop_one_event(group);
> + mutex_unlock(&group->notification_mutex);
> +
> /*
> * Permission events get queued to wait for response. Other
> * events can be destroyed now.
> --
> 2.0.0.rc0
>
--
Jan Kara <jack@xxxxxxx>
SUSE Labs, CR
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/