Re: [PATCH V7 01/16] rv: Add Runtime Verification (RV) interface

From: Steven Rostedt
Date: Tue Jul 26 2022 - 16:59:15 EST


On Tue, 26 Jul 2022 22:00:55 +0200
Daniel Bristot de Oliveira <bristot@xxxxxxxxxx> wrote:

> On 7/26/22 1
> >> +static void disable_all_monitors(void)
> >> +{
> >> + struct rv_monitor_def *mdef;
> >> + int enabled = 0;
> >> +
> >> + mutex_lock(&rv_interface_lock);
> >> +
> >> + list_for_each_entry(mdef, &rv_monitors_list, list)
> >> + enabled += __rv_disable_monitor(mdef);
> >> +
> >> + if (enabled) {
> >> + /*
> >> + * Wait for the execution of all current events.

"Wait for the execution of all events to finish."

> >> + */
> >
> > And do we really need to hold the locks when calling the synchronization?
> >
> > I think we only care if the caller sees a synchronized view of changes, the
> > locks will help synchronize the internal code.
>
> I think we need. For instance:
>
> CPU 0: CPU 1:
> disable_monitor(x) enable_monitor(x)
> lock()
> disable()
> unlock() lock()
> wait without the lock enable()
>
> <old preempted events with inconsistent data>
>
> so by holding the lock we avoid this case...

So you are synchronizing against it being re-enabled and not just to have a
consistent state by the caller?

If that's the case, then OK.


>
> no?
>
> >> + tracepoint_synchronize_unregister();
> >> + }
> >> +
> >> + mutex_unlock(&rv_interface_lock);
> >> +}
> >> +
> >
> > [..]
> >
> >> +/**
> >> + * rv_monitoring_on - checks if monitoring is on
> >> + *
> >> + * Returns 1 if on, 0 otherwise.
> >> + */
> >> +bool rv_monitoring_on(void)
> >> +{
> >> + /* monitoring_on */
> >
> > You need a better comment than that.
> >
> > What is this synchronizing?
>
>
> like:
> /* Ensures that concurrent monitors reads a consistent data */
>
> But I wonder if it is needed, given that we now wait for all the events
> to be processed before switching it back again, e.g., having the
> monitoring_on as old data is not a problem.
>
> >> + smp_rmb();
> >> + return READ_ONCE(monitoring_on);
> >> +}
> >> +
> >> +/*
> >> + * monitoring_on general switcher.
> >> + */
> >> +static ssize_t monitoring_on_read_data(struct file *filp, char __user *user_buf,
> >> + size_t count, loff_t *ppos)
> >> +{
> >> + const char *buff;
> >> +
> >> + buff = rv_monitoring_on() ? "1\n" : "0\n";
> >> +
> >> + return simple_read_from_buffer(user_buf, count, ppos, buff, strlen(buff) + 1);
> >> +}
> >> +
> >> +static void turn_monitoring_off(void)
> >> +{
> >> + WRITE_ONCE(monitoring_on, false);
> >> + /* monitoring_on */
> >
> > Same here.
> >
> >> + smp_wmb();
> >> +}
> >> +
> >> +static void reset_all_monitors(void)
> >> +{
> >> + struct rv_monitor_def *mdef;
> >> +
> >> + list_for_each_entry(mdef, &rv_monitors_list, list) {
> >> + if (mdef->monitor->enabled)
> >> + mdef->monitor->reset();
> >> + }
> >> +}
> >> +
> >> +static void turn_monitoring_on(void)
> >> +{
> >> + reset_all_monitors();
> >
> > Why does this reset all monitors but turn_monitoring_off() does not?
>
> When we turn monitoring off, the monitors will stop monitoring while yet
> in sync with the events generated by the system, i.e., all the events after
> the start were processed.
>
> But if we disabled the monitor, and some events get ignored, the monitors
> will be out of sync with the system. Thus, resetting the monitor to
> for a synchronization between the monitors and the system is needed, before
> enabling monitoring back.
>
> > You should keep that out.
>
> did not get :-(

I don't like the way the _on() and _off() are different.

Have the _on() just turn in on *without* the reset.

If you need the reset, then make a separate function called:

static void turn_monitoring_on_with_reset(void)
{
reset_all_monitors();
turn_monitoring_on();
}


>
> >> + WRITE_ONCE(monitoring_on, true);
> >> + /* monitoring_on */
> >> + smp_wmb();
> >> +}
> >> +
> >> +static ssize_t monitoring_on_write_data(struct file *filp, const char __user *user_buf,
> >> + size_t count, loff_t *ppos)
> >> +{
> >> + int retval;
> >> + bool val;
> >> +
> >> + retval = kstrtobool_from_user(user_buf, count, &val);
> >> + if (retval)
> >> + return retval;
> >> +
> >> + mutex_lock(&rv_interface_lock);
> >> +
> >> + if (val)
> >> + turn_monitoring_on();
> >> + else
> >> + turn_monitoring_off();
> >> +
> >> + /*
> >> + * Wait for the execution of all current events to notice
> >> + * the change before returning to user-space.
> >> + */
> >> + tracepoint_synchronize_unregister();
> >
> > Again, I think we want this outside the lock.
>
> I fear a problem similar to the one I said before could happen. For
> instance, turning monitoring off and on could result on monitors with
> old data being back enable with "reseted" data...
>
> >> +
> >> + mutex_unlock(&rv_interface_lock);
> >> +
> >> + return count;
> >> +}
> >> +
> >> +static const struct file_operations monitoring_on_fops = {
> >> + .open = simple_open,
> >> + .llseek = no_llseek,
> >> + .write = monitoring_on_write_data,
> >> + .read = monitoring_on_read_data,
> >> +};
> >> +
> >> +static void destroy_monitor_dir(struct rv_monitor_def *mdef)
> >> +{
> >> + rv_remove(mdef->root_d);
> >> +}
> >> +
> >> +/**
> >> + * rv_register_monitor - register a rv monitor.
> >> + * @monitor: The rv_monitor to be registered.
> >> + *
> >> + * Returns 0 if successful, error otherwise.
> >> + */
> >> +int rv_register_monitor(struct rv_monitor *monitor)
> >> +{
> >> + struct rv_monitor_def *r;
> >> + int retval = 0;
> >> +
> >> + if (strlen(monitor->name) >= MAX_RV_MONITOR_NAME_SIZE) {
> >> + pr_info("Monitor %s has a name longer than %d\n", monitor->name,
> >> + MAX_RV_MONITOR_NAME_SIZE);
> >> + return -1;
> >> + }
> >> +
> >> + mutex_lock(&rv_interface_lock);
> >> +
> >> + list_for_each_entry(r, &rv_monitors_list, list) {
> >> + if (strcmp(monitor->name, r->monitor->name) == 0) {
> >> + pr_info("Monitor %s is already registered\n", monitor->name);
> >> + retval = -1;
> >> + goto out_unlock;
> >> + }
> >> + }
> >> +
> >> + r = kzalloc(sizeof(struct rv_monitor_def), GFP_KERNEL);
> >> + if (!r) {
> >> + retval = -ENOMEM;
> >> + goto out_unlock;
> >> + }
> >> +
> >> + r->monitor = monitor;
> >> +
> >> + retval = create_monitor_dir(r);
> >> + if (retval) {
> >> + kfree(r);
> >> + goto out_unlock;
> >> + }
> >> +
> >> + list_add_tail(&r->list, &rv_monitors_list);
> >> +
> >> +out_unlock:
> >> + mutex_unlock(&rv_interface_lock);
> >> + return retval;
> >> +}
> >> +
> >> +/**
> >> + * rv_unregister_monitor - unregister a rv monitor.
> >> + * @monitor: The rv_monitor to be unregistered.
> >> + *
> >> + * Returns 0 if successful, error otherwise.
> >> + */
> >> +int rv_unregister_monitor(struct rv_monitor *monitor)
> >> +{
> >> + struct rv_monitor_def *ptr, *next;
> >> +
> >> + mutex_lock(&rv_interface_lock);
> >> +
> >> + list_for_each_entry_safe(ptr, next, &rv_monitors_list, list) {
> >> + if (strcmp(monitor->name, ptr->monitor->name) == 0) {
> >> + rv_disable_monitor(ptr);
> >> + list_del(&ptr->list);
> >> + destroy_monitor_dir(ptr);
> >> + }
> >> + }
> >> +
> >> + mutex_unlock(&rv_interface_lock);
> >> + return 0;
> >> +}
> >> +
> >> +int __init rv_init_interface(void)
> >> +{
> >> + struct dentry *tmp;
> >> +
> >> + rv_root.root_dir = rv_create_dir("rv", NULL);
> >> + if (!rv_root.root_dir)
> >> + goto out_err;
> >> +
> >> + rv_root.monitors_dir = rv_create_dir("monitors", rv_root.root_dir);
> >> + if (!rv_root.monitors_dir)
> >> + goto out_err;
> >> +
> >> + tmp = rv_create_file("available_monitors", RV_MODE_READ, rv_root.root_dir, NULL,
> >> + &available_monitors_ops);
> >> + if (!tmp)
> >> + goto out_err;
> >> +
> >> + tmp = rv_create_file("enabled_monitors", RV_MODE_WRITE, rv_root.root_dir, NULL,
> >> + &enabled_monitors_ops);
> >> + if (!tmp)
> >> + goto out_err;
> >> +
> >> + tmp = rv_create_file("monitoring_on", RV_MODE_WRITE, rv_root.root_dir, NULL,
> >> + &monitoring_on_fops);
> >> + if (!tmp)
> >> + goto out_err;
> >> +
> >
> > This should call "turn_monitoriing_on()" instead of open coding it,
> > especially since it includes a memory barrier (another reason to not
> > reset the monitors in that function.
>
> Here we do not need to reset monitors... but monitors would not be enabled anyway as

EXACTLY!!! Which is why the turn_monitoring_on() should only do that. Turn
it on. No reset involved. You are convoluting implementation with the name.

The reason you open coded turning on the monitor here, is because your
turn_monitoring_on() function is broken by design.

-- Steve


> they are not loaded yet.... and so I wonder if we need the barrier here...
>

> >
> >
> >> + WRITE_ONCE(monitoring_on, true);
> >> + /* monitoring_on */
> >> + smp_wmb();
> >> +
> >> + return 0;
> >> +
> >> +out_err:
> >> + rv_remove(rv_root.root_dir);
> >> + printk(KERN_ERR "RV: Error while creating the RV interface\n");
> >> + return 1;
> >> +}