Re: [patch 06/11] Use virtual debug registers in process/threadhandling code

From: Alan Stern
Date: Tue Mar 10 2009 - 12:08:09 EST


On Tue, 10 Mar 2009, Ingo Molnar wrote:

> > @@ -595,6 +596,12 @@ __switch_to(struct task_struct *prev_p,
> >
> > percpu_write(current_task, next_p);
> >
> > + /*
> > + * Handle debug registers. This must be done _after_ current
> > + * is updated.
> > + */
> > + if (unlikely(test_tsk_thread_flag(next_p, TIF_DEBUG)))
> > + switch_to_thread_hw_breakpoint(next_p);
>
> why does this have to be called after 'current' has been
> updated? AFAICS switch_to_thread_hw_breakpoint() does not take a
> look at 'current'.

There was a discussion about this on LKML last October 17, and you were
in the CC list. Here is the reason, extracted from one of those
messages:


There's a problem with moving the switch_to_thread_hw_breakpoint() call
before current is updated. Suppose a kernel breakpoint is triggered in
between the two. The hw-breakpoint handler will see that current is
different from the task pointer stored in the chbi area, so it will
think the task pointer is leftover from an old task (lazy switching)
and will erase it. Then until the next context switch, no
user-breakpoints will be installed.

The real problem is that it's impossible to update both current and
chbi->bp_task at the same instant, so there will always be a window in
which they disagree and a breakpoint might get triggered. Since we use
lazy switching, we are forced to assume that a disagreement means that
current is correct and chbi->bp_task is old. But if you move the code
above then you'll create a window in which current is old and
chbi->bp_task is correct.


> Speaking of switch_to_thread_hw_breakpoint(), i dont like that
> function at all:
>
> - why does it have to do a list of debug registers?

I'm not sure I understand the point of this question. Are you asking
why the hw_breakpoint structures are stored on a list? Because there
can be an arbitrarily large number of them.

> - why does it worry about IPIs arriving when context-switches on
> x86 are always done with interrupts disabled?

The routine gets invoked at times other than during a context switch.
However you may be right that these times are all mutually exclusive.
If so then a good deal of complication can be removed.

> - also, what do the ->installed() and ->uninstalled() callbacks
> do - nothing uses it!

What do you mean? They do what any callback does. And of course
nothing uses them -- the code hasn't been merged yet!

The intention is to let programs (or kernel debuggers) know when the
statistics they are gathering are contaminated because the breakpoint
in question has been uninstalled, and when the statistics are again
valid because the breakpoint has been re-installed.

Alan Stern

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/