[PATCH 2.6.16-rc3] m32r: update sys_tas() routine

From: Hirokazu Takata
Date: Mon Feb 20 2006 - 08:38:20 EST


Hi,

This patch updates and fixes sys_tas() routine for m32r.

In the previous implementation, a lockup rarely caused at sys_tas()
routine in SMP environment.

> > The problem is that touching *addr will generate an oops if that page isn't
> > paged in. If we convert it to use get_user() then that's an improvement,
> > but we must not run get_user() under spinlock or local_irq_disable().

I rewrote sys_tas() routine by using "lock -> unlock" instructions,
and utilizing the m32r's interrupt handling characteristics;
the m32r processor can accept interrupts only at the 32-bit instruction
boundary. So, the "unlock" instruction can be executed continuously
after the "lock" instruction execution without any interruptions.

In addition, to solve such a page_fault problem, I use a fixup code
like get_user().

And, as for the kernel lockup problem, we found that a calling
do_page_fault() routine with disabling interrupts might cause
a lockup at flush_tlb_others(), because we checked a completion
of IPI handler's operations in a spin-locked critical section.

Therefore, by using "lock -> unlock" code, we can implement
the sys_tas() rouitine without disabling interrupts explicitly,
then no lockups would happen at flush_tlb_others(), I hope.

Compile check and some working test in SMP environment have done.

-- Takata


From: Hirokazu Takata <takata@xxxxxxxxxxxxxx>
Subject: Re: [WTF?] sys_tas() on m32r
Date: Tue, 27 Dec 2005 14:27:39 +0900 (JST)
> Hi,
>
> From: Andrew Morton <akpm@xxxxxxxx>
> Date: Fri, 23 Dec 2005 05:55:26 -0800
> > Al Viro <viro@xxxxxxxxxxxxxxxx> wrote:
> > >
> > > asmlinkage int sys_tas(int *addr)
> > > {
> > > int oldval;
> > > unsigned long flags;
> > >
> > > if (!access_ok(VERIFY_WRITE, addr, sizeof (int)))
> > > return -EFAULT;
> > > local_irq_save(flags);
> > > oldval = *addr;
> > > if (!oldval)
> > > *addr = 1;
> > > local_irq_restore(flags);
> > > return oldval;
> > > }
> > > in arch/m32r/kernel/sys_m32r.c. Trivial oops *AND* ability to trigger
> > > IO with interrupts disabled.
> >
> > Yeah. I pointed this out to Takata in October last year and then promptly
> > forgot about it. It's rather amazing that this code (which appears to be in
> > live use in linuxthreads) hasn't generated oopses.
>
> Yes, I remember it.
>
> > The problem is that touching *addr will generate an oops if that page isn't
> > paged in. If we convert it to use get_user() then that's an improvement,
> > but we must not run get_user() under spinlock or local_irq_disable().
>
> Exactly.
> This problem has been bothering me, but I cannot find a good solusion...


Signed-off-by: Hirokazu Takata <takata@xxxxxxxxxxxxxx>
---

arch/m32r/kernel/sys_m32r.c | 61 ++++++++++++++++++++++++--------------------
1 file changed, 34 insertions(+), 27 deletions(-)

Index: linux-2.6.16-rc1/arch/m32r/kernel/sys_m32r.c
===================================================================
--- linux-2.6.16-rc1.orig/arch/m32r/kernel/sys_m32r.c 2006-01-31 21:47:06.918951364 +0900
+++ linux-2.6.16-rc1/arch/m32r/kernel/sys_m32r.c 2006-01-31 21:58:21.733388614 +0900
@@ -29,28 +29,7 @@

/*
* sys_tas() - test-and-set
- * linuxthreads testing version
*/
-#ifndef CONFIG_SMP
-asmlinkage int sys_tas(int *addr)
-{
- int oldval;
- unsigned long flags;
-
- if (!access_ok(VERIFY_WRITE, addr, sizeof (int)))
- return -EFAULT;
- local_irq_save(flags);
- oldval = *addr;
- if (!oldval)
- *addr = 1;
- local_irq_restore(flags);
- return oldval;
-}
-#else /* CONFIG_SMP */
-#include <linux/spinlock.h>
-
-static DEFINE_SPINLOCK(tas_lock);
-
asmlinkage int sys_tas(int *addr)
{
int oldval;
@@ -58,15 +37,43 @@ asmlinkage int sys_tas(int *addr)
if (!access_ok(VERIFY_WRITE, addr, sizeof (int)))
return -EFAULT;

- _raw_spin_lock(&tas_lock);
- oldval = *addr;
- if (!oldval)
- *addr = 1;
- _raw_spin_unlock(&tas_lock);
+ /* atomic operation:
+ * oldval = *addr; *addr = 1;
+ */
+ __asm__ __volatile__ (
+ DCACHE_CLEAR("%0", "r4", "%1")
+ " .fillinsn\n"
+ "1:\n"
+ " lock %0, @%1 -> unlock %2, @%1\n"
+ "2:\n"
+ /* NOTE:
+ * The m32r processor can accept interrupts only
+ * at the 32-bit instruction boundary.
+ * So, in the above code, the "unlock" instruction
+ * can be executed continuously after the "lock"
+ * instruction execution without any interruptions.
+ */
+ ".section .fixup,\"ax\"\n"
+ " .balign 4\n"
+ "3: ldi %0, #%3\n"
+ " seth r14, #high(2b)\n"
+ " or3 r14, r14, #low(2b)\n"
+ " jmp r14\n"
+ ".previous\n"
+ ".section __ex_table,\"a\"\n"
+ " .balign 4\n"
+ " .long 1b,3b\n"
+ ".previous\n"
+ : "=&r" (oldval)
+ : "r" (addr), "r" (1), "i"(-EFAULT)
+ : "r14", "memory"
+#ifdef CONFIG_CHIP_M32700_TS1
+ , "r4"
+#endif /* CONFIG_CHIP_M32700_TS1 */
+ );

return oldval;
}
-#endif /* CONFIG_SMP */

/*
* sys_pipe() is the normal C calling standard for creating

--
Hirokazu Takata <takata@xxxxxxxxxxxxxx>
Linux/M32R Project: http://www.linux-m32r.org/

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/