Re: [PATCH 2.6.28-rc4]lock_stat: Add "con-hungry" to show that howmany person-time fight for the ticket spinlock

From: Ingo Molnar
Date: Fri Dec 26 2008 - 03:34:56 EST



* Yang Xi <yangxilkm@xxxxxxxxx> wrote:

> Because the lock-stat and lock-dep is so heavy, this statistic result is
> not very accurate :(

hm, lockdep indeed has to do some non-trivial work - but is that really
true of pure lockstat too? If you have a workload where you can see that
it's heavy, you could do an NMI profile on x86 by running kerneltop on
tip/master:

http://redhat.com/~mingo/perfcounters/kerneltop.c

You should get a top-alike list of the highest-cost functions. If lockstat
is heavy, its activities should show up there.

> @@ -34,6 +34,7 @@ config X86
> select HAVE_ARCH_TRACEHOOK
> select HAVE_GENERIC_DMA_COHERENT if X86_32
> select HAVE_EFFICIENT_UNALIGNED_ACCESS
> + select HAVE_TICKET_SPINLOCK

no fundamental objections against your patch, but i think it needs a
couple of cleanups first.

For example, this HAVE_TICKET_SPINLOCK distinction is unnecessarily
exposed to the core kernel, why not just allow architectures to define
spin_nr_contended():

> +#ifdef CONFIG_HAVE_TICKET_SPINLOCK
> +#define spin_nr_contended(lock) __ticket_spin_nr_contended(&(lock)->raw_lock)
> +#else
> +#define spin_nr_contended(lock) (spin_is_contended(lock) ? 1 : 0)
> +#endif

and do something like this in spinlock.h to give a default definition:

#ifndef spin_nr_contended
# define spin_nr_contended(lock) (spin_is_contended(lock) ? 1 : 0)
#endif

> @@ -2588,7 +2594,6 @@ static int __lock_acquire(struct lockdep_map
> *lock, unsigned int subclass,
>
> if (check == 2 && !mark_irqflags(curr, hlock))
> return 0;
> -
> /* mark it as used: */
> if (!mark_lock(curr, hlock, LOCK_USED))
> return 0;
> @@ -2623,7 +2628,6 @@ static int __lock_acquire(struct lockdep_map
> *lock, unsigned int subclass,
>
> if (!validate_chain(curr, lock, hlock, chain_head, chain_key))
> return 0;
> -
> curr->curr_chain_key = chain_key;
> curr->lockdep_depth++;
> check_chain_key(curr);
> @@ -3000,6 +3004,13 @@ __lock_contended(struct lockdep_map *lock,

those newlines you removed were there for a reason - they delimit blocks
of code from each other and make return statements more visible.


> unsigned long ip)
> struct lock_class_stats *stats;
> unsigned int depth;
> int i, point;
> + spinlock_t *lock_ptr;
> + unsigned long hungry = 0;

please keep the local variable definitions in their original style, i.e. a
reverse christmas tree:

> struct lock_class_stats *stats;
> + unsigned long hungry = 0;
> + spinlock_t *lock_ptr;
> unsigned int depth;
> int i, point;

> +
> + if (lock->isticketspinlock) {
> + lock_ptr = container_of(lock, spinlock_t, dep_map);
> + hungry = spin_nr_contended(lock_ptr);
> + }

do we need the ->isticketspinlock distinction? Cannot we call
spin_nr_contended() for all spinlocks? (it's just that for ticket
spinlocks we get a real value out of it - for normal spinlocks we only get
0/1 out of it. But that is not a problem really.)

also, please rename 'hungry' to something more descriptive: for example
'nr_contended' fits pretty well?

> @@ -3030,9 +3041,16 @@ found_it:
> stats->contention_point[point]++;
> if (lock->cpu != smp_processor_id())
> stats->bounces[bounce_contended + !!hlock->read]++;
> + stats->bounces[bounce_hungry] += hungry;
> + if (lock->isticketspinlock) {
> + if (stats->bounces[bounce_max_hungry] < hungry)
> + stats->bounces[bounce_max_hungry] = hungry;
> + }
> +
> put_lock_stats(stats);
> }
>
> +
> static void

spurious newline.

Ingo
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/