Re: [PATCH 00/21] perf, c2c: Add new tool to analyze cachelinecontention on NUMA systems

From: Don Zickus
Date: Mon Feb 10 2014 - 17:12:42 EST


On Mon, Feb 10, 2014 at 10:18:25PM +0100, Peter Zijlstra wrote:
> On Mon, Feb 10, 2014 at 12:28:55PM -0500, Don Zickus wrote:
> > With the introduction of NUMA systems, came the possibility of remote memory accesses.
> > Combine those remote memory accesses with contention on the remote node (ie a modified
> > cacheline) and you have a possibility for very long latencies. These latencies can
> > bottleneck a program.
> >
> > The program added by these patches, helps detect the situation where two nodes are
> > 'tugging' on the same _data_ cacheline. The term used through out this program and
> > the various changelogs is called a HITM. This means nodeX went to read a cacheline
> > and it was discovered to be loaded in nodeY's LLC cache (hence the cacheHIT). The
> > remote cacheline was also in a 'M'odified state thus creating a 'HIT M' for hit in
> > a modified state. HITMs can happen locally and remotely. This program's interest
> > is mainly in remote HITMs as they cause the longest latencies.
>
> All of that is true of the traditional SMP system too. Just use lower
> level caches.

Yup. We just focused on the longer latencies which is the remote case. I
think the idea was overflowing an L1 and L2 wasn't that hard, so the gain
on solving local LLC HITMs wouldn't be that much. Maybe we are wrong.

Anyway, if this tool can help solve any bottlenecks, NUMA or non-NUMA,
that would be great. :-)

Cheers,
Don

--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/