Re: [RFC][PATCH 00/21] tracing: Inter-event (e.g. latency) support

From: Frank Ch. Eigler
Date: Thu Feb 09 2017 - 11:03:04 EST



Hi, Tom -


tom.zanussi wrote:

> [...]
>> Hmm, this looks a bit hard to understand, I guess that onmatch() means
>> "if there is an event which has ts0 variable and the event's key matches
>> this key, take some action".
>
> Yes, that's pretty much it. It's essentially shorthand for this kind of
> common idiom, where timestamp[] is an associative array, which in our
> case is the tracing_map of the histogram:
>
> event sched_wakeup()
> {
> ts0[wakeup_pid] = now()
> }
> event sched_switch()
> {
> if (ts0[next_pid])
> latency = now() - ts0[next_pid] /* next_pid == wakeup_pid */
> }

By the way, here is a working systemtap version of this demo:

# cat foo.stp
global ts0%, latency%
function now() { return gettimeofday_us() }

probe kernel.trace("sched_wakeup") { ts0[$p->pid] = now() }

probe kernel.trace("sched_switch") {
if (ts0[$next->pid])
latency[$next->pid,$next->prio] <<< now() - ts0[$next->pid];
}

probe timer.s(5) {
foreach ([pid+,x] in latency) {
println("pid:", pid, " prio:", x)
print(@hist_log(latency[pid,x]))
}
delete latency
}


# stap foo.stp
[...]
pid:20183 prio:109
value |-------------------------------------------------- count
2 | 0
4 | 0
8 |@ 1
16 | 0
32 | 0

pid:29095 prio:120
value |-------------------------------------------------- count
0 | 1
1 |@@@@ 8
2 |@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@ 76
4 |@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@ 60
8 |@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@@ 68
16 |@@@@@@@@ 16
32 | 0
64 | 0
[...]




> ts0 is basically a per-table-entry variable - there's one for each
> entry in the table, and it can only be accessed by events with
> matching keys. [...] So, that's a long-winded way of saying that the
> name ts0 is global across all tables (histograms) but an instance of
> ts0 is local to each entry in the table that owns the name.

In systemtap, one of the things we take care of is automatic concurrency
control over such shared variables. Even if many CPUs run these same
functions and try to access the same ts0/latency hash tables at the same
time, things will work correctly. I'm curious how your code deals with
this.


- FChE