Re: [RFC][PATCH] perf: sysfs type id

From: Peter Zijlstra
Date: Wed Nov 17 2010 - 06:25:30 EST


On Tue, 2010-11-16 at 18:35 -0800, Corey Ashford wrote:

> I don't understand the /sys/devices tree much (I will read up on it),
> but this idea looks good to me.

Yeah, me too.. I talked to Kay a bit earlier on and /sys/devices/system
is 'special'..

> To clarify my understanding a bit and taking the gfx example, in the
> path /sys/class/pmu/radeon0, is the '0' here denoting the 0'th radeon
> chip in the system, or the radeon model number? I would assume the 0'th
> chip.

Chip indeed.

> So if I assume that now points to a unique radeon chip in the system,
> underneath /sys/class/pmu/radeon0 will be a structure something like:
>
> radeon0/
> event/
> evt0
> ..
> evtn
>
> And if there is a second radeon chip, there would be a nearly identical
> tree:
>
> radeon1/
> event/
> evt0
> ..
> evtn
>
> Is that correct?

Yes.

> Some of these events may need modifiers / attributes / umasks...
> whatever you want to call them. And they may need more than one each,
> and they may vary from event to event. So to add to the hierarchy,
> we'd have:
>
> radeon0/
> type (for attr.type)
> event/
> evt0/
> id (a base number for attr.config)
> description (text file - but could be CONFIG_*'d out)
> modifiers/
> mod0/
> formula (some ascii syntax for describing how
> to set .config and/or .config_extra
> with this modifer's value)
> description (text - can configure out)
> constraints (some ascii syntax for describing
> the values mod0 can take on)
> ..
> modn/
> ..
> evtn/
>
> And this would be replicated for radeon1..n

The idea of the events dir is to provide a few frequently used/common
events, not to be an exhaustive list.

What we can do is provide a break-down of the config in the top-level
directory and refer people to the hardware documentation (they need to
read that anyway if they want to make use special events anyway).

> Maybe all of the "event" directories could be soft links to a common
> radeon<model_number> event directory.

Possibly, but I don't expect this to be a common thing, and we can
always do it later.

> When you fully specify an event, you have something like:
>
> /sys/devices/pci0000:00/0000:00:1e.0/0000:0b:01.0/drm/card0/pmu/<event>[:<modifier>=nnn:...]
>
> So it wouldn't end up being strictly a sysfs path anymore, and perf
> would have a bit of parsing work to do, to evaluate the modifiers, using
> the info from constraints, and construct the .type, .config, and
> .config_extra fields using formula.
>
> Or maybe you have some other structure in mind?

I wouldn't bother with modifiers and all that:
perf record -e radeon0:r0123456789ABCDEF

is there for people who know what they're doing, possibly we can parse
the config format and use some of that to enable things like:

[ using the x86-intel format because I actually know that, as opposed to
the radeon case which I know absolutely nothing about. ]

# cat cpu/config_format
event_selector:8
unit_mask:8
NULL:7
invert:1
counter_mask:8

perf record -e radeon0:event_selector=0xf;unit_mask=0x5;invert;counter_mask=1

To make it slightly easier, we could maybe event do something like:

perf record -e radeon0:instructions;invert;counter_mask=1

To take the base of the 'instructions' event and modify that with the
invert and counter_mask details.

A further attribute we can give the top level node is something like
'name', which would describe the actual PMU, ARM people want something
like that because its near impossible to figure out what PMU lives on
your chip from userspace.

# cat cpu/name
x86,Core2,PEBS

or somesuch.
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/