Re: [PATCH v4 3/4] thermal/core: Build ascending ordered indexes for the trip points

From: Rafael J. Wysocki
Date: Fri Jul 22 2022 - 13:40:24 EST


On Thu, Jul 21, 2022 at 11:15 PM Daniel Lezcano
<daniel.lezcano@xxxxxxxxxx> wrote:
>
> On 21/07/2022 13:25, Rafael J. Wysocki wrote:
> > On Thu, Jul 21, 2022 at 12:59 PM Daniel Lezcano
> > <daniel.lezcano@xxxxxxxxxx> wrote:
> >>
> >> On 19/07/2022 20:56, Rafael J. Wysocki wrote:
> >>> On Mon, Jul 18, 2022 at 4:50 PM Daniel Lezcano
> >>> <daniel.lezcano@xxxxxxxxxx> wrote:
> >>>>
> >>>> By convention the trips points are declared in the ascending
> >>>> temperature order. However, no specification for the device tree, ACPI
> >>>> or documentation tells the trip points must be ordered this way.
> >>>>
> >>>> In the other hand, we need those to be ordered to browse them at the
> >>>
> >>> s/In/On/
> >>>
> >>>> thermal events.
> >>>
> >>> What if they are all inspected every time?
> >>
> >> My bad, my sentence is confusing. The trip point are browsed every time
> >> and we need to have them ordered to detect correctly the thermal events.
> >
> > I see.
> >
> > So this mostly is a preparation for patch 4, isn't it?
>
> Yes, it is correct
>
> >>>> But if we assume they are ordered and change the code
> >>>> based on this assumption, any platform with shuffled trip points
> >>>> description will be broken (if they exist).
> >>>>
> >>>> Instead of taking the risk of breaking the existing platforms, use an
> >>>> array of temperature ordered trip identifiers and make it available
> >>>> for the code needing to browse the trip points in an ordered way.
> >>>
> >>> Well, having ops->get_trip_temp() suggests that the trip temperatures
> >>> can be dynamic. Is the ordering guaranteed to be preserved in that
> >>> case?
> >>
> >> The number of trips can not be changed. It is fixed when the thermal
> >> zone is created AFAICT.
> >
> > The current code appears to assume that and I think that this is a
> > reasonable expectation.
> >
> >> The get_trip_temp() is just a way to let the
> >> different driver declare their own trip structure which is actually
> >> something I'm trying to fix by moving the structure thermal_trip inside
> >> the thermal zone. But that is a longer and separate work.
> >
> > Well, I'm not sure.
> >
> > Trip point temperatures can be set via trip_point_temp_store() at
> > least in principle. How is it guaranteed that this won't affect the
> > ordering?
>
> Right, that is a good point. I don't see a logical use case where a trip
> point will be set below or above the previous or the next one, so the
> order should be kept.

Good.

> However, strictly speaking, nothing prevents that
> so I guess we need to reorder the trips when one is changed. It should
> be a one line call.

Or modify trip_point_temp_store() to preserve the ordering of trip
points (which would be way simpler if trip point temperatures were
stored as numbers in a sorted array - see below).

>
> >>> Anyway, if they need to be sorted, why don't we just sort them
> >>> properly instead of adding this extra array?
> >>
> >> We can not because ATM the trip points array is private to the different
> >> sensors.
> >
> > Well, the core could create an array or list of trip points for the
> > thermal zone during registration and populate it from the
> > driver-provided data. Then, it could be sorted at the creation time.
>
>
> There won't be any benefit ATM.

What about less overhead? And the code being easier to understand?
And no hidden assumptions? Hmm??

> The get_trip_temp/type/hyst ops are
> called all over the place. If we create a second sorted trip point array
> and use it in the core code, then all those ops should be replaced to
> use the sorted array instead of addressing the private trip structure.

Right.

> A big deal in terms of changes.

It is not overwhelming in my view.

> If we don't do the ops changes, then it is simpler to have an array of
> index->trip id and the impact is small.
>
> But I agree we should have a sorted trip array per thermal zone and stop
> using the ops->get_trip_temp|type|hyst. That is part of the work I'm
> doing in parallel to cleanup the thermal-of and I've the plan to migrate
> all the sensors to use the struct thermal_trip instead of private data.
> From there we will be able to get rid of the get_trip[*] and the sorted
> trip indexes array.

Well, what about making the core use struct thermal_trip internally in
the first place?

Except that I don't like the np member of that one.

> All these changes are not feasible in the short term.

They can be made in the opposite direction, starting at the core
level. Then, it would be clear where you were going.

> I would like to
> keep the indexes trip array approach to fix the trip cross events which
> is broken right now and then go forward with the struct thermal_trip
> changes and the thermal-of cleanups I've sent last week.
>
> Does it sound reasonable ?

I'm not convinced about the need to make the code more complicated if
the overall direction is to simplify it.

I understand that you want to avoid regressing things, but you want to
make these changes eventually anyway, so why to you think that the
risk of regressing things would be smaller in the future, after making
the code more complicated than it is now? Sounds counter-intuitive to
me.