Re: [RFC PATCH v2 0/7] DAMON based 2-tier memory management for CXL memory

From: SeongJae Park
Date: Mon Mar 18 2024 - 15:07:34 EST


On Mon, 18 Mar 2024 22:27:45 +0900 Honggyu Kim <honggyu.kim@xxxxxx> wrote:

> Hi SeongJae,
>
> On Sun, 17 Mar 2024 08:31:44 -0700 SeongJae Park <sj@xxxxxxxxxx> wrote:
> > Hi Honggyu,
> >
> > On Sun, 17 Mar 2024 17:36:29 +0900 Honggyu Kim <honggyu.kim@xxxxxx> wrote:
> >
> > > Hi SeongJae,
> > >
> > > Thanks for the confirmation. I have a few comments on young filter so
> > > please read the inline comments again.
> > >
> > > On Wed, 12 Mar 2024 08:53:00 -0800 SeongJae Park <sj@xxxxxxxxxx> wrote:
> > > > Hi Honggyu,
> > > >
> > > > > > -----Original Message-----
> > > > > > From: SeongJae Park <sj@xxxxxxxxxx>
> > > > > > Sent: Tuesday, March 12, 2024 3:33 AM
> > > > > > To: Honggyu Kim <honggyu.kim@xxxxxx>
> > > > > > Cc: SeongJae Park <sj@xxxxxxxxxx>; kernel_team <kernel_team@xxxxxxxxxxx>
> > > > > > Subject: RE: Re: [RFC PATCH v2 0/7] DAMON based 2-tier memory management for CXL memory
> > > > > >
> > > > > > Hi Honggyu,
> > > > > >
> > > > > > On Mon, 11 Mar 2024 12:51:12 +0000 "honggyu.kim@xxxxxx" <honggyu.kim@xxxxxx> wrote:
> > > > > >
> > > > > > > Hi SeongJae,
> > > > > > >
> > > > > > > I've tested it again and found that "young" filter has to be set
> > > > > > > differently as follows.
> > > > > > > - demote action: set "young" filter with "matching" true
> > > > > > > - promote action: set "young" filter with "matching" false
>
> Thinking it again, I feel like "matching" true or false looks quite
> vague to me as a general user.
>
> Instead, I would like to have more meaningful names for "matching" as
> follows.
>
> - matching "true" can be either (filter) "out" or "skip".
> - matching "false" can be either (filter) "in" or "apply".

I agree the naming could be done much better. And thank you for the nice
suggestions. I have a few concerns, though.

Firstly, increasing the number of behavioral concepts. DAMOS filter feature
has only single behavior: excluding some types of memory from DAMOS action
target. The "matching" is to provide a flexible way for further specifying the
target to exclude in a bit detail. Without it, we would need non-variant for
each filter type. Comapred to the current terms, the new terms feel like
implying there are two types of behaviors. I think one behavior is easier to
understand than two behaviors, and better match what internal code is doing.

Secondly, ambiguity in "in" and "apply". To me, the terms sound like _adding_
something more than _excluding_. I think that might confuse people in some
cases. Actually, I have used the term "filter-out" and "filter-in" on
this and several threads. When saying about "filter-in" scenario, I had to
emphasize the fact that it is not adding something but excluding others. I now
think that was not a good approach.

Finally, "apply" sounds a bit deterministic. I think it could be a bit
confusing in some cases such as when using multiple filters in a combined way.
For example, if we have two filters for 1) "apply" a memcg and 2) skip anon
pages, the given DAMOS action will not be applied to anon pages of the memcg.
I think this might be a bit confusing.

>
> Internally, the type of "matching" can be boolean, but it'd be better
> for general users have another ways to set it such as "out"/"in" or
> "skip"/"apply" via sysfs interface. I prefer "skip" and "apply" looks
> more intuitive, but I don't have strong objection on "out" and "in" as
> well.

Unfortunately, DAMON sysfs interface is an ABI that we want to keep stable. Of
course we could make some changes on it if really required. But I'm unsure if
the problem of current naming and benefit of the sugegsted change are big
enough to outweighs the stability risk and additional efforts.

Also, DAMON sysfs interface is arguably not for _very_ general users. DAMON
user-space tool is the one for _more_ general users. To quote DAMON usage
document,

- *DAMON user space tool.*
`This <https://github.com/awslabs/damo>`_ is for privileged people such as
system administrators who want a just-working human-friendly interface.
[...]
- *sysfs interface.*
:ref:`This <sysfs_interface>` is for privileged user space programmers who
want more optimized use of DAMON. [...]

If the concept is that confused, I think we could improve the documentation, or
the user space tool. But for DAMON sysfs interface, I think we need more
discussions for getting clear pros/cons that justifies the risk and the effort.

>
> I also feel the filter name "young" is more for developers not for
> general users. I think this can be changed to "accessed" filter
> instead.

In my humble opinion, "accessed" might be confusing with the term that being
used by DAMON, specifically, the concept of "nr_accesses". I also thought
about using more specific term such as "pg-accessed" or something else, but I
felt it is still unclear or making it too verbose.

I agree "young" sounds more for developers. But, again, DAMON sysfs is for not
_very_ general users. And the term is used commonly on relcaimation part and
LRU, so I think this is not too bad as long as we provide nice documentation or
abstraction from user-space tool.

>
> The demote and promote filters can be set as follows using these.
>
> - demote action: set "accessed" filter with "matching" to "skip"
> - promote action: set "accessed" filter with "matching" to "apply"
>
> I also think that you can feel this is more complicated so I would like
> to hear how you think about this.

To my humble brain, this looks intuitive for this use case. But as I also
mentioned above, I worry if this could keep simple and intuitive in complicated
filter usages.

>
> > > > > >
> > > > > > DAMOS filter is basically for filtering "out" memory regions that matches to
> > > > > > the condition.
>
> Right. In other tools, I see filters are more used as filtering "in"
> rather than filtering "out". I feel this makes me more confused.

I understand that the word, "filtering", can be used for both, and therefore
can be confused. I was also spending no small times at naming since I was
thinking about both coffee filters and color filters (of photoshop or glasses).
But it turned out that I'm more familiar with coffee filters, and might be same
for DAMON community, since the community is having beer/coffee/tea chat series
;) (https://lore.kernel.org/damon/20220810225102.124459-1-sj@xxxxxxxxxx/)

That said, I think we may be able to make this better documented, or add a
layer of abstraction on DAMON user-space tool.

[...]
> > > > > Yes, I understand "promote non-non-young pages" means "promote young pages".
> > > > > This might be understood as "young pages are NOT filtered out" for promotion
> > > > > but it doesn't mean that "old pages are filtered out" instead.
> > > > > And we just rely hot detection only on DAMOS logics such as access frequency
> > > > > and age. Am I correct?
> > > >
> > > > You're correct.
> > >
> > > Ack. But if it doesn't mean that "old pages are filtered out" instead,
> >
> > It does mean that. Here, filtering is exclusive. Hence, "filter-in a type of
> > pages" means "filter-out pages of other types". At least that's the intention.
> > To quote the documentation
> > (https://docs.kernel.org/mm/damon/design.html#filters),
> >
> > Each filter specifies the type of target memory, and whether it should
> > exclude the memory of the type (filter-out), or all except the memory of
> > the type (filter-in).
>
> Thanks for the correction.
>
> > > then do we really need this filter for promotion? If not, maybe should
> > > we create another "old" filter for promotion? As of now, the promotion
> > > is mostly done inaccurately, but the accurate migration is done at
> > > demotion level.
> >
> > Is this based on your theory? Or, a real behavior that you're seeing from your
> > setup? If this is a real behavior, I think that should be a bug that need to
> > be fixed.
>
> I have observed this in the hot_cold example, but I also found that the
> promotion is done very quickly because the age for promote action is set
> to 0 to max in my json setup. It makes most pages of the region are
> young because there is not enough time for those pages being old. That
> means I was wrong.
[...]
> > > I understand the function name of internal implementation is
> > > "damos_pa_filter_out" so the basic action is filtering out, but the
> > > cgroup filter works in the opposite way for now.
> >
> > Does memcg filter works in the opposite way? I don't think so because
> > __damos_pa_filter_out() sets 'matches' as 'true' only if the the given folio is
> > contained in the given memcg. 'young' filter also simply sets 'matches' as
> > 'true' only if the given folio is young.
> >
> > If it works in the opposite way, it's a bug that need to be fixed. Please let
> > me know if I'm missing something.
>
> No, it was also my misunderstanding. I used to set the matching false
> using my script.

Thank you for confirming. I understand we found no bug at the moment.


To summarize my opinion again,

1. I agree the concept and names of DAMOS filters are confusing and not very
intuitive.
2. However, it's unclear if the problem and the benefit from the suggested new
names are huge enough to take the risk and effort on changing ABI.
3. We could improve documentation and/or user-space tool.

Thank you again for the suggestion and confirmations to my questions.


Thanks,
SJ

[...]