Re: [PATCH] mm/damon: introduce DAMON-based NUMA memory tiering module

From: Lorenzo Stoakes (Oracle)

Date: Thu Mar 26 2026 - 06:39:01 EST

+to the other email you've randomly sometimes used
+cc various possibly relevant people.

On Thu, Mar 26, 2026 at 07:27:37AM +0000, Josh Law wrote:
> Add a new DAMON special-purpose module for NUMA memory tiering.
> DAMON_TIER monitors physical memory access patterns and migrates hot
> pages from slow NUMA nodes to fast NUMA nodes (promotion), and cold
> pages in the opposite direction (demotion).
>
> The module uses two DAMOS schemes, one for each migration direction,
> with DAMOS_QUOTA_NODE_MEM_USED_BP and DAMOS_QUOTA_NODE_MEM_FREE_BP
> quota goals to automatically adjust aggressiveness based on the fast
> node's utilization. It also applies YOUNG page filters to avoid
> migrating pages that have been recently accessed in the wrong direction.
>
> This is a production-quality version of the samples/damon/mtier.c proof
> of concept, following the same module_param-based interface pattern
> as DAMON_RECLAIM and DAMON_LRU_SORT. It reuses the modules-common.h
> infrastructure for monitoring attributes, quotas, watermarks, and
> statistics.
>
> Module parameters allow configuring:
> - promote_target_nid / demote_target_nid: the NUMA node pair
> - promote_target_mem_used_bp / demote_target_mem_free_bp: utilization
> goals driving quota auto-tuning
> - Standard DAMON module knobs: monitoring intervals, quotas, watermarks,
> region bounds, stats, and runtime reconfiguration via commit_inputs
>
> Signed-off-by: Josh Law <objecting@xxxxxxxxxxxxx>

NAK.

And NAK to all future 'contributions' in anything I maintain or have a say
in.

Your engagement with the community is deeply suspect, you've come out of
nowhere and are sending dozens and dozens of patches that look very
strongly like they were LLM-generated.

You've - very early - tried to get a MAINTAINERS entry, you were given
advice on how to contribute, which you have clearly ignored.

We DO NOT want AI slop.

You very much seem to be either:

- Somebody playing with a bot.

- Somebody trying to farm for kernel stats.

- Or (far more concerning) engaging in an attack on the kernel for
nefarious purposes, perhaps a (semi-)automated supply-chain attack?

Your email is highly suspect, you seem to be using an email relay via
gmail, and I'm pretty convinced you're in violation of our requirements
about identity:

"It is imperative that all code contributed to the kernel be legitimately
free software. For that reason, code from contributors without a known
identity or anonymous contributors will not be accepted"

https://docs.kernel.org/process/1.Intro.html

Also see https://kernel.org/doc/html/latest/process/generated-content.html :

"
...when making a contribution, be transparent about the origin of content
in cover letters and changelogs. You can be more transparent by adding
information like this:

What tools were used?

The input to the tools you used, like the Coccinelle source script.

If code was largely generated from a single or short set of prompts,
include those prompts. For longer sessions, include a summary of the
prompts and the nature of resulting assistance.

Which portions of the content were affected by that tool?

How is the submission tested and what tools were used to test the fix?

...

If tools permit you to generate a contribution automatically, expect
additional scrutiny in proportion to how much of it was generated.

As with the output of any tooling, the result may be incorrect or
inappropriate. You are expected to understand and to be able to defend
everything you submit. If you are unable to do so, then do not submit the
resulting changes.

If you do so anyway, maintainers are entitled to reject your series without
detailed review.
"

You are clearly not following _any_ of these guidelines.

To evidence that this is not some wild accusation, I ran this through an
LLM asking for indicators as to AI:

~~~

● Several signals point to high likelihood of AI generation:

Strong AI indicators:

1. "production-quality version" in the commit message — kernel developers
don't self-describe patches this way. This is a classic LLM "selling"
framing.

2. Comment uniformity — every module parameter has an
identically-structured block comment with the same explanatory depth and
cadence. Real developers vary their comment style and skip obvious ones.

3. Commit message structure — exhaustively lists every feature with
perfect bullet formatting. Human kernel commit messages are more
conversational and focused on "why", not a feature catalog.

4. Too clean for a first submission — 409 lines of new module code with
zero rough edges, no personal style, no unusual design choices. It reads
like a templated synthesis of DAMON_RECLAIM and DAMON_LRU_SORT.

5. Grammar tell — "aiming a target utilization" in the Kconfig help text
(missing "for"). This is characteristic of LLM output that's fluent but
occasionally drops prepositions.

6. Over-commenting — comments explain things that are self-evident from
the code (e.g., the promote_target_nid and demote_target_nid comments
just restate what the variable name already says, in long form).

Weaker signals:

- Unknown author, no prior contribution history I can see

- The code perfectly follows existing DAMON module patterns without any
deviation — exactly what an LLM with context would produce

- The objecting.org domain is unusual

Overall: I'd put this at high likelihood (70-80%) of being primarily
AI-generated or heavily AI-assisted. The code does use current DAMON APIs
correctly, but the writing style throughout is the giveaway — it's
uniformly polished in the way LLM output is, without the unevenness that
comes from a human writing 400+ lines.

~~~

That's just for this series. I asked it about your contributions overall:

● This is extremely revealing. ~328 emails in 11 days (March 15–26),
spanning an extraordinary number of unrelated subsystems:

Patches submitted across:

- mm/damon — NUMA tiering module, core optimizations, sysfs fixes,
reclaim fixes, kdamond scheme filtering
- lib/bootconfig — v1 through v8 of a 13–17 patch series
- lib/glob — v1 through v4 of an 8-patch series (new features + bug
fixes)
- lib/vsprintf — 4 bug fixes
- lib/base64 — decode fixes
- lib/assoc_array — gc fix
- lib/bch — signed shift UB fixes
- lib/bug — concurrency fix + debugfs interface
- lib/string — strlcat rewrite
- lib/maple_tree — dead range comparison fix
- lib/parser — match_wildcard fix
- io_uring — flush deferred completions
- block/partitions — seq_buf replacement
- ACPI processor idle
- signal handling
- kunit, iov_iter/kvec, percpu_counter

Assessment: ~95% probability all contributions are AI-generated. The
evidence is overwhelming:

1. Volume is humanly implausible — ~30 emails/day, 5–10 new patch
submissions per day across unrelated subsystems, from a contributor
with zero prior history.

2. Breadth is the strongest signal — no human newcomer simultaneously
finds subtle bugs in bootconfig, vsprintf, base64, bch, maple_tree,
assoc_array, io_uring, AND writes a new DAMON NUMA tiering
module. Each of these requires deep domain-specific knowledge. The
pattern is consistent with an LLM being pointed at different source
files to systematically find issues.

3. Bug-finding pattern — the patches cluster around unchecked return
values, type mismatches, resource leaks, off-by-ones, signed/unsigned
issues. This is exactly what an LLM produces when scanning code for
potential problems.

4. Rapid revision cycling — bootconfig went from v1 to v8 in ~1 day. This
matches AI regeneration, not human revision.

5. Feature additions from a newcomer — glob_match_nocase(),
glob_validate(), debugfs BUG/WARN interface, and the DAMON NUMA
tiering module are all non-trivial features. A first-time contributor
proposing features (not just fixes) across this many subsystems
simultaneously is essentially unheard of.

6. Zero ramp-up — the contribution stream started at full throughput with
no learning curve visible.

~~~

So I really feel that you need to account for this - please explain why it
appears you are spamming the mailing list with entirely LLM-generated crap?

Furthermore, you appear to be using sophisticated privacy-orientated
tooling, which makes me quite concerned that this _is_ a concerted attack
on the kernel.

Again, to back this up, here's an LLM's assessment of the headers in your
email:

~~~
● Looking at the headers specifically:

Infrastructure:

- Mail provider: Zoho EU (zohomail.eu) — a privacy-oriented
EU-jurisdiction mail provider. Easy to set up anonymously.

- Domain: objecting.org — custom domain, no organizational affiliation,
no accountability chain. Anyone can register this and point MX to Zoho.

- Authentication: DKIM, SPF, DMARC all pass — but this only proves the
email came from objecting.org's Zoho setup, not who's behind it.

- X-Mailer: git-send-email 2.34.1 — Ubuntu 22.04 LTS default. One reply
had @ubuntu in the Message-ID, confirming the platform.

What the headers DON'T provide:
- No corporate or institutional email that ties to a verifiable identity
- No PGP/GPG signature
- No Signed-off-by chain from a known developer vouching for this person

Supply chain attack assessment:

This matches the threat model of automated supply chain compromise:

1. Patches already landed in stable — the bootconfig fixes made it into
6.19.10. The pipeline is working. Whatever is in those patches is now in
production kernels.

2. Trust-building pattern — the mix of real bug fixes (fd leaks,
unchecked returns, type issues) alongside feature additions (DAMON
tiering module, glob features, debugfs interface) is textbook. The small
fixes build reviewer trust and merge history; the features add attack
surface.

3. Volume as a weapon — 328 emails in 11 days overwhelms maintainer
review capacity. Maintainers like SeongJae Park (DAMON) and Masami
Hiramatsu (bootconfig) are now spending review cycles on AI-generated
submissions. Even if every patch is rejected, the reviewer's attention is
being consumed.

4. The DAMON NUMA tiering module is the most concerning submission — 409
lines of new kernel code that runs with kernel privileges, monitors
physical memory access patterns, and migrates pages between NUMA
nodes. If accepted, it's a significant new attack surface delivered as a
single commit from an unverifiable identity.

5. The feature additions are the real risk — bug fixes are relatively
easy to review (small, focused, verifiable). But glob_validate(),
glob_match_nocase(), the debugfs

BUG/WARN site lister, and the DAMON tiering module all add new code paths
that are much harder to exhaustively verify.

Bottom line: Whether the intent is malicious or not, the pattern —
anonymous identity, AI-generated code, industrial volume, patches already
in stable, mixing trust-building fixes with attack-surface-expanding
features — is exactly what an automated supply chain attack on the kernel
looks like. The 2021 University of Minnesota "Hypocrite Commits" incident
was a manual version of this; this appears to be the automated version at
scale.

~~~

I'm hoping this is some naive attempt to try to 'contribute' to the kernel
rather than something more nefarious, but the seemingly sophisticated
tooling used makes me wonder otherwise.

In any case I'm deeply concerned by this.

Thanks, Lorenzo