Re: [REGRESSION] Re: AMDGPU 6.11.0 crash, 6.10.0 git bisect log

From: Alex Hung
Date: Fri Sep 20 2024 - 19:34:56 EST




On 2024-09-20 17:00, Dr. David Alan Gilbert wrote:
* Bob Gill (gillb5@xxxxxxxxx) wrote:
Hello.  Kernel 6.11.0 crashes.  6.10.0 builds.  Al Viro and Dr. David Alan
Gilbert have been helpful, and asked that I

post a git bisect log.  The last log step seems odd, but the second last
step "Remove useless function call" might be what broke.

Thank you for doing this!

My reading is that's fine, I think the next one:

tree: git bisect bad
[a171cce57792b0a6206d532050179a381ad74f8f] drm/amd/display: Check and log for
function error codes

or the one after it is the culprit?

Adding the two Alex's from AMD back onto the thread.
(Also added the [REGRESSION] marker the notes tell us to add)

The commit triggers debugger in case of in case of errors.

Is the config CONFIG_DEBUG_KERNEL_DC (Enable kgdb break in DC) enabled in .config, i.e. can you check "grep CONFIG_DEBUG_KERNEL_DC .config"?

If so, can you also try to disable it and check whether you can see error messages "Failed to execute VBIOS command table" in kernel log?


My hardware is old corei7 quad core/8 thread Tylersberg/Nehalem with an AMD
RX 6500XT.  That's the odd combination.

Thanks in advance,

Thanks again for the bisect.

Dave

Bob

Config:  (.config)
/data/kernel/bobtest6.10-64

Build line: (last command tells me the job is finished)
make menuconfig && make -j $(nproc) && make modules && make modules_install
&& make install && /data/music/pl.sh

Rule 1: Do not modify ANYTHING in the source tree

git bisect start
git bisect bad
git bisect good v6.10

Bisecting: 11273 revisions left to test after this (roughly 14 steps)
[2c9b3512402ed192d1f43f4531fb5da947e72bd0] Merge tag 'for-linus' of
git://git.kernel.org/pub/scm/virt/kvm/kvm

latest kernel:
ls -alt /lib/modules | head -2 | tail -1 | tr -s " " | cut -d' ' -f9
6.10.0+

RESULT:  boot 6.10.0+ fails
         screen black for more than 2 minutes,
         (caps lock key unresponsive, reset, power
         buttons on computer case do nothing).  Reset with power bar.

tree: git bisect bad
Bisecting: 5677 revisions left to test after this (roughly 13 steps)
[280e36f0d5b997173d014c07484c03a7f7750668] nsfs: use cleanup guard

latest kernel:
ls -alt /lib/modules | head -2 | tail -1 | tr -s " " | cut -d' ' -f9
6.10.0+

RESULT: boot 6.10.0+ successful
tree: git bisect good
Bisecting: 2855 revisions left to test after this (roughly 12 steps)
[dde1a0e1625c08cf4f958348a83434b2ddecf449] Merge tag 'x86-percpu-2024-07-17'
of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip

latest kernel:
ls -alt /lib/modules | head -2 | tail -1 | tr -s " " | cut -d' ' -f9
6.10.0+

RESULT: boot 6.10.0+ fails
        screen black for more than 2 minutes,
        (caps lock key unresponsive, reset, power
        buttons on computer case do nothing).  Reset with power bar.

tree: git bisect bad
Bisecting: 1478 revisions left to test after this (roughly 11 steps)
[32a120f52a4c0121bca8f2328d4680d283693d60] drm/i915/mtl: Skip PLL state
verification in TBT mode

latest kernel:
ls -alt /lib/modules | head -2 | tail -1 | tr -s " " | cut -d' ' -f9
6.10.0-rc3+

RESULT: boot 6.10.0-rc3+ successful
tree: git bisect good
Bisecting: 739 revisions left to test after this (roughly 10 steps)
[b6a343df46d69070a7073405e470e6348180ea34] drm/amdgpu: initialize GC IP
v11.5.2

latest kernel:
ls -alt /lib/modules | head -2 | tail -1 | tr -s " " | cut -d' ' -f9
6.10.0-rc3+

RESULT: boot 6.10.0-rc3+ fails
        screen black for more than 2 minutes,
        (caps lock key unresponsive, reset, power
        buttons on computer case do nothing).  Reset with power bar.

tree: git bisect bad
Bisecting: 369 revisions left to test after this (roughly 9 steps)
[cf1d06ac53a15b83c0a63225606cfe175e33a8a0] accel/ivpu: Increase autosuspend
delay to 100ms on 40xx

latest kernel:
ls -alt /lib/modules | head -2 | tail -1 | tr -s " " | cut -d' ' -f9
6.10.0-rc1+

RESULT: boot 6.10.0-rc1+ successful
tree: git bisect good
Bisecting: 184 revisions left to test after this (roughly 8 steps)
[0ca9f757a0e27a076395ec1b2002661bcf5c25e8] drm/amd/pm: powerplay: Add
`__counted_by` attribute for flexible arrays

latest kernel:
ls -alt /lib/modules | head -2 | tail -1 | tr -s " " | cut -d' ' -f9
6.9.0-rc5+

RESULT: boot 6.9.0-rc5+ successful
tree: git bisect good
Bisecting: 92 revisions left to test after this (roughly 7 steps)
[9862ef7bae47b9292a38a0a1b30bff7f56d7815b] drm/amd/display: Use periodic
detection for ipx/headless

latest kernel:
ls -alt /lib/modules | head -2 | tail -1 | tr -s " " | cut -d' ' -f9
6.10.0-rc3+

RESULT: boot 6.10.0-rc3+ fails
        screen black for more than 2 minutes,
        (caps lock key unresponsive, reset, power
        buttons on computer case do nothing).  Reset with power bar.

tree: git bisect bad
Bisecting: 44 revisions left to test after this (roughly 6 steps)
[a78313bb206e0c456a989f380c4cbd8af8af7c76] Merge tag
'drm-intel-gt-next-2024-06-12' of
https://gitlab.freedesktop.org/drm/i915/kernel into drm-next

latest kernel:
ls -alt /lib/modules | head -2 | tail -1 | tr -s " " | cut -d' ' -f9
6.10.0-rc3+

RESULT: boot 6.10.0-rc3+ successful
tree: git bisect good
Bisecting: 22 revisions left to test after this (roughly 5 steps)
[51dbe0239b1fc7c435867ce28e5eb4394b6641e1] drm/amd/display: Fix cursor size
issues

latest kernel:
ls -alt /lib/modules | head -2 | tail -1 | tr -s " " | cut -d' ' -f9
6.10.0-rc3+

RESULT: boot 6.10.0-rc3+ successful
tree: git bisect good
Bisecting: 11 revisions left to test after this (roughly 4 steps)
[871512e36f9c1c2cb4e62eb860ca0438800e4d63] drm/amd/display: Add workaround
to restrict max frac urgent for DPM0

latest kernel:
ls -alt /lib/modules | head -2 | tail -1 | tr -s " " | cut -d' ' -f9
6.10.0-rc3+

RESULT: boot 6.10.0-rc3+ fails
        screen black for more than 2 minutes,
        (caps lock key unresponsive, reset, power
        buttons on computer case do nothing).  Reset with power bar.

tree: git bisect bad
Bisecting: 5 revisions left to test after this (roughly 3 steps)
[5d93060d430b359e16e7c555c8f151ead1ac614b] drm/amd/display: Check HDCP
returned status

latest kernel:
ls -alt /lib/modules | head -2 | tail -1 | tr -s " " | cut -d' ' -f9
6.10.0-rc3+

RESULT: boot 6.10.0-rc3+ fails
        screen black for more than 2 minutes,
        (caps lock key unresponsive, reset, power
        buttons on computer case do nothing).  Reset with power bar.

tree: git bisect bad
Bisecting: 2 revisions left to test after this (roughly 1 step)
[e094992bd1caa1fbd42221c7c305fc3b54172b5c] drm/amd/display: Remove useless
function call

latest kernel:
ls -alt /lib/modules | head -2 | tail -1 | tr -s " " | cut -d' ' -f9
6.10.0-rc3+

RESULT: boot 6.10.0-rc3+ successful

tree: git bisect good
[2c2ee1d1329881d8e6bb23c3b9f3b41df8a8055c] drm/amd/display: Check and log
for function error codes

latest kernel:
ls -alt /lib/modules | head -2 | tail -1 | tr -s " " | cut -d' ' -f9
6.10.0-rc3+

RESULT: boot 6.10.0-rc3+ fails
        screen black for more than 2 minutes,
        (caps lock key unresponsive, reset, power
        buttons on computer case do nothing).  Reset with power bar.

tree: git bisect bad
[a171cce57792b0a6206d532050179a381ad74f8f] drm/amd/display: Check and log
for function error codes