Re: [ANNOUNCE] 3.14-rt1

From: Fernando Lopez-Lezcano
Date: Thu May 15 2014 - 16:09:55 EST


On 05/15/2014 10:51 AM, Fernando Lopez-Lezcano wrote:
On 05/02/2014 04:37 AM, Sebastian Andrzej Siewior wrote:
* Fernando Lopez-Lezcano | 2014-04-26 11:29:04 [-0700]:

Saw this a moment ago (3.14.1 + rt1, Fedora 19 laptop - I think I
have seen something similar in 3.12.x-r):

Yes, you did: https://lkml.org/lkml/2014/3/7/163
You did not test I've sent. Care to do so?
...
I re-patched 3.14.3-rt5 with a slightly tweaked version of you patch.
Will see what happens and report back.

I got another freeze a moment ago. I'm attaching a text file with the oops, the patch I added to rt5, and the kernel configuration...

-- Fernando
May 15 12:51:27 localhost kernel: [ 9282.171060] ------------[ cut here ]------------
May 15 12:51:27 localhost kernel: [ 9282.171079] WARNING: CPU: 0 PID: 2049 at lib/list_debug.c:59 __list_del_entry+0xa1/0xd0()
May 15 12:51:27 localhost kernel: [ 9282.171082] list_del corruption. prev->next should be ffff880211174c60, but was dead000000100100
May 15 12:51:27 localhost kernel: [ 9282.171083] Modules linked in: fuse ipt_MASQUERADE xt_CHECKSUM tun ip6t_rpfilter ip6t_REJECT xt_conntrack ebtable_nat ebtable_broute bridge stp llc ebtable_filter ebtables ip6table_nat nf_conntrack_ipv6 nf_defrag_ipv6 nf_nat_ipv6 ip6table_mangle ip6table_security ip6table_raw ip6table_filter ip6_tables iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 nf_nat rfcomm nf_conntrack iptable_mangle bnep iptable_security iptable_raw iTCO_wdt iTCO_vendor_support coretemp kvm_intel kvm crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel microcode uvcvideo serio_raw videobuf2_vmalloc videobuf2_memops snd_hda_codec_hdmi videobuf2_core videodev intel_ips i2c_i801 media btusb bluetooth 6lowpan_iphc arc4 iwldvm snd_hda_codec_conexant snd_hda_codec_generic mac80211 iwlwifi sdhci_pci sdhci cfg80211 mmc_core lpc_ich mfd_core snd_hda_intel snd_hda_codec snd_hwdep snd_seq snd_seq_device snd_pcm snd_timer e1000e mei_me ptp mei thinkpad_acpi pps_core shpchpnouveau E[ PGRAPH][0000:01:00.0] TRAP_PROP - TP 0 - ppdev snd soundcore rfkill parport_pc parport acpi_cpufreq uinput nouveau i2c_algo_bit drm_kms_helper firewire_ohci mxm_wmi firewire_core ttm
May 15 12:51:27 localhost kernel: [ 9282.171136] DST2D_FAULT
May 15 12:51:27 localhost kernel: [ 9282.171139] crc_itu_t drm i2c_core
May 15 12:51:27 localhost kernel: [ 9282.171139] - Address 0020a92000
May 15 12:51:27 localhost kernel: [ 9282.171141] wmi video
May 15 12:51:27 localhost kernel: [ 9282.171144] CPU: 0 PID: 2049 Comm: cinnamon Tainted: G W 3.14.3-200.rt5.1.fc19.ccrma.x86_64+rt #1
May 15 12:51:27 localhost kernel: [ 9282.171145] nouveau E[ PGRAPH][0000:01:00.0] TRAP_PROP - TP 0 - e0c: 00000000, e18: 00000000, e1c: 00000000, e20: 00000011, e24: 0c030000
May 15 12:51:27 localhost kernel: [ 9282.171146] Hardware name: LENOVO 4313CTO/4313CTO, BIOS 6MET64WW (1.27 ) 07/15/2010
May 15 12:51:27 localhost kernel: [ 9282.171149] 0000000000000000 000000006a3e8229 ffff8800ada91a88 ffffffff81703b57
May 15 12:51:27 localhost kernel: [ 9282.171151] ffff8800ada91ad0 ffff8800ada91ac0
May 15 12:51:27 localhost kernel: [ 9282.171151] nouveau E[ PGRAPH][0000:01:00.0] ch 3 [0x001fb14000 Xorg[1396]] subc 2 class 0x502d mthd 0x060c data 0x00000022
May 15 12:51:27 localhost kernel: [ 9282.171153] ffffffff8108bc2d ffff880211174c60
May 15 12:51:27 localhost kernel: [ 9282.171156] ffff880211174d50 ffff880211174d50 ffff880211174d40 ffff88021b73be48
May 15 12:51:27 localhost kernel: [ 9282.171156] Call Trace:
May 15 12:51:27 localhost kernel: [ 9282.171164] [<ffffffff81703b57>] dump_stack+0x4d/0x6f
May 15 12:51:27 localhost kernel: [ 9282.171170] [<ffffffff8108bc2d>] warn_slowpath_common+0x7d/0xc0
May 15 12:51:27 localhost kernel: [ 9282.171171] nouveau E[ PFB][0000:01:00.0] trapped write at 0x0020a92000 on channel 0x0001fb14 [Xorg[1396]]
May 15 12:51:27 localhost kernel: [ 9282.171173] [<ffffffff8108bccc>] warn_slowpath_fmt+0x5c/0x80
May 15 12:51:27 localhost kernel: [ 9282.171176] [<ffffffff81376581>] __list_del_entry+0xa1/0xd0
May 15 12:51:27 localhost kernel: [ 9282.171178] [<ffffffff813765bd>] list_del+0xd/0x30
May 15 12:51:27 localhost kernel: [ 9282.171181] PGRAPH/
May 15 12:51:27 localhost kernel: [ 9282.171181] [<ffffffffa0145593>] nouveau_fence_signal+0x53/0x80 [nouveau]
May 15 12:51:27 localhost kernel: [ 9282.171219] PROP/ [<ffffffffa0145678>] nouveau_fence_update+0x48/0xa0 [nouveau]
May 15 12:51:27 localhost kernel: [ 9282.171260] [<ffffffffa0145fb5>] nouveau_fence_sync+0x45/0x80 [nouveau]
May 15 12:51:27 localhost kernel: [ 9282.171282] DST2D
May 15 12:51:27 localhost kernel: [ 9282.171282] [<ffffffffa014aed8>] validate_list+0xd8/0x2e0 [nouveau]
May 15 12:51:27 localhost kernel: [ 9282.171304] [<ffffffffa014c403>] nouveau_gem_ioctl_pushbuf+0xaa3/0x13e0 [nouveau]
May 15 12:51:27 localhost kernel: [ 9282.171305] reason: PAGE_NOT_PRESENT
May 15 12:51:27 localhost kernel: [ 9282.171318] [<ffffffffa002ad02>] drm_ioctl+0x4f2/0x620 [drm]
May 15 12:51:27 localhost kernel: [ 9282.171323] [<ffffffff810bcf44>] ? migrate_enable+0x94/0x1a0
May 15 12:51:27 localhost kernel: [ 9282.171344] [<ffffffffa0142cfe>] nouveau_drm_ioctl+0x4e/0x90 [nouveau]
May 15 12:51:27 localhost kernel: [ 9282.171347] [<ffffffff811feb50>] do_vfs_ioctl+0x2e0/0x4c0
May 15 12:51:27 localhost kernel: [ 9282.171349] [<ffffffff812f7da6>] ? file_has_perm+0xa6/0xb0
May 15 12:51:27 localhost kernel: [ 9282.171351] [<ffffffff810bb701>] ? __sched_fork+0x171/0x220
May 15 12:51:27 localhost kernel: [ 9282.171352] [<ffffffff811fedb1>] SyS_ioctl+0x81/0xa0
May 15 12:51:27 localhost kernel: [ 9282.171355] [<ffffffff81712069>] system_call_fastpath+0x16/0x1b
May 15 12:51:27 localhost kernel: [ 9282.171356] ---[ end trace 0000000000000006 ]---
May 15 12:51:27 localhost kernel: [ 9282.172554] nouveau E[ PGRAPH][0000:01:00.0] TRAP_PROP - TP 0 - RT_FAULT - Address 0020a92000
May 15 12:51:27 localhost kernel: [ 9282.172561] nouveau E[ PGRAPH][0000:01:00.0] TRAP_PROP - TP 0 - e0c: 00000000, e18: 00000000, e1c: 00000000, e20: 00002a00, e24: 00030000
May 15 12:51:27 localhost kernel: [ 9282.172566] nouveau E[ PGRAPH][0000:01:00.0] ch 3 [0x001fb14000 Xorg[1396]] subc 7 class 0x8597 mthd 0x15e0 data 0x00000000
May 15 12:51:27 localhost kernel: [ 9282.172575] nouveau E[ PFB][0000:01:00.0] trapped write at 0x0020a92000 on channel 0x0001fb14 [Xorg[1396]] PGRAPH/PROP/RT0 reason: PAGE_NOT_PRESENT
May 15 12:51:27 localhost kernel: [ 9282.172616] nouveau E[ PGRAPH][0000:01:00.0] TRAP_PROP - TP 0 - RT_FAULT - Address 0020a92000
May 15 12:51:27 localhost kernel: [ 9282.172621] nouveau E[ PGRAPH][0000:01:00.0] TRAP_PROP - TP 0 - e0c: 00000000, e18: 00000000, e1c: 00000000, e20: 00002a00, e24: 00030000
May 15 12:51:27 localhost kernel: [ 9282.172625] nouveau E[ PGRAPH][0000:01:00.0] ch 3 [0x001fb14000 Xorg[1396]] subc 7 class 0x8597 mthd 0x15e0 data 0x00000000
May 15 12:51:27 localhost kernel: [ 9282.172635] nouveau E[ PFB][0000:01:00.0] trapped write at 0x0020a92000 on channel 0x0001fb14 [Xorg[1396]] PGRAPH/PROP/RT0 reason: PAGE_NOT_PRESENT
May 15 12:51:27 localhost kernel: [ 9282.172656] nouveau E[ PGRAPH][0000:01:00.0] magic set 0:
May 15 12:51:27 localhost kernel: [ 9282.172659] nouveau E[ PGRAPH][0000:01:00.0] 0x00408604: 0x20095203
May 15 12:51:27 localhost kernel: [ 9282.172662] nouveau E[ PGRAPH][0000:01:00.0] 0x00408608: 0x0020a958
May 15 12:51:27 localhost kernel: [ 9282.172665] nouveau E[ PGRAPH][0000:01:00.0] 0x0040860c: 0x80000432
May 15 12:51:27 localhost kernel: [ 9282.172668] nouveau E[ PGRAPH][0000:01:00.0] 0x00408610: 0xa9200000
May 15 12:51:27 localhost kernel: [ 9282.172671] nouveau E[ PGRAPH][0000:01:00.0] TRAP_TEXTURE - TP0: Unhandled ustatus 0x00000003
May 15 12:51:27 localhost kernel: [ 9282.172674] nouveau E[ PGRAPH][0000:01:00.0] ch 3 [0x001fb14000 Xorg[1396]] subc 2 class 0x502d mthd 0x08dc data 0x00000000
May 15 12:51:27 localhost kernel: [ 9282.172685] nouveau E[ PFB][0000:01:00.0] trapped read at 0x0020a92200 on channel 0x0001fb14 [Xorg[1396]] PGRAPH/TEXTURE/00 reason: PAGE_NOT_PRESENT
May 15 12:51:27 localhost kernel: [ 9282.240139] BUG: unable to handle kernel NULL pointer dereference at 0000000000000009
May 15 12:51:27 localhost kernel: [ 9282.240149] IP: [<ffffffff81376511>] __list_del_entry+0x31/0xd0
May 15 12:51:27 localhost kernel: [ 9282.240152] PGD 227e4e067 PUD 227e4d067 PMD 0
May 15 12:51:27 localhost kernel: [ 9282.240154] Oops: 0000 [#1] PREEMPT SMP
May 15 12:51:27 localhost kernel: [ 9282.240188] Modules linked in: fuse ipt_MASQUERADE xt_CHECKSUM tun ip6t_rpfilter ip6t_REJECT xt_conntrack ebtable_nat ebtable_broute bridge stp llc ebtable_filter ebtables ip6table_nat nf_conntrack_ipv6 nf_defrag_ipv6 nf_nat_ipv6 ip6table_mangle ip6table_security ip6table_raw ip6table_filter ip6_tables iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 nf_nat rfcomm nf_conntrack iptable_mangle bnep iptable_security iptable_raw iTCO_wdt iTCO_vendor_support coretemp kvm_intel kvm crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel microcode uvcvideo serio_raw videobuf2_vmalloc videobuf2_memops snd_hda_codec_hdmi videobuf2_core videodev intel_ips i2c_i801 media btusb bluetooth 6lowpan_iphc arc4 iwldvm snd_hda_codec_conexant snd_hda_codec_generic mac80211 iwlwifi sdhci_pci sdhci cfg80211 mmc_core lpc_ich mfd_core snd_hda_intel snd_hda_codec snd_hwdep snd_seq snd_seq_device snd_pcm snd_timer e1000e mei_me ptp mei thinkpad_acpi pps_core shpchp ppdev snd soundcore rfkill parport_pc parport acpi_cpufreq uinput nouveau i2c_algo_bit drm_kms_helper firewire_ohci mxm_wmi firewire_core ttm crc_itu_t drm i2c_core wmi video
May 15 12:51:27 localhost kernel: [ 9282.240206] CPU: 0 PID: 2049 Comm: cinnamon Tainted: G W 3.14.3-200.rt5.1.fc19.ccrma.x86_64+rt #1
May 15 12:51:27 localhost kernel: [ 9282.240207] Hardware name: LENOVO 4313CTO/4313CTO, BIOS 6MET64WW (1.27 ) 07/15/2010
May 15 12:51:27 localhost kernel: [ 9282.240208] task: ffff880227cfc750 ti: ffff8800ada90000 task.ti: ffff8800ada90000
May 15 12:51:27 localhost kernel: [ 9282.240211] RIP: 0010:[<ffffffff81376511>] [<ffffffff81376511>] __list_del_entry+0x31/0xd0
May 15 12:51:27 localhost kernel: [ 9282.240212] RSP: 0018:ffff8800ada91b38 EFLAGS: 00010246
May 15 12:51:27 localhost kernel: [ 9282.240213] RAX: ffff8800a79fcf50 RBX: ffff8800a79fc3a0 RCX: dead000000200200
May 15 12:51:27 localhost kernel: [ 9282.240214] RDX: 0000000000000001 RSI: 0000000000000286 RDI: ffff8800a79fc3a0
May 15 12:51:27 localhost kernel: [ 9282.240214] RBP: ffff8800ada91b38 R08: ffff8800a79fc3a0 R09: 0000000000000000
May 15 12:51:27 localhost kernel: [ 9282.240215] R10: ffffea00029e2d40 R11: ffffffffa01456c8 R12: ffff8800a79fc820
May 15 12:51:27 localhost kernel: [ 9282.240216] R13: ffff8800a79fcf50 R14: ffff8800a79fcf40 R15: ffff88021b73be48
May 15 12:51:27 localhost kernel: [ 9282.240217] FS: 00007fc079062a40(0000) GS:ffff88023bc00000(0000) knlGS:0000000000000000
May 15 12:51:27 localhost kernel: [ 9282.240218] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
May 15 12:51:27 localhost kernel: [ 9282.240219] CR2: 0000000000000009 CR3: 000000022834f000 CR4: 00000000000007f0
May 15 12:51:27 localhost kernel: [ 9282.240220] Stack:
May 15 12:51:27 localhost kernel: [ 9282.240222] ffff8800ada91b50 ffffffff813765bd ffff8800a79fc800 ffff8800ada91b80
May 15 12:51:27 localhost kernel: [ 9282.240223] ffffffffa0145593 ffff8800a79fcf40 ffff8800a79fc340 ffff88021b73be00
May 15 12:51:27 localhost kernel: [ 9282.240225] ffff88021742ab40 ffff8800ada91bb8 ffffffffa0145678 ffff8800a79fc340
May 15 12:51:27 localhost kernel: [ 9282.240225] Call Trace:
May 15 12:51:27 localhost kernel: [ 9282.240229] [<ffffffff813765bd>] list_del+0xd/0x30
May 15 12:51:27 localhost kernel: [ 9282.240268] [<ffffffffa0145593>] nouveau_fence_signal+0x53/0x80 [nouveau]
May 15 12:51:27 localhost kernel: [ 9282.240295] [<ffffffffa0145678>] nouveau_fence_update+0x48/0xa0 [nouveau]
May 15 12:51:27 localhost kernel: [ 9282.240321] [<ffffffffa0145fb5>] nouveau_fence_sync+0x45/0x80 [nouveau]
May 15 12:51:27 localhost kernel: [ 9282.240347] [<ffffffffa014aed8>] validate_list+0xd8/0x2e0 [nouveau]
May 15 12:51:27 localhost kernel: [ 9282.240374] [<ffffffffa014c403>] nouveau_gem_ioctl_pushbuf+0xaa3/0x13e0 [nouveau]
May 15 12:51:27 localhost kernel: [ 9282.240386] [<ffffffffa002ad02>] drm_ioctl+0x4f2/0x620 [drm]
May 15 12:51:27 localhost kernel: [ 9282.240393] [<ffffffff810bcf44>] ? migrate_enable+0x94/0x1a0
May 15 12:51:27 localhost kernel: [ 9282.240418] [<ffffffffa0142cfe>] nouveau_drm_ioctl+0x4e/0x90 [nouveau]
May 15 12:51:27 localhost kernel: [ 9282.240421] [<ffffffff811feb50>] do_vfs_ioctl+0x2e0/0x4c0
May 15 12:51:27 localhost kernel: [ 9282.240424] [<ffffffff812f7da6>] ? file_has_perm+0xa6/0xb0
May 15 12:51:27 localhost kernel: [ 9282.240426] [<ffffffff811fedb1>] SyS_ioctl+0x81/0xa0
May 15 12:51:27 localhost kernel: [ 9282.240429] [<ffffffff81129566>] ? __audit_syscall_exit+0x1f6/0x2a0
May 15 12:51:27 localhost kernel: [ 9282.240432] [<ffffffff81712069>] system_call_fastpath+0x16/0x1b
May 15 12:51:27 localhost kernel: [ 9282.240451] Code: 00 01 10 00 00 00 ad de 48 8b 47 08 48 89 e5 48 39 ca 74 29 48 b9 00 02 20 00 00 00 ad de 48 39 c8 74 7a 4c 8b 00 4c 39 c7 75 53 <4c> 8b 42 08 4c 39 c7 75 2b 48 89 42 08 48 89 10 5d c3 49 89 d0
May 15 12:51:27 localhost kernel: [ 9282.240453] RIP [<ffffffff81376511>] __list_del_entry+0x31/0xd0
May 15 12:51:27 localhost kernel: [ 9282.240453] RSP <ffff8800ada91b38>
May 15 12:51:27 localhost kernel: [ 9282.240454] CR2: 0000000000000009
May 15 12:51:27 localhost kernel: [ 9282.247071] ---[ end trace 0000000000000007 ]---
--- a/drivers/gpu/drm/nouveau/nouveau_fence.c~ 2014-03-30 20:40:15.000000000 -0700
+++ b/drivers/gpu/drm/nouveau/nouveau_fence.c 2014-05-14 13:39:02.115359923 -0700
@@ -178,11 +178,17 @@

{
struct nouveau_channel *chan = fence->channel;
- struct nouveau_fifo *pfifo = nouveau_fifo(chan->drm->device);
- struct nouveau_fence_priv *priv = chan->drm->fence;
+ struct nouveau_fifo *pfifo;
+ struct nouveau_fence_priv *priv;
struct nouveau_eventh *handler;
int ret = 0;

+ if (WARN_ON_ONCE(!chan))
+ return 0;
+
+ pfifo = nouveau_fifo(chan->drm->device);
+ priv = chan->drm->fence;
+
ret = nouveau_event_new(pfifo->uevent, 0,
nouveau_fence_wait_uevent_handler,
priv, &handler);

Attachment: config-3.14.3-200.rt5.1.fc19.ccrma.x86_64+rt.bz2
Description: application/bzip