Re: [PATCH] drm/i915: Record error batch buffers using iomem

From: Jaswinder Singh Rajput
Date: Tue May 11 2010 - 17:46:00 EST


Hello Chris,

On Wed, May 12, 2010 at 1:35 AM, Jaswinder Singh Rajput
<jaswinderlinux@xxxxxxxxx> wrote:
> Hello Chris,
>
> On Wed, May 12, 2010 at 1:23 AM, Chris Wilson <chris@xxxxxxxxxxxxxxxxxx> wrote:
>> On Wed, 12 May 2010 01:08:23 +0530, Jaswinder Singh Rajput <jaswinderlinux@xxxxxxxxx> wrote:
>>> Hello Chris and Andrew,
>>>
>>> I did further testing and noticed that this patch fixes the boot
>>> errors and warnings and I get the XWindows.
>>>
>>> But XWindows freezes after some time.
>>
>> The BUG you were hitting before is on the error collection path which
>> presumably is still being triggered during boot by a GPU error.
>
> No, I am not getting any bug with your patch.
>
> dmesg with your patch :
> http://userweb.kernel.org/~jaswinder/acer_netbook/dmesg_2634-rc7-chris.txt
>

I did more testing. And test pass 80% of time. I get the bugs with cold boot :

[ 40.090295] [drm:i915_hangcheck_elapsed] *ERROR* Hangcheck timer
elapsed... GPU hung
[ 40.090318] ------------[ cut here ]------------
[ 40.090338] WARNING: at mm/highmem.c:453 debug_kmap_atomic+0xa9/0x11e()
[ 40.090345] Hardware name: Aspire one
[ 40.090351] Modules linked in: nf_conntrack_ftp ath9k ath9k_common
battery ath9k_hw [last unloaded: scsi_wait_scan]
[ 40.090378] Pid: 0, comm: swapper Not tainted 2.6.34-rc7-netbook #8
[ 40.090385] Call Trace:
[ 40.090402] [<c1030ecb>] warn_slowpath_common+0x65/0x7c
[ 40.090415] [<c108ce5d>] ? debug_kmap_atomic+0xa9/0x11e
[ 40.090428] [<c1030eef>] warn_slowpath_null+0xd/0x10
[ 40.090440] [<c108ce5d>] debug_kmap_atomic+0xa9/0x11e
[ 40.090454] [<c1020611>] kmap_atomic_prot_pfn+0x1d/0x5e
[ 40.090465] [<c1020675>] iomap_atomic_prot_pfn+0x23/0x26
[ 40.090479] [<c11f7d8a>] i915_error_object_create+0x110/0x17c
[ 40.090492] [<c11f8298>] i915_handle_error+0x4a2/0x9ba
[ 40.090506] [<c11f884f>] i915_hangcheck_elapsed+0x9f/0xdf
[ 40.090518] [<c103ab6e>] run_timer_softirq+0x1c9/0x269
[ 40.090531] [<c11f87b0>] ? i915_hangcheck_elapsed+0x0/0xdf
[ 40.090543] [<c1035b7b>] __do_softirq+0xc6/0x186
[ 40.090553] [<c1035c61>] do_softirq+0x26/0x2b
[ 40.090564] [<c1035dd2>] irq_exit+0x29/0x66
[ 40.090576] [<c101681f>] smp_apic_timer_interrupt+0x6e/0x7c
[ 40.090591] [<c141f996>] apic_timer_interrupt+0x2a/0x30
[ 40.090605] [<c104007b>] ? ftrace_raw_event_signal_generate+0x6d/0xd4
[ 40.090618] [<c11bed9d>] ? acpi_idle_enter_simple+0x13b/0x168
[ 40.090633] [<c12dd435>] cpuidle_idle_call+0x6b/0xda
[ 40.090645] [<c1001a3c>] cpu_idle+0x44/0x74
[ 40.090657] [<c141a1b1>] start_secondary+0x1b2/0x1b7
[ 40.090666] ---[ end trace 5e47c395a6f397dc ]---
[ 40.090862] ------------[ cut here ]------------

dmesg with this patch with cold boot :
http://userweb.kernel.org/~jaswinder/acer_netbook/dmesg_2634-rc7-chris-cold.txt

Thanks,
--
Jaswinder Singh.
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/