Re: GART Error 11

From: Arthur Perry
Date: Wed Jun 02 2004 - 13:45:08 EST


Or actually, I should say, you "most likely" have this as well, since I asked you to gather the information through the more qurky interface.
The bits for this error case match perfectly, so I'd say it's probably a good bet.

Arthur Perry


On Wed, 2 Jun 2004, Arthur Perry wrote:

> Hi Saurabh,
>
> Thanks. It looks like you also have true GART errors as reported by hardware, on CPU0.
> So our common failure mode here is actual GART errors and not something else being reported as a GART error because of erroneous kernel translation.
>
> It's possible that we are using a device driver somewhere that is misbehaving, which is using the GART or IOMMU improperly somehow, or my guess is that is may be the actual AGP device driver used by RedHat.
> ie, they may have not patched in the most recent version that may contain a lot of fixes.
>
> Thanks for your feedback.
>
> As of making your messages go away, I would tell you to disable the GartTableWalk in MCE, but that does not seem to work on my machine.
> I'll let you know what does work without turning off Northbridge MC* entirely once I discover it.
>
> -Arthur Perry
>
>
>
> On Wed, 2 Jun 2004, Saurabh Barve wrote:
>
> > Sorry about the delay in my reply. Just got in to work!
> > Here is the output:
> >
> > > pcitweak -r 0:18:3 0x48
> >
> > 0x0005001B
> >
> > > and
> > > pcitweak -r 0:19:3 0x48
> >
> > 0x00000000
> >
> > > While you are at it, can you send us status high as well?
> > >
> > > pcitweak -r 0:18:3 0x4c
> >
> > 0xA4000000
> >
> > > and
> > > pcitweak -r 0:19:3 0x4c
> >
> > 0x00000000
> >
> > I don't know if this would help, but below is a part of my cronwatch log:
> >
> > --------------------- Init Begin ------------------------
> >
> > **Unmatched Entries**
> > Trying to re-exec init
> > Trying to re-exec init
> >
> > ---------------------- Init End -------------------------
> >
> >
> > --------------------- Kernel Begin ------------------------
> >
> >
> > WARNING: Kernel Errors Present
> > uteval-0098: *** Error: Method executio...: 4Time(s)
> > psparse-1121: *** Error: Method executio...: 8Time(s)
> > Error uncorrected...: 538Time(s)
> > GART error 11...: 538Time(s)
> > Lost an northbridge error...: 538Time(s)
> > NB error address 00000000...: 538Time(s)
> >
> > ---------------------- Kernel End -------------------------
> >
> >
> > --------------------- ModProbe Begin ------------------------
> >
> >
> > Can't locate these modules:
> > char-major-10-134: 4 Time(s)
> > sound-service-0-3: 6 Time(s)
> > xp0: 3 Time(s)
> > sound-slot-0: 6 Time(s)
> > char-major-188: 15 Time(s)
> >
> > ---------------------- ModProbe End -------------------------
> >
> >
> > Thanks,
> > Saurabh.
> >
> > --
> > ===============================================================================
> > Saurabh Barve Phone:
> > System Administrator/Data Specialist 970-491-7714 (voice)
> > Montgomery Research Group, 970-491-8449 (Fax)
> > Atmospheric Sciences Department,
> > Fort Collins, Colorado
> > Colorado State University
> >
> > Mail : sa@xxxxxxxxxxxxxxxxxxx
> > Web : http://fjortoft.atmos.colostate.edu/~sa
> >
> > -
> > To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
> > the body of a message to majordomo@xxxxxxxxxxxxxxx
> > More majordomo info at http://vger.kernel.org/majordomo-info.html
> > Please read the FAQ at http://www.tux.org/lkml/
> >
>
>
> --
> amd64-list mailing list
> amd64-list@xxxxxxxxxx
> https://www.redhat.com/mailman/listinfo/amd64-list
>
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/