Re: Possible dcache BUG

From: Rafael J. Wysocki
Date: Sun Aug 29 2004 - 10:13:20 EST


On Sunday 29 of August 2004 15:48, Gene Heskett wrote:
> On Wednesday 25 August 2004 02:13, Denis Vlasenko wrote:
> >On Wednesday 25 August 2004 04:49, Tom Vier wrote:
> >> On Mon, Aug 23, 2004 at 11:08:41PM -0400, Gene Heskett wrote:
> >> > >are you translating virt->phys?
> >> >
> >> > No, this is straight out of the memburn output (after I'd fixed
> >> > the
> >>
> >> that's weird that you're finding that pattern in virtual
> >> addresses. i wouldn't expect that. even if you're booting to
> >> single user, certain variables might change during boot and cause
> >> different physical pages to be mapped. maybe single user is more
> >> deterministic than i think, though.
> >
> >On x86, pages are aligned at 4k. Lower 12 bits of virtual address
> >match lower 12 bits of corresponding real address.
> >
> >So, yes, if you hit bad RAM cell, you see random virtual address,
> > but three last digits of it (in hex) must be the same.
>
> I think, based on the last 25 hours of running both memburn and
> setiathome at a -nice 19, and there have been no errors, that I might
> have stumbled onto a fix.
>
> It seems the dram is marked DDR400, so I was trying to run it that
> way. Unforch, on checking the invoice for the umpteenth time, it
> finally dawned on me that this particular AMD 2800XP is supposedly a
> 333mhz FSB chip, and not rated for use with DDR400 memory. Switching
> the bios setting for the memory to 'auto' from 'spd' seems to effect
> this particular item, and the memory now signs in as DDR333 Dual
> Channel.
>
> And after 25 hours, no errors, nothing unusual in the logs.
>
> I guess I should go paint my face with egg or something...

Not necessarily. :-) Some mobos based on the nforce2 chipsets should be able
to clock FSB and memory asynchronously. The very fact that you can set the
memory clock separately in the BIOS indicates that your mobo is one of these.
So, if it runs well at synchronous FSB and memory clock rates, but causes
problems otherwise, the northbridge is probably fishy. Or the memory is not
up to the spec. Anyway, the symptoms are quite "interesting" and it's good
to know what they are.

Regards,
RJW

--
For a successful technology, reality must take precedence over public
relations, for nature cannot be fooled.
-- Richard P. Feynman
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/