Aw: Re: question about ioremap_cache and PAT

From: Andreas Werner
Date: Fri Aug 23 2013 - 12:59:26 EST




Hi,
thank you for your answer.
Â
So we are two persons for now who need WT :-)
Â
Im currently working on an ethernet driver for our own ETH core.
The problem is that one requirement is to not use DMA to transmit or receive the data.
This means the that the ethernet buffer are not located in the main memory. They are located in
the FPGA.
Â
To transmit or receive a frame, i have to read or write to mmio to get the data.
Â
Intel has introduced the command "clflush" which can flush a cache line.
I wanted to activate the caches for those mmio (eth buffer) to speed up the transmit or receive.
After that the transfer over PCIe uses burst read/write.
Â
The problem was if i set the buffer to Write-Back and call clflush on those mmio-addresses, the system crashed without any output.
I found this articel http://software.intel.com/en-us/forums/topic/393070.[http://software.intel.com/en-us/forums/topic/393070]
Â
After that i configured theÂtransmit buffer to be Write-Combining (only write to that adresses) using ioremap_wc, and
theÂreceive buffer to be Write-Through (ioremap_cache + mtrr Write-Back + my Kernel Hack :-)) everything worked fine.
The other configuration Register on the FPGA are just mapped with ioremap.
Â
On PCIe Tracer i can see the burst read/write on my device.
Â
Is it possible to get hits into the Kernel?
Â
My modification in arch/x86/mm/pat.c:
Â
--- pat.c.origÂ2013-02-03 01:18:49.491879407 +0100
+++ pat.cÂ2013-02-03 01:19:19.053509836 +0100
@@ -149,10 +149,16 @@ static unsigned long pat_x_mtrr_type(u64
ÂÂÂu8 mtrr_type;
Â
ÂÂÂmtrr_type = mtrr_type_lookup(start, end);
-ÂÂif (mtrr_type != MTRR_TYPE_WRBACK)
+
+ÂÂif (mtrr_type == MTRR_TYPE_WRTHROUGH) {
+ÂÂÂreturn _PAGE_CACHE_WB;
+ÂÂ}
+ÂÂelse if( mtrr_type == MTRR_TYPE_WRBACK )
+ÂÂÂreturn _PAGE_CACHE_WB;
+ÂÂelse
ÂÂÂÂreturn _PAGE_CACHE_UC_MINUS;

-ÂÂreturn _PAGE_CACHE_WB;
+
ÂÂ}
Â
ÂÂreturn req_type;
Â
Â
Best regards.
Â

Gesendet:ÂMontag, 12. August 2013 um 19:53 Uhr
Von:Â"Andy Lutomirski" <luto@xxxxxxxxxxxxxx>
An:Â"Andreas Werner" <wernerandy@xxxxxx>
Cc:Âlinux-kernel@xxxxxxxxxxxxxxx
Betreff:ÂRe: question about ioremap_cache and PAT
On 08/11/2013 09:50 AM, Andreas Werner wrote:
> Hi i have a question about ioremap_cache and the resulting PAT attribute on X86 system. If I configure the mtrr to Write-Through for an adress range, and call ioremap_cache to map the mmio, the resulting PAT attribute is set to UC.
> If I check the Intel document IA-32 SDM vol 3a, the resulting PAT attribute should be WB.
>
> I found the function pat_x_mtrr_type in arch/x86/mm/pat.c where the resulting attribute is returned. There will be always UC return expect if the MTRR is set to WB.
>
> Why is there only WB or UC returned? In the Intel document there are a lot of combinations "allowed".
>
> I need a Attribute of WT, so what i did is to modify the pat_x_mtrr_type function to return also WB if the MTRR is set to WT.
>
> Is this a solution to solve that or whats the reasion why the kernel doesnÂt support this combination?

The kernel doesn't support it because I'm apparently the only person who
ever wanted it and I haven't implemented it yet.

This stuff is handled in hardware, so modifying the kernel's idea of
what hardware does won't do much. Also, the kernel using MTRRs is on
its (very slow) way out. You could probably hack something up, but I
can almost guarantee that hpa, etc won't accept the patches.

That being said, I'm planning to support WT directly using PAT in the
near future. This will work on most recent cpus (there are errata that
will prevent use of the high PAT entries on some cpus).

What do you need WT for? I want it for NVDIMMs, and all I need to get
started now is a heatsink*, so I'll hopefully start implementing this
stuff in the next week or so.

--Andy

* Damnit, Intel, it's not 2003 any more. You already figured out that
heatsinks want screw holes. But why couldn't you make sure that all
so-called "LGA 2011" sockets have the screw holes in the same place?


>
> Best regards
>
> B
> B
> B
> B
> B
> B
> B
> B
> B
> B
> B
> B
> B
> B
> B
> B
> B
> B
> A
> A
> A
> A
> A
> A
> A
> A
> A
> A
> A
> A
> A
> A
> A
> B
> B
> B
> Best regards
>
Â
--
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/