Re: [PATCH v2] ARC: io.h: Implement reads{x}()/writes{x}()

From: Vineet Gupta
Date: Fri Nov 30 2018 - 14:00:09 EST

Next message: Russell King - ARM Linux: "Re: [PATCH v2 2/8] phy: mvebu-cp110-comphy: fix port check in ->xlate()"
Previous message: Russell King - ARM Linux: "Re: remove the ->mapping_error method from dma_map_ops V3"
In reply to: David Laight: "RE: [PATCH v2] ARC: io.h: Implement reads{x}()/writes{x}()"
Next in thread: Vineet Gupta: "Re: [PATCH v2] ARC: io.h: Implement reads{x}()/writes{x}()"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

On 11/30/18 5:57 AM, David Laight wrote:
> There're even identical opcodes...
> The barrier() (etc) in the asm output probably stopped the optimisation.
>
> It also seems to have used a different type of loop to the
> other example, probably less efficient.
> (Not that I'm an expert on ARC opcodes.)

The difference is due to ISA and ensuing ARC gcc backends. ARCompact based cores
don't support unaligned access and the loop there was ZOL (Zero delay loop). In
ARCv2 based cores, the gcc backend has been tweaked to generate fewer ZOLs hence
you see the more canonical tst and branch style loop.

-Vineet

Next message: Russell King - ARM Linux: "Re: [PATCH v2 2/8] phy: mvebu-cp110-comphy: fix port check in ->xlate()"
Previous message: Russell King - ARM Linux: "Re: remove the ->mapping_error method from dma_map_ops V3"
In reply to: David Laight: "RE: [PATCH v2] ARC: io.h: Implement reads{x}()/writes{x}()"
Next in thread: Vineet Gupta: "Re: [PATCH v2] ARC: io.h: Implement reads{x}()/writes{x}()"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]