On Thu, 3 Mar 2022, Jiri Slaby wrote:
> The real problem is that using char (or short) for a function parameter
> or result is very likely to require the compile add code to mask
> the value to 8 (or 16) bits.
>
> Remember that almost every time you do anything with a signed or unsigned
> char/short variable the compiler has to use the integer promotion rules
> to convert the value to int.
>
> You'll almost certainly get better code if the value is left in an
> int (or unsigned int) variable until the low 8 bits get written to
> a buffer (or hardware register).
So should we use int/uint instead of more appropriate shorter types everywhere
now? The answer is: definitely not. The assembly on x86 looks good (it uses
movz, no ands), RISC architectures have to do what they chose to.
We do have an issue, because we still have this:
void uart_console_write(struct uart_port *port, const char *s,
unsigned int count,
void (*putchar)(struct uart_port *, int))
and then:
putchar(port, *s);
there. Consequently on targets where plain `char' type is signed the
value retrieved from `*s' has to be truncated in the call to `putchar'.
And indeed it happens with the MIPS target:
803ae47c: 82050000 lb a1,0(s0)
803ae480: 26100001 addiu s0,s0,1
803ae484: 02402025 move a0,s2
803ae488: 0220f809 jalr s1
803ae48c: 30a500ff andi a1,a1,0xff
vs current code:
803ae47c: 82050000 lb a1,0(s0)
803ae480: 26100001 addiu s0,s0,1
803ae484: 0220f809 jalr s1
803ae488: 02402025 move a0,s2
(NB the last instruction shown after the call instruction, JALR, is in the
delay slot that is executed before the PC gets updated). Now arguably the
compiler might notice that and use an unsigned LBU load instruction rather
than the signed LB load instruction, which would make the ANDI instruction
redundant, but still I think we ought to avoid gratuitous type signedness
changes.
So I'd recommend changing `s' here to `const unsigned char *' or, as I
previously suggested, maybe to `const u8 *' even.