Re: [PATCH] update checkpatch.pl to version 0.03

From: Adrian Bunk
Date: Thu Jun 07 2007 - 19:49:18 EST


On Fri, Jun 08, 2007 at 01:21:52AM +0200, Adrian Bunk wrote:
>...
> I added a MODULE_AUTHOR("J. Ãrsted <jorsted@xxxxxxxxxx>") into the "raw"
> module:
>
> # echo $LANG
> C
> # modinfo --version
> module-init-tools version 3.3-pre11
> # modinfo raw
> filename: /lib/modules/2.6.21.2/kernel/drivers/char/raw.ko
> author: J. Ã
> ^ the cursor hangs here
>...

If anyone's wondering what's happening:

The UTF-8 representation of the character à consists of the two bytes
0xC3 0x98

In the ISO/IEC 8859 encodings where every character is represented by
one bytes this corresponds to two characters:

In ISO/IEC 8859-1 the byte 0xC3 represents the character à resulting in
the (harmless) display of this wrong character.

But in all the ISO/IEC 8859 encodings, the byte 0x98 is the
_control code_ "Start of String".

Therefore, if we want start using UTF-8 anywhere into the kernel, we
must ensure that all applications correctly convert all characters
if running in a non-UTF-8 environment.

I'm not sure that's worth the hassle.

cu
Adrian

--

"Is there not promise of rain?" Ling Tan asked suddenly out
of the darkness. There had been need of rain for many days.
"Only a promise," Lao Er said.
Pearl S. Buck - Dragon Seed

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/