OT: character encodings (was: Linux 2.6.20-rc4)

From: Tilman Schmidt
Date: Sun Jan 07 2007 - 08:07:20 EST


Russell King schrieb:
[Leonard NorrgÃÂrd (1):]
> That is an à if you look at the raw message in UTF-8. However, Linus
> sends mail in with a charset of ISO-8859-1, and if you place UTF-8
> encoded text in such a message body, you will see AÂ.

Only if the mechanism used for placing it there ignores the different
encodings.

> Welcome to the mess which the UTF-8 charset creates.

The problem of different character encodings coexisting on the same
platform, and the resulting occasional messing-up, far predates Unicode.
I distinctly remember one case of being bitten by this myself in 1977
when Unicode wasn't even on the horizon yet, and I don't think that was
the first time.

Tilman

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/