Re: [Patch] Support UTF-8 scripts

From: H. Peter Anvin
Date: Fri Sep 16 2005 - 14:25:49 EST


Bodo Eggert wrote:

It should, but as long as old programs are still around, we'll have both and need a marker to distinguish them. Otherwise we'll be stuck with
legacy scripts for a long time.


You don't have markers (although they're defined, see ISO 2022) for your 8-bit encodings, and *THEY'RE THE ONES THAT NEED TO BE DISTINGUISHED.* Flagging UTF-8, especially with the BOM (as opposed to the ISO 2022 signature, <ESC>%G) is pointless in the context, since you still can't distinguish your arbitrary number of legacy encodings.

Oh, yes, and try to stick ISO 2022 signatures in scripts or whatnot, and you can see what current software does with a signature standard that dates back to the 1970's.

-hpa
-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/