Re: A Great Idea (tm) about reimplementing NLS.

From: Bernd Petrovitsch
Date: Fri Aug 26 2005 - 03:35:54 EST


On Thu, 2005-08-25 at 20:00 -0400, Daniel B. wrote:
> Alan Cox wrote:
> > On Sul, 2005-06-19 at 18:55, Pavel Machek wrote:
[...]
> > > If we are serious about utf-8 support in ext3, we should return
> > > -EINVAL if someone passes non-canonical utf-8 string.
> >
> > That would ironically not be standards compliant
>
> Which standards?

Probably POSIX, SuSv3 and similiar.

> The standards I've read (mostly XML- and web-related specs)
> do say that non-standard UTF-8 octet sequences should be rejected.

There you have basically text files with some structure in it and the
definiton/requirement that is is UTF-8.
At kernel level these are also just byte streams and the kernel doesn't
know or care which charset, encoding, file format, font etc. the data is
used with or interpreted several layers higher, e.g. for presenting to
the user. And the same holds for filenames.

Bernd
--
Firmix Software GmbH http://www.firmix.at/
mobil: +43 664 4416156 fax: +43 1 7890849-55
Embedded Linux Development and Services

-
To unsubscribe from this list: send the line "unsubscribe linux-kernel" in
the body of a message to majordomo@xxxxxxxxxxxxxxx
More majordomo info at http://vger.kernel.org/majordomo-info.html
Please read the FAQ at http://www.tux.org/lkml/