[OFFTOPIC] Re: unicode (char as abstract data type)

Pavel Machek (pavel@elf.ucw.cz)
Tue, 21 Apr 1998 23:12:24 +0200


Hi!

> Helvetica is a font. Russian is a language. koi8 is a charset. Fonts are
> irrelevant, but languages must be preserved for any sane
> internationalization. And since labeling of languages automatically
> provides a way to label charsets, there is no need in messy Unicode.

I sometimes use two languages in same document.

Pozdrav vsem fanouskum z ceske republiky! :-)

Really, you can not determine charset from language - because language
of *my* emails is sometimes something between czech and english. And
now imagine, me wanting to write russian word sabaka (or how is dog
written). I of course want to write it in azbuka. And I do not want to
tell my text editor origin of each word I use.

So it is hard to impossible to gather info about language. User just
will not want to tell you. User may even want to write Geek with 'G'
in azbuka. Why not? I believe that language labeling cannot handle
that.

Pavel

-- 
I'm really pavel@atrey.karlin.mff.cuni.cz. 	   Pavel
Look at http://atrey.karlin.mff.cuni.cz/~pavel/ ;-).

- To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to majordomo@vger.rutgers.edu