NetNews Usenet Archive 1992 #31

home *** CD-ROM | disk | FTP | other *** search

/ NetNews Usenet Archive 1992 #31 / NN_1992_31.iso / spool / comp / std / internat / 966 < prev next >

Wrap

Internet Message Format | 1993-01-01 | 1.6 KB

Path: sparky!uunet!mcsun!sun4nl!cwi.nl!dik From: dik@cwi.nl (Dik T. Winter) Newsgroups: comp.std.internat Subject: Re: Dumb Americans (was INTERNATIONALIZATION: JAPAN, FAR EAST) Keywords: Han Kanji Katakana Hirugana ISO10646 Unicode Codepages Message-ID: <8494@charon.cwi.nl> Date: 1 Jan 93 11:09:47 GMT References: <1hvu79INN4qf@rodan.UU.NET> <8492@charon.cwi.nl> <1i0vnmINN352@rodan.UU.NET> Sender: news@cwi.nl Organization: CWI, Amsterdam Lines: 23 In article <1i0vnmINN352@rodan.UU.NET> avg@rodan.UU.NET (Vadim Antonov) writes: > In article <8492@charon.cwi.nl> dik@cwi.nl (Dik T. Winter) writes: > > > My point was that there obviously are identifyable meta-alpahbets > > > covering several languages. > >I do think that the number in several is very small. > > With a trivial trick of including several codes for identical glyphs > for letters from different languages you can put all of them in ONE > meta-alphabet. > As somebody else mentioned already: should a spelling-checker allow a German A-umlaut in a Swedish word? If so, what are the consequences, if not, why not? Moreover, one question: how would you encode the German A-umlaut such that it sorts properly (i.e. as if it is the letter combination AE)? Even in a single language (German) you can not come up with a coding that gives proper sorting. And also how to code the German eszet such that when uppercased it becomes the *two* letters 'SS' (and how do you lowercase that again? Things are not as simple as you appear to think. -- dik t. winter, cwi, kruislaan 413, 1098 sj amsterdam, nederland home: bovenover 215, 1025 jn amsterdam, nederland; e-mail: dik@cwi.nl