home *** CD-ROM | disk | FTP | other *** search
- Path: sparky!uunet!mcsun!sun4nl!cwi.nl!dik
- From: dik@cwi.nl (Dik T. Winter)
- Newsgroups: comp.std.internat
- Subject: Re: Dumb Americans (was INTERNATIONALIZATION: JAPAN, FAR EAST)
- Keywords: Han Kanji Katakana Hirugana ISO10646 Unicode Codepages
- Message-ID: <8494@charon.cwi.nl>
- Date: 1 Jan 93 11:09:47 GMT
- References: <1hvu79INN4qf@rodan.UU.NET> <8492@charon.cwi.nl> <1i0vnmINN352@rodan.UU.NET>
- Sender: news@cwi.nl
- Organization: CWI, Amsterdam
- Lines: 23
-
- In article <1i0vnmINN352@rodan.UU.NET> avg@rodan.UU.NET (Vadim Antonov) writes:
- > In article <8492@charon.cwi.nl> dik@cwi.nl (Dik T. Winter) writes:
- > > > My point was that there obviously are identifyable meta-alpahbets
- > > > covering several languages.
- > >I do think that the number in several is very small.
- >
- > With a trivial trick of including several codes for identical glyphs
- > for letters from different languages you can put all of them in ONE
- > meta-alphabet.
- >
- As somebody else mentioned already: should a spelling-checker allow a
- German A-umlaut in a Swedish word? If so, what are the consequences,
- if not, why not?
-
- Moreover, one question: how would you encode the German A-umlaut such that
- it sorts properly (i.e. as if it is the letter combination AE)? Even in
- a single language (German) you can not come up with a coding that gives
- proper sorting. And also how to code the German eszet such that when
- uppercased it becomes the *two* letters 'SS' (and how do you lowercase
- that again? Things are not as simple as you appear to think.
- --
- dik t. winter, cwi, kruislaan 413, 1098 sj amsterdam, nederland
- home: bovenover 215, 1025 jn amsterdam, nederland; e-mail: dik@cwi.nl
-