NetNews Usenet Archive 1993 #3

home *** CD-ROM | disk | FTP | other *** search

/ NetNews Usenet Archive 1993 #3 / NN_1993_3.iso / spool / comp / std / internat / 1267 < prev next >

Wrap

Internet Message Format | 1993-01-21 | 2.4 KB

Path: sparky!uunet!cs.utexas.edu!sun-barr!sh.wide!wnoc-tyo-news!cs.titech!titccy.cc.titech!necom830!mohta From: mohta@necom830.cc.titech.ac.jp (Masataka Ohta) Newsgroups: comp.std.internat Subject: Re: Dumb Americans (was INTERNATIONALIZATION: JAPAN, FAR EAST) Keywords: Han Kanji Katakana Hirugana ISO10646 Unicode Codepages Message-ID: <2770@titccy.cc.titech.ac.jp> Date: 21 Jan 93 17:20:13 GMT References: <1993Jan9.031217.27425@fcom.cc.utah.edu> <1in56mINNnhq@life.ai.mit.edu> <2676@titccy.cc.titech.ac.jp> <1ippgmINN7af@life.ai.mit.edu> Sender: news@titccy.cc.titech.ac.jp Organization: Tokyo Institute of Technology Lines: 55 In article <1ippgmINN7af@life.ai.mit.edu> glenn@wheat-chex.ai.mit.edu (Glenn A. Adams) writes: >>Different Characters with different stroke counts are mapped to a >>single code point in 10646/Unicode. > >For those in the listening audience who don't read Hanzi or Kanji, I should >say that Ohta-san is correct, but in a completely frivolous way. Stroke >counts are a bit subjective. Some people count some elements as distinct >strokes, whereas others do not. Wrong. That's not what I'm referring. >However, there is no systematic difference >in the way that Japanese count strokes versus the way that Chinese count >strokes versus the way that Koreans count strokes. The ways that CJK >standards bodies count strokes are extremely uniform; True, but it is misleading. Consider, for example, a very common radical used for plant-related characters, such as grass and flowers (character code from 827C to 864F of DIS 10646-1.2). Its shape in Japanese or mainland China is * * **************** * * whose stroke count is 3, while its shape in Taiwan and Korea is * * ******* ******* * * whose stroke count is 4. But, these different characters with differently shaped radicals are unified in Unicode. >therefore, the >unification of CJK characters into single Han code elements does not >remove discriminating information; Therefore, the stroke count information, which is "THE EXTREMELY UNIFORM" property of characters, are destroyed by Han unificaiton. This is the reality. >This is a bit like saying "Japanese (or English or another language) is >context dependent. Don't use it." Come on Ohta-san, return to reality. >Try making a rational argument. I have come again. I'll be happy if you stop dreaming and shut up. Masataka Ohta