home *** CD-ROM | disk | FTP | other *** search
- Newsgroups: comp.unix.bsd
- Path: sparky!uunet!spool.mu.edu!agate!dog.ee.lbl.gov!hellgate.utah.edu!fcom.cc.utah.edu!cs.weber.edu!terry
- From: terry@cs.weber.edu (A Wizard of Earth C)
- Subject: Re: INTERNATIONALIZATION: JAPAN, FAR EAST
- Message-ID: <1992Dec28.064029.24421@fcom.cc.utah.edu>
- Sender: news@fcom.cc.utah.edu
- Organization: University of Utah Computer Center
- References: <1992Dec18.235809.15484@midway.uchicago.edu> <agp22+#@rpi.edu> <1gvpt0INN8s0@hrd769.brooks.af.mil> <BzIwvu.3BE@demon.co.uk> <CARLTON.92Dec21163548@scws8.harvard.edu> <2565@titccy.cc.titech.ac.jp>
- Date: Mon, 28 Dec 92 06:40:29 GMT
- Lines: 56
-
- In article <2565@titccy.cc.titech.ac.jp>, mohta@necom830.cc.titech.ac.jp (Masataka Ohta) writes:
- |> In article <CARLTON.92Dec21163548@scws8.harvard.edu>
- |> carlton@scws8.harvard.edu (david carlton) writes:
- |>
- |> >I certainly wouldn't call it an insignificantly larger market. Huge
- |> >numbers of people use Chinese, Japanese, and other languages whose
- |> >scripts don't fit in 8 bits, after all.
- |>
- |> True. But, it should be noted that they don't fit even in 16 bits.
-
- Work is already under way to adapt Unicode to 32 bits. I would be interested
- in any similar work you know of in progress for XPG4/JIS. I am *not*
- interested in proposing or attempting to provide yet another standard, if
- that is what you believe is necessary.
-
- |> Even character sets used in a single language can not be represented
- |> with 16 bits.
- |>
- |> While not all characters are used in modern Japanese, Taiwan already have
- |> a character encoding standard which contains more than 65536 Han
- |> characters.
-
- 0-65535 is 65536 characters, and is capable of being contained in an unsigned
- short (16 bits). I agree that this would leave no room for other characters
- from other languages, but I doubt that all of these characters are in modern
- use in a single dialect of Chinese. In any case, the 32-bit Unicode should
- have no problem handling the symbols. I am more concerned with ligatures,
- and I suspect there isn't a mechanism for doing this (short of an Arabic
- or other ligature-bound language terminal). Certainly our proposed mechanism
- of X can not handle this within the bounds of its current font mechanisms,
- nor can, I believe, Display Postscript, as on the NeXT. Postscript is not
- an alternative, given the nature of the 386BSD project and the licensing
- restrictions glued to Postscript.
-
- |> >Whether it is worth the
- |> >effort to you is another matter, though;
- |>
- |> Sure. But if you do something, do it throughly, so that you don't have
- |> to do it twice.
-
- Again, I want to stress that we are about the identification and adoption
- of an existing standard rather than the specification and ratification of
- a new one. We may invoke "tricks" to reduce storage requirements or to
- retrofit existing input mechanisms, but we are not attempting a new standard.
-
-
- Terry Lambert
- terry@icarus.weber.edu
- terry_lambert@novell.com
- ---
- Any opinions in this posting are my own and not those of my present
- or previous employers.
- -------------------------------------------------------------------------------
- "I have an 8 user poetic license" - me
- Get the 386bsd FAQ from agate.berkeley.edu:/pub/386BSD/386bsd-0.1/unofficial
- -------------------------------------------------------------------------------
-