home *** CD-ROM | disk | FTP | other *** search
- Newsgroups: comp.std.internat
- Path: sparky!uunet!psinntp!ficc!peter
- From: peter@ferranti.com (peter da silva)
- Subject: Re: Dumb Americans (was INTERNATIONALIZATION: JAPAN, FAR EAST)
- Message-ID: <id.68CW.A16@ferranti.com>
- Keywords: ISO10646 Unicode
- Organization: Xenix Support, FICC
- References: <1hvu79INN4qf@rodan.UU.NET> <1i0oj2INNp4v@life.ai.mit.edu> <1i13rrINNars@rodan.UU.NET>
- Date: Fri, 1 Jan 1993 23:19:06 GMT
- Lines: 41
-
- In article <1i13rrINNars@rodan.UU.NET> avg@rodan.UU.NET (Vadim Antonov) writes:
- > We were talking about lexicographical sorting, not abouth phonetics.
-
- But lexicographic sorting (actually, lexicograhic ordering) is a minor part of
- this. Most sorting computers do is algorithmic ordering, to optimise some
- combination of operations on data structures (searching, for example). The
- character set is irrelevant there.
-
- > Then you KNOW that it is compressed graphical format -- which is
- > essentially useless in anything except for storing and then reproduction
- > of the text.
-
- Yes.
-
- > What makes encoded text useful is that its encoding extracts
- > some SEMANTIC allowing for mechanical processing (particularly sorting).
-
- OK, I want a character set that differentiates a word (if) between a C language
- keyword (if(...)), command line options (dd if=...), and English text (if you
- pass this way again...).
-
- I want a character set that differentiates between parts of speech.
-
- I want a character set that differentiates between running text, "quoted
- running text", EMPHASISED RUNNING TEXT, references(1), Proper Nouns, and
- <courier>computer text</courier>.
-
- You don't want a character set. You want an SGML DTD.
-
- > The semantic in ASCII is hard-coded -- it is the order of letters
- > and the trivial upper-case to lower-case convertion.
-
- <para><sentence><phrase>The semantic in <acronym>ASCII</> is <jargon>
- hard-coded</></><dash><phrase>it is the <phrase>order of letters</>
- and <phrase>the trivial <jargon>upper-case</> to <jargon>lower-case</>
- conversion</></></sentence></para>
- --
- Peter da Silva `-_-'
- Ferranti International Controls Corporation 'U`
- Sugar Land, TX 77487-5012 USA
- +1 713 274 5180 "Zure otsoa besarkatu al duzu gaur?"
-