xml spy
Previous  Top  Next
Windows 95/98/ME

These Windows versions do not include full Unicode capabilities on the operating system layer. Instead support for non-roman writing-systems is provided through so-called code-pages that contain all the required characters mapped to either the available byte-values in the range of 0x80 to 0xFF (in case of single-byte systems, such as Cyrillic or Hebrew) or as double-byte values, where the first most significant bit of the first byte is typically used to indicate that this is a so-called "wide character" (in case of CJK writing-systems).

We therefore provider a special Windows 95/98/ME version of XML Spy that is automatically installed whenever you are using the Setup program on these operating systems. This version of XML Spy supports the following code-pages for viewing and editing XML documents (for excellent background information about code-pages please refer to http://czyborra.com/charsets/codepages.html):

Code-page   Equivalent XML Encoding  
1252   ISO-8859-1 (Western, Latin-1)  
1250   ISO-8859-2 (Eastern Europe, Latin-2)  
1251   ISO-8859-5 (Cyrillic)  
1253   ISO-8859-7 (Greek)  
1254   ISO-8859-9 (Turkish)  
1255   ISO-8859-8 (Hebrew)  
1256   ISO-8859-6 (Arabic)  
874   ISO-8859-11/TIS-620 (Thai)  
932   Shift-JIS (Japanese)  
936   GB2312 (Chinese)  
949   EUC-KR (Korean)  
950   Big5 (Taiwanese)  

Whenever you open an XML file, XML Spy detects the character-set encoding used in that file, expands the file to an internal full Unicode representation and then transforms the document to a code-page supported by Windows 95/98 in order to enable viewing and editing of the document.

In most cases this process will be entirely automatic, as the available ISO-8895-x encodings as well as some of the CJK encodings often correspond with a certain code-page. However, if you open a Unicode encoded file (e.g. UTF-8 or UTF-16) XML Spy will be unable to determine which code-page to use and will thus bring up a dialog box that asks you to specify a code-page to be used for editing.

In order to correctly view and edit a Unicode file under Windows 95/98/ME it is extremely important that you use this dialog box to:

a)   choose the correct code-page that includes all characters contained in the file and  
b)   later select a font and script from the settings dialog box that also supports the same code-page.  

If the file contains any characters that are not available in the selected code-page, the user will receive an error message including a detailed list of offending characters before they will be replaced by a '_' (underscore).

encoding_error


We therefore highly recommend using only the Windows NT/2000 version of XML Spy for editing XML files that make full use of Unicode!


Previous  Top  Next

⌐ 2002 Altova