Solaris Internationalization Guide For Developers

Unicode Locale: en_US.UTF-8

The en_US.UTF-8 locale is a multiscript locale that can input and output text in multiple scripts, including single-byte and multi-byte scripts. This locale is part of the developer cluster. This is the first locale with this capability in the Solaris operating environment.

This locale uses UTF-8 (Universal Character Set Transformation Format for 8 bits) encoding, which was developed by the X/Open-Uniforum Joint Internationalization Working Group (XoJIG). This standard has been adopted by the Unicode Consortium, the International Standards Organization, and the International Electrotechnical Commission as a part of Unicode 2.0 and ISO/IEC 10646-1.

en_US.UTF-8 supports computation for every code point value, which is defined in Unicode 2.0 and ISO/IEC 10646-1. In Solaris 7, language script support is not limited to pan-European locales, but also includes Asian scripts such as Korean, Traditional Chinese, Simplified Chinese, and Japanese. Input method support has been enabled for the following language scripts only. Due to limited font resources, Solaris 7 software includes only character glyphs from the following codesets: