This chapter describes the Simplified Chinese Solaris input modes for typing Simplified Chinese characters with Simplified Chinese Solaris software. You can type any of the following kinds of characters:
ASCII/English characters
Simplified Chinese ideograms (characters or symbols representing ideas/actions/things)
Special symbol characters
You can type all of these characters in the input areas of the following application windows:
In terminal emulation (TTY) windows such as Terminal
In text entry subwindows such as used by Text Editor or Mailer
In control panel subwindows such as used by File Manager for typing a file name
In other special use (sometimes pop-up) subwindows
For information about creating your own input method, see Chapter 5, Code Table Input Method Interface.
Three separate areas of an application subwindow are involved in entering characters. These areas are typically displayed, named, and used as follows:
Preedit area--Highlighted (such as inverse video or underlined) entry display area
Status area--Area indicating the current input/conversion mode
Lookup choice area--Area displaying multiple character choices
Auxiliary Windows-- Utilities for input method management
The highlighted (for example, reverse video and underlined) preedit area displays characters as they are typed or converted. It holds formations of text before they are converted to Simplified Chinese characters or symbols and put in the text block being assembled for the application.
The status area is shows what input conversion mode is in effect. In the above example, it is located in the lower left corner of the window margin.
The lookup choice area displays multiple Simplified Chinese or special character choices available for conversion of the character(s)/radical(s) in the preedit area. In the above example, it is a pop-up.
The auxiliary window provides tools and utilities to manage the input methods or to make the input simpler.
Solaris 9 provides graphics interface tools and utilities to manage input methods, set the properties of input methods, and to facilitate the input of special characters.
The following tools are supported:
Input method selection
Input method option setting
Lookup table .
User define character
Input method help
Virtual Keyboard.
Select one of the input method tools from the menu.
The input method selection tool allows you to select a list of input method. You can also set the default input method and the sequence of the input methods.
The input method selection panel appears as below:
After selecting an input method, click "OK" or "Apply", the setting will be activated. The first input method selected becomes the default input method.
Press "CTRL+Space" in the application window to activate Chinese input, the default input method will be selected. Press "F2" to switch to the first input method selected. Press "F3" to switch to the second one, and so on.
The properties of Simplified Chinese input methods can be set from the input method options setting screen, which appears as follows:
With this options setting tool, user can set the options of input methods. After setting the options in this panel, then click "OK" or "Apply", the setting will be activated.
For the input methods that based on code table, there are 4 options can be set by user:
"Display candidates key by key
if TRUE: when a valid key is entered for this input method, IM will search the dictionary table and display the candidates in Lookup window.
if FALSE: when a valid key is entered for this input method, IM does not search the dictionary table, but displays the key in the preedit area. The "SPACE" key must be pressed for IM to begin to search the dictionary table, and display the candidates.
"Display external codes":
if TRUE: in every Lookup window, the external codes for every candidate will follow after the candidate.
if FALSE: the external codes for every candidate will not follow after that candidate.
This option provides a way to study the input method and view the external code of a Chinese character in that input method.
"Automatically commit if only one candidate":
if TRUE: if there is only one candidate for the external code, IM will automatically commit it.
if FALSE: IM will display it in the Lookup window.
"Display keymap character for every external code"
if TRUE: when a valid key is entered, the corresponding keymap character of the key will display in Preedit area,
if FALSE: only the key is displayed without the keymap character,
User can use the the lookup table tools to search a Chinese characters and input it. There are three kinds of lookup table provided:
Lookup table with native encoding set. such as: in zh_CN.EUC/zh_CN/zh locale, lookup table with EUC_CN encoding is provided, and in zh_CN.GBK/zh.GBK, lookup table with GBK encoding is provided, and in zh_CN.GB18030 locale, GB18030 encoding lookup table is provided.
The lookup table panel with native encoding appears as below:
Lookup table with UNICODE encoding.
The lookup table panel with UNICODE encoding appears as below:
The Lookup table for special characters, such as Greek characters, Mathmatic symbols, etc. is shown below:
Virtual Keyboard tools can be used as a Lookup utilities to simplify the input of some special symbols.
There are several kinds of virtual keyboard for Simplified Chinese environment. They are shown below:
The PC Virtual Keyboard appears as below:
The Greek virtual keyboard appears as below:
The Russian Virtual Keyboard appears as below:
The ZhuYin Virtual Keyboard appears as below:
The Chinese Punctuation Characters Keyboard appears as below:
The Number Symbol virtual keyboard appears as below
The Mathmatic Symbol virtual keyboard appears as below:
The Special Symbol virtual keyboard appears as below:
The Table Symbol virtual keyboard appears as below:
The UDC editor tool allows you to draw and save new characters. After ascribing the character to an input method, it can be displayed in an application.
Chapter 7, Fonts provides more information about user defined characters.
Help pages are displayed in a default browser such as Netscape or Hotjava.
The following input methods and conversion modes are available for entering ASCII/English, Simplified Chinese and other text:
In In zh/zh_CN/zh_CN.EUC locale:
NewShuangPin input method (F3).
QuanPin input method (F4).
ShuangPin input method (F5).
GB2312 NeiMa input method (F6).
English_Chinese Input method (F7).
In zh.GBK/zh_CN.GBK locales:
NewShuangPin input method (F3).
QuanPin input method (F4).
ShuangPin input method (F5).
English_Chinese Input method (F7).
In zh_CN.GB18030/zh.UTF-8/zh_CN.UTF-8 locales:
NewQuanPin input method, the default method (F2).
NewShuangPin input method (F3).
QuanPin input method (F4).
ShuangPin input method (F5).
GB18030 NeiMa input method (F6).
English_Chinese Input method (F7).
Press Control-spacebar to toggle on or off the Simplified Chinese input conversion. The function keys listed above (for example: F2, F3) turn on the corresponding input methods.
Each tool first starts with all Simplified Chinese input modes off, the window's status area blank. This mode is for typing ASCII text:
Simplified Chinese input conversion mode is toggled on and off by pressing Control-spacebar. After Simplified Chinese input conversion has been turned on once and input conversion is then turned off, the status area is no longer blank, but instead shows that conversion is off.
Type "CTRL+SPACE".
An auxiliary window appears, as shown below:
In the Chinese status window, type Function key "Fn". For example, type F2 to switch to the first input method. F3 to switch to the second input nethod, and so on. OR click the input mehtod selection button on the auxiliary window.
The input method selection menu appears as below:
This procedure allows you to toggle between 6 imput methods. Text is entered in the Chinese status window.
This method is entered in the Chinese status window.
Type "SHIFT+SPACE" to switch between Half_width Character Mode and Full_width Character Mode. OR click the Half_width/Full_width button of auxiliary window to toggle between modes.
The indicates the input method system is in Half_width Character Mode.
The indicates the input method system is in Full_width Character Mode.
When in Full_width mode, the Full_width character of the input key will be commit to system. For example: if you input 'a' when in Full_width mode, the fullwidth character of 'a' is commited to the application as in the figure below:
This method is entered in the Chinese status window.
Type "CTRL+." to switch between Chinese Punctuation Mode and English Punctuation Mode. Or click the Chinese/English Punctuation Button on the auxiliary window to toggle between modes.
The icon indicates the input method system is in Chinese Punctuation Mode.
The icon indicates the input method system is in English Punctuation Mode.
When the punctuation key is selected in Chinese Punctuation mode, the corresponding Chinese punctuation character is commited to the application. For example: when in Chinese Punctuation mode and the "$" symbol is selected, the Chinese currency symbol character is commited to the application as in the figure below:
The punctuation keys include the following: , . / <> :;'"\$!^&_-
The correspondence between English and Chinese punctuation is mapped below:
Solaris 9 supports the following input methods for the Simplified Chinese environment:
NewQuanPin and NewShuangPin Input Method
GB2312 Code
GBK Code
GB18030 Code
ShuangPin
QuanPin
English_Chinese
This section describes the features in the New QuanPin and New ShuangPin
input methods, and how to use some of the features in the zh_CN.EUC
and zh_CN.GBK locales.
PinYin is a popular input method in PRC, and there are various PinYin-based input methods. Two of them, New QuanPin and New ShuangPin, contain the following features:
Supports user-defined phrases for later use.
Dynamically adjusts the frequency of candidates.
Accepts PinYin strings up to 222 characters long.
Supports input by ShengMu.
Supports GBK Chinese.
These features are described in detail in the following sections.
The following example shows how to define the phrase "ke lin dun" and store it for later use.
Type the phrase kelindun without spaces.
The New QuanPin and New ShuangPin input methods will insert spaces for you automatically.
Type the number representing the first character you want to select.
The following example shows the second character selected.
Select Chinese for the second and third parts of the phrase.
The new phrase is defined and added to the user dictionary file. The next time you type ke lin dun, you will see the phrase you defined in the candidate area.
In these input methods, candidates that have been selected are moved to the start of the list to facilitate repeated use.
Type sh yi.
Notice the order of the five available candidates.
Select the fifth candidate.
Type sh yi again.
Notice that the fifth candidate has moved to the first position because you previously selected it. Frequently-used candidates are promoted for faster selection.
The New QuanPin input methods accepts PinYin strings up to 222 characters long. The following illustrations use the string below:
>>meiguozhongtongkelindunzhengzaitaolunhaiwanjushiwenti<< |
The result is the following Chinese string:
The New ShuangPin input method supports up to 30-character strings.
You can also type ShengMu only. Candidates are supplied for ShengMu, as shown in the following illustration:
The zh_CN.GBK locale supports GBK by default, as shown in the following illustration:
The second Chinese character in the following illustration is defined only in the GBK standard.
Single GBK candidates are placed at the end of the list of candidates. Press Return to scroll to the GBK area. For easier selection next time, you can define the GBK candidate as a phrase (for more information, see "Defining Phrases for Later Use"). Once a phrase is defined, you can insert it easily.
Both New QuanPin and New ShuangPin support GBK Hanzi by default in the zh.GBK locale. However, because several Hanzi have the same ShengMu (the first part of Pinyin), New QuanPin and New ShuangPin do not display GBK candidates if you provide only the ShengMu.
For example, typing the string rong will display GBK candidates because it is a complete Pinyin string. However, typing r alone will not display any GBK candidates because it is only a ShengMu.
The following table shows the definitions of the edit keys.
The preedit line is a normal X text field.
Key |
Definition |
---|---|
[a-z] |
PinYin character. |
Home |
Moves to the start of the preedit line. |
End |
Moves to the end of the preedit line. |
Left |
Moves the caret in the preedit line to the left. If left is Hanzi, the original PinYin is recovered. |
Right |
Moves the caret in the preedit line to the right. |
Delete |
Deletes the PinYin character following the caret on the preedit line. |
Backspace |
Deletes the PinYin character preceding the caret on the preedit line. |
The candidates of a Pinyin string belong to the following groups:
G1 - Highest frequency Hanzi + Long (3 or more) Cizu + Double Chinese Cizu
G2 - GB Single Hanzi
G3 - GBK Single Hanzi (in the zh_CN.GBK locale)
Some Pinyin strings may have more candidates than can be displayed in the same window. In that case, use the keys described in the following table to scroll through the candidates.
Table 4-2 Page Scroll Key Definitions
Key |
Definition |
---|---|
- = |
Scrolls to previous/next candidate(s) |
[ ] |
Scrolls to previous/next candidate(s) |
, . |
Scrolls to previous/next candidate(s) |
Return |
Quickly scrolls through all candidates |
New QuanPin and New ShuangPin use the numeric selection keys.
In accord with the national Pinyin standard, the separator (') is supported to avoid ambiguous interpretations of Pinyin strings. For example, the Pinyin string [jiang] can be interpreted as [jiang] or [ji][ang]; both are valid. In New QuanPin, however, [jiang] is interpreted only as [jiang]. You must use the separator and enter [ji'ang] for it to be interpreted as [ji] and [ang]. New ShuangPin does not require the use of separators.
New QuanPin and New ShuangPin share two dictionary files: PyCiku.dat and Ud.Ciku.dat. In the zh_CN.EUC and zh_CN.GBKlocale, the default path names are /usr/lib/im/locale/zh_CN/data/PyCiku.dat and /usr/lib/im/locale/zh_CN/data/UdCiku.dat.
Users cannot normally write to these files. However, since users can affect the way New QuanPin and New ShuangPin work through features such as frequency adjustment and user-defined phrases, it is necessary to update the dictionary files frequently.
A user's dictionary is normally located in ~/.Xlocale/PyCiku.dat or ~/.Xlocale/UdCiku.dat (~ indicates the home directory of the user who starts the htt command). When New QuanPin and New ShuangPin are started, they locate and read the dictionary files in the user's home directory. If a dictionary file is not found, the system default path is used (that is, /usr/lib/im/locale/zh_CN/...).
ShuangPin is an abbreviated form of QuanPin. It is faster but more difficult to use than QuanPin. New ShuangPin supports all of the features, keyboard definitions, and dictionary files of New QuanPin.
There are various ShuangPin keyboard mapping designs in PRC. The most popular three are ZiRanMa, Chinese Star, and Intelligent_ABC. The New ShuangPin input method supports all three of these keyboard mappings.
The following tables contain keyboard mappings for the ZiRanMa, Chinese Star, and Intelligent_ABC keyboards.
Table 4-3 ZiRanMa Keyboard Mapping
Key |
Definition |
---|---|
i |
ch |
u |
sh |
v |
zh |
a |
a |
b |
ou |
c |
iao |
d |
uang, iang |
e |
e |
f |
en |
g |
eng |
h |
ang |
i |
i |
j |
an |
k |
ao |
l |
ai |
m |
ian |
n |
in |
o |
o, uo |
p |
un |
q |
iu |
r |
uan, er |
s |
iong, ong |
t |
ue |
u |
u |
v |
v, ui |
w |
ua, ia |
x |
ie |
y |
uai, ing |
z |
ei |
Table 4-4 CStar2.97 Keyboard Mapping
Key |
Definition |
---|---|
u |
ch |
i |
sh |
v |
zh |
a |
a |
b |
ia, ua |
c |
uan |
d |
ao |
e |
e |
f |
an |
g |
ang |
h |
iang, uang |
i |
i |
j |
ian |
k |
iao |
l |
in |
m |
ie |
n |
iu |
o |
o, uo |
p |
ou |
q |
er, ing |
r |
en |
s |
ai |
t |
eng |
u |
u |
v |
v, ui |
w |
ei |
x |
uai, ue |
y |
iong, ong |
z |
un |
Table 4-5 Intelligent ABC Keyboard Mapping
Key |
Definition |
---|---|
i |
ch |
u |
sh |
v |
zh |
a |
a |
b |
ou |
c |
in, uai |
d |
ua, ia |
e |
e |
f |
en |
g |
eng |
h |
ang |
i |
i |
j |
an |
k |
ao |
l |
ai |
m |
ue, ui |
n |
un |
o |
o, uo |
p |
uan |
q |
ei |
r |
iu, er |
s |
ong, iong |
t |
uang, iang |
u |
u |
v |
v |
w |
ian |
x |
ie |
y |
ing |
z |
iao |
This method uses the GBK code defined by the Chinese Internal Code Specification. It includes all of the Chinese characters and symbols in GB2312-80, and other CJK Chinese characters in GB 13000-1. Each Chinese character or symbol is identified by a four hexadecimal digital internal code defined in the Chinese Internal Code Specification.
This section contains instructions on how to use the GBK codes to type Chinese characters and symbols.
In a new Terminal, turn Chinese input conversion on by pressing Control-spacebar.
Press F4 to turn on the GBK code input method.
The status area shows that GBK code input mode is on.
Press the first three of the four keys that represent the character to display (in this example, b0a1).
The key remains visible in the preedit area.
Type the fourth key.
The character automatically replaces the preedit area.
This method uses the GBK code defined by the Chinese Internal Code Specification. It includes all of the Chinese characters and symbols in GB2312-80, and other CJK Chinese characters in GB 13000-1. Each Chinese character or symbol is identified by a four hexadecimal digital internal code defined in the Chinese Internal Code Specification.
This section contains instructions on how to use the GB2312 codes to type Chinese characters and symbols.
In a new Terminal, turn Chinese input conversion on by pressing Control-spacebar.
Click the Input method selection button on the auxiliary window and select GB2312 input method.
The status area shows that GB2312 code input mode is on.
Press the first three of the four keys that represent the character to display (in this example, b0a1).
The key remains visible in the preedit area.
Type the fourth key.
The character automatically replaces the preedit area.
This method uses the GB18030 code defined by the Chinese Internal Code Specification. It includes all of the Chinese characters and symbols in GB2312-80, and other CJK Chinese characters in GB 18030. Each Chinese character or symbol is identified by a four or eight hexadecimal digital internal code defined in the Chinese Internal Code Specification.
This section contains instructions on how to use the GB18030 codes to type Chinese characters and symbols.
In a new Terminal, turn Chinese input conversion on by pressing Control-spacebar.
Click the Input method selection button on the auxiliary window and select GB18030 input method.
The status area shows that GB18030 code input mode is on.
For example: To input Chinese GB18030 character with code 0xb0a1, press the first three of the four keys that represent the character to display (in this example, b0a).
The key remains visible in the preedit area.
Type the fourth key.
The character automatically replaces the preedit area.
For example: To input Chinese GB18030 character with code 0x82358538, press the first three of the four keys that represent the character to display (in this example, 8235853).
The key remains visible in the preedit area.
Type the last key. The character is automatically commited to the window.
The QuanPin input method requires up to six keystrokes to type each Chinese Pinyin character. Quanpy maps Pinyin phonetics to single lowercase Roman letters. You can use the QuanPin input method to type individual Chinese characters in both zh_CN.EUC and zh_CN.GBK.
A lookup area showing the characters that match the QuanPin input is displayed with each keystroke. If more than one option is available, you can type a period (.) to display the next page moving forward through the lookup choices and typing a comma (,) to display the next page moving backward. You can select the character you want by typing the label letter corresponding to the character in the lookup area.
This section describes how to create QuanPin text.
The following figure shows how to use this input method to type the character representing the Full Pinyin word fang. The word requires four keystrokes. Type them and select the text as follows:
Type the four keystrokes fang.
Type 1 to select the corresponding GBK Chinese character in the lookup choice list.
Your choice is substituted for the Full Pinyin string in the preedit area.
The English_Chinese input method requires up to fifteen keystrokes to type each Chinese word. English_Chinese maps the English word to a Chinese phrase. You can use the English_Chinese input method to type a Chinese phrase in both zh_CN.EUC and zh_CN.GBK locales.
A lookup area showing the characters that match the QuanPin input is displayed with each keystroke. If more than one option is available, you can type a period (.) to display the next page. Moving forward through the lookup choices and typing a comma (,) displays the next page moving backward. You can select the character you want by typing the label letter corresponding to the character in the lookup area.
This section describes how to create English_Chinese text.
The following figure shows how to use this input method to type the character representing the Engilsh word "world". The word requires five keystrokes. Type them and select the text as follows:
Type the five keystrokes world.
Type 3 to select the corresponding Chinese phrase in the lookup choice list.
Your choice is substituted for the English string in the preedit area.
Wild characters "*"or "?" can be used to search in the dictionary," * " stands for one or several letters, and ? represents only one letter. For example, to search all English words which end with lution, you can input "*lution" and the lookup choice window appears as below.
Or to search all English words which begin with "c", and only three letters, you can input "c??", the lookup choice window appears as below: