Oracle9i Globalization Support Guide
Release 1 (9.0.1)

Part Number A90236-02
Go To Documentation Library
Home
Go To Product List
Book List
Go To Table Of Contents
Contents
Go To Index
Index

Master Index

Feedback

Go to previous page Go to next page

A
Locale Data

This appendix lists the languages, territories, character sets, and other locale data supported by the Oracle server. It includes these topics:

You can also obtain information about supported character sets, languages, territories, and sorting orders by querying the dynamic data view V$NLS_VALID_VALUES.

See Also:

Oracle9i Database Reference for more information about the data that can be returned by this view 

Languages

Table A-1 lists the languages supported by the Oracle server.

Table A-1 Oracle Supported Languages  
Name  Abbreviation 

AMERICAN 

us 

ARABIC 

ar 

ASSAMESE 

as 

BANGLA 

bn 

BENGALI 

bn 

BRAZILIAN PORTUGUESE 

ptb 

BULGARIAN 

bg 

CANADIAN FRENCH 

frc 

CATALAN 

ca 

CROATIAN 

hr 

CZECH 

cs 

DANISH 

dk 

DUTCH 

nl 

EGYPTIAN 

eg 

ENGLISH 

gb 

ESTONIAN 

et 

FINNISH 

sf 

FRENCH 

GERMAN DIN 

din 

GERMAN 

GREEK 

el 

GUJARATI 

gu 

HEBREW 

iw 

HINDI 

hi 

HUNGARIAN 

hu 

ICELANDIC 

is 

INDONESIAN 

in 

ITALIAN 

JAPANESE 

ja 

KANNADA 

kn 

KOREAN 

ko 

LATIN AMERICAN SPANISH 

esa 

LATVIAN 

lv 

LITHUANIAN 

lt 

MALAY 

ms 

MALAYALAM 

ml 

MARATHI 

mr 

MEXICAN SPANISH 

esm 

NORWEGIAN 

ORIYA 

or 

POLISH 

pl 

PORTUGUESE 

pt 

PUNJABI 

pa 

ROMANIAN 

ro 

RUSSIAN 

ru 

SIMPLIFIED CHINESE 

zhs 

SLOVAK 

sk 

SLOVENIAN 

sl 

SPANISH 

SWEDISH 

TAMIL 

ta 

TELUGU 

te 

THAI 

th 

TRADITIONAL CHINESE 

zht 

TURKISH 

tr 

UKRAINIAN 

uk 

VIETNAMESE 

vn 

Translated Messages

Oracle error messages have been translated into the languages which are listed in Table A-2.

Table A-2 Oracle Supported Messages  
Name  Abbreviation 

ARABIC 

ar 

BRAZILIAN PORTUGUESE 

ptb 

CATALAN 

ca 

CZECH 

cs 

DANISH 

dk 

DUTCH 

nl 

FINNISH 

sf 

FRENCH 

GERMAN 

GREEK 

el 

HEBREW 

iw 

HUNGARIAN 

hu 

ITALIAN 

JAPANESE 

ja 

KOREAN 

ko 

LATIN AMERICAN SPANISH 

esa 

NORWEGIAN 

POLISH 

pl 

PORTUGUESE 

pt 

ROMANIAN 

ro 

RUSSIAN 

ru 

SIMPLIFIED CHINESE 

zhs 

SLOVAK 

sk 

SPANISH 

SWEDISH 

TRADITIONAL CHINESE 

zht 

TURKISH 

tr 

Territories

Table A-3 lists the territories supported by the Oracle server.

Table A-3 Oracle Supported Territories  
Name     

ALGERIA 

HONG KONG 

PERU 

AMERICA 

HUNGARY 

POLAND 

AUSTRALIA 

ICELAND 

PORTUGAL 

AUSTRIA 

INDIA 

PUERTO RICO 

BAHRAIN 

INDONESIA 

QATAR 

BANGLADESH 

IRAQ 

ROMANIA 

BELGIUM 

IRELAND 

SAUDI ARABIA 

BRAZIL 

ISRAEL 

SINGAPORE 

BULGARIA 

ITALY 

SLOVAKIA 

CANADA 

JAPAN 

SLOVENIA 

CATALONIA 

JORDAN 

SOMALIA 

CHILE 

KAZAKHSTAN 

SOUTH AFRICA 

CHINA 

KOREA 

SPAIN 

CIS 

KUWAIT 

SUDAN 

COLOMBIA 

LATVIA 

SWEDEN 

COSTA RICA 

LEBANON 

SWITZERLAND 

CROATIA 

LIBYA 

SYRIA 

CYPRUS 

LITHUANIA 

TAIWAN 

CZECH REPUBLIC 

LUXEMBOURG 

THAILAND 

DENMARK 

MACEDONIA 

THE NETHERLANDS 

DJIBOUTI 

MALAYSIA 

TUNISIA 

EGYPT 

MAURITANIA 

TURKEY 

EL SALVADOR 

MEXICO 

UKRAINE 

ESTONIA 

MOROCCO 

UNITED ARAB EMIRATES 

FINLAND 

NEW ZEALAND 

UNITED KINGDOM 

FRANCE 

NICARAGUA 

UZBEKISTAN 

GUATEMALA 

NORWAY 

VENEZUELA 

GERMANY 

OMAN 

VIETNAM 

GREECE 

PANAMA 

YEMEN 

 

 

YUGOSLAVIA 

Character Sets

Oracle-supported character sets are listed below, for easy reference, according to three broad language groups. In addition, common subset/superset combinations are listed.

Note that some character sets may be listed under multiple language groups because they provide multilingual support. For instance, Unicode spans the Asian, European, and Middle Eastern language groups because it supports most of the major scripts of the world.

The comment section indicates the type of encoding used:

As mentioned in Chapter 3, "Setting Up a Globalization Support Environment", the type of encoding affects performance, so use the most efficient encoding that meets your language needs. Also, some encoding types can only be used with certain data types. For instance, the AL16UTF16 character set can only be used as an NCHAR character set, and not as a database character set.

Also documented in the comment section are other unique features of the character set that may be important to users or your database administrator. For instance, whether the character set supports the new Euro currency symbol, whether user-defined characters are supported for character set customization, and whether the character set is a strict superset of ASCII (which will allow you to make use of the ALTER DATABASE [NATIONAL] CHARACTER SET statement in case of migration.)

Oracle does not document individual code page layouts. For specific details about a particular character set, its character repertoire, and code point values, you should refer to the actual national, international, or vendor-specific standards.

Asian Language Character Sets

Table A-4 lists the Oracle character sets that can support Asian languages.

Table A-4 Asian Language Character Sets  
Name  Description  Comments 

BN8BSCII 

Bangladesh National Code 8-bit BSCII 

SB, ASCII 

ZHT16BIG5 

BIG5 16-bit Traditional Chinese 

MB, ASCII 

ZHT16HKSCS 

MS Windows Code Page 950 with Hong Kong Supplementary Character Set 

MB, ASCII, EURO 

ZHS16CGB231280 

CGB2312-80 16-bit Simplified Chinese 

MB, ASCII 

ZHS32GB18030 

GB18030-2000 

MB, ASCII, EURO 

JA16EUC 

EUC 24-bit Japanese 

MB, ASCII 

JA16EUCYEN 

EUC 24-bit Japanese with '\' mapped to the Japanese yen character 

MB 

ZHT32EUC 

EUC 32-bit Traditional Chinese 

MB, ASCII 

ZHS16GBK 

GBK 16-bit Simplified Chinese 

MB, ASCII, UDC 

ZHT16CCDC 

HP CCDC 16-bit Traditional Chinese 

MB, ASCII 

JA16DBCS 

IBM EBCDIC 16-bit Japanese 

MB, UDC 

JA16EBCDIC930 

IBM DBCS Code Page 290 16-bit Japanese 

MB, UDC 

KO16DBCS 

IBM EBCDIC 16-bit Korean 

MB, UDC 

ZHS16DBCS 

IBM EBCDIC 16-bit Simplified Chinese 

MB, UDC 

ZHT16DBCS 

IBM EBCDIC 16-bit Traditional Chinese 

MB, UDC 

KO16KSC5601 

KSC5601 16-bit Korean 

MB, ASCII 

KO16KSCCS 

KSCCS 16-bit Korean 

MB, ASCII 

JA16VMS 

JVMS 16-bit Japanese 

MB, ASCII 

ZHS16MACCGB231280 

Mac client CGB2312-80 16-bit Simplified Chinese 

MB 

JA16MACSJIS 

Mac client Shift-JIS 16-bit Japanese 

MB 

TH8MACTHAI 

Mac Client 8-bit Latin/Thai 

SB 

TH8MACTHAIS 

Mac Server 8-bit Latin/Thai 

SB, ASCII 

TH8TISEBCDICS 

Thai Industrial Standard 620-2533-EBCDIC Server 8-bit 

SB 

ZHT16MSWIN950 

MS Windows Code Page 950 Traditional Chinese 

MB, ASCII, UDC 

KO16MSWIN949 

MS Windows Code Page 949 Korean 

MB, ASCII, UDC 

VN8MSWIN1258 

MS Windows Code Page 1258 8-bit Vietnamese 

SB, ASCII, EURO 

IN8ISCII 

Multiple-Script Indian Standard 8-bit Latin/Indian
Languages 

SB, ASCII 

JA16SJIS 

Shift-JIS 16-bit Japanese 

MB, ASCII, UDC 

JA16SJISYEN 

Shift-JIS 16-bit Japanese with '\' mapped to the Japanese yen character 

MB, UDC 

ZHT32SOPS 

SOPS 32-bit Traditional Chinese 

MB, ASCII 

ZHT16DBT 

Taiwan Taxation 16-bit Traditional Chinese 

MB, ASCII 

TH8TISASCII 

Thai Industrial Standard 620-2533 - ASCII 8-bit 

SB, ASCII, EURO 

TH8TISEBCDIC 

Thai Industrial Standard 620-2533 - EBCDIC 8-bit 

SB 

ZHT32TRIS 

TRIS 32-bit Traditional Chinese 

MB, ASCII 

AL16UTF16 

See "Universal Character Sets" for details 

MB, EURO, FIXED 

AL32UTF8 

See "Universal Character Sets" for details 

MB, ASCII, EURO 

UTF8 

See "Universal Character Sets" for details 

MB, ASCII, EURO 

UTFE 

See "Universal Character Sets" for details 

MB, EURO 

VN8VN3 

VN3 8-bit Vietnamese 

SB, ASCII 

European Language Character Sets

Table A-5 lists the Oracle character sets that can support European languages.

Table A-5 European Language Character Sets  
Name  Description  Comments 

US7ASCII 

ASCII 7-bit American 

SB, ASCII 

SF7ASCII 

ASCII 7-bit Finnish 

SB 

YUG7ASCII 

ASCII 7-bit Yugoslavian 

SB 

RU8BESTA 

BESTA 8-bit Latin/Cyrillic 

SB, ASCII 

EL8GCOS7 

Bull EBCDIC GCOS7 8-bit Greek 

SB 

WE8GCOS7 

Bull EBCDIC GCOS7 8-bit West European 

SB 

EL8DEC 

DEC 8-bit Latin/Greek 

SB 

TR7DEC 

DEC VT100 7-bit Turkish 

SB 

TR8DEC 

DEC 8-bit Turkish 

SB, ASCII 

TR8EBCDIC1026 

EBCDIC Code Page 1026 8-bit Turkish 

SB 

TR8EBCDIC1026S 

EBCDIC Code Page 1026 Server 8-bit Turkish 

SB 

TR8PC857 

IBM-PC Code Page 857 8-bit Turkish 

SB, ASCII 

TR8MACTURKISH 

MAC Client 8-bit Turkish 

SB 

TR8MACTURKISHS 

MAC Server 8-bit Turkish 

SB, ASCII 

TR8MSWIN1254 

MS Windows Code Page 1254 8-bit Turkish 

SB, ASCII, EURO 

WE8BS2000L5 

Siemens EBCDIC.DF.L5 8-bit West European/Turkish 

SB 

WE8DEC 

DEC 8-bit West European 

SB, ASCII 

D7DEC 

DEC VT100 7-bit German 

SB 

F7DEC 

DEC VT100 7-bit French 

SB 

S7DEC 

DEC VT100 7-bit Swedish 

SB 

E7DEC 

DEC VT100 7-bit Spanish 

SB 

NDK7DEC 

DEC VT100 7-bit Norwegian/Danish 

SB 

I7DEC 

DEC VT100 7-bit Italian 

SB 

NL7DEC 

DEC VT100 7-bit Dutch 

SB 

CH7DEC 

DEC VT100 7-bit Swiss (German/French) 

SB 

SF7DEC 

DEC VT100 7-bit Finnish 

SB 

WE8DG 

DG 8-bit West European 

SB, ASCII 

WE8EBCDIC37C 

EBCDIC Code Page 37 8-bit Oracle/c 

SB 

WE8EBCDIC37 

EBCDIC Code Page 37 8-bit West European 

SB 

D8EBCDIC273 

EBCDIC Code Page 273/1 8-bit Austrian German 

SB 

DK8EBCDIC277 

EBCDIC Code Page 277/1 8-bit Danish 

SB 

S8EBCDIC278 

EBCDIC Code Page 278/1 8-bit Swedish 

SB 

I8EBCDIC280 

EBCDIC Code Page 280/1 8-bit Italian 

SB 

WE8EBCDIC284 

EBCDIC Code Page 284 8-bit Latin American/Spanish 

SB 

WE8EBCDIC285 

EBCDIC Code Page 285 8-bit West European 

SB 

WE8EBCDIC924 

Latin 9 EBCDIC 924 

SB, EBCDIC 

WE8EBCDIC1047 

EBCDIC Code Page 1047 8-bit West European 

SB 

WE8EBCDIC1047E 

Latin 1/Open Systems 1047 

SB, EBCDIC, EURO 

WE8EBCDIC1140 

EBCDIC Code Page 1140 8-bit West European 

SB, EURO 

WE8EBCDIC1140C 

EBCDIC Code Page 1140 Client 8-bit West European 

SB, EURO 

WE8EBCDIC1145 

EBCDIC Code Page 1145 8-bit West European 

SB, EURO 

WE8EBCDIC1146 

EBCDIC Code Page 1146 8-bit West European 

SB, EURO 

WE8EBCDIC1148 

EBCDIC Code Page 1148 8-bit West European 

SB, EURO 

WE8EBCDIC1148C 

EBCDIC Code Page 1148 Client 8-bit West European 

SB, EURO 

F8EBCDIC297 

EBCDIC Code Page 297 8-bit French 

SB 

WE8EBCDIC500C 

EBCDIC Code Page 500 8-bit Oracle/c 

SB 

WE8EBCDIC500 

EBCDIC Code Page 500 8-bit West European 

SB 

EE8EBCDIC870 

EBCDIC Code Page 870 8-bit East European 

SB 

EE8EBCDIC870C 

EBCDIC Code Page 870 Client 8-bit East European 

SB 

EE8EBCDIC870S 

EBCDIC Code Page 870 Server 8-bit East European 

SB 

WE8EBCDIC871 

EBCDIC Code Page 871 8-bit Icelandic 

SB 

EL8EBCDIC875 

EBCDIC Code Page 875 8-bit Greek 

SB 

EL8EBCDIC875R 

EBCDIC Code Page 875 Server 8-bit Greek 

SB 

CL8EBCDIC1025 

EBCDIC Code Page 1025 8-bit Cyrillic 

SB 

CL8EBCDIC1025C 

EBCDIC Code Page 1025 Client 8-bit Cyrillic 

SB 

CL8EBCDIC1025R 

EBCDIC Code Page 1025 Server 8-bit Cyrillic 

SB 

CL8EBCDIC1025S 

EBCDIC Code Page 1025 Server 8-bit Cyrillic 

SB 

CL8EBCDIC1025X 

EBCDIC Code Page 1025 (Modified) 8-bit Cyrillic 

SB 

BLT8EBCDIC1112 

EBCDIC Code Page 1112 8-bit Baltic Multilingual 

SB 

BLT8EBCDIC1112S 

EBCDIC Code Page 1112 8-bit Server Baltic Multilingual 

SB 

D8EBCDIC1141 

EBCDIC Code Page 1141 8-bit Austrian German 

SB, EURO 

DK8EBCDIC1142 

EBCDIC Code Page 1142 8-bit Danish  

SB, EURO 

S8EBCDIC1143 

EBCDIC Code Page 1143 8-bit Swedish 

SB, EURO 

I8EBCDIC1144 

EBCDIC Code Page 1144 8-bit Italian 

SB, EURO 

F8EBCDIC1147 

EBCDIC Code Page 1147 8-bit French  

SB, EURO 

EEC8EUROASCI 

EEC Targon 35 ASCI West European/Greek 

SB 

EEC8EUROPA3 

EEC EUROPA3 8-bit West European/Greek 

SB 

LA8PASSPORT 

German Government Printer 8-bit All-European Latin 

SB, ASCII 

WE8HP 

HP LaserJet 8-bit West European 

SB 

WE8ROMAN8 

HP Roman8 8-bit West European 

SB, ASCII 

HU8CWI2 

Hungarian 8-bit CWI-2 

SB, ASCII 

HU8ABMOD 

Hungarian 8-bit Special AB Mod 

SB, ASCII 

LV8RST104090 

IBM-PC Alternative Code Page 8-bit Latvian (Latin/Cyrillic) 

SB, ASCII 

US8PC437 

IBM-PC Code Page 437 8-bit American 

SB, ASCII 

BG8PC437S 

IBM-PC Code Page 437 8-bit (Bulgarian Modification) 

SB, ASCII 

EL8PC437S 

IBM-PC Code Page 437 8-bit (Greek modification) 

SB, ASCII 

EL8PC737 

IBM-PC Code Page 737 8-bit Greek/Latin 

SB 

LT8PC772 

IBM-PC Code Page 772 8-bit Lithuanian (Latin/Cyrillic) 

SB, ASCII 

LT8PC774 

IBM-PC Code Page 774 8-bit Lithuanian (Latin) 

SB, ASCII 

BLT8PC775 

IBM-PC Code Page 775 8-bit Baltic 

SB, ASCII 

WE8PC850 

IBM-PC Code Page 850 8-bit West European 

SB, ASCII 

EL8PC851 

IBM-PC Code Page 851 8-bit Greek/Latin 

SB, ASCII 

EE8PC852 

IBM-PC Code Page 852 8-bit East European 

SB, ASCII 

RU8PC855 

IBM-PC Code Page 855 8-bit Latin/Cyrillic 

SB, ASCII 

WE8PC858 

IBM-PC Code Page 858 8-bit West European 

SB, ASCII, EURO 

WE8PC860 

IBM-PC Code Page 860 8-bit West European 

SB. ASCII 

IS8PC861 

IBM-PC Code Page 861 8-bit Icelandic 

SB, ASCII 

CDN8PC863 

IBM-PC Code Page 863 8-bit Canadian French 

SB, ASCII 

N8PC865 

IBM-PC Code Page 865 8-bit Norwegian 

SB. ASCII 

RU8PC866 

IBM-PC Code Page 866 8-bit Latin/Cyrillic 

SB, ASCII 

EL8PC869 

IBM-PC Code Page 869 8-bit Greek/Latin 

SB, ASCII 

LV8PC1117 

IBM-PC Code Page 1117 8-bit Latvian 

SB, ASCII 

US8ICL 

ICL EBCDIC 8-bit American 

SB 

WE8ICL 

ICL EBCDIC 8-bit West European 

SB 

WE8ISOICLUK 

ICL special version ISO8859-1 

SB 

WE8ISO8859P1 

ISO 8859-1 West European 

SB, ASCII 

EE8ISO8859P2 

ISO 8859-2 East European 

SB, ASCII 

SE8ISO8859P3 

ISO 8859-3 South European 

SB, ASCII 

NEE8ISO8859P4 

ISO 8859-4 North and North-East European 

SB, ASCII 

CL8ISO8859P5 

ISO 8859-5 Latin/Cyrillic 

SB, ASCII 

AR8ISO8859P6 

ISO 8859-6 Latin/Arabic 

SB, ASCII 

EL8ISO8859P7 

ISO 8859-7 Latin/Greek 

SB, ASCII, EURO 

IW8ISO8859P8 

ISO 8859-8 Latin/Hebrew 

SB, ASCII 

NE8ISO8859P10 

ISO 8859-10 North European 

SB, ASCII 

BLT8ISO8859P13 

ISO 8859-13 Baltic 

SB, ASCII 

CEL8ISO8859P14 

ISO 8859-13 Celtic 

SB, ASCII 

WE8ISO8859P15 

ISO 8859-15 West European 

SB, ASCII, EURO 

LA8ISO6937 

ISO 6937 8-bit Coded Character Set for Text Communication 

SB, ASCII 

IW7IS960 

Israeli Standard 960 7-bit Latin/Hebrew 

SB 

AR8ARABICMAC 

Mac Client 8-bit Latin/Arabic 

SB 

EE8MACCE 

Mac Client 8-bit Central European 

SB 

EE8MACCROATIAN 

Mac Client 8-bit Croatian 

SB 

WE8MACROMAN8 

Mac Client 8-bit Extended Roman8 West European 

SB 

EL8MACGREEK 

Mac Client 8-bit Greek 

SB 

IS8MACICELANDIC 

Mac Client 8-bit Icelandic 

SB 

CL8MACCYRILLIC 

Mac Client 8-bit Latin/Cyrillic 

SB 

AR8ARABICMACS 

Mac Server 8-bit Latin/Arabic 

SB, ASCII 

EE8MACCES 

Mac Server 8-bit Central European 

SB, ASCII 

EE8MACCROATIANS 

Mac Server 8-bit Croatian 

SB, ASCII 

WE8MACROMAN8S 

Mac Server 8-bit Extended Roman8 West European 

SB, ASCII 

CL8MACCYRILLICS 

Mac Server 8-bit Latin/Cyrillic 

SB, ASCII 

EL8MACGREEKS 

Mac Server 8-bit Greek 

SB, ASCII 

IS8MACICELANDICS 

Mac Server 8-bit Icelandic 

SB 

BG8MSWIN 

MS Windows 8-bit Bulgarian Cyrillic 

SB, ASCII 

LT8MSWIN921 

MS Windows Code Page 921 8-bit Lithuanian 

SB, ASCII 

ET8MSWIN923 

MS Windows Code Page 923 8-bit Estonian 

SB, ASCII 

EE8MSWIN1250 

MS Windows Code Page 1250 8-bit East European 

SB, ASCII, EURO 

CL8MSWIN1251 

MS Windows Code Page 1251 8-bit Latin/Cyrillic 

SB, ASCII, EURO 

WE8MSWIN1252 

MS Windows Code Page 1252 8-bit West European 

SB, ASCII, EURO 

EL8MSWIN1253 

MS Windows Code Page 1253 8-bit Latin/Greek 

SB, ASCII, EURO 

BLT8MSWIN1257 

MS Windows Code Page 1257 8-bit Baltic 

SB, ASCII, EURO 

BLT8CP921 

Latvian Standard LVS8-92(1) Windows/Unix 8-bit Baltic 

SB, ASCII 

LV8PC8LR 

Latvian Version IBM-PC Code Page 866 8-bit Latin/Cyrillic 

SB, ASCII 

WE8NCR4970 

NCR 4970 8-bit West European 

SB, ASCII 

WE8NEXTSTEP 

NeXTSTEP PostScript 8-bit West European 

SB, ASCII 

CL8ISOIR111 

ISOIR111 Cyrillic 

SB 

CL8KOI8R 

RELCOM Internet Standard 8-bit Latin/Cyrillic 

SB, ASCII 

CL8KOI8U 

KOI8 Ukrainian Cyrillic 

SB 

US8BS2000 

Siemens 9750-62 EBCDIC 8-bit American 

SB 

DK8BS2000 

Siemens 9750-62 EBCDIC 8-bit Danish 

SB 

F8BS2000 

Siemens 9750-62 EBCDIC 8-bit French 

SB 

D8BS2000 

Siemens 9750-62 EBCDIC 8-bit German 

SB 

E8BS2000 

Siemens 9750-62 EBCDIC 8-bit Spanish 

SB 

S8BS2000 

Siemens 9750-62 EBCDIC 8-bit Swedish 

SB 

DK7SIEMENS9780X 

Siemens 97801/97808 7-bit Danish 

SB 

F7SIEMENS9780X 

Siemens 97801/97808 7-bit French 

SB 

D7SIEMENS9780X 

Siemens 97801/97808 7-bit German 

SB 

I7SIEMENS9780X 

Siemens 97801/97808 7-bit Italian 

SB 

N7SIEMENS9780X 

Siemens 97801/97808 7-bit Norwegian 

SB 

E7SIEMENS9780X 

Siemens 97801/97808 7-bit Spanish 

SB 

S7SIEMENS9780X 

Siemens 97801/97808 7-bit Swedish 

SB 

EE8BS2000 

Siemens EBCDIC.DF.04 8-bit East European 

SB 

WE8BS2000 

Siemens EBCDIC.DF.04 8-bit West European 

SB 

WE8BS2000E 

Siemens EBCDIC.DF.04 8-bit West European 

SB, EURO 

CL8BS2000 

Siemens EBCDIC.EHC.LC 8-bit Cyrillic 

SB 

AL16UTF16 

See "Universal Character Sets" for details 

MB, EURO, FIXED 

AL32UTF8 

See "Universal Character Sets" for details 

MB, ASCII, EURO 

UTF8 

See "Universal Character Sets" for details 

MB, ASCII, EURO 

UTFE 

See "Universal Character Sets" for details 

MB, EURO 

Middle Eastern Language Character Sets

Table A-6 lists the Oracle character sets that can support Middle Eastern languages.

Table A-6 Middle Eastern Character Sets  
Name  Description  Comments 

AR8APTEC715 

APTEC 715 Server 8-bit Latin/Arabic 

SB, ASCII 

AR8APTEC715T 

APTEC 715 8-bit Latin/Arabic 

SB 

AR8ASMO708PLUS 

ASMO 708 Plus 8-bit Latin/Arabic 

SB, ASCII 

AR8ASMO8X 

ASMO Extended 708 8-bit Latin/Arabic 

SB, ASCII 

AR8ADOS710 

Arabic MS-DOS 710 Server 8-bit Latin/Arabic 

SB, ASCII 

AR8ADOS710T 

Arabic MS-DOS 710 8-bit Latin/Arabic 

SB 

AR8ADOS720 

Arabic MS-DOS 720 Server 8-bit Latin/Arabic 

SB, ASCII 

AR8ADOS720T 

Arabic MS-DOS 720 8-bit Latin/Arabic 

SB 

TR7DEC 

DEC VT100 7-bit Turkish 

SB 

TR8DEC 

DEC 8-bit Turkish 

SB 

WE8EBCDIC37C 

EBCDIC Code Page 37 8-bit Oracle/c 

SB 

IW8EBCDIC424 

EBCDIC Code Page 424 8-bit Latin/Hebrew 

SB 

IW8EBCDIC424S 

EBCDIC Code Page 424 Server 8-bit Latin/Hebrew 

SB 

WE8EBCDIC500C 

EBCDIC Code Page 500 8-bit Oracle/c 

SB 

IW8EBCDIC1086 

EBCDIC Code Page 1086 8-bit Hebrew 

SB 

AR8EBCDIC420S 

EBCDIC Code Page 420 Server 8-bit Latin/Arabic 

SB 

AR8EBCDICX 

EBCDIC XBASIC Server 8-bit Latin/Arabic 

SB 

TR8EBCDIC1026 

EBCDIC Code Page 1026 8-bit Turkish 

SB 

TR8EBCDIC1026S 

EBCDIC Code Page 1026 Server 8-bit Turkish 

SB 

AR8HPARABIC8T 

HP 8-bit Latin/Arabic 

SB 

TR8PC857 

IBM-PC Code Page 857 8-bit Turkish 

SB, ASCII 

IW8PC1507 

IBM-PC Code Page 1507/862 8-bit Latin/Hebrew 

SB, ASCII 

AR8ISO8859P6 

ISO 8859-6 Latin/Arabic 

SB, ASCII 

IW8ISO8859P8 

ISO 8859-8 Latin/Hebrew 

SB, ASCII 

WE8ISO8859P9 

ISO 8859-9 West European & Turkish 

SB, ASCII 

LA8ISO6937 

ISO 6937 8-bit Coded Character Set for Text Communication 

SB, ASCII 

IW7IS960 

Israeli Standard 960 7-bit Latin/Hebrew 

SB 

IW8MACHEBREW 

Mac Client 8-bit Hebrew 

SB 

AR8ARABICMAC 

Mac Client 8-bit Latin/Arabic 

SB 

AR8ARABICMACT 

Mac 8-bit Latin/Arabic 

SB 

TR8MACTURKISH 

Mac Client 8-bit Turkish 

SB 

IW8MACHEBREWS 

Mac Server 8-bit Hebrew 

SB, ASCII 

AR8ARABICMACS 

Mac Server 8-bit Latin/Arabic 

SB, ASCII 

TR8MACTURKISHS 

Mac Server 8-bit Turkish 

SB, ASCII 

TR8MSWIN1254 

MS Windows Code Page 1254 8-bit Turkish 

SB, ASCII, EURO 

IW8MSWIN1255 

MS Windows Code Page 1255 8-bit Latin/Hebrew 

SB, ASCII, EURO 

AR8MSWIN1256 

MS Windows Code Page 1256 8-Bit Latin/Arabic 

SB. ASCII, EURO 

IN8ISCII 

Multiple-Script Indian Standard 8-bit Latin/Indian
Languages 

SB 

AR8MUSSAD768 

Mussa'd Alarabi/2 768 Server 8-bit Latin/Arabic 

SB, ASCII 

AR8MUSSAD768T 

Mussa'd Alarabi/2 768 8-bit Latin/Arabic 

SB 

AR8NAFITHA711 

Nafitha Enhanced 711 Server 8-bit Latin/Arabic 

SB, ASCII 

AR8NAFITHA711T 

Nafitha Enhanced 711 8-bit Latin/Arabic 

SB 

AR8NAFITHA721 

Nafitha International 721 Server 8-bit Latin/Arabic 

SB, ASCII 

AR8NAFITHA721T 

Nafitha International 721 8-bit Latin/Arabic 

SB 

AR8SAKHR706 

SAKHR 706 Server 8-bit Latin/Arabic 

SB, ASCII 

AR8SAKHR707 

SAKHR 707 Server 8-bit Latin/Arabic 

SB, ASCII 

AR8SAKHR707T 

SAKHR 707 8-bit Latin/Arabic 

SB 

AR8XBASIC 

XBASIC 8-bit Latin/Arabic 

SB 

WE8BS2000L5 

Siemens EBCDIC.DF.04.L5 8-bit West European/Turkish 

SB 

AL16UTF16 

See "Universal Character Sets" for details 

MB, EURO, FIXED 

AL32UTF8 

See "Universal Character Sets" for details 

MB, ASCII, EURO 

UTF8 

See "Universal Character Sets" for details 

MB, ASCII, EURO 

UTFE 

See "Universal Character Sets" for details 

MB, EURO 

Universal Character Sets

Table A-7 lists the Oracle character sets that provide universal language support, that is, they attempt to support all languages of the world, including, but not limited to, Asian, European, and Middle Eastern languages.

Table A-7 Universal Character Sets
Name  Description  Comments 

AL16UTF16 

Unicode 3.0 UTF-16 Universal character set 

MB, EURO, FIXED 

AL32UTF8 

Unicode 3.0 UTF-8 Universal character set 

MB, ASCII, EURO 

UTF8 

Unicode 3.0 UTF-8 Universal character set 

MB, ASCII, EURO 

UTFE 

EBCDIC form of Unicode 3.0 UTF-8 Universal character set 

MB, EURO 

See Also:

Chapter 5, "Supporting Multilingual Databases with Unicode" 

Character Set Conversion Support

The following character set encodings are supported for conversion only, so they cannot be used as the database or national character set:

You can use these character sets as the source_char_set or dest_char_set in the CONVERT function.

See Also:

 

Subsets and Supersets

Table A-8 lists common subset/superset relationships.

Table A-8 Subset-Superset Pairs  
Subset  Superset 

AR8ADOS710 

AR8ADOS710T 

AR8ADOS720 

AR8ADOS720T 

AR8ADOS720T 

AR8ADOS720 

AR8APTEC715 

AR8APTEC715T 

AR8ARABICMACT 

AR8ARABICMAC 

AR8ISO8859P6 

AR8ASMO708PLUS 

AR8ISO8859P6 

AR8ASMO8X 

AR8MUSSAD768 

AR8MUSSAD768T 

AR8MUSSAD768T 

AR8MUSSAD768 

AR8NAFITHA711 

AR8NAFITHA711T 

AR8NAFITHA721 

AR8NAFITHA721T 

AR8SAKHR707 

AR8SAKHR707T 

AR8SAKHR707T 

AR8SAKHR707 

BLT8CP921 

BLT8ISO8859P13 

D7DEC 

D7SIEMENS9780X 

D7SIEMENS9780X 

D7DEC 

DK7SIEMENS9780X 

N7SIEMENS9780X 

I7DEC 

I7SIEMENS9780X 

I7SIEMENS9780X 

IW8EBCDIC424 

IW8EBCDIC424 

IW8EBCDIC1086 

KO16KSC5601 

KO16MSWIN949 

LV8PC8LR 

LV8RST104090 

N7SIEMENS9780X 

DK7SIEMENS9780X 

US7ASCII 

See Table 10-5 for a complete list 

WE16DECTST 

WE16DECTST2 

WE16DECTST2 

WE16DECTST 

WE8DEC 

TR8DEC 

WE8DEC 

WE8NCR4970 

WE8ISO8859P1 

WE8MSWIN1252 

WE8NCR4970 

TR8DEC 

WE8NCR4970 

WE8DEC 

WE8PC850 

WE8PC858 

US7ASCII is a special case because so many other character sets are supersets of it. Table A-9 lists supersets for US7ASCII.

Table A-9 US7ASCII Supersets  
Supersets  Supersets  Supersets 

AL24UTFFSS 

EE8MACCES 

NEE8ISO8859P4 

AL32UTF8 

EE8MACCROATIANS 

RU8BESTA 

AR8ADOS710 

EE8MSWIN1250 

RU8PC855 

AR8ADOS710T 

EE8PC852 

RU8PC866 

AR8ADOS720 

EL8DEC 

SE8ISO8859P3 

AR8ADOS720T 

EL8ISO8859P7 

TH8MACTHAIS 

AR8APTEC715 

EL8MACGREEKS 

TH8TISASCII 

AR8APTEC715T 

EL8MSWIN1253 

TR8DEC 

AR8ARABICMACS 

EL8PC437S 

TR8MACTURKISHS 

AR8ASMO708PLUS 

EL8PC851 

TR8MSWIN1254 

AR8ASMO8X 

EL8PC869 

TR8PC857 

AR8HPARABIC8T 

ET8MSWIN923 

US8PC437 

AR8ISO8859P6 

HU8ABMOD 

UTF8 

AR8MSAWIN 

HU8CWI2 

VN8MSWIN1258 

AR8MUSSAD768 

IN8ISCII 

VN8VN3 

AR8MUSSAD768T 

IS8PC861 

WE8DEC 

AR8NAFITHA711 

IW8ISO8859P8 

WE8DG 

AR8NAFITHA711T 

IW8MACHEBREWS 

WE8ISO8859P1 

AR8NAFITHA721 

IW8MSWIN1255 

WE8ISO8859P15 

AR8NAFITHA721T 

IW8PC1507 

WE8ISO8859P9 

AR8SAKHR706 

JA16EUC 

WE8MACROMAN8S 

AR8SAKHR707 

JA16SJIS 

WE8MSWIN1252 

AR8SAKHR707T 

JA16TSTSET 

WE8NCR4970 

BG8MSWIN 

JA16TSTSET2 

WE8NEXTSTEP 

BG8PC437S 

JA16VMS 

WE8PC850 

BLT8CP921 

KO16KSC5601 

WE8PC858 

BLT8ISO8859P13 

KO16KSCCS 

WE8PC860 

BLT8MSWIN1257 

KO16MSWIN949 

WE8ROMAN8 

BLT8PC775 

KO16TSTSET 

ZHS16CGB231280 

BN8BSCII 

LA8ISO6937 

ZHS16GBK 

CDN8PC863 

LA8PASSPORT 

ZHT16BIG5 

CEL8ISO8859P14 

LT8MSWIN921 

ZHT16CCDC 

CL8ISO8859P5 

LT8PC772 

ZHT16DBT 

CL8KOI8R 

LT8PC774 

ZHT16HKSCS 

CL8KOI8U 

LV8PC1117 

ZHT16MSWIN950 

CL8ISOIR111 

LV8PC8LR 

ZHT32EUC 

CL8MACCYRILLICS 

LV8RST104090 

ZHT32SOPS 

CL8MSWIN1251 

N8PC865 

ZHT32TRIS 

EE8ISO8859P2 

NE8ISO8859P10 

 

Linguistic Sorting

Oracle offers two kinds of linguistic sorts, monolingual and multilingual. In addition, monolingual sorts can be extended to handle special cases. These special cases (represented with a prefix X) typically mean that the characters will be sorted differently from their ASCII values. For example, ch and ll are treated as only one character in XSPANISH. In other words, the SPANISH sort is uses modern collation rules while XSPANISH uses traditional sorting rules.

Table A-10 lists the monolingual linguistic sorts supported by the Oracle server.

Table A-10 Monolingual Linguistic Sorts  
Basic Name  Extended Name  Special Cases 

ARABIC 

-- 

 

ARABIC_MATCH 

-- 

 

ARABIC_ABJ_SORT 

-- 

 

ARABIC_ABJ_MATCH 

-- 

 

ASCII7 

-- 

 

BENGALI 

-- 

 

BIG5 

 

 

BINARY 

 

 

BULGARIAN 

-- 

 

CANADIAN FRENCH 

-- 

 

CATALAN 

XCATALAN 

æ, AE, ß 

CROATIAN 

XCROATIAN 

D, L, N, d, l, n, ß 

CZECH 

XCZECH 

ch, CH, Ch, ß  

CZECH_PUNCTUTION 

XCZECH_PUNCTUATION 

ch, CH, Ch, ß  

DANISH 

XDANISH 

A, ß, Å, å 

DUTCH 

XDUTCH 

ij, IJ 

EBCDIC 

 

 

EEC_EURO 

-- 

 

EEC_EUROPA3 

-- 

 

ESTONIAN 

-- 

 

FINNISH 

-- 

 

FRENCH 

XFRENCH 

 

GERMAN 

XGERMAN 

ß 

GERMAN_DIN 

XGERMAN_DIN 

ß, ä, ö, ü, Ä, Ö, Ü 

GBK 

 

 

GREEK 

-- 

 

HEBREW 

-- 

 

HKSCS 

 

 

HUNGARIAN 

XHUNGARIAN 

cs, gy, ny, sz, ty, zs, ß, CS, Cs, GY, Gy, NY, Ny, SZ, Sz, TY, Ty, ZS, Zs 

ICELANDIC 

-- 

 

INDONESIAN 

-- 

 

ITALIAN 

-- 

 

JAPANESE 

-- 

 

LATIN 

-- 

 

LATVIAN 

-- 

 

LITHUANIAN 

-- 

 

MALAY 

-- 

 

NORWEGIAN 

-- 

 

POLISH 

-- 

 

PUNCTUATION 

XPUNCTUATION 

 

ROMANIAN 

-- 

 

RUSSIAN 

-- 

 

SLOVAK 

XSLOVAK 

dz, DZ, Dz, ß (caron

SLOVENIAN 

XSLOVENIAN 

ß 

SPANISH 

XSPANISH 

ch, ll, CH, Ch, LL, Ll 

SWEDISH 

-- 

 

SWISS 

XSWISS 

ß 

THAI_DICTIONARY 

-- 

 

THAI_TELEPHONE 

-- 

 

TURKISH 

XTURKISH 

æ, AE, ß 

UKRAINIAN 

-- 

 

UNICODE_BINARY 

 

 

VIETNAMESE 

-- 

 

WEST_EUROPEAN 

XWEST_EUROPEAN 

ß 

Table A-11 lists the multilingual linguistic sorts available in Oracle. All of them include GENERIC_M (an ISO standard for sorting Latin-based characters) as a base. Multilingual linguistic sorts are for the collation of a given primary language together with Latin-based characters. For example, KOREAN_M will sort Korean and Latin-based characters, but it will not collate Chinese, Thai, or Japanese characters.

Table A-11 Multilingual Linguistic Sorts  
Basic Name  Explanation 

CANADIAN_M 

Canadian French sort supports reverse secondary, special expanding characters 

DANISH_M 

Danish sort supports sorting lower case characters before upper case characters 

FRENCH_M 

French sort supports reverse sort for secondary 

GENERIC_M 

Generic sorting order which is based on ISO14651 and Unicode canonical equivalence rules but excluding compatible equivalence rules 

JAPANESE_M 

Japanese sort supports SJIS character set order and EUC characters which are not included in SJIS 

KOREAN_M 

Korean sort: Hangul characters are based on Unicode binary order. Hanja characters based on pronunciation order. All Hangul characters are before Hanja characters 

SPANISH_M 

Traditional Spanish sort supports special contracting characters 

THAI_M 

Thai sort supports swap characters for some vowels and consonants 

SCHINESE_RADICAL_M 

Simplified Chinese sort based on radical as primary order and number of strokes order as secondary order 

SCHINESE_STROKE_M 

Simplified Chinese sort uses number of strokes as primary order and radical as secondary order 

SCHINESE_PINYIN_M 

Simplified Chinese PinYin sorting order 

TCHINESE_RADICAL_M 

Traditional Chinese sort based on radical as primary order and number of strokes order as secondary order 

TCHINESE_STROKE_M 

Traditional Chinese sort uses number of strokes as primary order and radical as secondary order 

Calendar Systems

By default, most territory definitions use the Gregorian calendar system. Table A-12 lists the other calendar systems supported by the Oracle server.

Table A-12 NLS Supported Calendars
Name  Default Format  Character Set Used
For Default Format
 

Japanese Imperial 

EEYYMMDD 

JA16EUC 

ROC Official 

EEyymmdd 

ZHT32EUC 

Thai Buddha 

dd month EE yyyy  

TH8TISASCII  

Persian 

DD Month YYYY 

AR8ASMO8X  

Arabic Hijrah 

DD Month YYYY 

AR8ISO8859P6 

English Hijrah 

DD Month YYYY 

AR8ISO8859P6 

Figure A-1 shows how March 20, 1998 appears in ROC Official:

Figure A-1 ROC Official Example


Text description of roc.gif follows.
Text description of the illustration roc.gif

Figure A-2 shows how March 27, 1998 appears in Japanese Imperial:

Figure A-2 Japanese Imperial Example


Text description of nls81005.gif follows.
Text description of the illustration nls81005.gif

Obsolete Locale Data

Before Oracle server release 7.2, when a character set was renamed, the old name was usually supported along with the new name for several releases after the change. Beginning with release 7.2, the old names are no longer supported.

Table A-13 lists the affected character sets. If you reference any of these character sets in your code, replace them with their new name:

Table A-13 New Names for Obsolete NLS Data Character Sets  
Old Name  New Name 

AL24UTFSS 

UTF8, AL32UTF8 

AR8MSAWIN 

AR8MSWIN1256 

CL8EBCDIC875S 

CL8EBCDIC875R 

EL8EBCDIC875S 

EL8EBCDIC875R 

JVMS 

JA16VMS 

JEUC 

JA16EUC 

SJIS 

JA16SJIS 

JDBCS 

JA16DBCS 

KSC5601 

KO16KSC5601 

KDBCS 

KO16DBCS 

CGB2312-80 

ZHS16CGB231280 

CNS 11643-86 

ZHT32EUC 

JA16EUCFIXED 

None. Replaced by new national character set. UTF8 and AL16UTF16. 

ZHS32EUCFIXED 

None. Replaced by new national character set. UTF8 and AL16UTF16. 

ZHS16GBKFIXED 

None. Replaced by new national character set. UTF8 and AL16UTF16. 

JA16DBCSFIXED 

None. Replaced by new national character set. UTF8 and AL16UTF16. 

KO16DBCSFIXED 

None. Replaced by new national character set. UTF8 and AL16UTF16. 

ZHS16DBCSFIXED 

None. Replaced by new national character set. UTF8 and AL16UTF16. 

ZHS16CGB231280
FIXED 

None. Replaced by new national character set. UTF8 and AL16UTF16. 

ZHT16DBCSFIXED 

None. Replaced by new national character set. UTF8 and AL16UTF16. 

KO16KSC5601FIXED 

None. Replaced by new national character set. UTF8 and AL16UTF16. 

JA16SJISFIXED 

None. Replaced by new national character set. UTF8 and AL16UTF16. 

ZHT16BIG5FIXED 

None. Replaced by new national character set. UTF8 and AL16UTF16. 

ZHT32TRISFIXED 

None. Replaced by new national character set. UTF8 and AL16UTF16. 

Character set CL8MSWINDOW31 has been desupported. The newer character set CL8MSWIN1251 is actually a duplicate of CL8MSWINDOW31 and includes some characters omitted from the earlier version. Change any usage of CL8MSWINDOW31 to CL8MSWIN1251 instead.

AL24UTFFSS Character Set Desupport

The Unicode Character Set AL24UTFFSS has been desupported in Oracle9i. AL24UTFFSS was introduced with Oracle7 as the Unicode character set supporting UTF-8 encoding scheme based on the Unicode standard 1.1, which is now obsolete. In Oracle9i, Oracle now offers the Unicode database character set AL32UTF8 and UTF8, which includes the Unicode enhancements based on the Unicode standard 3.0.

The migration path for an existing AL24UTFFSS database is to upgrade to UTF8 prior to upgrading to Oracle9i. As with all migrations to a new database character set, Oracle Corporation recommends that you use the Character Set Scanner for data analysis before attempting to migrate your existing database character set to UTF8.

See Also:

Chapter 10, "Character Set Scanner Utility"  


Go to previous page Go to next page
Oracle
Copyright © 1996-2001, Oracle Corporation.

All Rights Reserved.
Go To Documentation Library
Home
Go To Product List
Book List
Go To Table Of Contents
Contents
Go To Index
Index

Master Index

Feedback