Oracle8i National Language Support Guide
Release 2 (8.1.6)

Part Number A76966-01

Library

Product

Contents

Index

Go to previous page Go to next page

A
Locale Data

This appendix lists the languages, territories, character sets, and other locale data supported by the Oracle server. It includes these topics:

You can also obtain information about supported character sets, languages, territories, and sorting orders by querying the dynamic data view V$NLS_VALID_VALUES. For more information on the data which can be returned by this view, see Oracle8i Reference.

Languages

Table A-1 lists the languages supported by the Oracle server.

Table A-1 Oracle Supported Languages
Name  Abbreviation 

AMERICAN 

us 

ARABIC 

ar 

BENGALI 

bn 

BRAZILIAN PORTUGUESE 

ptb 

BULGARIAN 

bg 

CANADIAN FRENCH 

frc 

CATALAN 

ca 

CROATIAN 

hr 

CZECH 

cs 

DANISH 

dk 

DUTCH 

nl 

EGYPTIAN 

eg 

ENGLISH 

gb 

ESTONIAN 

et 

FINNISH 

sf 

FRENCH 

GERMAN DIN 

din 

GERMAN 

GREEK 

el 

HEBREW 

iw 

HINDI 

hi 

HUNGARIAN 

hu 

ICELANDIC 

is 

INDONESIAN 

in 

ITALIAN 

JAPANESE 

ja 

KOREAN 

ko 

LATIN AMERICAN SPANISH 

esa 

LATVIAN 

lv 

LITHUANIAN 

lt 

MALAY 

ms 

MEXICAN SPANISH 

esm 

NORWEGIAN 

POLISH 

pl 

PORTUGUESE 

pt 

ROMANIAN 

ro 

RUSSIAN 

ru 

SIMPLIFIED CHINESE 

zhs 

SLOVAK 

sk 

SLOVENIAN 

sl 

SPANISH 

SWEDISH 

TAMIL 

ta 

THAI 

th 

TRADITIONAL CHINESE 

zht 

TURKISH 

tr 

UKRAINIAN 

uk 

VIETNAMESE 

vn 

Translated Messages

Oracle error messages have been translated into the languages which are listed in Table A-2.

Table A-2 Oracle Supported Messages
Name  Abbreviation 

ARABIC 

ar 

BRAZILIAN PORTUGUESE 

ptb 

CATALAN 

ca 

CZECH 

cs 

DANISH 

dk 

DUTCH 

nl 

FINNISH 

sf 

FRENCH 

GERMAN 

GREEK 

el 

HEBREW 

iw 

HUNGARIAN 

hu 

ITALIAN 

JAPANESE 

ja 

KOREAN 

ko 

LATIN AMERICAN SPANISH 

esa 

NORWEGIAN 

POLISH 

pl 

PORTUGUESE 

pt 

ROMANIAN 

ro 

RUSSIAN 

ru 

SIMPLIFIED CHINESE 

zhs 

SLOVAK 

sk 

SPANISH 

SWEDISH 

TRADITIONAL CHINESE 

zht 

TURKISH 

tr 

Territories

Table A-3 lists the territories supported by the Oracle server.

Table A-3 Oracle Supported Territories
Name     

ALGERIA 

ICELAND 

QATAR 

AMERICA 

INDIA 

ROMANIA 

AUSTRALIA 

INDONESIA 

SAUDI ARABIA 

AUSTRIA 

IRAQ 

SINGAPORE 

BAHRAIN 

IRELAND 

SLOVAKIA 

BANGLADESH 

ISRAEL 

SLOVENIA 

BELGIUM 

ITALY 

SOMALIA 

BRAZIL 

JAPAN 

SOUTH AFRICA 

BULGARIA 

JORDAN 

SPAIN 

CANADA 

KAZAKHSTAN 

SUDAN 

CATALONIA 

KOREA 

SWEDEN 

CHINA 

KUWAIT 

SWITZERLAND 

CIS 

LATVIA 

SYRIA 

CROATIA 

LEBANON 

TAIWAN 

CYPRUS 

LIBYA 

THAILAND 

CZECH REPUBLIC 

LITHUANIA 

THE NETHERLANDS 

DENMARK 

LUXEMBOURG 

TUNISIA 

DJIBOUTI 

MALAYSIA 

TURKEY 

EGYPT 

MAURITANIA 

UKRAINE 

ESTONIA 

MEXICO 

UNITED ARAB EMIRATES 

FINLAND 

MOROCCO 

UNITED KINGDOM 

FRANCE 

NEW ZEALAND 

UZBEKISTAN 

GERMANY 

NORWAY 

VIETNAM 

GREECE 

OMAN 

YEMEN 

HONG KONG 

POLAND 

 

HUNGARY 

PORTUGAL 

 

Character Sets

Oracle-supported character sets are listed below, for easy reference, according to three broad language groups:

Note that some character sets may be listed under multiple language groups because they provide multilingual support. For instance, Unicode spans the Asian, European, and Middle Eastern language groups because it supports most of the major scripts of the world.

The comment section indicates the type of encoding used:

As mentioned in Chapter 3, "Choosing a Character Set", the type of encoding will affect performance, so you should use the most efficient encoding that meets your language needs. Also, some encoding types can only be used with certain data types. For instance, fixed-width multibyte encoded character sets can only be used as an NCHAR character set, and not as a database character set.

Also documented in the comment section are other unique features of the character set that may be important to users or your database administrator. For instance, whether the character set supports the new Euro currency symbol, whether user-defined characters are supported for character set customization, and whether the character set is a strict superset of ASCII (which will allow you to make use of the ALTER DATABASE [NATIONAL] CHARACTER SET statement in case of migration.)

Oracle does not document individual code page layouts. For specific details about a particular character set, its character repertoire, and code point values, you should refer to the actual national, international, or vendor-specific standards.

Asian Language Character Sets

Table A-4 lists the Oracle character sets that can support Asian languages.

Table A-4 Asian Language Character Sets
Name  Description  Comments 

BN8BSCII 

Bangladesh National Code 8-bit BSCII 

SB, ASCII 

ZHT16BIG5 

BIG5 16-bit Traditional Chinese 

MB, ASCII 

ZHS16CGB231280 

CGB2312-80 16-bit Simplified Chinese 

MB, ASCII 

JA16EUC 

EUC 24-bit Japanese 

MB, ASCII 

JA16EUCYEN 

EUC 24-bit Japanese with '\' mapped to the Japanese yen character 

MB 

JA16EUCFIXED 

EUC 16-bit Japanese. A fixed-width subset of JA16EUC (contains only the 2-byte characters of JA16EUC). Contains no 7- or 8-bit ASCII characters 

FIXED 

ZHT32EUC 

EUC 32-bit Traditional Chinese 

MB, ASCII 

ZHT32EUCFIXED 

EUC 32-bit Traditional Chinese (32-bit fixed-width, no single byte) 

FIXED 

ZHS16GBK 

GBK 16-bit Simplified Chinese 

MB, ASCII, UDC 

ZHS16GBKFIXED 

GBK 16-bit Simplified Chinese (16-bit fixed-width, no single byte) 

FIXED, UDC 

ZHT16CCDC 

HP CCDC 16-bit Traditional Chinese 

MB, ASCII 

JA16DBCS 

IBM EBCDIC 16-bit Japanese 

MB, UDC 

JA16EBCDIC930 

IBM DBCS Code Page 290 16-bit Japanese 

MB, UDC 

JA16DBCSFIXED 

IBM EBCDIC 16-bit Japanese (16-bit fixed width, no single byte) 

FIXED, UDC 

KO16DBCS 

IBM EBCDIC 16-bit Korean 

MB, UDC 

KO16DBCSFIXED 

IBM EBCDIC 16-bit Korean (16-bit fixed-width, no single byte) 

FIXED, UDC 

ZHS16DBCS 

IBM EBCDIC 16-bit Simplified Chinese 

MB, UDC 

ZHS16CGB231280
FIXED 

CGB2312-80 16-bit Simplified Chinese (16-bit fixed-width, no single byte) 

FIXED 

ZHS16DBCSFIXED 

IBM EBCDIC 16-bit Simplified Chinese (16-bit fixed-width, no single byte) 

FIXED, UDC 

ZHT16DBCS 

IBM EBCDIC 16-bit Traditional Chinese 

MB, UDC 

ZHT16DBCSFIXED 

IBM EBCDIC 16-bit Traditional Chinese (16-bit fixed-width, no single byte) 

FIXED 

KO16KSC5601 

KSC5601 16-bit Korean 

MB, ASCII 

KO16KSCCS 

KSCCS 16-bit Korean 

MB, ASCII 

KO16KSC5601FIXED 

KSC5601 (16-bit fixed-width, no single byte) 

FIXED 

JA16VMS 

JVMS 16-bit Japanese 

MB, ASCII 

ZHS16MACCGB231280 

Mac client CGB2312-80 16-bit Simplified Chinese 

MB 

JA16MACSJIS 

Mac client Shift-JIS 16-bit Japanese 

MB 

TH8MACTHAI 

Mac Client 8-bit Latin/Thai 

SB 

TH8MACTHAIS 

Mac Server 8-bit Latin/Thai 

SB, ASCII 

TH8TISEBCDICS 

Thai Industrial Standard 620-2533-EBCDIC Server 8-bit 

SB 

ZHT16MSWIN950 

MS Windows Code Page 950 Traditional Chinese 

MB, ASCII, UDC 

KO16MSWIN949 

MS Windows Code Page 949 Korean 

MB, ASCII, UDC 

VN8MSWIN1258 

MS Windows Code Page 1258 8-bit Vietnamese 

SB, ASCII, EURO 

IN8ISCII 

Multiple-Script Indian Standard 8-bit Latin/Indian
Languages 

SB, ASCII 

JA16SJIS 

Shift-JIS 16-bit Japanese 

MB, ASCII, UDC 

JA16SJISFIXED 

Shift-JIS 16-bit Japanese. A fixed-width subset of JA16SJIS (contains only the 2-byte characters of JA16JIS). Contains no 7- or 8-bit ASCII characters 

FIXED, UDC 

JA16SJISYEN 

Shift-JIS 16-bit Japanese with '\' mapped to the Japanese yen character 

MB, UDC 

ZHT32SOPS 

SOPS 32-bit Traditional Chinese 

MB, ASCII 

ZHT16DBT 

Taiwan Taxation 16-bit Traditional Chinese 

MB, ASCII 

ZHT16BIG5FIXED 

BIG5 16-bit Traditional Chinese (16-bit fixed-width, no single byte) 

FIXED 

TH8TISASCII 

Thai Industrial Standard 620-2533 - ASCII 8-bit 

SB, ASCII, EURO 

TH8TISEBCDIC 

Thai Industrial Standard 620-2533 - EBCDIC 8-bit 

SB 

ZHT32TRIS 

TRIS 32-bit Traditional Chinese 

MB, ASCII 

ZHT32TRISFIXED 

TRIS 32-bit Fixed-width Traditional Chinese 

FIXED 

AL24UTFFSS 

See "Universal Character Sets" for details 

 

UTF8 

See "Universal Character Sets" for details 

 

UTFE 

See "Universal Character Sets" for details 

 

VN8VN3 

VN3 8-bit Vietnamese 

SB, ASCII 

European Language Character Sets

Table A-5 lists the Oracle character sets that can support European languages.

Table A-5 European Language Character Sets
Name  Description  Comments 

US7ASCII 

ASCII 7-bit American 

SB, ASCII 

SF7ASCII 

ASCII 7-bit Finnish 

SB 

YUG7ASCII 

ASCII 7-bit Yugoslavian 

SB 

RU8BESTA 

BESTA 8-bit Latin/Cyrillic 

SB, ASCII 

EL8GCOS7 

Bull EBCDIC GCOS7 8-bit Greek 

SB 

WE8GCOS7 

Bull EBCDIC GCOS7 8-bit West European 

SB 

EL8DEC 

DEC 8-bit Latin/Greek 

SB 

TR7DEC 

DEC VT100 7-bit Turkish 

SB 

TR8DEC 

DEC 8-bit Turkish 

SB, ASCII 

TR8EBCDIC1026 

EBCDIC Code Page 1026 8-bit Turkish 

SB 

TR8EBCDIC1026S 

EBCDIC Code Page 1026 Server 8-bit Turkish 

SB 

TR8PC857 

IBM-PC Code Page 857 8-bit Turkish 

SB, ASCII 

TR8MACTURKISH 

MAC Client 8-bit Turkish 

SB 

TR8MACTURKISHS 

MAC Server 8-bit Turkish 

SB, ASCII 

TR8MSWIN1254 

MS Windows Code Page 1254 8-bit Turkish 

SB, ASCII, EURO 

WE8BS2000L5 

Siemens EBCDIC.DF.L5 8-bit West European/Turkish 

SB 

WE8DEC 

DEC 8-bit West European 

SB, ASCII 

D7DEC 

DEC VT100 7-bit German 

SB 

F7DEC 

DEC VT100 7-bit French 

SB 

S7DEC 

DEC VT100 7-bit Swedish 

SB 

E7DEC 

DEC VT100 7-bit Spanish 

SB 

NDK7DEC 

DEC VT100 7-bit Norwegian/Danish 

SB 

I7DEC 

DEC VT100 7-bit Italian 

SB 

NL7DEC 

DEC VT100 7-bit Dutch 

SB 

CH7DEC 

DEC VT100 7-bit Swiss (German/French) 

SB 

SF7DEC 

DEC VT100 7-bit Finnish 

SB 

WE8DG 

DG 8-bit West European 

SB, ASCII 

WE8EBCDIC37C 

EBCDIC Code Page 37 8-bit Oracle/c 

SB 

WE8EBCDIC37 

EBCDIC Code Page 37 8-bit West European 

SB 

D8EBCDIC273 

EBCDIC Code Page 273/1 8-bit Austrian German 

SB 

DK8EBCDIC277 

EBCDIC Code Page 277/1 8-bit Danish 

SB 

S8EBCDIC278 

EBCDIC Code Page 278/1 8-bit Swedish 

SB 

I8EBCDIC280 

EBCDIC Code Page 280/1 8-bit Italian 

SB 

WE8EBCDIC284 

EBCDIC Code Page 284 8-bit Latin American/Spanish 

SB 

WE8EBCDIC285 

EBCDIC Code Page 285 8-bit West European 

SB 

WE8EBCDIC1047 

EBCDIC Code Page 1047 8-bit West European 

SB 

WE8EBCDIC1140 

EBCDIC Code Page 1140 8-bit West European 

SB, EURO 

WE8EBCDIC1140C 

EBCDIC Code Page 1140 Client 8-bit West European 

SB, EURO 

WE8EBCDIC1145 

EBCDIC Code Page 1145 8-bit West European 

SB, EURO 

WE8EBCDIC1146 

EBCDIC Code Page 1146 8-bit West European 

SB, EURO 

WE8EBCDIC1148 

EBCDIC Code Page 1148 8-bit West European 

SB, EURO 

WE8EBCDIC1148C 

EBCDIC Code Page 1148 Client 8-bit West European 

SB, EURO 

F8EBCDIC297 

EBCDIC Code Page 297 8-bit French 

SB 

WE8EBCDIC500C 

EBCDIC Code Page 500 8-bit Oracle/c 

SB 

WE8EBCDIC500 

EBCDIC Code Page 500 8-bit West European 

SB 

EE8EBCDIC870 

EBCDIC Code Page 870 8-bit East European 

SB 

EE8EBCDIC870C 

EBCDIC Code Page 870 Client 8-bit East European 

SB 

EE8EBCDIC870S 

EBCDIC Code Page 870 Server 8-bit East European 

SB 

WE8EBCDIC871 

EBCDIC Code Page 871 8-bit Icelandic 

SB 

EL8EBCDIC875 

EBCDIC Code Page 875 8-bit Greek 

SB 

EL8EBCDIC875S 

EBCDIC Code Page 875 Server 8-bit Greek 

SB 

CL8EBCDIC1025 

EBCDIC Code Page 1025 8-bit Cyrillic 

SB 

CL8EBCDIC1025C 

EBCDIC Code Page 1025 Client 8-bit Cyrillic 

SB 

CL8EBCDIC1025S 

EBCDIC Code Page 1025 Server 8-bit Cyrillic 

SB 

CL8EBCDIC1025X 

EBCDIC Code Page 1025 (Modified) 8-bit Cyrillic 

SB 

BLT8EBCDIC1112 

EBCDIC Code Page 1112 8-bit Baltic Multilingual 

SB 

BLT8EBCDIC1112S 

EBCDIC Code Page 1112 8-bit Server Baltic Multilingual 

SB 

D8EBCDIC1141 

EBCDIC Code Page 1141 8-bit Austrian German 

SB, EURO 

DK8EBCDIC1142 

EBCDIC Code Page 1142 8-bit Danish  

SB, EURO 

S8EBCDIC1143 

EBCDIC Code Page 1143 8-bit Swedish 

SB, EURO 

I8EBCDIC1144 

EBCDIC Code Page 1144 8-bit Italian 

SB, EURO 

F8EBCDIC1147 

EBCDIC Code Page 1147 8-bit French  

SB, EURO 

EEC8EUROASCI 

EEC Targon 35 ASCI West European/Greek 

SB 

EEC8EUROPA3 

EEC EUROPA3 8-bit West European/Greek 

SB 

LA8PASSPORT 

German Government Printer 8-bit All-European Latin 

SB, ASCII 

WE8HP 

HP LaserJet 8-bit West European 

SB 

WE8ROMAN8 

HP Roman8 8-bit West European 

SB, ASCII 

HU8CWI2 

Hungarian 8-bit CWI-2 

SB, ASCII 

HU8ABMOD 

Hungarian 8-bit Special AB Mod 

SB, ASCII 

LV8RST104090 

IBM-PC Alternative Code Page 8-bit Latvian (Latin/Cyrillic) 

SB, ASCII 

US8PC437 

IBM-PC Code Page 437 8-bit American 

SB, ASCII 

BG8PC437S 

IBM-PC Code Page 437 8-bit (Bulgarian Modification) 

SB, ASCII 

EL8PC437S 

IBM-PC Code Page 437 8-bit (Greek modification) 

SB, ASCII 

EL8PC737 

IBM-PC Code Page 737 8-bit Greek/Latin 

SB 

LT8PC772 

IBM-PC Code Page 772 8-bit Lithuanian (Latin/Cyrillic) 

SB, ASCII 

LT8PC774 

IBM-PC Code Page 774 8-bit Lithuanian (Latin) 

SB, ASCII 

BLT8PC775 

IBM-PC Code Page 775 8-bit Baltic 

SB, ASCII 

WE8PC850 

IBM-PC Code Page 850 8-bit West European 

SB, ASCII 

EL8PC851 

IBM-PC Code Page 851 8-bit Greek/Latin 

SB, ASCII 

EE8PC852 

IBM-PC Code Page 852 8-bit East European 

SB, ASCII 

RU8PC855 

IBM-PC Code Page 855 8-bit Latin/Cyrillic 

SB, ASCII 

WE8PC858 

IBM-PC Code Page 858 8-bit West European 

SB, ASCII, EURO 

WE8PC860 

IBM-PC Code Page 860 8-bit West European 

SB. ASCII 

IS8PC861 

IBM-PC Code Page 861 8-bit Icelandic 

SB, ASCII 

CDN8PC863 

IBM-PC Code Page 863 8-bit Canadian French 

SB, ASCII 

N8PC865 

IBM-PC Code Page 865 8-bit Norwegian 

SB. ASCII 

RU8PC866 

IBM-PC Code Page 866 8-bit Latin/Cyrillic 

SB, ASCII 

EL8PC869 

IBM-PC Code Page 869 8-bit Greek/Latin 

SB, ASCII 

LV8PC1117 

IBM-PC Code Page 1117 8-bit Latvian 

SB, ASCII 

US8ICL 

ICL EBCDIC 8-bit American 

SB 

WE8ICL 

ICL EBCDIC 8-bit West European 

SB 

WE8ISOICLUK 

ICL special version ISO8859-1 

SB 

WE8ISO8859P1 

ISO 8859-1 West European 

SB, ASCII 

EE8ISO8859P2 

ISO 8859-2 East European 

SB, ASCII 

SE8ISO8859P3 

ISO 8859-3 South European 

SB, ASCII 

NEE8ISO8859P4 

ISO 8859-4 North and North-East European 

SB, ASCII 

CL8ISO8859P5 

ISO 8859-5 Latin/Cyrillic 

SB, ASCII 

AR8ISO8859P6 

ISO 8859-6 Latin/Arabic 

SB, ASCII 

EL8ISO8859P7 

ISO 8859-7 Latin/Greek 

SB, ASCII, EURO 

IW8ISO8859P8 

ISO 8859-8 Latin/Hebrew 

SB, ASCII 

NE8ISO8859P10 

ISO 8859-10 North European 

SB, ASCII 

WE8ISO8859P15 

ISO 8859-15 West European 

SB, ASCII, EURO 

LA8ISO6937 

ISO 6937 8-bit Coded Character Set for Text Communication 

SB, ASCII 

IW7IS960 

Israeli Standard 960 7-bit Latin/Hebrew 

SB 

AR8ARABICMAC 

Mac Client 8-bit Latin/Arabic 

SB 

EE8MACCE 

Mac Client 8-bit Central European 

SB 

EE8MACCROATIAN 

Mac Client 8-bit Croatian 

SB 

WE8MACROMAN8 

Mac Client 8-bit Extended Roman8 West European 

SB 

EL8MACGREEK 

Mac Client 8-bit Greek 

SB 

IS8MACICELANDIC 

Mac Client 8-bit Icelandic 

SB 

CL8MACCYRILLIC 

Mac Client 8-bit Latin/Cyrillic 

SB 

AR8ARABICMACS 

Mac Server 8-bit Latin/Arabic 

SB, ASCII 

EE8MACCES 

Mac Server 8-bit Central European 

SB, ASCII 

EE8MACCROATIANS 

Mac Server 8-bit Croatian 

SB, ASCII 

WE8MACROMAN8S 

Mac Server 8-bit Extended Roman8 West European 

SB, ASCII 

CL8MACCYRILLICS 

Mac Server 8-bit Latin/Cyrillic 

SB, ASCII 

EL8MACGREEKS 

Mac Server 8-bit Greek 

SB, ASCII 

IS8MACICELANDICS 

Mac Server 8-bit Icelandic 

SB 

BG8MSWIN 

MS Windows 8-bit Bulgarian Cyrillic 

SB, ASCII 

LT8MSWIN921 

MS Windows Code Page 921 8-bit Lithuanian 

SB, ASCII 

ET8MSWIN923 

MS Windows Code Page 923 8-bit Estonian 

SB, ASCII 

EE8MSWIN1250 

MS Windows Code Page 1250 8-bit East European 

SB, ASCII, EURO 

CL8MSWIN1251 

MS Windows Code Page 1251 8-bit Latin/Cyrillic 

SB, ASCII, EURO 

WE8MSWIN1252 

MS Windows Code Page 1252 8-bit West European 

SB, ASCII, EURO 

EL8MSWIN1253 

MS Windows Code Page 1253 8-bit Latin/Greek 

SB, ASCII, EURO 

BLT8MSWIN1257 

MS Windows Code Page 1257 8-bit Baltic 

SB, ASCII, EURO 

BLT8CP921 

Latvian Standard LVS8-92(1) Windows/Unix 8-bit Baltic 

SB, ASCII 

LV8PC8LR 

Latvian Version IBM-PC Code Page 866 8-bit Latin/Cyrillic 

SB, ASCII 

WE8NCR4970 

NCR 4970 8-bit West European 

SB, ASCII 

WE8NEXTSTEP 

NeXTSTEP PostScript 8-bit West European 

SB, ASCII 

CL8KOI8R 

RELCOM Internet Standard 8-bit Latin/Cyrillic 

SB, ASCII 

US8BS2000 

Siemens 9750-62 EBCDIC 8-bit American 

SB 

DK8BS2000 

Siemens 9750-62 EBCDIC 8-bit Danish 

SB 

F8BS2000 

Siemens 9750-62 EBCDIC 8-bit French 

SB 

D8BS2000 

Siemens 9750-62 EBCDIC 8-bit German 

SB 

E8BS2000 

Siemens 9750-62 EBCDIC 8-bit Spanish 

SB 

S8BS2000 

Siemens 9750-62 EBCDIC 8-bit Swedish 

SB 

DK7SIEMENS9780X 

Siemens 97801/97808 7-bit Danish 

SB 

F7SIEMENS9780X 

Siemens 97801/97808 7-bit French 

SB 

D7SIEMENS9780X 

Siemens 97801/97808 7-bit German 

SB 

I7SIEMENS9780X 

Siemens 97801/97808 7-bit Italian 

SB 

N7SIEMENS9780X 

Siemens 97801/97808 7-bit Norwegian 

SB 

E7SIEMENS9780X 

Siemens 97801/97808 7-bit Spanish 

SB 

S7SIEMENS9780X 

Siemens 97801/97808 7-bit Swedish 

SB 

WE8BS2000 

Siemens EBCDIC.DF.04 8-bit West European 

SB 

CL8BS2000 

Siemens EBCDIC.EHC.LC 8-bit Cyrillic 

SB 

AL24UTFFSS 

See "Universal Character Sets" for details 

 

UTF8 

See "Universal Character Sets" for details 

 

UTFE 

See "Universal Character Sets" for details 

 

Middle Eastern Language Character Sets

Table A-6 lists the Oracle character sets that can support Middle Eastern languages.

Table A-6 Middle Eastern Character Sets
Name  Description  Comments 

AR8APTEC715 

APTEC 715 Server 8-bit Latin/Arabic 

SB, ASCII 

AR8APTEC715T 

APTEC 715 8-bit Latin/Arabic 

SB 

AR8ASMO708PLUS 

ASMO 708 Plus 8-bit Latin/Arabic 

SB, ASCII 

AR8ASMO8X 

ASMO Extended 708 8-bit Latin/Arabic 

SB, ASCII 

AR8ADOS710 

Arabic MS-DOS 710 Server 8-bit Latin/Arabic 

SB, ASCII 

AR8ADOS710T 

Arabic MS-DOS 710 8-bit Latin/Arabic 

SB 

AR8ADOS720 

Arabic MS-DOS 720 Server 8-bit Latin/Arabic 

SB, ASCII 

AR8ADOS720T 

Arabic MS-DOS 720 8-bit Latin/Arabic 

SB 

TR7DEC 

DEC VT100 7-bit Turkish 

SB 

TR8DEC 

DEC 8-bit Turkish 

SB 

WE8EBCDIC37C 

EBCDIC Code Page 37 8-bit Oracle/c 

SB 

IW8EBCDIC424 

EBCDIC Code Page 424 8-bit Latin/Hebrew 

SB 

IW8EBCDIC424S 

EBCDIC Code Page 424 Server 8-bit Latin/Hebrew 

SB 

WE8EBCDIC500C 

EBCDIC Code Page 500 8-bit Oracle/c 

SB 

IW8EBCDIC1086 

EBCDIC Code Page 1086 8-bit Hebrew 

SB 

AR8EBCDIC420S 

EBCDIC Code Page 420 Server 8-bit Latin/Arabic 

SB 

AR8EBCDICX 

EBCDIC XBASIC Server 8-bit Latin/Arabic 

SB 

TR8EBCDIC1026 

EBCDIC Code Page 1026 8-bit Turkish 

SB 

TR8EBCDIC1026S 

EBCDIC Code Page 1026 Server 8-bit Turkish 

SB 

AR8HPARABIC8T 

HP 8-bit Latin/Arabic 

SB 

TR8PC857 

IBM-PC Code Page 857 8-bit Turkish 

SB, ASCII 

IW8PC1507 

IBM-PC Code Page 1507/862 8-bit Latin/Hebrew 

SB, ASCII 

AR8ISO8859P6 

ISO 8859-6 Latin/Arabic 

SB, ASCII 

IW8ISO8859P8 

ISO 8859-8 Latin/Hebrew 

SB, ASCII 

WE8ISO8859P9 

ISO 8859-9 West European & Turkish 

SB, ASCII 

LA8ISO6937 

ISO 6937 8-bit Coded Character Set for Text Communication 

SB, ASCII 

IW7IS960 

Israeli Standard 960 7-bit Latin/Hebrew 

SB 

IW8MACHEBREW 

Mac Client 8-bit Hebrew 

SB 

AR8ARABICMAC 

Mac Client 8-bit Latin/Arabic 

SB 

AR8ARABICMACT 

Mac 8-bit Latin/Arabic 

SB 

TR8MACTURKISH 

Mac Client 8-bit Turkish 

SB 

IW8MACHEBREWS 

Mac Server 8-bit Hebrew 

SB, ASCII 

AR8ARABICMACS 

Mac Server 8-bit Latin/Arabic 

SB, ASCII 

TR8MACTURKISHS 

Mac Server 8-bit Turkish 

SB, ASCII 

TR8MSWIN1254 

MS Windows Code Page 1254 8-bit Turkish 

SB, ASCII, EURO 

IW8MSWIN1255 

MS Windows Code Page 1255 8-bit Latin/Hebrew 

SB, ASCII, EURO 

AR8MSWIN1256 

MS Windows Code Page 1256 8-Bit Latin/Arabic 

SB. ASCII, EURO 

IN8ISCII 

Multiple-Script Indian Standard 8-bit Latin/Indian
Languages 

SB 

AR8MUSSAD768 

Mussa'd Alarabi/2 768 Server 8-bit Latin/Arabic 

SB, ASCII 

AR8MUSSAD768T 

Mussa'd Alarabi/2 768 8-bit Latin/Arabic 

SB 

AR8NAFITHA711 

Nafitha Enhanced 711 Server 8-bit Latin/Arabic 

SB, ASCII 

AR8NAFITHA711T 

Nafitha Enhanced 711 8-bit Latin/Arabic 

SB 

AR8NAFITHA721 

Nafitha International 721 Server 8-bit Latin/Arabic 

SB, ASCII 

AR8NAFITHA721T 

Nafitha International 721 8-bit Latin/Arabic 

SB 

AR8SAKHR706 

SAKHR 706 Server 8-bit Latin/Arabic 

SB, ASCII 

AR8SAKHR707 

SAKHR 707 Server 8-bit Latin/Arabic 

SB, ASCII 

AR8SAKHR707T 

SAKHR 707 8-bit Latin/Arabic 

SB 

AR8XBASIC 

XBASIC 8-bit Latin/Arabic 

SB 

WE8BS2000L5 

Siemens EBCDIC.DF.04.L5 8-bit West European/Turkish 

SB 

AL24UTFFSS 

See "Universal Character Sets" for details 

 

UTF8 

See "Universal Character Sets" for details 

 

UTFE 

See "Universal Character Sets" for details 

 

Universal Character Sets

Table A-7 lists the Oracle character sets that provide universal language support, that is, they attempt to support all languages of the world, including, but not limited to, Asian, European, and Middle Eastern languages.

Table A-7 Universal Character Sets
Name  Description  Comments 

AL24UTFFSS 

Unicode 1.1 UTF-8 Universal character set 

MB, ASCII, EURO 

UTF8 

Unicode 2.1 UTF-8 Universal character set 

MB, ASCII, EURO 

UTFE 

UTF-EBCDIC character set (EBCDIC-friendly UTF encoding of Unicode 2.1). UTFE works only on EBCDIC-based platforms such as IBM mainframe as compared to UTF8, which works only on ASCII-based platforms such as UNIX and Win32. The maximum length of one character in Oracle's current UTFE character set is 4 bytes (the maximum in UTF8 is 3 bytes). Both UTF8 and UTFE include exactly the same set of characters, but the character codes are different. 

 

Note: The Unicode 1.1 character set has been superseded by Unicode 2.1. One of the major differences between version 1.1 and 2.1 is the redefinition and addition of 11,172 Korean characters. Whenever possible, you should use the latest version of the Unicode standard. The primary scripts currently supported by Unicode 2.1 are:

Arabic 

Gujarati 

Latin 

Armenian 

Gurmukhi 

Lao 

Bengali 

Han 

Malayalam 

Bopomofo 

Hangul 

Oriya 

Cyrillic 

Hebrew 

Tamil 

Devanagari 

Hiragana 

Telugu 

Georgian 

Kannada 

Thai 

Greek 

Katakana 

Tibetan 

For details on the Unicode standard, see http://www.unicode.org or refer to the Unicode Standard, defined by the Unicode consortium.

UTF8 Support

Oracle's UTF8 character set currently supports the following characters.

These are 2-byte characters in UTF8, that have character codes 0xc0WW through 0xdfWW inclusive where WW can be 0x80 through 0xbf inclusive.

These can represent characters of most European (including Greek and Russian), Arabic, Hebrew and some other languages.

Oracle's UTF8 character set currently does not support the following characters. If you use these characters in Oracle's current UTF8 character set, the result is not guaranteed, and the behavior changes in the future releases of Oracle.

Therefore, unless you need more than 6,400 User-Defined Characters, Oracle's current UTF8 character set can represent all characters of Unicode 2.1.

Linguistic Definitions

Linguistic definitions define linguistic cases for particular languages. Extended linguistic definitions include some special linguistic cases for the language. Typically, using the extended definition means that characters will be sorted differently from their ASCII values. For example, ch and ll are treated as only one character in XSPANISH. Table A-8 lists the linguistic definitions supported by the Oracle server.

Table A-8 Linguistic Definitions
Basic Name  Extended Name  Special Cases 

ARABIC 

-- 

 

ARABIC_MATCH 

-- 

 

ARABIC_ABJ_SORT 

-- 

 

ARABIC_ABJ_MATCH 

-- 

 

ASCII7 

-- 

 

BENGALI 

-- 

 

BULGARIAN 

-- 

 

CANADIAN FRENCH 

-- 

 

CATALAN 

XCATALAN 

æ, AE, ß 

CROATIAN 

XCROATIAN 

D, L, N, d, l, n, ß 

CZECH 

XCZECH 

ch, CH, Ch, ß  

DANISH 

XDANISH 

A, ß, Å, å 

DUTCH 

XDUTCH 

ij, IJ 

EEC_EURO 

-- 

 

EEC_EUROPA3 

-- 

 

ESTONIAN 

-- 

 

FINNISH 

-- 

 

FRENCH 

XFRENCH 

 

GERMAN 

XGERMAN 

ß 

GERMAN_DIN 

XGERMAN_DIN 

ß, ä, ö, ü, Ä, Ö, Ü 

GREEK 

-- 

 

HEBREW 

-- 

 

HUNGARIAN 

XHUNGARIAN 

cs, gy, ny, sz, ty, zs, ß, CS, Cs, GY, Gy, NY, Ny, SZ, Sz, TY, Ty, ZS, Zs 

ICELANDIC 

-- 

 

INDONESIAN 

-- 

 

ITALIAN 

-- 

 

JAPANESE 

-- 

 

LATIN 

-- 

 

LATVIAN 

-- 

 

LITHUANIAN 

-- 

 

MALAY 

-- 

 

NORWEGIAN 

-- 

 

POLISH 

-- 

 

PUNCTUATION 

XPUNCTUATION 

 

ROMANIAN 

-- 

 

RUSSIAN 

-- 

 

SLOVAK 

XSLOVAK 

dz, DZ, Dz, ß (caron

SLOVENIAN 

XSLOVENIAN 

ß 

SPANISH 

XSPANISH 

ch, ll, CH, Ch, LL, Ll 

SWEDISH 

-- 

 

SWISS 

XSWISS 

ß 

THAI_DICTIONARY 

-- 

 

THAI_TELEPHONE 

-- 

 

TURKISH 

XTURKISH 

æ, AE, ß 

UKRAINIAN 

-- 

 

UNICODE_BINARY 

 

 

VIETNAMESE 

-- 

 

WEST_EUROPEAN 

XWEST_EUROPEAN 

ß 

Calendar Systems

By default, most territory definitions use the Gregorian calendar system. Table A-9 lists the other calendar systems supported by the Oracle server.

Table A-9 NLS Supported Calendars
Name  Default Format  Character Set Used
For Default Format
 

Japanese Imperial 

EEYY"\307\257"MM"\267\356"DD"\306\374" 

JA16EUC 

ROC Official 

EEyy"\310\241"mm"\305\314"dd"\305\312" 

ZHT32EUC 

Thai Buddha 

dd month EE yyyy  

TH8TISASCII  

Persian 

DD Month YYYY 

AR8ASMO8X  

Arabic Hijrah 

DD Month YYYY 

AR8ISO8859P6 

English Hijrah 

DD Month YYYY 

AR8ISO8859P6 

Figure A-1 shows how March 20, 1998 appears in ROC Official:

Figure A-1 ROC Official Example


Text description of appaa.gif follows.

Text description of the illustration appaa.gif.

Figure A-2 shows how March 27, 1998 appears in Japanese Imperial:

Figure A-2 Japanese Imperial Example


Text description of appa2.gif follows.

Text description of the illustration appa2.gif.

Character Sets that Support the Euro Symbol

Table A-10 lists the character sets that support the Euro symbol.

Table A-10 Character Sets with Euro Support
Name  Description  Euro Code Value 

WE8EBCDIC1140 

EBCDIC Code Page 1140 8-bit West European 

0x9F 

WE8EBCDIC1140C 

EBCDIC Code Page 1140C 8-bit West European 

0x9F 

D8EBCDIC1141 

EBCDIC Code Page 1141 8-bit Austrian German 

0x9F 

DK8EBCDIC1142 

EBCDIC Code Page 1142 8-bit Danish 

0x5A 

S8EBCDIC1143 

EBCDIC Code Page 1143 8-bit Swedish 

0x5A 

I8EBCDIC1144 

EBCDIC Code Page 1144 8-bit Italian 

0x9F 

WE8EBCDIC1145 

EBCDIC Code Page 1145 8-bit West European 

0x9F 

WE8EBCDIC1146 

EBCDIC Code Page 1146 8-bit West European 

0x9F 

F8EBCDIC1147 

EBCDIC Code Page 1147 8-bit French 

0x9F 

WE8EBCDIC1148 

EBCDIC Code Page 1148 8-bit West European 

0x9F 

WE8EBCDIC1148C 

EBCDIC Code Page 1148C 8-bit West European 

0x9F 

WE8PC858 

IBM-PC Code Page 858 8-bit West European 

0xDF 

EL8ISO8859P7 

ISO 8859-7 Latin/Greek 

0xA4 

WE8ISO8859P15 

ISO 8859-15 West European 

0xA4 

EE8MSWIN1250 

MS Windows Code Page 1250 8-bit East European 

0x80 

CL8MSWIN1251 

MS Windows Code Page 1251 8-bit Latin/Cyrillic 

0x88 

WE8MSWIN1252 

MS Windows Code Page 1252 8-bit West European 

0x80 

EL8MSWIN1253 

MS Windows Code Page 1253 8-bit Latin/Greek 

0x80 

TR8MSWIN1254 

MS Windows Code Page 1254 8-bit Turkish 

0x80 

IW8MSWIN1255 

MS Windows Code Page 1255 8-bit Latin/Hebrew 

0x80 

AR8MSWIN1256 

MS Windows Code Page 1256 8-bit Latin/Arabic 

0x80 

BLT8MSWIN1257 

MS Windows Code Page 1257 Baltic 

0x80 

VN8MSWIN1258 

MS Windows Code Page 1258 8-bit Vietnamese 

0x80 

TH8TISASCII 

Thai Industrial 520-2533 - ASCII 8-bit 

0x80 

AL24UTFFSS 

Unicode 1.1 UTF-8 Universal character set 

U+20AC 

UTF8 

Unicode 2.1 UTF-8 Universal character set 

U+20AC 

UTFE 

UTF-EBCDIC encoding of Unicode 2.1 

U+20AC 

Default Values for NLS Parameters

Table A-11 lists the default values for NLS parameters.

Table A-11 Default Values for NLS Parameters
Name  Default Value 

NLS_CALENDAR 

Gregorian 

NLS_COMP 

Binary 

NLS_CREDIT 

NLS_TERRITORY 

NLS_CURRENCY 

NLS_TERRITORY 

NLS_DATE_FORMAT 

NLS_TERRITORY 

NLS_DATE_LANGUAGE 

NLS_LANGUAGE 

NLS_DEBIT 

NLS_TERRITORY 

NLS_ISO_CURRENCY 

NLS_TERRITORY 

NLS_LANG 

American_America.US7ASCII 

NLS_LANGUAGE 

NLS_LANG 

NLS_LIST_SEPARATOR 

NLS_TERRITORY 

NLS_MONETARY_CHARACTERS 

NLS_TERRITORY 

NLS_CREDIT 

NLS_TERRITORY 

NLS_NCHAR 

NLS_LANG 

NLS_NUMERIC_CHARACTERS 

NLS_TERRITORY 

NLS_SORT 

NLS_LANGUAGE 

NLS_TERRITORY 

NLS_LANG 

NLS_DUAL_CURRENCY 

NLS_TERRITORY 


Go to previous page Go to next page
Oracle
Copyright © 1996-2000, Oracle Corporation.

All Rights Reserved.

Library

Product

Contents

Index