Skip Headers

Oracle9i Database Globalization Support Guide
Release 2 (9.2)

Part Number A96529-01
Go To Documentation Library
Home
Go To Product List
Book List
Go To Table Of Contents
Contents
Go To Index
Index

Master Index

Feedback

Go to previous page Go to next page

A
Locale Data

This appendix lists the languages, territories, character sets, and other locale data supported by the Oracle server. It includes these topics:

You can also obtain information about supported character sets, languages, territories, and sorting orders by querying the V$NLS_VALID_VALUES dynamic performance view.

See Also:

Oracle9i Database Reference for more information about the data that can be returned by this view

Languages

Table A-1 lists the languages supported by the Oracle server.

Table A-1 Oracle Supported Languages  
Name Abbreviation

AMERICAN

us

ARABIC

ar

ASSAMESE

as

BANGLA

bn

BRAZILIAN PORTUGUESE

ptb

BULGARIAN

bg

CANADIAN FRENCH

frc

CATALAN

ca

CROATIAN

hr

CZECH

cs

DANISH

dk

DUTCH

nl

EGYPTIAN

eg

ENGLISH

gb

ESTONIAN

et

FINNISH

sf

FRENCH

f

GERMAN DIN

din

GERMAN

d

GREEK

el

GUJARATI

gu

HEBREW

iw

HINDI

hi

HUNGARIAN

hu

ICELANDIC

is

INDONESIAN

in

ITALIAN

i

JAPANESE

ja

KANNADA

kn

KOREAN

ko

LATIN AMERICAN SPANISH

esa

LATVIAN

lv

LITHUANIAN

lt

MALAY

ms

MALAYALAM

ml

MARATHI

mr

MEXICAN SPANISH

esm

NORWEGIAN

n

ORIYA

or

POLISH

pl

PORTUGUESE

pt

PUNJABI

pa

ROMANIAN

ro

RUSSIAN

ru

SIMPLIFIED CHINESE

zhs

SLOVAK

sk

SLOVENIAN

sl

SPANISH

e

SWEDISH

s

TAMIL

ta

TELUGU

te

THAI

th

TRADITIONAL CHINESE

zht

TURKISH

tr

UKRAINIAN

uk

VIETNAMESE

vn

Translated Messages

Oracle error messages have been translated into the languages which are listed in Table A-2.

Table A-2 Oracle Supported Messages  
Name Abbreviation

ARABIC

ar

BRAZILIAN PORTUGUESE

ptb

CANADIAN FRENCH

frc

CATALAN

ca

CZECH

cs

DANISH

dk

DUTCH

nl

FINNISH

sf

FRENCH

f

GERMAN

d

GREEK

el

HEBREW

iw

HUNGARIAN

hu

ITALIAN

i

JAPANESE

ja

KOREAN

ko

LATIN AMERICAN SPANISH

esa

NORWEGIAN

n

POLISH

pl

PORTUGUESE

pt

ROMANIAN

ro

RUSSIAN

ru

SIMPLIFIED CHINESE

zhs

SLOVAK

sk

SPANISH

e

SWEDISH

s

THAI

th

TRADITIONAL CHINESE

zht

TURKISH

tr

Territories

Table A-3 lists the territories supported by the Oracle server.

Table A-3 Oracle Supported Territories  
Name Name Name

ALGERIA

HONG KONG

PERU

AMERICA

HUNGARY

POLAND

AUSTRALIA

ICELAND

PORTUGAL

AUSTRIA

INDIA

PUERTO RICO

BAHRAIN

INDONESIA

QATAR

BANGLADESH

IRAQ

ROMANIA

BELGIUM

IRELAND

SAUDI ARABIA

BRAZIL

ISRAEL

SINGAPORE

BULGARIA

ITALY

SLOVAKIA

CANADA

JAPAN

SLOVENIA

CATALONIA

JORDAN

SOMALIA

CHILE

KAZAKHSTAN

SOUTH AFRICA

CHINA

KOREA

SPAIN

CIS

KUWAIT

SUDAN

COLOMBIA

LATVIA

SWEDEN

COSTA RICA

LEBANON

SWITZERLAND

CROATIA

LIBYA

SYRIA

CYPRUS

LITHUANIA

TAIWAN

CZECH REPUBLIC

LUXEMBOURG

THAILAND

DENMARK

MACEDONIA

THE NETHERLANDS

DJIBOUTI

MALAYSIA

TUNISIA

EGYPT

MAURITANIA

TURKEY

EL SALVADOR

MEXICO

UKRAINE

ESTONIA

MOROCCO

UNITED ARAB EMIRATES

FINLAND

NEW ZEALAND

UNITED KINGDOM

FRANCE

NICARAGUA

UZBEKISTAN

GUATEMALA

NORWAY

VENEZUELA

GERMANY

OMAN

VIETNAM

GREECE

PANAMA

YEMEN

-

-

YUGOSLAVIA

Character Sets

Oracle-supported character sets are listed in the following sections according to three broad language groups.

In addition, common subset/superset combinations are listed.

Note that some character sets may be listed under multiple language groups because they provide multilingual support. For instance, Unicode spans the Asian, European, and Middle Eastern language groups because it supports most of the major scripts of the world.

The comment section indicates the type of encoding used:

SB = Single-byte encoding

MB = Multibyte encoding

FIXED = Fixed-width multibyte encoding

As mentioned in Chapter 3, "Setting Up a Globalization Support Environment", the type of encoding affects performance, so use the most efficient encoding that meets your language needs. Also, some encoding types can only be used with certain data types. For instance, the AL16UTF16 character set can only be used as an NCHAR character set, and not as a database character set.

Also documented in the comment section are other unique features of the character set that may be important to users or your database administrator. For instance, whether the character set supports the new Euro currency symbol, whether user-defined characters are supported for character set customization, and whether the character set is a strict superset of ASCII (which will allow you to make use of the ALTER DATABASE [NATIONAL] CHARACTER SET statement in case of migration.)

EURO = Euro symbol supported

UDC = User-defined characters supported

ASCII = Strict superset of ASCII

Oracle does not document individual code page layouts. For specific details about a particular character set, its character repertoire, and code point values, you should refer to the actual national, international, or vendor-specific standards.

Asian Language Character Sets

Table A-4 lists the Oracle character sets that can support Asian languages.

Table A-4 Asian Language Character Sets  
Name Description Comments

BN8BSCII

Bangladesh National Code 8-bit BSCII

SB, ASCII

ZHT16BIG5

BIG5 16-bit Traditional Chinese

MB, ASCII

ZHT16HKSCS

MS Windows Code Page 950 with Hong Kong Supplementary Character Set

MB, ASCII, EURO

ZHS16CGB231280

CGB2312-80 16-bit Simplified Chinese

MB, ASCII

ZHS32GB18030

GB18030-2000

MB, ASCII, EURO

JA16EUC

EUC 24-bit Japanese

MB, ASCII

JA16EUCTILDE

The same as JA16EUC except for the way that the wave dash and the tilde are mapped to and from Unicode.

MB, ASCII

JA16EUCYEN

EUC 24-bit Japanese with '\' mapped to the Japanese yen character

MB

ZHT32EUC

EUC 32-bit Traditional Chinese

MB, ASCII

ZHS16GBK

GBK 16-bit Simplified Chinese

MB, ASCII, UDC

ZHT16CCDC

HP CCDC 16-bit Traditional Chinese

MB, ASCII

JA16DBCS

IBM EBCDIC 16-bit Japanese

MB, UDC

JA16EBCDIC930

IBM DBCS Code Page 290 16-bit Japanese

MB, UDC

KO16DBCS

IBM EBCDIC 16-bit Korean

MB, UDC

ZHS16DBCS

IBM EBCDIC 16-bit Simplified Chinese

MB, UDC

ZHT16DBCS

IBM EBCDIC 16-bit Traditional Chinese

MB, UDC

KO16KSC5601

KSC5601 16-bit Korean

MB, ASCII

KO16KSCCS

KSCCS 16-bit Korean

MB, ASCII

JA16VMS

JVMS 16-bit Japanese

MB, ASCII

ZHS16MACCGB231280

Mac client CGB2312-80 16-bit Simplified Chinese

MB

JA16MACSJIS

Mac client Shift-JIS 16-bit Japanese

MB

TH8MACTHAI

Mac Client 8-bit Latin/Thai

SB

TH8MACTHAIS

Mac Server 8-bit Latin/Thai

SB, ASCII

TH8TISEBCDICS

Thai Industrial Standard 620-2533-EBCDIC Server 8-bit

SB

ZHT16MSWIN950

MS Windows Code Page 950 Traditional Chinese

MB, ASCII, UDC

KO16MSWIN949

MS Windows Code Page 949 Korean

MB, ASCII, UDC

VN8MSWIN1258

MS Windows Code Page 1258 8-bit Vietnamese

SB, ASCII, EURO

IN8ISCII

Multiple-Script Indian Standard 8-bit Latin/Indian
Languages

SB, ASCII

JA16SJIS

Shift-JIS 16-bit Japanese

MB, ASCII, UDC

JA16SJISTILDE

The same as JA16SJIS except for the way that the wave dash and the tilde are mapped to and from Unicode.

MB, ASCII, UDC

JA16SJISYEN

Shift-JIS 16-bit Japanese with '\' mapped to the Japanese yen character

MB, UDC

ZHT32SOPS

SOPS 32-bit Traditional Chinese

MB, ASCII

ZHT16DBT

Taiwan Taxation 16-bit Traditional Chinese

MB, ASCII

TH8TISASCII

Thai Industrial Standard 620-2533 - ASCII 8-bit

SB, ASCII, EURO

TH8TISEBCDIC

Thai Industrial Standard 620-2533 - EBCDIC 8-bit

SB

ZHT32TRIS

TRIS 32-bit Traditional Chinese

MB, ASCII

AL16UTF16

See "Universal Character Sets" for details

MB, EURO, FIXED

AL32UTF8

See "Universal Character Sets" for details

MB, ASCII, EURO

UTF8

See "Universal Character Sets" for details

MB, ASCII, EURO

UTFE

See "Universal Character Sets" for details

MB, EURO

VN8VN3

VN3 8-bit Vietnamese

SB, ASCII

European Language Character Sets

Table A-5 lists the Oracle character sets that can support European languages.

Table A-5 European Language Character Sets  
Name Description Comments

US7ASCII

ASCII 7-bit American

SB, ASCII

SF7ASCII

ASCII 7-bit Finnish

SB

YUG7ASCII

ASCII 7-bit Yugoslavian

SB

RU8BESTA

BESTA 8-bit Latin/Cyrillic

SB, ASCII

EL8GCOS7

Bull EBCDIC GCOS7 8-bit Greek

SB

WE8GCOS7

Bull EBCDIC GCOS7 8-bit West European

SB

EL8DEC

DEC 8-bit Latin/Greek

SB

TR7DEC

DEC VT100 7-bit Turkish

SB

TR8DEC

DEC 8-bit Turkish

SB, ASCII

TR8EBCDIC1026

EBCDIC Code Page 1026 8-bit Turkish

SB

TR8EBCDIC1026S

EBCDIC Code Page 1026 Server 8-bit Turkish

SB

TR8PC857

IBM-PC Code Page 857 8-bit Turkish

SB, ASCII

TR8MACTURKISH

MAC Client 8-bit Turkish

SB

TR8MACTURKISHS

MAC Server 8-bit Turkish

SB, ASCII

TR8MSWIN1254

MS Windows Code Page 1254 8-bit Turkish

SB, ASCII, EURO

WE8BS2000L5

Siemens EBCDIC.DF.L5 8-bit West European/Turkish

SB

WE8DEC

DEC 8-bit West European

SB, ASCII

D7DEC

DEC VT100 7-bit German

SB

F7DEC

DEC VT100 7-bit French

SB

S7DEC

DEC VT100 7-bit Swedish

SB

E7DEC

DEC VT100 7-bit Spanish

SB

NDK7DEC

DEC VT100 7-bit Norwegian/Danish

SB

I7DEC

DEC VT100 7-bit Italian

SB

NL7DEC

DEC VT100 7-bit Dutch

SB

CH7DEC

DEC VT100 7-bit Swiss (German/French)

SB

SF7DEC

DEC VT100 7-bit Finnish

SB

WE8DG

DG 8-bit West European

SB, ASCII

WE8EBCDIC37C

EBCDIC Code Page 37 8-bit Oracle/c

SB

WE8EBCDIC37

EBCDIC Code Page 37 8-bit West European

SB

D8EBCDIC273

EBCDIC Code Page 273/1 8-bit Austrian German

SB

DK8EBCDIC277

EBCDIC Code Page 277/1 8-bit Danish

SB

S8EBCDIC278

EBCDIC Code Page 278/1 8-bit Swedish

SB

I8EBCDIC280

EBCDIC Code Page 280/1 8-bit Italian

SB

WE8EBCDIC284

EBCDIC Code Page 284 8-bit Latin American/Spanish

SB

WE8EBCDIC285

EBCDIC Code Page 285 8-bit West European

SB

WE8EBCDIC924

Latin 9 EBCDIC 924

SB, EBCDIC

WE8EBCDIC1047

EBCDIC Code Page 1047 8-bit West European

SB

WE8EBCDIC1047E

Latin 1/Open Systems 1047

SB, EBCDIC, EURO

WE8EBCDIC1140

EBCDIC Code Page 1140 8-bit West European

SB, EURO

WE8EBCDIC1140C

EBCDIC Code Page 1140 Client 8-bit West European

SB, EURO

WE8EBCDIC1145

EBCDIC Code Page 1145 8-bit West European

SB, EURO

WE8EBCDIC1146

EBCDIC Code Page 1146 8-bit West European

SB, EURO

WE8EBCDIC1148

EBCDIC Code Page 1148 8-bit West European

SB, EURO

WE8EBCDIC1148C

EBCDIC Code Page 1148 Client 8-bit West European

SB, EURO

F8EBCDIC297

EBCDIC Code Page 297 8-bit French

SB

WE8EBCDIC500C

EBCDIC Code Page 500 8-bit Oracle/c

SB

WE8EBCDIC500

EBCDIC Code Page 500 8-bit West European

SB

EE8EBCDIC870

EBCDIC Code Page 870 8-bit East European

SB

EE8EBCDIC870C

EBCDIC Code Page 870 Client 8-bit East European

SB

EE8EBCDIC870S

EBCDIC Code Page 870 Server 8-bit East European

SB

WE8EBCDIC871

EBCDIC Code Page 871 8-bit Icelandic

SB

EL8EBCDIC875

EBCDIC Code Page 875 8-bit Greek

SB

EL8EBCDIC875R

EBCDIC Code Page 875 Server 8-bit Greek

SB

CL8EBCDIC1025

EBCDIC Code Page 1025 8-bit Cyrillic

SB

CL8EBCDIC1025C

EBCDIC Code Page 1025 Client 8-bit Cyrillic

SB

CL8EBCDIC1025R

EBCDIC Code Page 1025 Server 8-bit Cyrillic

SB

CL8EBCDIC1025S

EBCDIC Code Page 1025 Server 8-bit Cyrillic

SB

CL8EBCDIC1025X

EBCDIC Code Page 1025 (Modified) 8-bit Cyrillic

SB

BLT8EBCDIC1112

EBCDIC Code Page 1112 8-bit Baltic Multilingual

SB

BLT8EBCDIC1112S

EBCDIC Code Page 1112 8-bit Server Baltic Multilingual

SB

D8EBCDIC1141

EBCDIC Code Page 1141 8-bit Austrian German

SB, EURO

DK8EBCDIC1142

EBCDIC Code Page 1142 8-bit Danish

SB, EURO

S8EBCDIC1143

EBCDIC Code Page 1143 8-bit Swedish

SB, EURO

I8EBCDIC1144

EBCDIC Code Page 1144 8-bit Italian

SB, EURO

F8EBCDIC1147

EBCDIC Code Page 1147 8-bit French

SB, EURO

EEC8EUROASCI

EEC Targon 35 ASCI West European/Greek

SB

EEC8EUROPA3

EEC EUROPA3 8-bit West European/Greek

SB

LA8PASSPORT

German Government Printer 8-bit All-European Latin

SB, ASCII

WE8HP

HP LaserJet 8-bit West European

SB

WE8ROMAN8

HP Roman8 8-bit West European

SB, ASCII

HU8CWI2

Hungarian 8-bit CWI-2

SB, ASCII

HU8ABMOD

Hungarian 8-bit Special AB Mod

SB, ASCII

LV8RST104090

IBM-PC Alternative Code Page 8-bit Latvian (Latin/Cyrillic)

SB, ASCII

US8PC437

IBM-PC Code Page 437 8-bit American

SB, ASCII

BG8PC437S

IBM-PC Code Page 437 8-bit (Bulgarian Modification)

SB, ASCII

EL8PC437S

IBM-PC Code Page 437 8-bit (Greek modification)

SB, ASCII

EL8PC737

IBM-PC Code Page 737 8-bit Greek/Latin

SB

LT8PC772

IBM-PC Code Page 772 8-bit Lithuanian (Latin/Cyrillic)

SB, ASCII

LT8PC774

IBM-PC Code Page 774 8-bit Lithuanian (Latin)

SB, ASCII

BLT8PC775

IBM-PC Code Page 775 8-bit Baltic

SB, ASCII

WE8PC850

IBM-PC Code Page 850 8-bit West European

SB, ASCII

EL8PC851

IBM-PC Code Page 851 8-bit Greek/Latin

SB, ASCII

EE8PC852

IBM-PC Code Page 852 8-bit East European

SB, ASCII

RU8PC855

IBM-PC Code Page 855 8-bit Latin/Cyrillic

SB, ASCII

WE8PC858

IBM-PC Code Page 858 8-bit West European

SB, ASCII, EURO

WE8PC860

IBM-PC Code Page 860 8-bit West European

SB. ASCII

IS8PC861

IBM-PC Code Page 861 8-bit Icelandic

SB, ASCII

CDN8PC863

IBM-PC Code Page 863 8-bit Canadian French

SB, ASCII

N8PC865

IBM-PC Code Page 865 8-bit Norwegian

SB. ASCII

RU8PC866

IBM-PC Code Page 866 8-bit Latin/Cyrillic

SB, ASCII

EL8PC869

IBM-PC Code Page 869 8-bit Greek/Latin

SB, ASCII

LV8PC1117

IBM-PC Code Page 1117 8-bit Latvian

SB, ASCII

US8ICL

ICL EBCDIC 8-bit American

SB

WE8ICL

ICL EBCDIC 8-bit West European

SB

WE8ISOICLUK

ICL special version ISO8859-1

SB

WE8ISO8859P1

ISO 8859-1 West European

SB, ASCII

EE8ISO8859P2

ISO 8859-2 East European

SB, ASCII

SE8ISO8859P3

ISO 8859-3 South European

SB, ASCII

NEE8ISO8859P4

ISO 8859-4 North and North-East European

SB, ASCII

CL8ISO8859P5

ISO 8859-5 Latin/Cyrillic

SB, ASCII

AR8ISO8859P6

ISO 8859-6 Latin/Arabic

SB, ASCII

EL8ISO8859P7

ISO 8859-7 Latin/Greek

SB, ASCII, EURO

IW8ISO8859P8

ISO 8859-8 Latin/Hebrew

SB, ASCII

NE8ISO8859P10

ISO 8859-10 North European

SB, ASCII

BLT8ISO8859P13

ISO 8859-13 Baltic

SB, ASCII

CEL8ISO8859P14

ISO 8859-13 Celtic

SB, ASCII

WE8ISO8859P15

ISO 8859-15 West European

SB, ASCII, EURO

LA8ISO6937

ISO 6937 8-bit Coded Character Set for Text Communication

SB, ASCII

IW7IS960

Israeli Standard 960 7-bit Latin/Hebrew

SB

AR8ARABICMAC

Mac Client 8-bit Latin/Arabic

SB

EE8MACCE

Mac Client 8-bit Central European

SB

EE8MACCROATIAN

Mac Client 8-bit Croatian

SB

WE8MACROMAN8

Mac Client 8-bit Extended Roman8 West European

SB

EL8MACGREEK

Mac Client 8-bit Greek

SB

IS8MACICELANDIC

Mac Client 8-bit Icelandic

SB

CL8MACCYRILLIC

Mac Client 8-bit Latin/Cyrillic

SB

AR8ARABICMACS

Mac Server 8-bit Latin/Arabic

SB, ASCII

EE8MACCES

Mac Server 8-bit Central European

SB, ASCII

EE8MACCROATIANS

Mac Server 8-bit Croatian

SB, ASCII

WE8MACROMAN8S

Mac Server 8-bit Extended Roman8 West European

SB, ASCII

CL8MACCYRILLICS

Mac Server 8-bit Latin/Cyrillic

SB, ASCII

EL8MACGREEKS

Mac Server 8-bit Greek

SB, ASCII

IS8MACICELANDICS

Mac Server 8-bit Icelandic

SB

BG8MSWIN

MS Windows 8-bit Bulgarian Cyrillic

SB, ASCII

LT8MSWIN921

MS Windows Code Page 921 8-bit Lithuanian

SB, ASCII

ET8MSWIN923

MS Windows Code Page 923 8-bit Estonian

SB, ASCII

EE8MSWIN1250

MS Windows Code Page 1250 8-bit East European

SB, ASCII, EURO

CL8MSWIN1251

MS Windows Code Page 1251 8-bit Latin/Cyrillic

SB, ASCII, EURO

WE8MSWIN1252

MS Windows Code Page 1252 8-bit West European

SB, ASCII, EURO

EL8MSWIN1253

MS Windows Code Page 1253 8-bit Latin/Greek

SB, ASCII, EURO

BLT8MSWIN1257

MS Windows Code Page 1257 8-bit Baltic

SB, ASCII, EURO

BLT8CP921

Latvian Standard LVS8-92(1) Windows/Unix 8-bit Baltic

SB, ASCII

LV8PC8LR

Latvian Version IBM-PC Code Page 866 8-bit Latin/Cyrillic

SB, ASCII

WE8NCR4970

NCR 4970 8-bit West European

SB, ASCII

WE8NEXTSTEP

NeXTSTEP PostScript 8-bit West European

SB, ASCII

CL8ISOIR111

ISOIR111 Cyrillic

SB

CL8KOI8R

RELCOM Internet Standard 8-bit Latin/Cyrillic

SB, ASCII

CL8KOI8U

KOI8 Ukrainian Cyrillic

SB

US8BS2000

Siemens 9750-62 EBCDIC 8-bit American

SB

DK8BS2000

Siemens 9750-62 EBCDIC 8-bit Danish

SB

F8BS2000

Siemens 9750-62 EBCDIC 8-bit French

SB

D8BS2000

Siemens 9750-62 EBCDIC 8-bit German

SB

E8BS2000

Siemens 9750-62 EBCDIC 8-bit Spanish

SB

S8BS2000

Siemens 9750-62 EBCDIC 8-bit Swedish

SB

DK7SIEMENS9780X

Siemens 97801/97808 7-bit Danish

SB

F7SIEMENS9780X

Siemens 97801/97808 7-bit French

SB

D7SIEMENS9780X

Siemens 97801/97808 7-bit German

SB

I7SIEMENS9780X

Siemens 97801/97808 7-bit Italian

SB

N7SIEMENS9780X

Siemens 97801/97808 7-bit Norwegian

SB

E7SIEMENS9780X

Siemens 97801/97808 7-bit Spanish

SB

S7SIEMENS9780X

Siemens 97801/97808 7-bit Swedish

SB

EE8BS2000

Siemens EBCDIC.DF.04 8-bit East European

SB

WE8BS2000

Siemens EBCDIC.DF.04 8-bit West European

SB

WE8BS2000E

Siemens EBCDIC.DF.04 8-bit West European

SB, EURO

CL8BS2000

Siemens EBCDIC.EHC.LC 8-bit Cyrillic

SB

AL16UTF16

See "Universal Character Sets" for details

MB, EURO, FIXED

AL32UTF8

See "Universal Character Sets" for details

MB, ASCII, EURO

UTF8

See "Universal Character Sets" for details

MB, ASCII, EURO

UTFE

See "Universal Character Sets" for details

MB, EURO

Middle Eastern Language Character Sets

Table A-6 lists the Oracle character sets that can support Middle Eastern languages.

Table A-6 Middle Eastern Character Sets  
Name Description Comments

AR8APTEC715

APTEC 715 Server 8-bit Latin/Arabic

SB, ASCII

AR8APTEC715T

APTEC 715 8-bit Latin/Arabic

SB

AR8ASMO708PLUS

ASMO 708 Plus 8-bit Latin/Arabic

SB, ASCII

AR8ASMO8X

ASMO Extended 708 8-bit Latin/Arabic

SB, ASCII

AR8ADOS710

Arabic MS-DOS 710 Server 8-bit Latin/Arabic

SB, ASCII

AR8ADOS710T

Arabic MS-DOS 710 8-bit Latin/Arabic

SB

AR8ADOS720

Arabic MS-DOS 720 Server 8-bit Latin/Arabic

SB, ASCII

AR8ADOS720T

Arabic MS-DOS 720 8-bit Latin/Arabic

SB

TR7DEC

DEC VT100 7-bit Turkish

SB

TR8DEC

DEC 8-bit Turkish

SB

WE8EBCDIC37C

EBCDIC Code Page 37 8-bit Oracle/c

SB

IW8EBCDIC424

EBCDIC Code Page 424 8-bit Latin/Hebrew

SB

IW8EBCDIC424S

EBCDIC Code Page 424 Server 8-bit Latin/Hebrew

SB

WE8EBCDIC500C

EBCDIC Code Page 500 8-bit Oracle/c

SB

IW8EBCDIC1086

EBCDIC Code Page 1086 8-bit Hebrew

SB

AR8EBCDIC420S

EBCDIC Code Page 420 Server 8-bit Latin/Arabic

SB

AR8EBCDICX

EBCDIC XBASIC Server 8-bit Latin/Arabic

SB

TR8EBCDIC1026

EBCDIC Code Page 1026 8-bit Turkish

SB

TR8EBCDIC1026S

EBCDIC Code Page 1026 Server 8-bit Turkish

SB

AR8HPARABIC8T

HP 8-bit Latin/Arabic

SB

TR8PC857

IBM-PC Code Page 857 8-bit Turkish

SB, ASCII

IW8PC1507

IBM-PC Code Page 1507/862 8-bit Latin/Hebrew

SB, ASCII

AR8ISO8859P6

ISO 8859-6 Latin/Arabic

SB, ASCII

IW8ISO8859P8

ISO 8859-8 Latin/Hebrew

SB, ASCII

WE8ISO8859P9

ISO 8859-9 West European & Turkish

SB, ASCII

LA8ISO6937

ISO 6937 8-bit Coded Character Set for Text Communication

SB, ASCII

IW7IS960

Israeli Standard 960 7-bit Latin/Hebrew

SB

IW8MACHEBREW

Mac Client 8-bit Hebrew

SB

AR8ARABICMAC

Mac Client 8-bit Latin/Arabic

SB

AR8ARABICMACT

Mac 8-bit Latin/Arabic

SB

TR8MACTURKISH

Mac Client 8-bit Turkish

SB

IW8MACHEBREWS

Mac Server 8-bit Hebrew

SB, ASCII

AR8ARABICMACS

Mac Server 8-bit Latin/Arabic

SB, ASCII

TR8MACTURKISHS

Mac Server 8-bit Turkish

SB, ASCII

TR8MSWIN1254

MS Windows Code Page 1254 8-bit Turkish

SB, ASCII, EURO

IW8MSWIN1255

MS Windows Code Page 1255 8-bit Latin/Hebrew

SB, ASCII, EURO

AR8MSWIN1256

MS Windows Code Page 1256 8-Bit Latin/Arabic

SB. ASCII, EURO

IN8ISCII

Multiple-Script Indian Standard 8-bit Latin/Indian
Languages

SB

AR8MUSSAD768

Mussa'd Alarabi/2 768 Server 8-bit Latin/Arabic

SB, ASCII

AR8MUSSAD768T

Mussa'd Alarabi/2 768 8-bit Latin/Arabic

SB

AR8NAFITHA711

Nafitha Enhanced 711 Server 8-bit Latin/Arabic

SB, ASCII

AR8NAFITHA711T

Nafitha Enhanced 711 8-bit Latin/Arabic

SB

AR8NAFITHA721

Nafitha International 721 Server 8-bit Latin/Arabic

SB, ASCII

AR8NAFITHA721T

Nafitha International 721 8-bit Latin/Arabic

SB

AR8SAKHR706

SAKHR 706 Server 8-bit Latin/Arabic

SB, ASCII

AR8SAKHR707

SAKHR 707 Server 8-bit Latin/Arabic

SB, ASCII

AR8SAKHR707T

SAKHR 707 8-bit Latin/Arabic

SB

AR8XBASIC

XBASIC 8-bit Latin/Arabic

SB

WE8BS2000L5

Siemens EBCDIC.DF.04.L5 8-bit West European/Turkish

SB

AL16UTF16

See "Universal Character Sets" for details

MB, EURO, FIXED

AL32UTF8

See "Universal Character Sets" for details

MB, ASCII, EURO

UTF8

See "Universal Character Sets" for details

MB, ASCII, EURO

UTFE

See "Universal Character Sets" for details

MB, EURO

Universal Character Sets

Table A-7 lists the Oracle character sets that provide universal language support. They attempt to support all languages of the world, including, but not limited to, Asian, European, and Middle Eastern languages.

Table A-7 Universal Character Sets
Name Description Comments

AL16UTF16

Unicode 3.1 UTF-16 Universal character set

MB, EURO, FIXED

AL32UTF8

Unicode 3.1 UTF-8 Universal character set

MB, ASCII, EURO

UTF8

Unicode 3.0 UTF-8 Universal character set, CESU-8 compliant

MB, ASCII, EURO

UTFE

EBCDIC form of Unicode 3.0 UTF-8 Universal character set

MB, EURO


Note:

CESU-8 defines an encoding scheme for Unicode that is identical to UTF-8 except for its representation of supplementary characters. In CESU-8, supplementary characters are represented as six-byte sequences that result from the transformation of each UTF-16 surrogate code unit into an eight-bit form that is similar to the UTF-8 transformation, but without first converting the input surrogate pairs to a scalar value. See Unicode Technical Report #26.


See Also:

Chapter 5, "Supporting Multilingual Databases with Unicode"

Character Set Conversion Support

The following character set encodings are supported for conversion only. They cannot be used as the database or national character set:

You can use these character sets as the source_char_set or dest_char_set in the CONVERT function.

See Also:

Subsets and Supersets

Table A-8 lists common subset/superset relationships.

Table A-8 Subset-Superset Pairs  
Subset Superset

AR8ADOS710

AR8ADOS710T

AR8ADOS720

AR8ADOS720T

AR8ADOS720T

AR8ADOS720

AR8APTEC715

AR8APTEC715T

AR8ARABICMACT

AR8ARABICMAC

AR8ISO8859P6

AR8ASMO708PLUS

AR8ISO8859P6

AR8ASMO8X

AR8MUSSAD768

AR8MUSSAD768T

AR8MUSSAD768T

AR8MUSSAD768

AR8NAFITHA711

AR8NAFITHA711T

AR8NAFITHA721

AR8NAFITHA721T

AR8SAKHR707

AR8SAKHR707T

AR8SAKHR707T

AR8SAKHR707

BLT8CP921

BLT8ISO8859P13

BLT8CP921

LT8MSWIN921

D7DEC

D7SIEMENS9780X

D7SIEMENS9780X

D7DEC

DK7SIEMENS9780X

N7SIEMENS9780X

I7DEC

I7SIEMENS9780X

I7SIEMENS9780X

IW8EBCDIC424

IW8EBCDIC424

IW8EBCDIC1086

KO16KSC5601

KO16MSWIN949

LT8MSWIN921

BLT8ISO8859P13

LT8MSWIN921

BLT8CP921

N7SIEMENS9780X

DK7SIEMENS9780X

US7ASCII

See Table A-9, "US7ASCII Supersets".

WE16DECTST

WE16DECTST2

WE16DECTST2

WE16DECTST

WE8DEC

TR8DEC

WE8DEC

WE8NCR4970

WE8ISO8859P1

WE8MSWIN1252

WE8ISO8859P9

TR8MSWIN1254

WE8NCR4970

TR8DEC

WE8NCR4970

WE8DEC

WE8PC850

WE8PC858

ZHS16GBK

ZHS32GB18030

US7ASCII is a special case because so many other character sets are supersets of it. Table A-9 lists supersets for US7ASCII.

Table A-9 US7ASCII Supersets  
Supersets Supersets Supersets

AL24UTFFSS

EE8MACCES

NEE8ISO8859P4

AL32UTF8

EE8MACCROATIANS

RU8BESTA

AR8ADOS710

EE8MSWIN1250

RU8PC855

AR8ADOS710T

EE8PC852

RU8PC866

AR8ADOS720

EL8DEC

SE8ISO8859P3

AR8ADOS720T

EL8ISO8859P7

TH8MACTHAIS

AR8APTEC715

EL8MACGREEKS

TH8TISASCII

AR8APTEC715T

EL8MSWIN1253

TR8DEC

AR8ARABICMACS

EL8PC437S

TR8MACTURKISHS

AR8ASMO708PLUS

EL8PC851

TR8MSWIN1254

AR8ASMO8X

EL8PC869

TR8PC857

AR8HPARABIC8T

ET8MSWIN923

US8PC437

AR8ISO8859P6

HU8ABMOD

UTF8

AR8MSAWIN

HU8CWI2

VN8MSWIN1258

AR8MUSSAD768

IN8ISCII

VN8VN3

AR8MUSSAD768T

IS8PC861

WE8DEC

AR8NAFITHA711

IW8ISO8859P8

WE8DG

AR8NAFITHA711T

IW8MACHEBREWS

WE8ISO8859P1

AR8NAFITHA721

IW8MSWIN1255

WE8ISO8859P15

AR8NAFITHA721T

IW8PC1507

WE8ISO8859P9

AR8SAKHR706

JA16EUC

WE8MACROMAN8S

AR8SAKHR707

JA16SJIS

WE8MSWIN1252

AR8SAKHR707T

JA16TSTSET

WE8NCR4970

BG8MSWIN

JA16TSTSET2

WE8NEXTSTEP

BG8PC437S

JA16VMS

WE8PC850

BLT8CP921

KO16KSC5601

WE8PC858

BLT8ISO8859P13

KO16KSCCS

WE8PC860

BLT8MSWIN1257

KO16MSWIN949

WE8ROMAN8

BLT8PC775

KO16TSTSET

ZHS16CGB231280

BN8BSCII

LA8ISO6937

ZHS16GBK

CDN8PC863

LA8PASSPORT

ZHT16BIG5

CEL8ISO8859P14

LT8MSWIN921

ZHT16CCDC

CL8ISO8859P5

LT8PC772

ZHT16DBT

CL8KOI8R

LT8PC774

ZHT16HKSCS

CL8KOI8U

LV8PC1117

ZHT16MSWIN950

CL8ISOIR111

LV8PC8LR

ZHT32EUC

CL8MACCYRILLICS

LV8RST104090

ZHT32SOPS

CL8MSWIN1251

N8PC865

ZHT32TRIS

EE8ISO8859P2

NE8ISO8859P10

ZHS32GB18030

ZHT32EUCTST

-

-

Linguistic Sorting

Oracle offers two kinds of linguistic sorts, monolingual and multilingual. In addition, monolingual sorts can be extended to handle special cases. These special cases (represented with a prefix X) typically mean that the characters will be sorted differently from their ASCII values. For example, ch and ll are treated as a single character in XSPANISH.

Table A-10 lists the monolingual linguistic sorts supported by the Oracle server.

Table A-10 Monolingual Linguistic Sorts  
Basic Name Extended Name Special Cases

ARABIC

-

-

ARABIC_MATCH

-

-

ARABIC_ABJ_SORT

-

-

ARABIC_ABJ_MATCH

-

-

ASCII7

-

-

BENGALI

-

-

BIG5

-

-

BINARY

-

-

BULGARIAN

-

-

CANADIAN FRENCH

-

-

CATALAN

XCATALAN

æ, AE, ß

CROATIAN

XCROATIAN

D, L, N, d, l, n, ß

CZECH

XCZECH

ch, CH, Ch, ß

CZECH_PUNCTUTION

XCZECH_PUNCTUATION

ch, CH, Ch, ß

DANISH

XDANISH

A, ß, Å, å

DUTCH

XDUTCH

ij, IJ

EBCDIC

-

-

EEC_EURO

-

-

EEC_EUROPA3

-

-

ESTONIAN

-

-

FINNISH

-

-

FRENCH

XFRENCH

-

GERMAN

XGERMAN

ß

GERMAN_DIN

XGERMAN_DIN

ß, ä, ö, ü, Ä, Ö, Ü

GBK

-

-

GREEK

-

-

HEBREW

-

-

HKSCS

-

-

HUNGARIAN

XHUNGARIAN

cs, gy, ny, sz, ty, zs, ß, CS, Cs, GY, Gy, NY, Ny, SZ, Sz, TY, Ty, ZS, Zs

ICELANDIC

-

-

INDONESIAN

-

-

ITALIAN

-

-

JAPANESE

-

-

LATIN

-

-

LATVIAN

-

-

LITHUANIAN

-

-

MALAY

-

-

NORWEGIAN

-

-

POLISH

-

-

PUNCTUATION

XPUNCTUATION

-

ROMANIAN

-

-

RUSSIAN

-

-

SLOVAK

XSLOVAK

dz, DZ, Dz, ß (caron)

SLOVENIAN

XSLOVENIAN

ß

SPANISH

XSPANISH

ch, ll, CH, Ch, LL, Ll

SWEDISH

-

-

SWISS

XSWISS

ß

THAI_DICTIONARY

-

-

THAI_TELEPHONE

-

-

TURKISH

XTURKISH

æ, AE, ß

UKRAINIAN

-

-

UNICODE_BINARY

-

-

VIETNAMESE

-

-

WEST_EUROPEAN

XWEST_EUROPEAN

ß

Table A-11 lists the multilingual linguistic sorts available in Oracle. All of them include GENERIC_M (an ISO standard for sorting Latin-based characters) as a base. Multilingual linguistic sorts are used for a specific primary language together with Latin-based characters. For example, KOREAN_M will sort Korean and Latin-based characters, but it will not collate Chinese, Thai, or Japanese characters.

Table A-11 Multilingual Linguistic Sorts  
Basic Name Explanation

CANADIAN_M

Canadian French sort supports reverse secondary, special expanding characters

DANISH_M

Danish sort supports sorting lower case characters before upper case characters

FRENCH_M

French sort supports reverse sort for secondary

GENERIC_M

Generic sorting order which is based on ISO14651 and Unicode canonical equivalence rules but excluding compatible equivalence rules

JAPANESE_M

Japanese sort supports SJIS character set order and EUC characters which are not included in SJIS

KOREAN_M

Korean sort: Hangul characters are based on Unicode binary order. Hanja characters based on pronunciation order. All Hangul characters are before Hanja characters

SPANISH_M

Traditional Spanish sort supports special contracting characters

THAI_M

Thai sort supports swap characters for some vowels and consonants

SCHINESE_RADICAL_M

Simplified Chinese sort based on radical as primary order and number of strokes order as secondary order

SCHINESE_STROKE_M

Simplified Chinese sort uses number of strokes as primary order and radical as secondary order

SCHINESE_PINYIN_M

Simplified Chinese PinYin sorting order

TCHINESE_RADICAL_M

Traditional Chinese sort based on radical as primary order and number of strokes order as secondary order

TCHINESE_STROKE_M

Traditional Chinese sort uses number of strokes as primary order and radical as secondary order. It supports supplementary characters.

Calendar Systems

By default, most territory definitions use the Gregorian calendar system. Table A-12 lists the other calendar systems supported by the Oracle server.

Table A-12 Supported Calendar Systems  
Name Default Date Format Character Set Used
For Default Date Format

Japanese Imperial

EEYYMMDD

JA16EUC

ROC Official

EEyymmdd

ZHT32EUC

Thai Buddha

dd month EE yyyy

TH8TISASCII

Persian

DD Month YYYY

AR8ASMO8X

Arabic Hijrah

DD Month YYYY

AR8ISO8859P6

English Hijrah

DD Month YYYY

AR8ISO8859P6

Figure A-1 shows how March 20, 1998 appears in ROC Official:

Figure A-1 ROC Official Example

Text description of roc.gif follows.

Text description of the illustration roc.gif

Figure A-2 shows how March 27, 1998 appears in Japanese Imperial:

Figure A-2 Japanese Imperial Example

Text description of nls81005.gif follows.

Text description of the illustration nls81005.gif

Obsolete Locale Data

Before Oracle server release 7.2, when a character set was renamed, the old name was usually supported along with the new name for several releases after the change. Beginning with release 7.2, the old names are no longer supported.

Table A-13 lists the affected character sets. If you reference any of these character sets in your code, replace them with their new name:

Table A-13 New Names for Obsolete Character Sets  
Old Name New Name

AL24UTFSS

UTF8, AL32UTF8

AR8MSAWIN

AR8MSWIN1256

CL8EBCDIC875S

CL8EBCDIC875R

EL8EBCDIC875S

EL8EBCDIC875R

JVMS

JA16VMS

JEUC

JA16EUC

SJIS

JA16SJIS

JDBCS

JA16DBCS

KSC5601

KO16KSC5601

KDBCS

KO16DBCS

CGB2312-80

ZHS16CGB231280

CNS 11643-86

ZHT32EUC

JA16EUCFIXED

None. Replaced by new national character set. UTF8 and AL16UTF16.

ZHS32EUCFIXED

None. Replaced by new national character set. UTF8 and AL16UTF16.

ZHS16GBKFIXED

None. Replaced by new national character set. UTF8 and AL16UTF16.

JA16DBCSFIXED

None. Replaced by new national character set. UTF8 and AL16UTF16.

KO16DBCSFIXED

None. Replaced by new national character set. UTF8 and AL16UTF16.

ZHS16DBCSFIXED

None. Replaced by new national character set. UTF8 and AL16UTF16.

ZHS16CGB231280
FIXED

None. Replaced by new national character set. UTF8 and AL16UTF16.

ZHT16DBCSFIXED

None. Replaced by new national character set. UTF8 and AL16UTF16.

KO16KSC5601FIXED

None. Replaced by new national character set. UTF8 and AL16UTF16.

JA16SJISFIXED

None. Replaced by new national character set. UTF8 and AL16UTF16.

ZHT16BIG5FIXED

None. Replaced by new national character set. UTF8 and AL16UTF16.

ZHT32TRISFIXED

None. Replaced by new national character set. UTF8 and AL16UTF16.

Character set CL8MSWINDOW31 has been desupported. The newer character set CL8MSWIN1251 is actually a duplicate of CL8MSWINDOW31 and includes some characters omitted from the earlier version. Change any usage of CL8MSWINDOW31 to CL8MSWIN1251 instead.

AL24UTFFSS Character Set Desupported

The Unicode Character Set AL24UTFFSS has been desupported in Oracle9i. AL24UTFFSS was introduced with Oracle7 as the Unicode character set supporting UTF-8 encoding scheme based on the Unicode standard 1.1, which is now obsolete. In Oracle9i, Oracle now offers the Unicode database character set AL32UTF8 and UTF8, which includes the Unicode enhancements based on the Unicode standard 3.1.

The migration path for an existing AL24UTFFSS database is to upgrade to UTF8 prior to upgrading to Oracle9i. As with all migrations to a new database character set, Oracle Corporation recommends that you use the Character Set Scanner for data analysis before attempting to migrate your existing database character set to UTF8.

See Also:

Chapter 11, "Character Set Scanner"

Bengali Language Definition Deprecated

The Bengali language definition is not compatible with Unicode standards. Oracle Corporation recommends that customers use the Bangla language definition instead. Bangla was introduced in Oracle9i Database Release 1 (9.0.1).

The Bengali language definition is supported in Oracle9i Database Release 2 (9.2), but it may be desupported in a future release.

Czechoslovakia Territory Definition Deprecated

Oracle Corporation recommends that customers use either Czech Republic or Slovakia territory definitions in Oracle9i Database Release 2 (9.2). The Czechoslovakia territory definition is supported in Oracle9i Database Release 2 (9.2), but it may be desupported in a future release.


Go to previous page Go to next page
Oracle
Copyright © 1996, 2002 Oracle Corporation.

All Rights Reserved.
Go To Documentation Library
Home
Go To Product List
Book List
Go To Table Of Contents
Contents
Go To Index
Index

Master Index

Feedback