|Oracle® Database Globalization Support Guide
10g Release 1 (10.1)
Part Number B10749-01
This section describes new features of globalization support and provides pointers to additional information.
This release supports Unicode 3.2.
Oracle provides linguistic sorts and queries that use information about base letter, accents, and case to sort character strings.This release enables you to specify a sort or query on the base letters only (accent-insensitive) or on the base letters and the accents (case-insensitive).
The Database Character Set Scanner and Converter now supports object types.
LCSD parameter enables the Database Character Set Scanner (
CSSCAN) to perform language and character set detection on the data cells categorized by the
LCSDATA parameter. The Database Character Set Scanner reports have also been enhanced.
CSALTER script is a DBA tool for special character set migration.
The Language and Character Set File Scanner (
LCSSCAN) is a high-performance, statistically based utility for determining the character set and language for unspecified plain file text.
The Globalization Development Kit (GDK) simplifies the development process and reduces the cost of developing Internet applications that will support a global multilingual market. GDK includes APIs, tools, and documentation that address many of the design, development, and deployment issues encountered in the creation of global applications. GDK lets a single program work with text in any language from anywhere in the world. It enables you to build a complete multilingual server application with little more effort than it takes to build a monolingual server application.
This release supports POSIX-compliant regular expressions to enhance search and replace capability in programming environments such as UNIX and Java. In SQL, this new functionality is implemented through new functions that are regular expression extensions to existing SQL functions such as
INSTR. This implementation supports multilingual queries and is locale-sensitive.
Oracle Locale Builder can display code charts for Unicode character sets.
In previous releases, Oracle defined language and territory definitions separately. This resulted in the definition of a territory being independent of the language setting of the user. In this release, some territories can have different date, time, number, and monetary formats based on the language setting of a user. This type of language-dependent territory definition is called a locale variant.
NLB files that are generated on one platform can be transported to another platform by, for example, FTP. The transported NLB files can be used the same way as the NLB files that were generated on the original platform. This is convenient because locale data can be modified on one platform and copied to other platforms.
NLS_LENGTH_SEMANTICS is now supported as an environment variable.
Implicit conversion between
NCLOB datatypes is now supported.
Changes have been made to the content in some of the language and territory definition files in Oracle Database 10g.