Previous Contents Index Next |
iPlanet Web Server: FastTrack Edition Administrator's Guide |
Appendix C Internationalized iPlanet Web Server
The internationalized version of the iPlanet Web Server contains special features tailored for the non-U.S. environment. These features include a choice of user-interface language (Japanese, French, or German) and a choice of search engines that allow you to use text search on a variety of languages.This appendix contains the following sections:
General Information
General Information
The following information covers the international considerations for general server capabilities.This section includes the following topics:
Installing the Server
Installing the Server
When you install the server, you choose what user-interface language to use, as well as what search engines to install.For information on installing the international version of the server, see the iPlanet Web Server, FastTrack Edition 4.1 Release Notes. You can access the Release Notes online via the link provided in the README file.
Entering 8-bit Text
If you want to type 8-bit data into the Server Manager or the administration server forms, you need to be aware of the issues in this section.
File or Directory Names
If a file or directory name is to appear in a URL, it cannot contain 8-bit or multi-byte characters.
LDAP Users and Groups
For email addresses, use only those characters permitted in RFC 822
(ftp://ds.internic.net/rfc/rfc822.txt). User ID and password information must be stored in ASCII.If you use a local database, you can enter 8-bit and multi-byte characters, but you should standardize on one character set. If you use more than one character set in the same database, it can cause display and search problems.
If you must use 8-bit or multi-byte characters in your directory database, you should store them in UTF-8 for future compatibility with the Netscape Directory Server version 4.x. To make sure you enter characters in the correct format, use a UTF-8 form-capable client (such as Netscape Communicator) to input 8-bit or double-byte data.
If you let users access their own user and group information, they will need to use a UTF-8 form-capable client.
Using the Accept Language Header
When clients contact a server using HTTP 1.1, they can send header information that describes the various languages they accept. You can configure your server to parse this language information.For example, suppose this feature is set to on, and a client configured to send the accept language header sends it with the value en,fr. Now suppose that the client requests the following URL:
The server first looks for:
- http://www.someplace.com/somepage.html
If it does not find that, it looks for:
- http://www.someplace.com/en/somepage.html
If that is not available either, and a ClientLanguage (call it xx) is defined in the magnus.conf file, the server tries:
- http://www.someplace.com/fr/somepage.html
If none of these exist, the server tries:
- http://www.someplace.com/xx/somepage.html
- http://www.someplace.com/somepage.html
Language Settings in Configuration Files
The following directives in the magnus.conf file affect languages:
The following directives in the ns-admin.conf file affect languages:
Server-side JavaScript Information
When you use server-side JavaScript with the international version of the server, you have additional things to consider when compiling applications and using databases. For example, you can specify the language of the JavaScript application one of two ways: using the compiler, or using the HTML <META> tag.
Specifying the Character Set for the Compiler
For the international version, the server-side JavaScript compiler (jsac) has a -l option called charSet. This option specifies the character set being used in the input HTML files. The value for charSet is one of the following character set names.
Table C-3 Valid values for charSet
Usage
To use this option, use the following format:jsac [-cdv] [-l charSet] -o binaryFile [-i] inputFile1 [-i] inputFile2 ...
jsac [-cdv] -o binaryFile -f includeFile
Options
The following table shows the options for the compiler.
Table C-4 Options for the jsac compiler
Name of inputFile (use if the input filename starts with a switch character)
Name of includeFile (has input filenames, separated by white space)
The possible filename extensions are summarized in the following table:.
When you specify the language using the compiler option, you can only specify one language. If you want to specify multiple languages, you can use the <META> tag in the individual files.
Specifying the Character Set With the <META> Tag
You can also use the <META> tag to specify the character set information. For example, if you put the following statement into the header (between <HEAD> and </HEAD>) in a JavaScript program, the server-side JavaScript compiler (jsac) considers the file to be written in x-sjis.
If the character set specified in the <META> tag is different from the character set specified by the compiler's charSet option, the character set specified by the compiler option is used.
- <META HTTP-EQUIV="Content-Type" CONTENT="test/html; CHARSET=x-sjis">
Using Server-side Javascript With Oracle's Japanese Database
To use server-side JavaScript with Oracle's Japanese database, you need to install Oracle and set up your environment, verify the connection, and verify the language setup. follow these overall steps. This section discusses these topics:
Installing Oracle and Setting Up Your Environment
Installing Oracle and Setting Up Your Environment
You must first install the Japanese Oracle database. For instructions, see the documentation that came with your database. Next, you must set up your environment variables using the following information. Note that the environment variable syntax assumes C Shell.Server-side JavaScript library:
setenv LD_LIBRARY_PATH server_root/bin/https:$LD_LIBRARY_PATH
Environment variables for Oracle:
setenv ORACLE_HOME oracle_root
Environment variable for NLS (National Language Support) in Oracle:
for example, /usr/oracle7setenv ORACLE_SID oracle_service_ID
for example, WG73setenv TNS_ADMIN path_to_tnsnames.ora
for example, /.../tnsnames.ora
setenv NLS_LANG language_charset_info
Environment variable for the path:
for example, japanese_japan.JA16EUC
- (This example sets up x-euc-jp)
setenv PATH server_root/bin/https:$ORACLE_HOME/bin:$PATH
Restart the web server from the command line.
At the Application Manager, select and run dbadmin.
Unless you see an error indicating otherwise, you are now connected.Click Connect to Database Server.
Enter the following information in the window, and click Connect. If your server identifier, user ID, or password is different from these default values, enter your actual values here.
Verifying the Language Setup
Use the videoapp sample application to verify the language setup.
If your ORACLE installation has a server identifier, user ID, or password that is different from the default values shown in the previous table, be sure to specify the actual values in the start.htm file at the following line:
Run the build script in the directory to recompile the JavaScript code.
- project.sharedConnections.pool =
new DbPool("ORACLE","WG73", "system", "manager", "", 2, false)
At the Application Manager, select and run vidoeapp.
Click Add New Customer and enter data in the character set you specified.
Click Home to go back to the videoapp home page, and then click Save Changes.
Check to see if the data you entered appears in the table. If the data appears in the database in the correct language, you've set up the languages correctly.
Putting the Oracle Client and Database Server On Separate Hosts
To put the Oracle client (with server-side JavaScript database service) and the Oracle database server on separate hosts, follow these steps:
On the client side, define the SERVER SID alias to refer to the server in tnsnames.ora.
Set the TWO_TASK environment variable to the SERVER SID alias defined in the tsnames.ora file. For example:
Set the NLS_LANG environment variable to the correct client language and character set information.
- setenv TWO_TASK SERVER SID alias
Using the sample application videoapp, edit the start.htm file as shown below. (In this example, assume that the SERVER SID alias is remoteDB.)
Click Add New Customer and enter data in the character set you specified.
- project.sharedConnections.pool = new DbPool("ORACLE","remoteDB", "system", "manager", "", 2, false)
Click Home to go back to the videoapp home page, and then click Save Changes.
Check to see if the data you entered appears in the table. If the data appears in the database correctly, you've configured your system properly.
Search Information
Search capabilities are supported for the following languages:
English
You choose which search engines to install when you install the international version of the server.
International Search and Auto Catalog
If your server contains documents in various character set encodings, the search collections and/or auto catalog for the documents will inherit the same encodings as the originals. To view documents in different character set encodings, users must change the character set encoding for their browsers. In addition, since the text search and auto catalog features work with one character set encoding at a time, you might receive inaccurate results when using those features. For best results, use one specific character set for all documents.
Searching in Chinese, Japanese, and Korean
The following information is specific to searching in Japanese, Korean, and Chinese.
Query Operators
This release supports the following query operators for Japanese, Korean and Chinese languages:
Table C-7 Query operators for Japanese
Document Formats
This release supports the following document formats for the Japanese, Korean, and Chinese languages:
Searching in Japanese
The following sections give additional information about searching in the Japanese character set.
Document Codes
This release supports the following document codes for the Japanese language:
Search Words
This release supports the following search words:The search engine translates half-width katakana to full-width katakana, and translates full-width ascii-string to half-width ascii-string. Users can use full-width and half-width as the same characters.
This release also supports phrase and sentence search.
Getting Support for Accented Characters in Filenames
If the filenames on your server contain accented characters, for instance elninõ.html, you can get support for them by specifying the 8859 character set as the internal coding for search collections. To specify 8859, you need to modify the file language.conf in the directory <serverRoot>\plugins\search\admin. This file is used by the Search Engine and the document indexing features of the server.The language.conf file contains the following lines for the English language. These lines direct the server to configuration files that use 8859 as the default character set. The configuration files are located in the directory <serverRoot>\plugins\search\common.
To specify 8859, you need to activate these lines in language.conf by removing the comment characters (#).
- # [en-ns]
# name = English NS 8859 (ISO-8859-1)
# lang = english-ns;8859
# charset = iso-8859-1
# cjk = N
# encode850 = N
If you make this change to the language.conf file after a collection has been created, to support accented characters in filenames for that collection you need to delete the collection, make this change to the file, recreate the collection choosing "English NS 8859 (ISO-8859-1" from the "Documents are in" drop-down list, and reindex all the documents in the collection.
Previous Contents Index Next
Copyright © 2000 Sun Microsystems, Inc. Some preexisting portions Copyright © 2000 Netscape Communications Corp. All rights reserved.
Last Updated July 13, 2000