Configure the Schema Creator Utility for HDFS Schema
Configure the <APP Pack>_SCHEMA_BIGDATA_IN.xm
l file for HDFS
installation.
Add the pack specific schema details for the following schema, present in
the <APP Pack>_SCHEMA_BIGDATA_IN.xml
file.
- CONFIG: This schema holds the entities and other objects required for OFSAA setup configuration information. There can be only one CONFIG schema per OFSAA instance. This schema is created only in RDBMS.
- METADOM: This schema holds the data model entities. One METADOM schema is attached to one Information Domain. There can be multiple DATADOM schemas per OFSAA Instance. This schema is created only in RDBMS. It has only platform entities that hold the metadata details. However, it does not hold the data model entities.
- DATADOM: This schema holds data model entities. One DATADOM schema is attached to one Information Domain. There can be multiple DATADOM schemas per OFSAA Instance.
Configure the <APP Pack>_SCHEMA_BIGDATA_IN.xml file
The primary steps in OFSAA Applications installation involve creating HIVE schemas and objects within the schemas.
The <APP PACK>_SCHEMA_BIGDATA_IN.xml
file
contains details of the various application schemas. Update the values of the
various tags and parameters available in this file before executing the schema
creator utility.
This file must be configured only if the installation is for HDFS. This file is not required to be configured for an RDBMS ONLY target installation.
Table 7-3 <APP Pack>_SCHEMA_BIGDATA_IN.xml File Parameters
Tag Name/Attribute Name | Details | Default Value/ Permissible Value |
---|---|---|
<APP_PACK_
ID> |
Seeded unique ID for the OFSAA Application Pack
DO NOT modify this value. |
Seeded |
<IS_TCPS> |
Enter if the TCPS configuration is required.
Modify this to |
Seeded, with FALSE as the default
value.
|
<JDBC_URL> |
Enter the JDBC URL
You can enter RAC/ NON-RAC enabled database connectivity URL. |
Example 1:
Example 2:
For an HDFS ONLY target installation, this URL must be of the RDBMS instance that hosts the Application's METADOM. Example 3:
Example 4:
Example 5:
Example 6:
|
<JDBC_
DRIVER> |
By default, this driver's name is seeded.
Only JDBC Thin Driver is supported. DO NOT modify this value. |
Example:
|
<HOST> |
Enter the Hostname/ IP Address of the system on which you are installing the OFSAA components. | Host Name/ IP Address |
<SETUPINFO>/ PREFIX_
SCHEMA_ NAME |
(Optional). Identifies if the value specified in
<SETUPINFO>/NAME attribute must be
prefixed to the schema name.
The default value is
|
YES or NO |
<SETUPINFO>/
NAME |
Enter the acronym for the type of implementation.
This information is displayed in the OFSAA Home Page.
This name appears in the OFSAA Landing Page as
On
executing the schema creator utility, this value is prefixed
with each schema name. For example:
|
Accepts strings with a minimum length of two and a
maximum of four.
Example:
|
<PASSWORD>/
DEFAULT* |
(Optional). Enter the password if you want to set a
default password for all schemas.
You also must set
the Applies only to the RDBMS type METADOM schema(s). |
The maximum length allowed is 30 characters. Special characters are not allowed. |
<PASSWORD>/ APPLYSAMEFO
RALL |
Enter as Y if you want to apply the
password specified in the DEFAULT attribute for all the schemas.
If you enter as If you have entered Setting this attribute value is mandatory If the DEFAULT attribute is set. Applies only to the RDBMS type METADOM schema(s). |
Default N Permissible Y or
N |
<SCHEMAS>/TYPE=RDBMS |
Identifies the RDBMS schema details.
In an HDFS ONLY target installation, the Application's METADOM (that hosts the metadata) for an application is stored in RDBMS schema and the data model entities of the application are stored in the DATADOM (which are on Hive). |
Default names for schemas within the pack are derived in absence of any value specified. |
<SCHEMA>/
TYPE |
The different types of schemas that are supported in
this release are ATOMIC, CONFIG, SANDBOX, and ADDON. By default, the
schemas types are seeded based on the Application Pack.
Do not edit this attribute value. Only One CONFIG schema can exist in the file. This schema identifies as the CONFIGURATION schema that holds the OFSAA setup details and other metadata information. Multiple ATOMIC/ SANDBOX/ ADDON schemas can exist in the file. ATOMIC schema refers to the METADOM within the Information Domain schema. SANDBOX schema refers to the SANDBOX schema. ADDON schema refers to another miscellaneous schema (not applicable for this Application Pack). |
ATOMIC/CONFIG/SANDBOX/ADDON
The SANDBOX AND ADDON schemas do not apply to the OFSAAAI Application Pack. |
<SCHEMA>/
NAME |
By default, the schema's names are seeded based on
the Application Pack. You can edit the schema names if required.
The Schema Name will have a prefix of the
For example, if
the name is set as
|
The permissible length is 15 characters and only alphanumeric characters allowed. No special characters allowed except underscore '_'. |
<SCHEMA>/PASSWORD |
(Optional). Enter the password of the schema to be
created.
If this attribute is left blank, then
the password specified in the
It is
mandatory to enter the password if you have set the
|
The maximum length allowed is 30 characters. Special characters are not allowed. |
<SCHEMA>/APP_ID |
By default, the Application ID is seeded based on
the Application Pack.
Identifies the Application/ Product for which the schema is being created. Do not edit this attribute value. |
Unique Seeded Value |
<SCHEMA>/DEFAULTTABLESPACE |
(Optional). Enter the available default tablespace
for DB User.
If this attribute is left blank, then
Modify this value to associate any valid tablespace with the schema. |
Default USERS
Permissible Any existing valid tablespace name. |
<SCHEMA>/
TEMPTABLESPACE |
(Optional). Enter the available temporary tablespace
for the DB User.
If this attribute is left blank, then TEMP is set as the default tablespace. Modify this value to associate any valid tablespace with the schema. |
Default TEMP
Permissible Any existing valid temporary tablespace name. |
<SCHEMA>/
QUOTA |
(Optional). Enter the quota to be set on
DEFAULTTABLESPACE attribute for the
schema/user. By default, the quota size is set to
500M .
Minimum:
Modify this value to grant the specified quota on the mentioned tablespace to the user. |
Example,
600M/m
|
<SCHEMA>/
INFODOM |
(Optional). Enter the name of the Information Domain
to associate this schema.
The schema creator utility automatically derives an Information Domain Name based on the Application Pack if no value is specified for this attribute. |
Permissible length is 16 characters and only alphanumeric characters allowed. No special characters allowed. |
<SCHEMAS>/TYPE=HDFS |
Type of schemas being created.
Refers to the DATADOM of the Application Pack being installed. |
|
<HIVE_ SERVER_
HOST> |
IP/HostName of the server where HIVE is installed. | |
<HIVE_LIB_
PATH> |
The directory path where the HIVE related drivers/jar
files are copied.
Manually copy the jars mentioned in the Copy_Jars_to_OFSAA_Installation_Folder and Copy_KEYTAB_and_KRB5_Files_in_OFSAAI sections. Select the appropriate versions of the files. |
|
<SCHEMA>/NAME |
By default, the schema names are seeded based on the
Application Pack.
You can edit the schema names if required. The Schema Name will have a prefix
of the The Schema Name must not be the same as the Schema Name specified for the ATOMIC Schema Type. |
The permissible length is 20 characters and only alphanumeric characters allowed. |
<SCHEMA>/TYPE |
Identifies the type of schema where the data model
entities reside.
DO NOT modify this value. |
By default, the TYPE attribute in
this tag is set to DATADOM .
|
<SCHEMA>/DB
TYPE |
Identifies the type of driver to be used for the
connection.
In upcoming releases, the type value can be HIVE/ IMPALA, etc. |
By default, the only supported type is HIVE in this release. |
<SCHEMA>/<PROPERTY>/COMMENT |
(Optional). COMMENTS for the HIVE schema. | |
<SCHEMA>/<PROPERTY>/LOCATION |
(Optional). You can optionally specify a location for the table data. | |
<CONNECTION_PROPERTIES>/<PROPERTY>/
JDBC_DRIVER |
HIVE JDBC driver details.
The default HiveServer 2 driver name. |
com.cloudera.hive.j
dbc4.HS2Driver |
<CONNECTION_PROPERTIES>/<PROPERTY>/
JDBC_URL |
Enter the HIVE JDBC URL.
Specify the Hive JDBC URL to connect to the Hive Server. For example,
|
Valid Hive JDBC URL to be specified. |
<CONNECTION_PROPERTIES>/<PROPERTY>/AUTH_TYPE |
Authentication Type.
Only "Kerberos with keytab" based authentication supported in this release. |
Permissible values: KERBEROS_WITH_
KEYTAB |
<CONNECTION_PROPERTIES>/<PROPERTY>/
AUTH_ALIAS |
Alias name for authentication credentials.
An Alias name mapping to a principal and password combination specified in the following tags. |
|
<CONNECTION_PROPERTIES>/<PROPERTY>/PRINCIPAL |
Authentication principal name
Principal name used in authentication to connect to the Hive Server. For example:
|
|
<CONNECTION_PROPERTIES>/<PROPERTY>/PASSWORD |
Authentication password
Password used in authentication to connect to the Hive Server. |
|
<CONNECTION_PROPERTIES>/<PROPERTY>/KRB_GSSJAAS_FILE_NAME |
A keytab file containing pairs of Kerberos principals
and an encrypted copy of that principal's key.
This file must be
copied to the location specified in
|
|
<CONNECTION_PROPERTIES>/<PROPERTY>/KRB_REALM_FILE_NAME |
REALM configuration file.
This file must be copied to the location specified
in |