This chapter helps to prepare you for your Oracle Fusion Middleware (EDQ) installation. Various topics are covered that should be reviewed thoroughly to help ensure that you do not encounter any problems either during or after the product installation and domain configuration.
This chapter includes the following sections:
EDQ provides a comprehensive data quality management environment that is used to understand, improve, protect and govern data quality. The software facilitates best practice master data management, data integration, business intelligence, and data migration initiatives. It provides integrated data quality in customer relationship management (CRM) and other applications.
This documentation guides you through the selection, installation and configuration of the components that are needed to support EDQ.
This section (Table 1-1) provides an overview of the EDQ installation and configuration tasks that you will perform, in the order that they should be performed.
Table 1-1 EDQ Product Installation Procedure Tasks
Task | Action to Perform |
---|---|
Understand and select the external software components that support EDQ. |
|
Satisfy EDQ system requirements. |
|
Obtain an EDQ installation file from Oracle Software Delivery Cloud. |
|
Install the JDK and your chosen application server and database components. |
See Chapter 2, "Installing the Required External Software Components." |
Install the EDQ software. |
|
Configure EDQ |
For instructions see, Chapter 4, "Configuring Enterprise Data Quality with Oracle WebLogic Server" or Chapter 5, "Configuring Enterprise Data Quality with Apache Tomcat." |
Set system parameters. |
See Chapter 6, "Setting Server Parameters to Support Enterprise Data Quality." |
Next Steps |
For log in and basic use information, see Section 7, "Next Steps After Configuring Enterprise Data Quality." |
(Optional) Upgrade from a previous release of EDQ. |
If using Oracle WebLogic Server, see Section 8, "Upgrading Enterprise Data Quality On WebLogic Servers." If using Apache Tomcat, see Section 9, "Upgrading Enterprise Data Quality On Tomcat Servers." |
The following sections show you the components that are required to support EDQ and the supported versions of those components and EDQ.
EDQ is a Java Web Application that uses a Java Servlet Engine, a Java Web Start graphical user interface, and a data repository within a database. As such, it requires access to the following components:
a Java Development Kit (JDK)
a Java Application Server to supply web services. Oracle WebLogic Server and Apache Tomcat are supported.
a structured query language (SQL) relational database management system (RDBMS) to store configuration data, working data, and the results of work performed by the processes. Oracle and PostgreSQL are supported.
The following table shows you the combinations of application server and database that are supported for use with EDQ.
Application Server | Database |
---|---|
WebLogic Server |
Oracle |
Tomcat |
Oracle |
Tomcat |
PostgreSQL |
Instructions for installing these components are in Section 2, "Installing the Required External Software Components." See Section 1.3.2 for supported versions of each of these components.
Review the list of certified platforms, JDKs, databases, application servers, and releases for EDQ prior to installation. This information is listed in Oracle Fusion Middleware Certification Matrix at
http://www.oracle.com/technetwork/middleware/ias/downloads/fusion-certification-100350.html
This section describes the hardware and software requirements of EDQ. These requirements represent the server configurations that are certified and supported by Oracle for the EDQ product.
Depending on the tasks that EDQ is required to perform, it can place heavy demands on the hardware used to run it. A recommended minimum hardware specification for an EDQ server is:
8GB physical memory, with 4GB allocated to the EDQ Java Virtual Machine (JVM)
At least 4 logical CPUs
At least 500GB of hard disk space on the database server. The EDQ results schema, which contains the working data that EDQ generates, must have enough space to contain at least 20 times the volume of source data that you expect to process through EDQ. This size may increase if there are many EDQ users working on the same projects at the same time. The configuration schema remains small, normally less than 5GB, unless there is a large amount of user-modified reference data and case management data.
Note:
These recommendations do not represent sizing advice for any specific deployment, but rather a starting point for testing size requirements in your environment. It may be appropriate to deploy a larger machine or many machines, depending on the processing load placed on EDQ.
EDQ uses an installation directory and two configuration directories. You should record the location of these directories in case you need to apply manual updates to any of their contents.
During the installation process, you must specify an installation directory to contain the EDQ installation files. This directory is known as the EDQ Home (EDQ_HOME
) directory and is named as follows:
If you are installing EDQ as part of the Oracle Fusion Middleware product stack, you must install the EDQ Home directory as a subdirectory of the Oracle Fusion Middleware home (installation) directory. A typical default Fusion Middleware home directory is as follows, depending on the platform:
Linux and UNIX:
/opt/Oracle/Middleware/FMW_HOME
Windows:
C:\Oracle\Middleware\FMW_HOME
Note:
The Middleware
home directory is referenced as FMW_HOME
in this guide.
If you install EDQ as a standalone product, or if you use the Apache Tomcat application server, you can install the EDQ Home directory in any directory.
The EDQ Home directory requires approximately 1GB of hard disk space.
EDQ requires two configuration directories, which are separate from the EDQ Home (installation) directory that contains the program files. The configuration directories are:
The base configuration directory: This directory contains default configuration data. Once EDQ is installed, the files in the base configuration directory must not be altered, renamed, or moved.
The local configuration directory: This directory contains overrides to the default configuration. EDQ looks for overrides in this directory first, before looking in the base configuration directory. Files in the local configuration directory can be modified to customize or extend EDQ.
The names and locations of the configuration directories are as follows:
If you are using Oracle WebLogic Server, the Oracle installation wizard automatically creates and populates the configuration directories in the EDQ domain with the names of oedq.home
(base configuration directory) and oedq.local.home
(local configuration directory). An example installation path is:
WLS_HOME/user_projects/domains/edq_domain/edq/oedq.home WLS_HOME/user_projects/domains/edq_domain/edq/oedq.local.home
If you are using Apache Tomcat, you create the configuration directories manually in any location, with any names, and the configuration utility will populate them. You are prompted to create the directories during the installation instructions.
On a default EDQ installation, the configuration directories occupy approximately 1MB in total, but they will grow in size with the amount of data and files that need to be processed.
Depending on how the UNIX system that hosts the application server is configured, you may find that the application server cannot create files larger than 1 GB. This restricts your ability to work with large data sets if you intend to use files to transfer data to EDQ for processing.
System resource limits are controlled by the ulimit
command. Default ulimit
values exist, which you can view by using the ulimit -a
command. Look at the settings for file size, process limit, and file handle limits. The hard ulimit
on file size may need to be adjusted upward or removed for your application server account. Consult your System Administrator for assistance, if needed.
You can install EDQ on virtualized systems using a virtualization tool, such as Oracle VM Server. Both the virtual system and the physical system must fulfill the minimum hardware requirements listed in this documentation.
If load balancing software is used to deploy multiple virtual systems onto a single physical system, care must be taken to ensure that the load balancing software is carefully tuned. In general, EDQ imposes a load similar to an extract, transform and load tool or data warehousing software. Between batches, very little load is imposed on the system. When processing a batch of data, EDQ rapidly drives hardware to be CPU or I/O bound. Unless the virtualized load balancing is correctly configured, suboptimal performance results.
An operating system user account is used to install and upgrade EDQ on your servers and install the application server. This user is required on all platforms. This user must have full permissions (read, write and execute) to the EDQ installation (EDQ_HOME
) directory, configuration directories, and all database directories. This user account is referred to as the EDQ installation user in this documentation. For more information about the EDQ directories, see Section 1.4.2, "EDQ Directory Requirements."
Note:
When installing on UNIX or Linux operating systems, do not use the root user as your EDQ installation user account.
To download the EDQ installation and configuration files, obtain the generic package installer from the Oracle Software Delivery Cloud website as follows:
Enter the Oracle Software Delivery Cloud URL into a web browser:
Click Sign-in/Register.
Note:
If you are not already logged in, the Oracle Single Sign-On page appears. Enter your Oracle user id and password and click Sign In.
The Terms & Restrictions page appears.
Select the Oracle Software Delivery Cloud Trial License Agreement and the Export Restrictions check boxes, and then click Continue.
The Media Pack Search page appears.
On the Media Pack Search page, do the following:
From the Select Product Pack drop-down list, select E-Business Suite (if you purchased the product from the Application Price List) or Oracle Fusion Middleware (if you purchased the product from the Technology Price List).
From the Platform drop-down list, select the platform on which you are installing EDQ.
Click Go.
The Results list expands to show all available media packs that include your search criteria.
Locate and select Oracle Enterprise Data Quality 12c Media Pack (12.1.3) (E-Business Suite Product Pack) or Oracle Enterprise Data Quality 12c (12.1.3) Media Pack (Oracle Fusion Middleware Product Pack), and then click Continue.
Click the Download button next to Oracle Fusion Middleware 12.1.3.
Browse to the directory where you want to save the file. Click Save to start the file download. A ZIP file is downloaded.
Extract the ZIP file to a temporary directory.