20.6 Configuring Parallel Graph AnalytiX (PGX) Correlation

This section describes the configuration activities for Parallel Graph AnalytiX (PGX) Correlation.

Overview

PGX is a toolkit for graph analysis - both running algorithms such as PageRank against graphs and performing SQL-like pattern-matching against graphs, using the results of algorithmic analysis. Algorithms are parallelized for extreme performance. The PGX toolkit includes both a single-node in-memory engine and a distributed engine for extremely large graphs. Graphs can be loaded from a variety of sources including flat files, SQL and NoSQL databases, and Apache Spark and Hadoop; incremental updates are supported.

Note:

PGX based correlation is not supported on AIX and Solaris SPARC OS. You can use Java-SQL correlation, which is a functionally equivalent module to PGX based correlation.

Pre-requisites

The following is a list of pre-requisites:

  • Java 8 is mandatory as PGX is the default.
  • Initiate correlation.shshouldbe triggered once before calling batch. This configures the correlation module. This instruction already there as part of the old correlation module.
  • <installed path>/ficdb/lib_PGX/pgxConfig.cfg where k hop should be configured by the user between 2 and 10. The default value is 5.