The Dgidx program indexes the tagged Endeca records that were prepared by Forge, and creates the proprietary indices for the Endeca MDEX Engine.
dgidx [-qv] [--flags] <data export file> <output db_prefix>where <db_prefix> specifies the path to the directory, and the prefix used for the files in your Endeca application.
Dgidx supports the following flags:
Flag | Description |
---|---|
-q | Quiet mode. |
-v | Verbose mode. |
--autogenerate-dval-specs | Specify the auto-generation of dimension value specs. If this flag is specified, then during both baseline and partial updates, any dimension value that does not have a dimension value spec is assigned one. |
--compoundDimSearch | Enable compound dimension search for the application. Use of this option increases indexing time. However, if this option is not enabled at index time, compound dimension results (multiple-dimension-value results) are not returned by the MDEX Engine. |
--cov | Compute and report coverage statistics for dimensions and properties. |
--diacritic-folding | Ignore character accents when indexing text. For details about how characters with diacritical marks are mapped to their ASCII equivalents, see the MDEX Engine Basic Development Guide. |
--equivopt | Deprecated. The MDEX Engine ignores
this flag if it is specified.
Compute dimension value equivalence classes as a space-saving optimization. This adds time to the indexing phase, but reduces the size of the index. The default is to search leaf assignments only. |
--help | Print the help message and exit. |
--lang <lang-id> | Assume all documents are in the specified language. The default for <lang-id> is en. |
--latin1 | Deprecated. Ignore character accents when indexing text. Use ISO Latin 1 character mappings for international characters when performing search indexing. Note that the accents are folded down before indexing, so only a single form is indexed. |
--ngram_min <value> | Deprecated. The MDEX Engine ignores this flag if it is specified. |
--noimplicit |
Disable computation of implicit refinement dimension values. Implicit refinements are dimension values that are assigned to all records in the current result set, and whose selection therefore does not narrow the results. In addition, this flag disables computation of dimension values for disabled refinements. |
--nostrictattrs | Disable strict attribute checking. Allows records to retain property values for properties with no property (or PROP_REF element) defined in the navigation configuration file, and in the Properties view of Developer Studio. |
--noxmlvalidate | Do not perform XML validation while reading the XML export file. This option only makes a difference if the export file is in XML format. |
--numbins <num> | Limit the number of records that Dgidx reads. |
--out <stdout/stderr file> | Specify file path to which stdout/stderr should be remapped (the default is to use default stdout/stderr for the process). |
--sort <spec> |
Specify a default sort specification for the data set. The
format of
<spec> is (including the quotation
marks):
"key|dir"where key is the name of a property or dimension on which to sort and dir is either asc for ascending or desc for descending (if not specified, the order will be ascending). key can also be a geocode property, as in
this example:
"Location(43,73)|desc" You can specify multiple sort keys in the format:
"key_1[|dir_1]||key_2[|dir_2]||...||key_n[|dir_n]" If you specify multiple sort keys, the records are sorted by the first sort key, with ties being resolved by the second sort key, whose ties are resolved by the third sort key, and so on. Note that if you are using the Endeca Application Controller
(EAC) to control your environment, you must omit the quotation marks from the
--sort flag. Instead, use the following
syntax:
--sort key_1|dir_1||key_2|dir_2||...||key_n|dir_n |
--spellmode <mode> | Specify the spelling correction mode for the
application. Supported modes are:
|
--spellnum | In spelling modes that enable the espell module, include non-word terms (numbers, symbols, and so on) in the espell dictionary. By default, such terms are not included. |
--stemming-updates <file> | Specify an optional XML file of stemming updates to apply to a default stemming dictionary. See the MDEX Engine Advanced Development Guide for XML examples and file name requirements. |
--threads <num> |
Specify the number of sorting threads to use for the multi-threaded portion of the indexing process. The default is 1. If this flag is not specified, or if 1 is specified for it, Dgidx uses one sorting thread. If the specified value is greater than 1, Dgidx uses the specified number of threads to sort data. Note that Dgidx runs in multithreaded mode by default. In addition to the number of sorting threads that you can control with the --threads flag, Dgidx may use additional maintenance threads that run in the background by default, and are not used for sorting data. To improve indexing performance, Endeca recommends increasing the number of sorting threads. In deployments where a dedicated server is used for indexing the Endeca application, allocate as many threads as your server allows to the Dgidx sorting process. For best performance, the number of sorting threads specified should correlate with the number of cores on the server. Since sorting is only part of the indexing process, using N sorting threads does not speed up Dgidx by N times. |
--version | Print version information and exit. |
--verbose-language-mapping | Deprecated. The MDEX Engine ignores
this option if it is specified.
Report which record properties are mapped to which languages. |