|
b1
|
Bigram Comparator
|
Compares
two strings using an algorithm based on the Bigram algorithm.
This function compares two strings using all combinations of two consecutive
characters and returns the total number of combinations that are the
same.
|
|
b2
|
Advanced Bigram Comparator
|
Compares
two strings allowing for character transpositions. This function is
similar to the standard Bigram Comparator (b1).
|
|
u
|
Advanced Jaro String Comparator
|
Compares two
strings taking into account uncertainty factors, such as string length,
transpositions, and characters in common. This function is based on
the Jaro algorithm.
|
|
ua
|
Winkler-Jaro String Comparator
|
Compares
two strings similar to the Advanced Jaro String Comparator (u), but increases the agreement weight
if the initial characters of each string are exact matches. This function
takes into account key punch and visual memory errors. It is based
on the Jaro algorithm with variants of Winkler/Lynch
and McLaughlin.
|
|
uf
|
Advanced Jaro Adjusted for First Names
|
Based on the generic string comparator (u), this function is
designed to specifically weight first name values. The string is analyzed
and the weight adjusted based on statistical data.
|
|
ul
|
Advanced Jaro Adjusted for Last Names
|
Based on the generic string comparator (u), this function
is designed to specifically weight last name values. The string is
analyzed and the weight adjusted based on statistical data.
|
|
un
|
Advanced Jaro Adjusted for House Numbers
|
Based on the generic string comparator (u), this function
is designed to specifically weight house number values. The string
is analyzed and the weight adjusted based on statistical data.
|
|
us
|
Condensed String Comparator
|
Compares
two strings similar to the Advanced Jaro String Comparator (u), but this function is a custom
string comparator that compares two strings taking into account such
uncertainty factors as string length, transpositions, key punch errors,
and visual memory errors. Unlike the Advanced Jaro String Comparator, this function handles diacritical
marks. This function also improves processing speed.
|
|
usu
|
Unicode String Comparator
|
Compares
two strings similar to the Condensed String Comparator (us), but this function is based in
Unicode to support multiple languages and alphabets. This comparator
takes one parameter indicating the language to use.
|
|
usus
|
Unicode AlphaNumeric Comparator
|
Compares
two strings similar to the Unicode String Comparator, but this function is designed to
match on unique identifiers such as national IDs. This comparator
takes one parameter indicating the language to use plus any of the
following parameters:
-
Field length
-
Character types
-
Invalid values
|
|
ujs
|
Advanced Jaro AlphaNumeric Comparator
|
Compares two strings similar to the Advanced Jaro String Comparator, but this function
is designed to match on unique identifiers such as national IDs. This
comparator takes any of the following parameters:
-
Field length
-
Character types
-
Invalid values
|
|
c
|
Exact Character-to-Character Comparator
|
Compares string fields character by character. Each character
must match in order for an agreement weight to be assigned.
|
|
nI
|
Integer Comparator
|
Compares integer fields using a relative distance value
to determine the match weight. As the difference between the two fields
increases, the match weight decreases. Once the difference is beyond
the relative distance, a disagreement weight is assigned. This comparator
takes two parameters; the first indicates whether to use a relative
distance or direct string comparison and the second indicates the
relative distance to use.
|
|
nR
|
Real Number Comparator
|
Compares fields containing real numbers using a relative
distance value to determine the match weight. As the difference between
the two fields increases, the match weight decreases. Once the difference
is beyond the relative distance, a disagreement weight is assigned.
This comparator takes two parameters; the first indicates whether
to use a relative distance or direct string comparison, and the second
indicates the relative distance to use.
|
|
nS
|
Condensed AlphaNumeric SSN Comparator
|
Compares
social security numbers or other unique identifiers, taking into account
any of these parameters:
-
Field length
-
Character types
-
Invalid values
|
|
dY
|
Date Comparator With Years as Units
|
Compares year values using relative
distance values prior to and following the given year to determine
the match weight. As the difference between the two fields increases,
the match weight decreases. Once the difference is beyond the relative
distance, a disagreement weight is assigned. The date comparison functions
handle Gregorian years. This comparator takes up to three parameters;
the first indicates whether to use a relative distance or direct string
comparison, and the second and third indicate the relative distance
before and after.
|
|
dM
|
Date Comparator With Months as Units
|
Compares the month and year using a relative distance
as described above for the year comparison function (dY).
|
|
dD
|
Date Comparator With Days as Units
|
Compares the day, month, and year using a relative distance
as described above for the year comparison function (dY).
|
|
dH
|
Date Comparator With Hours as Units
|
Compares the hour, day, month, and year using a relative
distance as described above for the year comparison function (dY).
|
|
dm
|
Date Comparator With Minutes as Units
|
Compares the minute, hour, day, month, and
year using a relative distance as described above for the year comparison
function (dY).
|
|
ds
|
Date Comparator With Seconds as Units
|
Compares the second, minute, hour, day, month,
and year using a relative distance as described above for the year
comparison function (dY).
|
|
p
|
Prorated Comparator
|
Prorates the
disagreement weight for a date or numeric field based on values you
specify. Differences greater than the amount you specify receive the
full disagreement weight. This comparator takes three parameters indicating
the relative distance and the agreement and disagreement ranges.
|