MySQL HeatWave Release Notes
MySQL HeatWave Advisor Auto Encoding, which recommends string column encodings, now provides encoding recommendations that optimize query performance. Recommendations are based on performance models that use query execution data. Previously, string column encoding recommendations were optimized for cluster memory usage only. A performance improvement estimate is provided with string column encoding recommendations. (Bug #34145862)
You can now train MySQL HeatWave AutoML models on tables containing
DATE,
TIME,
DATETIME,
TIMESTAMP, and
YEAR data types.
(Bug #33895503)
MySQL HeatWave AutoML now generates a model explanation when you train a machine learning model. Model explanations help identify the features that are most important to a model. For more information, see The Model Catalog.
The following columns were added to the
MODEL_CATALOG table:
column_names: The feature columns used to
train the model.
last_accessed: The last time the model
was accessed. MySQL HeatWave AutoML routines update this value to the
current timestamp when accessing the model.
model_explanation: The model explanation
generated during training.
model_type: The type of model (algorithm)
selected by ML_TRAIN to build the model.
task: The task type specified in the
ML_TRAIN query
(classification or
regression).
ML_PREDICT_* and
ML_EXPLAIN_* routine performance was
improved, resulting in faster prediction and explanation
processing.
(WL #15088, WL #15014)
The following MySQL HeatWave AutoML enhancements were implemented:
ML_TRAIN options for advanced users.
These options permit users to customize various aspects of
the ML training pipeline including algorithm selection,
feature selection, and hyperparameter optimization.
The model_list option permits
specifying the type of model to be trained.
The exclude_model_list option
specifies models types to exclude from consideration
during model selection.
The optimization_metric option
specifies the scoring metric to optimize for when
training a machine learning model.
The exclude_column_list option
specifies feature columns to exclude from consideration
when training a machine learning model.
For more information, see ML_TRAIN.
Support was added for Support Vector Machine
SVC and LinearSVC
classification and regression models. For a complete list of
supported model types, see
Model Types.
The ML_TRAIN routine now reports a
message if a trained model does not meet expected quality
criteria.
ML_EXPLAIN_ROW and
ML_EXPLAIN_TABLE routines now provide
information to help interpret explanations. The routines
also report a warning when a model quality issue is
detected, enabling users to revisit their data in order to
improve model quality.
(WL #15089)
The amount of heap memory allocated on the MySQL node for each
table loaded into MySQL HeatWave was reduced, increasing the maximum
number of tables that can be loaded. For
MySQL.HeatWave.VM.E3.Standard shapes, the
maximum was raised from 100k tables to 400k tables. For
MySQL.HeatWave.BM.E3.Standard shapes, the
maximum number was raised from 400k tables to 1600k tables. The
actual number of tables that can be loaded is dependent on the
table's data.
(Bug #33951708)
The performance_schema.rpd_column_id table
was modified to remove redundant data. The
NAME, SCHEMA_NAME,
TABLE_NAME columns were removed, and a
TABLE_ID column was added.
(Bug #33899183)
Support was added for the
FROM_DAYS() temporal function,
and GREATEST() and
LEAST() comparison and string
functions which now support DATE,
DATETIME,
TIME, and
TIMESTAMP columns.
(WL #14956)
Support was added for built-in server-side data masking and de-identification to help protect sensitive data from unauthorized uses by hiding and replacing real values with substitutes. Data masking and de-identification operations are performed on the server, and queries involving data masking and de-identification functions are accelerated by MySQL HeatWave. The following data masking and de-identification functions are supported:
See Data Masking and De-Identification Functions. (WL #15143)
Optimizations were implemented to improve performance for
JOIN and GROUP BY queries
with execution plans involving multiple consecutive rounds of
data partitioning.
(WL #15143)
comparisons,
where the expression is a single value and compared values are
constants of the same data type and encoding, have been
optimized. For example, the following expr IN
(value,...)IN()
comparison has been optimized:
SELECT * FROM Customers WHERE Country IN ('Germany', 'France', 'Spain');(WL #14952)