%matplotlib inline
%load_ext autoreload
%autoreload 2


import pandas as pd
import matplotlib.pyplot as plt
import plotly.express as px
import sklearn
from sklearn.metrics import roc_auc_score
from sklearn.datasets import fetch_openml
from sklearn.model_selection import train_test_split

# Settings for plots
plt.rcParams['figure.figsize'] = [10, 7]
plt.rcParams['font.size'] = 15

import automlx


dataset = fetch_openml(name='adult', as_frame=True)
df, y = dataset.data, dataset.target

# Several of the columns are incorrectly labeled as category type in the original dataset
numeric_columns = ['age', 'capitalgain', 'capitalloss', 'hoursperweek']
for col in df.columns:
    if col in numeric_columns:
        df[col] = df[col].astype(int)


X_train, X_test, y_train, y_test = train_test_split(
    df, y.map({">50K": 1, "<=50K": 0}).astype(int), train_size=0.8, random_state=12345
)

X_train, X_val, y_train, y_val = train_test_split(
    X_train, y_train, train_size=0.75, random_state=12345
)

X_train.shape, X_val.shape, X_test.shape

((29304, 14), (9769, 14), (9769, 14))


model = automlx.Pipeline(task='classification')
model.fit(X_train, y_train)

[2025-04-25 03:08:03,813] [automlx.backend] Overwriting ray session directory to /tmp/ct1h5q1c/ray, which will be deleted at engine shutdown. If you wish to retain ray logs, provide _temp_dir in ray_setup dict of engine_opts when initializing the AutoMLx engine.
[2025-04-25 03:08:07,973] [automlx.interface] Dataset shape: (29304,14)
[2025-04-25 03:08:12,868] [sanerec.autotuning.parameter] Hyperparameter epsilon autotune range is set to its validation range. This could lead to long training times
[2025-04-25 03:08:13,548] [sanerec.autotuning.parameter] Hyperparameter repeat_quality_threshold autotune range is set to its validation range. This could lead to long training times
[2025-04-25 03:08:13,561] [sanerec.autotuning.parameter] Hyperparameter scope autotune range is set to its validation range. This could lead to long training times
[2025-04-25 03:08:13,642] [automlx.data_transform] Running preprocessing. Number of features: 15
[2025-04-25 03:08:14,283] [automlx.data_transform] Preprocessing completed. Took 0.641 secs
[2025-04-25 03:08:14,310] [automlx.process] Running Model Generation
[2025-04-25 03:08:14,361] [automlx.process] KNeighborsClassifier is disabled. The KNeighborsClassifier model is only recommended for datasets with less than 10000 samples and 1000 features.
[2025-04-25 03:08:14,361] [automlx.process] SVC is disabled. The SVC model is only recommended for datasets with less than 10000 samples and 1000 features.
[2025-04-25 03:08:14,363] [automlx.process] Model Generation completed.
[2025-04-25 03:08:14,432] [automlx.model_selection] Running Model Selection
(run pid=2668571) [LightGBM] [Info] Number of positive: 2000, number of negative: 2000
(run pid=2668571) [LightGBM] [Info] Auto-choosing row-wise multi-threading, the overhead of testing was 0.000439 seconds.
(run pid=2668571) You can set `force_row_wise=true` to remove the overhead.
(run pid=2668571) And if memory is not enough, you can set `force_col_wise=true`.
(run pid=2668571) [LightGBM] [Info] Total Bins 391
(run pid=2668571) [LightGBM] [Info] Number of data points in the train set: 4000, number of used features: 15
(run pid=2668571) [LightGBM] [Info] [binary:BoostFromScore]: pavg=0.500000 -> initscore=0.000000
[2025-04-25 03:08:33,792] [automlx.model_selection] Model Selection completed - Took 19.360 sec - Selected models: [['XGBClassifier']]
[2025-04-25 03:08:33,820] [automlx.adaptive_sampling] Running Adaptive Sampling. Dataset shape: (29304,16).
[2025-04-25 03:08:35,976] [automlx.trials] Adaptive Sampling completed - Took 2.1557 sec.
[2025-04-25 03:08:36,067] [automlx.feature_selection] Starting feature ranking for XGBClassifier
[2025-04-25 03:08:43,777] [automlx.feature_selection] Feature Selection completed. Took 7.726 secs.
[2025-04-25 03:08:43,832] [automlx.trials] Running Model Tuning for ['XGBClassifier']
[2025-04-25 03:09:28,601] [automlx.trials] Best parameters for XGBClassifier: {'learning_rate': 0.10242113515453982, 'min_child_weight': 12, 'max_depth': 4, 'reg_alpha': 0, 'booster': 'gbtree', 'reg_lambda': 0.01878279410038923, 'n_estimators': 143, 'use_label_encoder': False}
[2025-04-25 03:09:28,602] [automlx.trials] Model Tuning completed. Took: 44.770 secs
[2025-04-25 03:09:35,259] [automlx.interface] Re-fitting pipeline
[2025-04-25 03:09:35,275] [automlx.final_fit] Skipping updating parameter seed, already fixed by FinalFit_4c481463-9
[2025-04-25 03:09:37,613] [automlx.interface] AutoMLx completed.

<automlx._interface.classifier.AutoClassifier at 0x150a49d219a0>


y_proba = model.predict_proba(X_test)
score_original = roc_auc_score(y_test, y_proba[:, 1])

print(f'Score on test data: {score_original:.2f}')

Score on test data: 0.91


from automlx.fairness.metrics import ModelStatisticalParityScorer

fairness_score = ModelStatisticalParityScorer(protected_attributes='sex')
parity_test_model = fairness_score(model, X_test)
print(f'Statistical parity of the model on test data (lower is better): {parity_test_model:.2f}')

Statistical parity of the model on test data (lower is better): 0.18


explainer = automlx.MLExplainer(model,
                               X_train,
                               y_train,
                               target_names=["<=50K", ">50K"],
                               task="classification")


fairness_exp = explainer.explain_model_fairness(protected_attributes='sex',
                                                scoring_metric='statistical_parity')
fairness_exp.show_in_notebook()


from automlx.fairness.bias_mitigation import ModelBiasMitigator

bias_mitigated_model = ModelBiasMitigator(
    model,
    protected_attribute_names="sex",
    fairness_metric="equalized_odds",
    accuracy_metric="balanced_accuracy",
    random_seed=12345,
)


bias_mitigated_model.fit(X_val, y_val)

<automlx.fairness.bias_mitigation._sklearn.ModelBiasMitigator at 0x150a8dbe6370>


bias_mitigated_model.predict(X_test)

array([0, 1, 0, ..., 1, 0, 0])


bias_mitigated_model.show_tradeoff(hide_inadmissible=False)


bias_mitigated_model.tradeoff_summary_


bias_mitigated_model.select_model(1)


dataset = fetch_openml(name='adult', as_frame=True)
df, y = dataset.data, dataset.target


df.head()


y_df = pd.DataFrame(y)
y_df.columns = ['income']

fig = px.histogram(y_df, x="income")
fig.show()


df1 = pd.concat([df, y_df], axis=1)

df1 = df1.groupby('sex')['income'].value_counts(normalize=True)
df1 = df1.mul(100).reset_index()
df1.columns = ['sex', 'income', 'percent']

fig = px.bar(df1, x="sex", y="percent", color="income", barmode="group")
fig.show()


# Several of the columns are incorrectly labeled as category type in the original dataset
numeric_columns = ['age', 'capitalgain', 'capitalloss', 'hoursperweek']
for col in df.columns:
    if col in numeric_columns:
        df[col] = df[col].astype(int)


X_train, X_test, y_train, y_test = train_test_split(
    df, y.map({">50K": 1, "<=50K": 0}).astype(int), train_size=0.8, random_state=12345
)

X_train, X_val, y_train, y_val = train_test_split(
    X_train, y_train, train_size=0.75, random_state=12345
)

X_train.shape, X_val.shape, X_test.shape

((29304, 14), (9769, 14), (9769, 14))


from sklearn.ensemble import RandomForestClassifier
from sklearn.preprocessing import OneHotEncoder

sklearn_model = sklearn.pipeline.Pipeline(
    steps=[("preprocessor", OneHotEncoder(handle_unknown="ignore")), ("classifier", RandomForestClassifier())]
)
sklearn_model.fit(X_train, y_train)

Pipeline(steps=[('preprocessor', OneHotEncoder(handle_unknown='ignore')),
                ('classifier', RandomForestClassifier())])

Pipeline(steps=[('preprocessor', OneHotEncoder(handle_unknown='ignore')),
                ('classifier', RandomForestClassifier())])

OneHotEncoder(handle_unknown='ignore')

RandomForestClassifier()


y_proba = sklearn_model.predict_proba(X_test)
score = roc_auc_score(y_test, y_proba[:, 1])

print(f'Score on test data: {score:.2f}')

Score on test data: 0.90


fairness_score = ModelStatisticalParityScorer(protected_attributes='sex')
parity_test_sklearn_model = fairness_score(sklearn_model, X_test)
print(f'Statistical parity of the sklearn model on test data (lower is better): {parity_test_sklearn_model:.2f}')

Statistical parity of the sklearn model on test data (lower is better): 0.19


model = automlx.Pipeline(task='classification')
model.fit(X_train, y_train)

[2025-04-25 03:11:35,769] [automlx.interface] Dataset shape: (29304,14)
[2025-04-25 03:11:35,874] [automlx.data_transform] Running preprocessing. Number of features: 15
[2025-04-25 03:11:36,280] [automlx.data_transform] Preprocessing completed. Took 0.406 secs
[2025-04-25 03:11:36,306] [automlx.process] Running Model Generation
[2025-04-25 03:11:36,357] [automlx.process] KNeighborsClassifier is disabled. The KNeighborsClassifier model is only recommended for datasets with less than 10000 samples and 1000 features.
[2025-04-25 03:11:36,357] [automlx.process] SVC is disabled. The SVC model is only recommended for datasets with less than 10000 samples and 1000 features.
[2025-04-25 03:11:36,358] [automlx.process] Model Generation completed.
[2025-04-25 03:11:36,430] [automlx.model_selection] Running Model Selection
[2025-04-25 03:11:53,842] [automlx.model_selection] Model Selection completed - Took 17.412 sec - Selected models: [['XGBClassifier']]
[2025-04-25 03:11:53,871] [automlx.adaptive_sampling] Running Adaptive Sampling. Dataset shape: (29304,16).
[2025-04-25 03:11:55,656] [automlx.trials] Adaptive Sampling completed - Took 1.7846 sec.
[2025-04-25 03:11:55,771] [automlx.feature_selection] Starting feature ranking for XGBClassifier
[2025-04-25 03:12:03,593] [automlx.feature_selection] Feature Selection completed. Took 7.839 secs.
[2025-04-25 03:12:03,647] [automlx.trials] Running Model Tuning for ['XGBClassifier']
[2025-04-25 03:12:49,388] [automlx.trials] Best parameters for XGBClassifier: {'learning_rate': 0.10242113515453982, 'min_child_weight': 12, 'max_depth': 4, 'reg_alpha': 0, 'booster': 'gbtree', 'reg_lambda': 0.01878279410038923, 'n_estimators': 143, 'use_label_encoder': False}
[2025-04-25 03:12:49,389] [automlx.trials] Model Tuning completed. Took: 45.742 secs
[2025-04-25 03:12:56,342] [automlx.interface] Re-fitting pipeline
[2025-04-25 03:12:56,357] [automlx.final_fit] Skipping updating parameter seed, already fixed by FinalFit_2070da66-2
[2025-04-25 03:12:58,079] [automlx.interface] AutoMLx completed.

<automlx._interface.classifier.AutoClassifier at 0x1501d42f5f40>


y_proba = model.predict_proba(X_test)
score_original = roc_auc_score(y_test, y_proba[:, 1])

print(f'Score on test data: {score_original:.2f}')

Score on test data: 0.91


fairness_score = ModelStatisticalParityScorer(protected_attributes='sex')
parity_test_model = fairness_score(model, X_test)
print(f'Statistical parity of the model on test data (lower is better): {parity_test_model:.2f}')

Statistical parity of the model on test data (lower is better): 0.18


y_pred = model.predict(X_train)

df_predict = X_train.copy()
df_predict['model prediction probability'] = y_pred

pred_per_sex = df_predict.groupby('sex').mean('model prediction probability').reset_index()
pred_per_sex = pred_per_sex.rename(columns={'model prediction probability': 'average prediction'})

fig = px.bar(pred_per_sex, x='sex', y='average prediction')
fig.show()


from automlx.fairness.metrics import model_statistical_parity

y_pred = model.predict(X_test)
subgroups = X_test[['sex']]

parity_test_model = model_statistical_parity(y_pred=y_pred, subgroups=subgroups)
print(f'Statistical parity of the model on test data (lower is better): {parity_test_model:.2f}')

Statistical parity of the model on test data (lower is better): 0.18


from automlx.fairness.metrics import DatasetStatisticalParityScorer

DSPS = DatasetStatisticalParityScorer(protected_attributes='sex')

parity_test_data = DSPS(X=X_test, y_true=y_test)


from automlx.fairness.metrics import dataset_statistical_parity

subgroups = X_test[['sex']]

parity_test_data = dataset_statistical_parity(y_test, subgroups)
print(f'Statistical parity of the test data (lower is better): {parity_test_data:.2f}')

Statistical parity of the test data (lower is better): 0.20


fig = px.bar(
        pd.DataFrame({
            'Fairness  Type': ['Data Fairness', 'Model Fairness'],
            'Statistical Parity': [parity_test_data, parity_test_model],
        }),
        x='Fairness  Type',
        y='Statistical Parity',
)
fig.show()


from automlx.fairness.metrics import EqualizedOddsScorer

fairness_score = EqualizedOddsScorer(protected_attributes='sex', distance_measure='diff')
EO_original = fairness_score(model, X_test, y_test)
print(f'Equalized odds on test data (lower is better): {EO_original:.2f}')

Equalized odds on test data (lower is better): 0.08


fairness_score = EqualizedOddsScorer(protected_attributes=['sex', 'race'], distance_measure='diff')
EO = fairness_score(model, X_test, y_test)
print(f'Equalized odds on test data (lower is better): {EO:.2f}')

Equalized odds on test data (lower is better): 0.29


from automlx.fairness.metrics import smoothed_edf

subgroups = X_train[['race', 'sex']]
smoothed_edf_score = smoothed_edf(y_train, subgroups)
print(f'Smoothed EDF score on train data: {smoothed_edf_score:.2f}')

Smoothed EDF score on train data: 1.63


explainer = automlx.MLExplainer(model,
                               X_train,
                               y_train,
                               target_names=["<=50K", ">50K"],
                               task="classification")


global_exp = explainer.explain_model()


global_exp.show_in_notebook()


fairness_exp = explainer.explain_model_fairness(protected_attributes='sex',
                                                scoring_metric='statistical_parity')
fairness_exp.show_in_notebook()


def compare(global_exp, fairness_exp):
    dfg = global_exp.to_dataframe()
    dff = fairness_exp.to_dataframe()

    dfg = dfg.set_index('Feature')
    dff = dff.set_index('Feature')

    dfg.columns = [f'{col}_score' for col in dfg.columns]
    dff.columns = [f'{col}_fairness' for col in dff.columns]

    df = pd.concat([dfg, dff], axis=1)

    df = df.reset_index()

    df.columns = ['Feature', 'Increases Accuracy', 'Upper-bound Accuracy', 'Lower-bound Accuracy',
                  'Decreases Fairness', 'Upper-bound Fairness', 'Lower-bound Fairness',]

    fig = px.scatter(df, x="Increases Accuracy", y="Decreases Fairness", text="Feature", log_x=False, size_max=60)

    fig.update_traces(textposition='middle left')

    fig.update_layout(
        height=800,
        title_text='Global vs Fairness Feature Importance'
    )

    fig.show()


compare(global_exp, fairness_exp)


from automlx.fairness.bias_mitigation import ModelBiasMitigator

bias_mitigated_model = ModelBiasMitigator(
    model,
    protected_attribute_names="sex",
    fairness_metric="equalized_odds",
    accuracy_metric="balanced_accuracy",
    constraint_type="absolute",    # indicates a hard constraint
    constraint_value=0.1,          # The maximum allowed equalized odds score
    constraint_target="fairness",  # as opposed to accuracy
)


bias_mitigated_model = ModelBiasMitigator(
    model,
    protected_attribute_names="sex",
    fairness_metric="equalized_odds",
    accuracy_metric="balanced_accuracy",
    constraint_type="relative",          # default
    constraint_value=0.05,               # default
    constraint_target="accuracy",        # default
    time_limit=50,
    n_trials_per_group=30,               # Number of different multiplying scalars to consider
    random_seed=12345,
)


bias_mitigated_model.fit(X_val, y_val)

<automlx.fairness.bias_mitigation._sklearn.ModelBiasMitigator at 0x1501cf1e32b0>


bias_mitigated_model.predict_proba(X_test)

array([[0.9480151 , 0.05198494],
       [0.00739041, 0.99260956],
       [0.89531946, 0.1046806 ],
       ...,
       [0.5493545 , 0.45064548],
       [0.6450227 , 0.3549773 ],
       [0.98485196, 0.01514801]], dtype=float32)


bias_mitigated_model.predict(X_test)

array([0, 1, 0, ..., 0, 0, 0])


bias_mitigated_model.tradeoff_summary_


bias_mitigated_model.show_tradeoff(hide_inadmissible=False)


bias_mitigated_model.select_model(3)


bias_mitigated_model.predict(X_test)

array([0, 1, 0, ..., 1, 0, 0])

	equalized_odds	balanced_accuracy	multiplier_sex=Female	multiplier_sex=Male
0	0.006916	0.608927	0.111534	0.199131
1	0.023170	0.624593	0.193148	0.233259
2	0.028703	0.628152	0.256520	0.233259
3	0.036227	0.642396	0.329184	0.274094
4	0.046819	0.759193	1.441927	0.840396
5	0.052286	0.793150	3.661628	1.334403
6	0.095597	0.795917	1.767152	1.365998
7	0.097830	0.796030	1.700147	1.364635
8	0.129705	0.817235	8.367842	3.129173
9	0.151287	0.819858	5.442261	2.822292
10	0.184541	0.822028	4.626349	3.146395
11	0.216564	0.822988	3.661628	3.447235
12	0.237449	0.824125	2.028335	3.146395

	age	workclass	fnlwgt	education	education-num	marital-status	occupation	relationship	race	sex	capitalgain	hoursperweek	native-country
0	2	State-gov	77516.0	Bachelors	13.0	Never-married	Adm-clerical	Not-in-family	White	Male	1	2	United-States
1	3	Self-emp-not-inc	83311.0	Bachelors	13.0	Married-civ-spouse	Exec-managerial	Husband	White	Male	0	0	United-States
2	2	Private	215646.0	HS-grad	9.0	Divorced	Handlers-cleaners	Not-in-family	White	Male	0	2	United-States
3	3	Private	234721.0	11th	7.0	Married-civ-spouse	Handlers-cleaners	Husband	Black	Male	0	2	United-States
4	1	Private	338409.0	Bachelors	13.0	Married-civ-spouse	Prof-specialty	Wife	Black	Female	0	2	Cuba

Metric	Dataset	Model	Punitive	Assistive	Perfect score means
Consistency	✓		NA	NA	Neighbors (k-means) have the same labels
Smoothed EDF	✓		NA	NA	Sub-populations have equal probability of positive label (with log scaling of deviation)
Statistical Parity	✓	✓	✓		Sub-populations have equal probability of positive prediction
True Positive Rates		✓		✓	Sub-populations have equal probability of positive prediction when their true label is positive
False Positive Rates		✓	✓		Sub-populations have equal probability of positive prediction when their true label is negative
False Negative Rates		✓		✓	Sub-populations have equal probability of negative prediction when their true label is positive
False Omission Rates		✓		✓	Sub-populations have equal probability of a positive true label when their prediction is negative
False Discovery Rates		✓	✓		Sub-populations have equal probability of a negative true label when their prediction is positive
Equalized Odds		✓	✓	✓	Sub-populations have equal true positive rate and equal false positive rate
Error Rates		✓		✓	Sub-populations have equal probability of a false prediction
Theil Index		✓		✓	Error rates are the same for sub-populations and whole population (deviations are measured using entropy).

	equalized_odds	balanced_accuracy	multiplier_sex=Female	multiplier_sex=Male
0	0.023170	0.624593	0.193148	0.233259
1	0.028703	0.628152	0.256520	0.233259
2	0.036227	0.642396	0.329184	0.274094
3	0.054804	0.779418	2.391568	1.075606
4	0.100502	0.794733	1.552707	1.365998
5	0.129705	0.817235	8.367842	3.129173
6	0.157349	0.819466	7.183583	3.448481
7	0.216564	0.822988	3.661628	3.447235
8	0.237449	0.824125	2.028335	3.146395

Fairness with AutoMLx

Overview of this Notebook¶

Prerequisites¶

Table of Contents¶

Preliminaries¶

Quick Start¶

Load the Data¶

Train and evaluate an AutoML model¶

Evaluate Fairness of the Model by Computing Statistical Parity¶

Compute Fairness Feature Importance¶

Mitigating a Model's Unintended Bias¶

In Depth: The Census Income Dataset¶

Unintended Bias and Fairness¶

Overview of the Fairness Metrics¶

Unintended Bias Detection¶

Measure the Compliance of a Model with a Fairness Metric¶

Train a Model Using Scikit-learn¶

Train a Model Using AutoML¶

Measure the Compliance of the True Labels of a Dataset with a Fairness Metric¶

Other Fairness Metrics¶

Revealing Bias with Explainability¶

Initializing an MLExplainer¶

Model Explanations (Global Feature Importance)¶

Model Fairness Explanations (Fairness Feature Importance)¶

Global vs Fairness Feature Importance¶

Model Bias Mitigation¶

References¶