skoot.datasets.load_adult_df

skoot.datasets.load_adult_df(include_tgt=True, tgt_name='target', names=None)[source][source]

Load and return the adult dataset (classification).

The adult dataset is a classic binary classification problem requiring pre-processing prior to being model-ready.

Classes 2
Samples per class <=50k: 24720; >50k: 7841
Samples total 32561
Dimensionality 15
Features real, positive, categorical

Read more in the User Guide.

Parameters:

include_tgt : bool, optional (default=True)

Whether to include the target

tgt_name : str, optional (default=”target”)

The name of the target feature

names : iterable or None

The column names for the dataframe. If not defined, will default to the canonical feature names.

Returns:

X : pd.DataFrame, shape=(n_samples, n_features)

The loaded adult dataset

References

[R11]Ronny Kohavi and Barry Becker, “Data Mining and Visualization” Silicon Graphics. https://archive.ics.uci.edu/ml/datasets/Adult

Examples using skoot.datasets.load_adult_df