skoot.datasets
.load_adult_df¶
-
skoot.datasets.
load_adult_df
(include_tgt=True, tgt_name='target', names=None)[source][source]¶ Load and return the adult dataset (classification).
The adult dataset is a classic binary classification problem requiring pre-processing prior to being model-ready.
Classes 2 Samples per class <=50k: 24720; >50k: 7841 Samples total 32561 Dimensionality 15 Features real, positive, categorical Read more in the User Guide.
Parameters: include_tgt : bool, optional (default=True)
Whether to include the target
tgt_name : str, optional (default=”target”)
The name of the target feature
names : iterable or None
The column names for the dataframe. If not defined, will default to the canonical feature names.
Returns: X : pd.DataFrame, shape=(n_samples, n_features)
The loaded adult dataset
References
[R11] Ronny Kohavi and Barry Becker, “Data Mining and Visualization” Silicon Graphics. https://archive.ics.uci.edu/ml/datasets/Adult