`sklearn.impute`.MissingIndicator¶

class sklearn.impute.MissingIndicator(*, missing_values=nan, features='missing-only', sparse='auto', error_on_new=True)[source]¶

Binary indicators for missing values.

Note that this component typically should not be used in a vanilla Pipeline consisting of transformers and a classifier, but rather could be added using a FeatureUnion or ColumnTransformer.

See also

SimpleImputer: Univariate imputation of missing values.
IterativeImputer: Multivariate imputation of missing values.

Examples

>>> import numpy as np
>>> from sklearn.impute import MissingIndicator
>>> X1 = np.array([[np.nan, 1, 3],
...                [4, 0, np.nan],
...                [8, 1, 0]])
>>> X2 = np.array([[5, 1, np.nan],
...                [np.nan, 2, 3],
...                [2, 4, 0]])
>>> indicator = MissingIndicator()
>>> indicator.fit(X1)
MissingIndicator()
>>> X2_tr = indicator.transform(X2)
>>> X2_tr
array([[False,  True],
       [ True, False],
       [False, False]])

Methods

`fit`(X[, y])	Fit the transformer on `X`.
`fit_transform`(X[, y])	Generate missing values indicator for `X`.
`get_params`([deep])	Get parameters for this estimator.
`set_params`(**params)	Set the parameters of this estimator.
`transform`(X)	Generate missing values indicator for `X`.

fit(X, y=None)[source]¶

Fit the transformer on X.

Parameters

X{array-like, sparse matrix} of shape (n_samples, n_features): Input data, where n_samples is the number of samples and n_features is the number of features.
yIgnored: Not used, present for API consistency by convention.

Returns

selfobject: Fitted estimator.

fit_transform(X, y=None)[source]¶

Generate missing values indicator for X.

Parameters

X{array-like, sparse matrix} of shape (n_samples, n_features): The input data to complete.
yIgnored: Not used, present for API consistency by convention.

Returns

Xt{ndarray, sparse matrix} of shape (n_samples, n_features) or (n_samples, n_features_with_missing): The missing indicator for input data. The data type of Xt will be boolean.

get_params(deep=True)[source]¶

Get parameters for this estimator.

Parameters

deepbool, default=True: If True, will return the parameters for this estimator and contained subobjects that are estimators.

Returns

paramsdict: Parameter names mapped to their values.

set_params(**params)[source]¶

Set the parameters of this estimator.

The method works on simple estimators as well as on nested objects (such as Pipeline). The latter have parameters of the form <component>__<parameter> so that it’s possible to update each component of a nested object.

Parameters

**paramsdict: Estimator parameters.

Returns

selfestimator instance: Estimator instance.

transform(X)[source]¶

Generate missing values indicator for X.

Parameters

X{array-like, sparse matrix} of shape (n_samples, n_features): The input data to complete.

Returns

Xt{ndarray, sparse matrix} of shape (n_samples, n_features) or (n_samples, n_features_with_missing): The missing indicator for input data. The data type of Xt will be boolean.

sklearn.impute.MissingIndicator¶

`sklearn.impute`.MissingIndicator¶