`sklearn.decomposition`.MiniBatchDictionaryLearning¶

class sklearn.decomposition.MiniBatchDictionaryLearning(n_components=None, alpha=1, n_iter=1000, fit_algorithm=’lars’, n_jobs=1, batch_size=3, shuffle=True, dict_init=None, transform_algorithm=’omp’, transform_n_nonzero_coefs=None, transform_alpha=None, verbose=False, split_sign=False, random_state=None)[source]¶

Mini-batch dictionary learning

Finds a dictionary (a set of atoms) that can best be used to represent data using a sparse code.

Solves the optimization problem:

(U^*,V^*) = argmin 0.5 || Y - U V ||_2^2 + alpha * || U ||_1
             (U,V)
             with || V_k ||_2 = 1 for all  0 <= k < n_components

Notes

References:

J. Mairal, F. Bach, J. Ponce, G. Sapiro, 2009: Online dictionary learning for sparse coding (http://www.di.ens.fr/sierra/pdfs/icml09.pdf)

Methods

`fit`(X[, y])	Fit the model from data in X.
`fit_transform`(X[, y])	Fit to data, then transform it.
`get_params`([deep])	Get parameters for this estimator.
`partial_fit`(X[, y, iter_offset])	Updates the model using the data in X as a mini-batch.
`set_params`(**params)	Set the parameters of this estimator.
`transform`(X)	Encode the data as a sparse combination of the dictionary atoms.

__init__(n_components=None, alpha=1, n_iter=1000, fit_algorithm=’lars’, n_jobs=1, batch_size=3, shuffle=True, dict_init=None, transform_algorithm=’omp’, transform_n_nonzero_coefs=None, transform_alpha=None, verbose=False, split_sign=False, random_state=None)[source]¶

fit(X, y=None)[source]¶

Fit the model from data in X.

Parameters:

X : array-like, shape (n_samples, n_features)

Training vector, where n_samples in the number of samples and n_features is the number of features.

y : Ignored.

Returns:

self : object

Returns the instance itself.

fit_transform(X, y=None, **fit_params)[source]¶

Fit to data, then transform it.

Fits transformer to X and y with optional parameters fit_params and returns a transformed version of X.

Parameters:

X : numpy array of shape [n_samples, n_features]

Training set.

y : numpy array of shape [n_samples]

Target values.

Returns:

X_new : numpy array of shape [n_samples, n_features_new]

Transformed array.

get_params(deep=True)[source]¶

Get parameters for this estimator.

Parameters:

deep : boolean, optional

If True, will return the parameters for this estimator and contained subobjects that are estimators.

Returns:

params : mapping of string to any

Parameter names mapped to their values.

partial_fit(X, y=None, iter_offset=None)[source]¶

Updates the model using the data in X as a mini-batch.

Parameters:

X : array-like, shape (n_samples, n_features)

Training vector, where n_samples in the number of samples and n_features is the number of features.

y : Ignored.

iter_offset : integer, optional

The number of iteration on data batches that has been performed before this call to partial_fit. This is optional: if no number is passed, the memory of the object is used.

Returns:

self : object

Returns the instance itself.

set_params(**params)[source]¶

Set the parameters of this estimator.

The method works on simple estimators as well as on nested objects (such as pipelines). The latter have parameters of the form <component>__<parameter> so that it’s possible to update each component of a nested object.