Skip to main content

Ctrl+K

Install
User Guide
API
Examples
Community
- Getting Started
- Release History
- Glossary
- Development
- FAQ
- Support
- Related Projects
- Roadmap
- Governance
- About us

GitHub

Install
User Guide
API
Examples
Community
Getting Started
Release History
Glossary
Development
FAQ
Support
Related Projects
Roadmap
Governance
About us

GitHub

Section Navigation

Release Highlights
Biclustering
Calibration
Classification
Clustering
Covariance estimation
Cross decomposition
- Compare cross decomposition methods
- Principal Component Regression vs Partial Least Squares Regression
Dataset examples
- Plot randomly generated multilabel dataset
Decision Trees
Decomposition
Developing Estimators
- __sklearn_is_fitted__ as Developer API
Ensemble methods
Examples based on real world datasets
Feature Selection
Frozen Estimators
- Examples of Using FrozenEstimator
Gaussian Mixture Models
Gaussian Process for Machine Learning
Generalized Linear Models
Inspection
Kernel Approximation
- Scalable learning with polynomial kernel approximation
Manifold learning
Miscellaneous
Missing Value Imputation
- Imputing missing values before building an estimator
- Imputing missing values with variants of IterativeImputer
Model Selection
Multiclass methods
- Overview of multiclass training meta-estimators
Multioutput methods
- Multilabel classification using a classifier chain
Nearest Neighbors
Neural Networks
Pipelines and composite estimators
Preprocessing
Semi Supervised Classification
Support Vector Machines
Working with text documents

Examples
Decomposition
Blind source separation using FastICA

Note

Go to the end to download the full example code or to run this example in your browser via JupyterLite or Binder.

Blind source separation using FastICA#

An example of estimating sources from noisy data.

Independent component analysis (ICA) is used to estimate sources given noisy measurements. Imagine 3 instruments playing simultaneously and 3 microphones recording the mixed signals. ICA is used to recover the sources ie. what is played by each instrument. Importantly, PCA fails at recovering our instruments since the related signals reflect non-Gaussian processes.

# Authors: The scikit-learn developers
# SPDX-License-Identifier: BSD-3-Clause

Generate sample data#

import numpy as np
from scipy import signal

np.random.seed(0)
n_samples = 2000
time = np.linspace(0, 8, n_samples)

s1 = np.sin(2 * time)  # Signal 1 : sinusoidal signal
s2 = np.sign(np.sin(3 * time))  # Signal 2 : square signal
s3 = signal.sawtooth(2 * np.pi * time)  # Signal 3: saw tooth signal

S = np.c_[s1, s2, s3]
S += 0.2 * np.random.normal(size=S.shape)  # Add noise

S /= S.std(axis=0)  # Standardize data
# Mix data
A = np.array([[1, 1, 1], [0.5, 2, 1.0], [1.5, 1.0, 2.0]])  # Mixing matrix
X = np.dot(S, A.T)  # Generate observations

Fit ICA and PCA models#

from sklearn.decomposition import PCA, FastICA

# Compute ICA
ica = FastICA(n_components=3, whiten="arbitrary-variance")
S_ = ica.fit_transform(X)  # Reconstruct signals
A_ = ica.mixing_  # Get estimated mixing matrix

# We can `prove` that the ICA model applies by reverting the unmixing.
assert np.allclose(X, np.dot(S_, A_.T) + ica.mean_)

# For comparison, compute PCA
pca = PCA(n_components=3)
H = pca.fit_transform(X)  # Reconstruct signals based on orthogonal components

Plot results#

import matplotlib.pyplot as plt

plt.figure()

models = [X, S, S_, H]
names = [
    "Observations (mixed signal)",
    "True Sources",
    "ICA recovered signals",
    "PCA recovered signals",
]
colors = ["red", "steelblue", "orange"]

for ii, (model, name) in enumerate(zip(models, names), 1):
    plt.subplot(4, 1, ii)
    plt.title(name)
    for sig, color in zip(model.T, colors):
        plt.plot(sig, color=color)

plt.tight_layout()
plt.show()

Observations (mixed signal), True Sources, ICA recovered signals, PCA recovered signals

Total running time of the script: (0 minutes 0.455 seconds)

Download Jupyter notebook: plot_ica_blind_source_separation.ipynb

Download Python source code: plot_ica_blind_source_separation.py

Download zipped: plot_ica_blind_source_separation.zip

Related examples

FastICA on 2D point clouds

FastICA on 2D point clouds

Orthogonal Matching Pursuit

Orthogonal Matching Pursuit

Comparison of kernel ridge and Gaussian process regression

Comparison of kernel ridge and Gaussian process regression

Ability of Gaussian process regression (GPR) to estimate data noise-level

Ability of Gaussian process regression (GPR) to estimate data noise-level

Gallery generated by Sphinx-Gallery

previous

Decomposition

next

Comparison of LDA and PCA 2D projection of Iris dataset

On this page

Generate sample data
Fit ICA and PCA models
Plot results

This Page

Show Source

Download source code

Download Jupyter notebook

Download zipped

© Copyright 2007 - 2025, scikit-learn developers (BSD License).