This is documentation for an old release of Scikit-learn (version 1.4). Try the latest stable release (version 1.6) or development (unstable) versions.
Classification
Identifying which category an object belongs to.
Applications: Spam detection, image recognition.
Algorithms:
Gradient boosting,
nearest neighbors,
random forest,
logistic regression,
and more...
Regression
Predicting a continuous-valued attribute associated with an object.
Applications: Drug response, Stock prices.
Algorithms:
Gradient boosting,
nearest neighbors,
random forest,
ridge,
and more...
Clustering
Automatic grouping of similar objects into sets.
Applications: Customer segmentation, Grouping experiment outcomes
Algorithms:
k-Means,
HDBSCAN,
hierarchical
clustering,
and more...
Dimensionality reduction
Reducing the number of random variables to consider.
Applications: Visualization, Increased efficiency
Algorithms:
PCA,
feature selection,
non-negative matrix factorization,
and more...
Model selection
Comparing, validating and choosing parameters and models.
Applications: Improved accuracy via parameter tuning
Algorithms:
grid search,
cross validation,
metrics,
and more...
Preprocessing
Feature extraction and normalization.
Applications: Transforming input data such as text for use with machine learning algorithms.
Algorithms:
preprocessing,
feature extraction,
and more...