This is documentation for an old release of Scikit-learn (version 1.0). Try the latest stable release (version 1.6) or development (unstable) versions.

`sklearn.feature_selection`.f_regression¶

sklearn.feature_selection.f_regression(X, y, *, center=True)[source]¶

Univariate linear regression tests returning F-statistic and p-values.

Quick linear model for testing the effect of a single regressor, sequentially for many regressors.

This is done in 2 steps:

The cross correlation between each regressor and the target is computed, that is, ((X[:, i] - mean(X[:, i])) * (y - mean_y)) / (std(X[:, i]) * std(y)) using r_regression function.
It is converted to an F score and then to a p-value.

f_regression is derived from r_regression and will rank features in the same order if all the features are positively correlated with the target.

Note however that contrary to f_regression, r_regression values lie in [-1, 1] and can thus be negative. f_regression is therefore recommended as a feature selection criterion to identify potentially predictive feature for a downstream classifier, irrespective of the sign of the association with the target variable.

Furthermore f_regression returns p-values while r_regression does not.

Examples using `sklearn.feature_selection.f_regression`¶

sklearn.feature_selection.f_regression¶

Examples using sklearn.feature_selection.f_regression¶

`sklearn.feature_selection`.f_regression¶

Examples using `sklearn.feature_selection.f_regression`¶