sklearn.discriminant_analysis
.QuadraticDiscriminantAnalysis¶
-
class
sklearn.discriminant_analysis.
QuadraticDiscriminantAnalysis
(priors=None, reg_param=0.0, store_covariances=False, tol=0.0001)[source]¶ Quadratic Discriminant Analysis
A classifier with a quadratic decision boundary, generated by fitting class conditional densities to the data and using Bayes’ rule.
The model fits a Gaussian density to each class.
New in version 0.17: QuadraticDiscriminantAnalysis
Changed in version 0.17: Deprecated
qda.QDA
have been moved to QuadraticDiscriminantAnalysis.Parameters: priors : array, optional, shape = [n_classes]
Priors on classes
reg_param : float, optional
Regularizes the covariance estimate as
(1-reg_param)*Sigma + reg_param*np.eye(n_features)
Attributes: covariances_ : list of array-like, shape = [n_features, n_features]
Covariance matrices of each class.
means_ : array-like, shape = [n_classes, n_features]
Class means.
priors_ : array-like, shape = [n_classes]
Class priors (sum to 1).
rotations_ : list of arrays
For each class k an array of shape [n_features, n_k], with
n_k = min(n_features, number of elements in class k)
It is the rotation of the Gaussian distribution, i.e. its principal axis.scalings_ : list of arrays
For each class k an array of shape [n_k]. It contains the scaling of the Gaussian distributions along its principal axes, i.e. the variance in the rotated coordinate system.
store_covariances : boolean
If True the covariance matrices are computed and stored in the self.covariances_ attribute.
New in version 0.17.
tol : float, optional, default 1.0e-4
Threshold used for rank estimation.
New in version 0.17.
See also
sklearn.discriminant_analysis.LinearDiscriminantAnalysis
- Linear Discriminant Analysis
Examples
>>> from sklearn.discriminant_analysis import QuadraticDiscriminantAnalysis >>> import numpy as np >>> X = np.array([[-1, -1], [-2, -1], [-3, -2], [1, 1], [2, 1], [3, 2]]) >>> y = np.array([1, 1, 1, 2, 2, 2]) >>> clf = QuadraticDiscriminantAnalysis() >>> clf.fit(X, y) ... QuadraticDiscriminantAnalysis(priors=None, reg_param=0.0, store_covariances=False, tol=0.0001) >>> print(clf.predict([[-0.8, -1]])) [1]
Methods
decision_function
(X)Apply decision function to an array of samples. fit
(X, y[, store_covariances, tol])Fit the model according to the given training data and parameters. get_params
([deep])Get parameters for this estimator. predict
(X)Perform classification on an array of test vectors X. predict_log_proba
(X)Return posterior probabilities of classification. predict_proba
(X)Return posterior probabilities of classification. score
(X, y[, sample_weight])Returns the mean accuracy on the given test data and labels. set_params
(**params)Set the parameters of this estimator. -
decision_function
(X)[source]¶ Apply decision function to an array of samples.
Parameters: X : array-like, shape = [n_samples, n_features]
Array of samples (test vectors).
Returns: C : array, shape = [n_samples, n_classes] or [n_samples,]
Decision function values related to each class, per sample. In the two-class case, the shape is [n_samples,], giving the log likelihood ratio of the positive class.
-
fit
(X, y, store_covariances=None, tol=None)[source]¶ Fit the model according to the given training data and parameters.
Changed in version 0.17: Deprecated store_covariance have been moved to main constructor.
Changed in version 0.17: Deprecated tol have been moved to main constructor.
Parameters: X : array-like, shape = [n_samples, n_features]
Training vector, where n_samples in the number of samples and n_features is the number of features.
y : array, shape = [n_samples]
Target values (integers)
-
get_params
(deep=True)[source]¶ Get parameters for this estimator.
Parameters: deep: boolean, optional :
If True, will return the parameters for this estimator and contained subobjects that are estimators.
Returns: params : mapping of string to any
Parameter names mapped to their values.
-
predict
(X)[source]¶ Perform classification on an array of test vectors X.
The predicted class C for each sample in X is returned.
Parameters: X : array-like, shape = [n_samples, n_features] Returns: C : array, shape = [n_samples]
-
predict_log_proba
(X)[source]¶ Return posterior probabilities of classification.
Parameters: X : array-like, shape = [n_samples, n_features]
Array of samples/test vectors.
Returns: C : array, shape = [n_samples, n_classes]
Posterior log-probabilities of classification per class.
-
predict_proba
(X)[source]¶ Return posterior probabilities of classification.
Parameters: X : array-like, shape = [n_samples, n_features]
Array of samples/test vectors.
Returns: C : array, shape = [n_samples, n_classes]
Posterior probabilities of classification per class.
-
score
(X, y, sample_weight=None)[source]¶ Returns the mean accuracy on the given test data and labels.
In multi-label classification, this is the subset accuracy which is a harsh metric since you require for each sample that each label set be correctly predicted.
Parameters: X : array-like, shape = (n_samples, n_features)
Test samples.
y : array-like, shape = (n_samples) or (n_samples, n_outputs)
True labels for X.
sample_weight : array-like, shape = [n_samples], optional
Sample weights.
Returns: score : float
Mean accuracy of self.predict(X) wrt. y.
-
set_params
(**params)[source]¶ Set the parameters of this estimator.
The method works on simple estimators as well as on nested objects (such as pipelines). The former have parameters of the form
<component>__<parameter>
so that it’s possible to update each component of a nested object.Returns: self :