sklearn.metrics.pairwise.cosine_similarity

sklearn.metrics.pairwise.cosine_similarity(X, Y=None, dense_output=True)[source]

Compute cosine similarity between samples in X and Y.

Cosine similarity, or the cosine kernel, computes similarity as the normalized dot product of X and Y:

K(X, Y) = <X, Y> / (||X||*||Y||)

On L2-normalized data, this function is equivalent to linear_kernel.

Read more in the User Guide.

Parameters
Xndarray or sparse array, shape: (n_samples_X, n_features)

Input data.

Yndarray or sparse array, shape: (n_samples_Y, n_features)

Input data. If None, the output will be the pairwise similarities between all samples in X.

dense_outputboolean (optional), default True

Whether to return dense output even when the input is sparse. If False, the output is sparse if both input arrays are sparse.

New in version 0.17: parameter dense_output for dense output.

Returns
kernel matrixarray

An array with shape (n_samples_X, n_samples_Y).