.. _installation-instructions: ======================= Installing scikit-learn ======================= There are different ways to install scikit-learn: * :ref:`Install the latest official release `. This is the best approach for most users. It will provide a stable version and pre-built packages are available for most platforms. * Install the version of scikit-learn provided by your :ref:`operating system or Python distribution `. This is a quick option for those who have operating systems or Python distributions that distribute scikit-learn. It might not provide the latest release version. * :ref:`Building the package from source `. This is best for users who want the latest-and-greatest features and aren't afraid of running brand-new code. This is also needed for users who wish to contribute to the project. .. _install_official_release: Installing the latest release ============================= .. This quickstart installation is a hack of the awesome https://spacy.io/usage/#quickstart page. See the original javascript implementation https://github.com/ines/quickstart .. raw:: html
Operating System
Packager
.. raw:: html
Install the 64bit version of Python 3, for instance from https://www.python.org.Install Python 3 using homebrew (brew install python) or by manually installing the package from https://www.python.org.Install python3 and python3-pip using the package manager of the Linux Distribution.Install conda (no administrator permission required).
Then run: .. raw:: html
python3 -m venv sklearn-venvpython -m venv sklearn-venvpython -m venv sklearn-venvsource sklearn-venv/bin/activatesource sklearn-venv/bin/activatesklearn-venv\Scripts\activatepip install -U scikit-learnpip install -U scikit-learnpip install -U scikit-learnpip3 install -U scikit-learnconda create -n sklearn-envconda activate sklearn-envconda install scikit-learn 
In order to check your installation you can use .. raw:: html
python3 -m pip show scikit-learn  # to see which version and where scikit-learn is installedpython3 -m pip freeze  # to see all packages installed in the active virtualenvpython3 -c "import sklearn; sklearn.show_versions()"python -m pip show scikit-learn  # to see which version and where scikit-learn is installedpython -m pip freeze  # to see all packages installed in the active virtualenvpython -c "import sklearn; sklearn.show_versions()"python -m pip show scikit-learn  # to see which version and where scikit-learn is installedpython -m pip freeze  # to see all packages installed in the active virtualenvpython -c "import sklearn; sklearn.show_versions()"python -m pip show scikit-learn  # to see which version and where scikit-learn is installedpython -m pip freeze  # to see all packages installed in the active virtualenvpython -c "import sklearn; sklearn.show_versions()"conda list scikit-learn  # to see which scikit-learn version is installedconda list  # to see all packages installed in the active conda environmentpython -c "import sklearn; sklearn.show_versions()"
Note that in order to avoid potential conflicts with other packages it is strongly recommended to use a virtual environment, e.g. python3 ``virtualenv`` (see `python3 virtualenv documentation `_) or `conda environments `_. Using an isolated environment makes possible to install a specific version of scikit-learn and its dependencies independently of any previously installed Python packages. In particular under Linux is it discouraged to install pip packages alongside the packages managed by the package manager of the distribution (apt, dnf, pacman...). Note that you should always remember to activate the environment of your choice prior to running any Python command whenever you start a new terminal session. If you have not installed NumPy or SciPy yet, you can also install these using conda or pip. When using pip, please ensure that *binary wheels* are used, and NumPy and SciPy are not recompiled from source, which can happen when using particular configurations of operating system and hardware (such as Linux on a Raspberry Pi). If you must install scikit-learn and its dependencies with pip, you can install it as ``scikit-learn[alldeps]``. Scikit-learn plotting capabilities (i.e., functions start with "plot\_" and classes end with "Display") require Matplotlib (>= 1.5.1). For running the examples Matplotlib >= 1.5.1 is required. A few examples require scikit-image >= 0.12.3, a few examples require pandas >= 0.18.0. .. warning:: Scikit-learn 0.20 was the last version to support Python 2.7 and Python 3.4. Scikit-learn now requires Python 3.5 or newer. .. note:: For installing on PyPy, PyPy3-v5.10+, Numpy 1.14.0+, and scipy 1.1.0+ are required. .. _install_by_distribution: Third party distributions of scikit-learn ========================================= Some third-party distributions provide versions of scikit-learn integrated with their package-management systems. These can make installation and upgrading much easier for users since the integration includes the ability to automatically install dependencies (numpy, scipy) that scikit-learn requires. The following is an incomplete list of OS and python distributions that provide their own version of scikit-learn. Arch Linux ---------- Arch Linux's package is provided through the `official repositories `_ as ``python-scikit-learn`` for Python. It can be installed by typing the following command: .. code-block:: none $ sudo pacman -S python-scikit-learn Debian/Ubuntu ------------- The Debian/Ubuntu package is splitted in three different packages called ``python3-sklearn`` (python modules), ``python3-sklearn-lib`` (low-level implementations and bindings), ``python3-sklearn-doc`` (documentation). Only the Python 3 version is available in the Debian Buster (the more recent Debian distribution). Packages can be installed using ``apt-get``:: $ sudo apt-get install python3-sklearn python3-sklearn-lib python3-sklearn-doc Fedora ------ The Fedora package is called ``python3-scikit-learn`` for the python 3 version, the only one available in Fedora30. It can be installed using ``dnf``:: $ sudo dnf install python3-scikit-learn NetBSD ------ scikit-learn is available via `pkgsrc-wip `_: http://pkgsrc.se/math/py-scikit-learn MacPorts for Mac OSX -------------------- The MacPorts package is named ``py-scikits-learn``, where ``XY`` denotes the Python version. It can be installed by typing the following command:: $ sudo port install py36-scikit-learn Canopy and Anaconda for all supported platforms ----------------------------------------------- `Canopy `_ and `Anaconda `_ both ship a recent version of scikit-learn, in addition to a large set of scientific python library for Windows, Mac OSX and Linux. Anaconda offers scikit-learn as part of its free distribution. Intel conda channel ------------------- Intel maintains a dedicated conda channel that ships scikit-learn:: $ conda install -c intel scikit-learn This version of scikit-learn comes with alternative solvers for some common estimators. Those solvers come from the DAAL C++ library and are optimized for multi-core Intel CPUs. Note that those solvers are not enabled by default, please refer to the `daal4py `_ documentation for more details. Compatibility with the standard scikit-learn solvers is checked by running the full scikit-learn test suite via automated continuous integration as reported on https://github.com/IntelPython/daal4py. WinPython for Windows ----------------------- The `WinPython `_ project distributes scikit-learn as an additional plugin. Troubleshooting =============== .. _windows_longpath: Error caused by file path length limit on Windows ------------------------------------------------- It can happen that pip fails to install packages when reaching the default path size limit of Windows if Python is installed in a nested location such as the `AppData` folder structure under the user home directory, for instance:: C:\Users\username>C:\Users\username\AppData\Local\Microsoft\WindowsApps\python.exe -m pip install scikit-learn Collecting scikit-learn ... Installing collected packages: scikit-learn ERROR: Could not install packages due to an EnvironmentError: [Errno 2] No such file or directory: 'C:\\Users\\username\\AppData\\Local\\Packages\\PythonSoftwareFoundation.Python.3.7_qbz5n2kfra8p0\\LocalCache\\local-packages\\Python37\\site-packages\\sklearn\\datasets\\tests\\data\\openml\\292\\api-v1-json-data-list-data_name-australian-limit-2-data_version-1-status-deactivated.json.gz' In this case it is possible to lift that limit in the Windows registry by using the ``regedit`` tool: #. Type "regedit" in the Windows start menu to launch ``regedit``. #. Go to the ``Computer\HKEY_LOCAL_MACHINE\SYSTEM\CurrentControlSet\Control\FileSystem`` key. #. Edit the value of the ``LongPathsEnabled`` property of that key and set it to 1. #. Reinstall scikit-learn (ignoring the previous broken installation):: pip install --exists-action=i scikit-learn