From Wikipedia, the free encyclopedia
Jump to navigation Jump to search

Scikit learn logo small.svg
Original author(s)David Cournapeau
Initial releaseJune 2007; 11 years ago (2007-06)
Stable release
0.20.2 / 19 December 2018; 3 months ago (2018-12-19)[1][2]
Repository Edit this at Wikidata
Written inPython, Cython, C and C++
Operating systemLinux, macOS, Windows
TypeLibrary for machine learning
LicenseNew BSD License

Scikit-learn (formerly scikits.learn) is a free software machine learning library for the Python programming language.[3] It features various classification, regression and clustering algorithms including support vector machines, random forests, gradient boosting, k-means and DBSCAN, and is designed to interoperate with the Python numerical and scientific libraries NumPy and SciPy.


The scikit-learn project started as scikits.learn, a Google Summer of Code project by David Cournapeau. Its name stems from the notion that it is a "SciKit" (SciPy Toolkit), a separately-developed and distributed third-party extension to SciPy.[4] The original codebase was later rewritten by other developers. In 2010 Fabian Pedregosa, Gael Varoquaux, Alexandre Gramfort and Vincent Michel, all from the French Institute for Research in Computer Science and Automation in Rocquencourt, France, took leadership of the project and made the first public release on February the 1st 2010.[5] Of the various scikits, scikit-learn as well as scikit-image were described as "well-maintained and popular" in November 2012.[6]

As of 2018, scikit-learn is under active development.[7]


Scikit-learn is largely written in Python, with some core algorithms written in Cython to achieve performance. Support vector machines are implemented by a Cython wrapper around LIBSVM; logistic regression and linear support vector machines by a similar wrapper around LIBLINEAR.

Version history[edit]

Scikit-learn was initially developed by David Cournapeau as a Google summer of code project in 2007. Later Matthieu Brucher joined the project and started to use it as a part of his thesis work. In 2010 INRIA, the French Institute for Research in Computer Science and Automation, got involved and the first public release (v0.1 beta) was published in late January 2010.

  • September 2018. scikit-learn 0.20.0 [8]
  • July 2018. scikit-learn 0.19.2
  • July 2017. scikit-learn 0.19.0
  • September 2016. scikit-learn 0.18.0
  • November 2015. scikit-learn 0.17.0[9]
  • March 2015. scikit-learn 0.16.0[9]
  • July 2014. scikit-learn 0.15.0[9]
  • August 2013. scikit-learn 0.14[9]

See also[edit]


  1. ^ "scikit-learn release history".
  2. ^ "scikit-learn 0.20.2". Python Package Index.
  3. ^ Fabian Pedregosa; Gaël Varoquaux; Alexandre Gramfort; Vincent Michel; Bertrand Thirion; Olivier Grisel; Mathieu Blondel; Peter Prettenhofer; Ron Weiss; Vincent Dubourg; Jake Vanderplas; Alexandre Passos; David Cournapeau; Matthieu Perrot; Édouard Duchesnay (2011). "Scikit-learn: Machine Learning in Python". Journal of Machine Learning Research. 12: 2825–2830.
  4. ^ Dreijer, Janto. "scikit-learn".
  5. ^ "About us — scikit-learn 0.20.1 documentation".
  6. ^ Eli Bressert (2012). SciPy and NumPy: an overview for developers. O'Reilly. p. 43.
  7. ^ "About Us". Retrieved 16 April 2018.
  8. ^ "Release History - 0.20.0 documentation". scikit-learn. Retrieved 6 November 2018.
  9. ^ a b c d "Release history — scikit-learn 0.19.dev0 documentation". Retrieved 2017-02-27.

External links[edit]