SciPy

From Wikipedia, the free encyclopedia
Jump to: navigation, search
Not to be confused with ScientificPython.
SciPy
SciPy.org
PSD of ECG using SciPy
Original author(s) Travis Oliphant, Pearu Peterson, Eric Jones
Developer(s) Community library project
Stable release 0.14.0 / 3 May 2014; 5 months ago (2014-05-03)
Written in Python, Fortran, C, C++[1]
Operating system Cross-platform (list)
Type Technical computing
License BSD-new license
Website www.scipy.org

SciPy (pronounced “Sigh Pie”) is an open source Python library used by scientists, analysts, and engineers doing scientific computing and technical computing. SciPy builds on the NumPy array object and is part of the NumPy stack which includes tools like Matplotlib, pandas and SymPy. There is an expanding set of scientific computing libraries that are being added to the NumPy stack everyday. This NumPy stack has similar users to other applications such as MATLAB, GNU Octave, and Scilab. The NumPy stack is also sometimes referred to as the SciPy stack.[2]

SciPy is also a family of conferences for users and developers of these tools: SciPy (in the United States), EuroSciPy (in Europe) and SciPy.in (in India).[3] Enthought originated the SciPy conference in the United States and continues to sponsors many of the international conferences as well as host the SciPy website.

SciPy contains modules for optimization, linear algebra, integration, interpolation, special functions, FFT, signal and image processing, ODE solvers and other tasks common in science and engineering.

The SciPy library is currently distributed under the BSD license, and its development is sponsored and supported by an open community of developers. It is also supported by Numfocus which is a community foundation for supporting reproducible and accessible science.

Python Scientific Computing Environment[edit]

A typical Python Scientific Computing Environment includes many dedicated software tools. For example,

  • Plotting. The currently recommended 2-D plotting package is Matplotlib, however, there are many other plotting packages such as HippoDraw, Chaco, Biggles, and Bokeh. Other popular graphics tools include Python Imaging Library and MayaVi (for 3D visualization).
  • Optimization. While SciPy has its own optimization package, OpenOpt has access to more optimization solvers and can involve Automatic differentiation. CVXOpt is another popular optimization library.
  • Advanced data analysis. Via RPy, Python can interface to the R statistical package for advanced data analysis.
  • Database. NumPy can interface with PyTables, a hierarchical database package designed to efficiently manage large amounts of data using HDF5.
  • Interactive shell. IPython is an interactive environment that offers debugging and coding features similar to that which MATLAB offers.
  • Symbolic mathematics. There are several Python libraries—such as PyDSTool Symbolic and SymPy—that offer symbolic mathematics.
  • Specialized extensions. The "scikits" provide special-purpose add-ons to NumPy and Python. Of these, scikit-image, scikit-learn[4] and statsmodels are mature packages.

The SciPy Library/Package[edit]

The SciPy package of key algorithms and functions core to Python's scientific computing capabilities. Available sub-packages include:

  • constants: physical constants and conversion factors (since version 0.7.0[5])
  • cluster: hierarchical clustering, vector quantization, K-means
  • fftpack: Discrete Fourier Transform algorithms
  • integrate: numerical integration routines
  • interpolate: interpolation tools
  • io: data input and output
  • lib: Python wrappers to external libraries
  • linalg: linear algebra routines
  • misc: miscellaneous utilities (e.g. image reading/writing)
  • ndimage: various functions for multi-dimensional image processing
  • optimize: optimization algorithms including linear programming
  • signal: signal processing tools
  • sparse: sparse matrix and related algorithms
  • spatial: KD-trees, nearest neighbors, distance functions
  • special: special functions
  • stats: statistical functions
  • weave: tool for writing C/C++ code as Python multiline strings
Snapshot showing SciPy ndimage source code

Data structures[edit]

The basic data structure used by SciPy is a multidimensional array provided by the NumPy module. NumPy provides some functions for linear algebra, Fourier transforms and random number generation, but not with the generality of the equivalent functions in SciPy. NumPy can also be used as an efficient multi-dimensional container of data with arbitrary data-types. This allows NumPy to seamlessly and speedily integrate with a wide variety of databases.Older versions of SciPy used Numeric as an array type, which is now deprecated in favor of the newer NumPy array code.[6]

History of SciPy[edit]

In the 1990s, Python was extended to include an array type for numerical computing called Numeric (This package was eventually replaced by Travis Oliphant who wrote NumPy in 2006 as a blending of Numeric and Numarray which had been started in 2001). In 1999, Travis Oliphant created a large collection of extension modules to enable scientific computing with Python and helped Pearu Peterson write f2py which enabled easily extending Python with Fortran code. This effort formed the foundation of SciPy. In 2001, Travis Oliphant and Pearu Peterson merged their efforts with a few modules that Eric Jones had written, and called the resulting package SciPy. The newly created package provided a standard collection of common numerical operations on top of the Numeric array data structure. Shortly thereafter, Fernando Pérez released IPython, an enhanced interactive shell widely used in the technical computing community, and John Hunter released the first version of Matplotlib, the 2D plotting library for technical computing. Since then the SciPy environment has continued to grow with more packages and tools for technical computing.[7][8][9]

See also[edit]

External links[edit]

Notes[edit]