Dot plot (statistics): Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
m Enum 2 author/editor WLs; WP:GenFixes on
Tags: Reverted references removed Visual edit
Line 1: Line 1:
A '''dot chart''' or '''dot plot''' is a [[statistics|statistical]] chart consisting of data points plotted on a fairly simple scale, typically using filled in circles. There are two common, yet very different, versions of the dot chart. The first has been used in hand-drawn (pre-computer era) graphs to depict distributions going back to 1884.<ref>{{cite journal | last=Wilkinson | first=Leland | year=1999 |title=Dot plots | journal=The American Statistician | volume=53 | pages=276&ndash;281 | jstor=2686111 | doi=10.2307/2686111 | issue=3 | publisher=American Statistical Association}}</ref> The other version is described by [[William S. Cleveland]] as an alternative to the bar chart, in which dots are used to depict the quantitative values (e.g. counts) associated with categorical variables.<ref>{{cite book | last=Cleveland | first=William S. | author-link=William S. Cleveland | title=Visualizing Data | publisher=Hobart Press | year=1993 | hdl=2027/mdp.39015026891187 | isbn=0-9634884-0-6 | url-access=registration | url=https://archive.org/details/visualizingdata00will }}</ref>
A '''dot chart''' or '''dot plot''' is a [[statistics|statistical]] chart consisting of data points plotted on a fairly simple scale, typically using filled in circles. There are two common, yet very different, versions of the dot chart. The first has been used in hand-drawn (pre-computer era) graphs to depict distributions going back to 1884.<ref>{{cite journal | last=Wilkinson | first=Leland | year=1999 |title=Dot plots | journal=The American Statistician | volume=53 | pages=276&ndash;281 | jstor=2686111 | doi=10.2307/2686111 | issue=3 | publisher=American Statistical Association}}</ref> The other version is described by [[William S. Cleveland]] as an alternative to the bar chart, in which dots are used to depict the quantitative value


==Dot plots==
[[File:Dotplot of random values 2.png|thumb|A dot plot of 50 random values from 0 to 9.]]
The dot plot as a representation of a distribution consists of group of data points plotted on a simple scale. Dot plots are used for [[continuous function|continuous]], [[quantitative data|quantitative]], [[univariate]] data. Data points may be labelled if there are few of them.


Dot plots are one of the simplest statistical plots, and are suitable for small to moderate sized data sets. They are useful for highlighting clusters and gaps, as well as [[outlier]]s. Their other advantage is the conservation of numerical information. When dealing with larger data sets (around 20–30 or more data points) the related [[stemplot]], [[box plot]] or [[histogram]] may be more efficient, as dot plots may become too cluttered after this point. Dot plots may be distinguished from histograms in that dots are not spaced uniformly along the horizontal axis.


Although the plot appears to be simple, its computation and the statistical theory underlying it are not simple. The algorithm for computing a dot plot is closely related to [[kernel density estimation]]. The size chosen for the dots affects the appearance of the plot. Choice of dot size is equivalent to choosing the bandwidth for a kernel density estimate.
Although the plot appears to be simple, its computation and the statistical theory underlying it are not simple. The algorithm for computing a dot plot is closely related to [[kernel density estimation]]. The size chosen for the dots affect


In the ''[[R (programming language)|R]]'' programming language this type of plot is also referred to as a '''stripchart'''<ref>{{Cite book
| author = Peter Dalgaard
| author-link = Peter Dalgaard
| title = Introductory Statistics with R
| publisher = Springer
| isbn = 0-387-95475-9
}}</ref>
or '''stripplot'''.<ref>{{Cite book
| author = Paul Murrell
| author-link = Paul Murrell
| title = R Graphics
| publisher = Chapman & Hall/CRC
| year = 2005
| url = http://www.stat.auckland.ac.nz/~paul/RGraphics/rgraphics.html
| isbn = 1-58488-486-X
}}</ref>


==Cleveland dot plots==
Dot plot may also refer to plots of points that each belong to one of several categories. They are an alternative to bar charts or pie charts, and look somewhat like a horizontal bar chart where the bars are replaced by a dots at the values associated with each category. Compared to (vertical) bar charts and pie charts, Cleveland argues that dot plots allow more accurate interpretation of the graph by readers by making the labels easier to read, reducing non-data ink (or graph clutter) and supporting table look-up.


==References==
==References==

Revision as of 15:16, 13 January 2021

A dot chart or dot plot is a statistical chart consisting of data points plotted on a fairly simple scale, typically using filled in circles. There are two common, yet very different, versions of the dot chart. The first has been used in hand-drawn (pre-computer era) graphs to depict distributions going back to 1884.[1] The other version is described by William S. Cleveland as an alternative to the bar chart, in which dots are used to depict the quantitative value


Although the plot appears to be simple, its computation and the statistical theory underlying it are not simple. The algorithm for computing a dot plot is closely related to kernel density estimation. The size chosen for the dots affect


References

  1. ^ Wilkinson, Leland (1999). "Dot plots". The American Statistician. 53 (3). American Statistical Association: 276–281. doi:10.2307/2686111. JSTOR 2686111.

Other references

  • Wild, C. and Seber, G. (2000) Chance Encounters: A First Course in Data Analysis and Inference John Wiley and Sons. ISBN 0-471-32936-3

External links