Pie chart

From Wikipedia, the free encyclopedia
Jump to: navigation, search
Not to be confused with circle graph.
Pie chart of populations of English native speakers

A pie chart is a circular chart divided into sectors, illustrating numerical proportion. In a pie chart, the arc length of each sector (and consequently its central angle and area), is proportional to the quantity it represents. While it is named for its resemblance to a pie which has been sliced, there are variations on the way it can be presented. The earliest known pie chart is generally credited to William Playfair's Statistical Breviary of 1801.[1][2]

Pie charts are very widely used in the business world and the mass media.[3] However, they have been criticized,[4] and many experts recommend avoiding them,[5][6][7][8] pointing out that research has shown it is difficult to compare different sections of a given pie chart, or to compare data across different pie charts. Pie charts can be replaced in most cases by other plots such as the bar chart.

Example[edit]

A pie chart for the example data.

The following example chart is based on preliminary results of the election for the European Parliament in 2004. The table lists the number of seats allocated to each party group, along with the derived percentage of the total that they each make up. The values in the last column, the derived central angle of each sector, is found by multiplying the percentage by 360°.

Group Seats Percent (%) Central angle (°)
EUL 39 5.3 19.2
PES 200 27.3 98.4
EFA 42 5.7 20.7
EDD 15 2.0 7.4
ELDR 67 9.2 33.0
EPP 276 37.7 135.7
UEN 27 3.7 13.3
Other 66 9.0 32.5
Total 732 99.9* 360.2*

*Because of rounding, these totals do not add up to 100 and 360.

The size of each central angle is proportional to the size of the corresponding quantity, here the number of seats. Since the sum of the central angles has to be 360°, the central angle for a quantity that is a fraction Q of the total is 360Q degrees. In the example, the central angle for the largest group (European People's Party (EPP)) is 135.7° because 0.377 times 360, rounded to one decimal place, equals 135.7.

Use, effectiveness and visual perception[edit]

An obvious flaw exhibited by pie charts is that they cannot show more than a few values without separating the visual encoding (the “slices”) from the data they represent (typically percentages). When slices become too small, pie charts have to rely on colors, textures or arrows so the reader can understand them. This makes them unsuitable for use with larger amounts of data. Pie charts also take up a larger amount of space on the page compared to the more flexible bar charts, which do not need to have separate legends, and can display other values such as averages or targets at the same time.[7]

An example of a pie chart with 18 values, having to separate the data from its representation. Note that several values are represented with the same color, making interpretation difficult.
Three sets of data plotted using pie charts and bar charts.

Statisticians generally regard pie charts as a poor method of displaying information, and they are uncommon in scientific literature. One reason is that it is more difficult for comparisons to be made between the size of items in a chart when area is used instead of length and when different items are shown as different shapes.

Further, in research performed at AT&T Bell Laboratories, it was shown that comparison by angle was less accurate than comparison by length. This can be illustrated with the diagram to the right, showing three pie charts, and, below each of them, the corresponding bar chart representing the same data. Most subjects have difficulty ordering the slices in the pie chart by size; when the bar chart is used the comparison is much easier.[9] Similarly, comparisons between data sets are easier using the bar chart. However, if the goal is to compare a given category (a slice of the pie) with the total (the whole pie) in a single chart and the multiple is close to 25 or 50 percent, then a pie chart can often be more effective than a bar graph.[10][11]

Variants and similar charts[edit]

Exploded pie chart[edit]

An exploded pie chart for the example data, with the largest party group exploded.

A chart with one or more sectors separated from the rest of the disk is known as an exploded pie chart. This effect is used to either highlight a sector, or to highlight smaller segments of the chart with small proportions.

Polar area diagram[edit]

"Diagram of the causes of mortality in the army in the East" by Florence Nightingale.

The polar area diagram is similar to a usual pie chart, except sectors are equal angles and differ rather in how far each sector extends from the center of the circle. The polar area diagram is used to plot cyclic phenomena (e.g., count of deaths by month). For example, if the count of deaths in each month for a year are to be plotted then there will be 12 sectors (one per month) all with the same angle of 30 degrees each. The radius of each sector would be proportional to the square root of the death count for the month, so the area of a sector represents the number of deaths in a month. If the death count in each month is subdivided by cause of death, it is possible to make multiple comparisons on one diagram, as is seen in the polar area diagram famously developed by Florence Nightingale.

The first known use of polar area diagrams was by André-Michel Guerry, which he called courbes circulaires, in an 1829 paper showing seasonal and daily variation in wind direction over the year and births and deaths by hour of the day.[12] Léon Lalanne later used a polar diagram to show the frequency of wind directions around compass points in 1843. The wind rose is still used by meteorologists. Nightingale published her rose diagram in 1858. The name "coxcomb" is sometimes used erroneously: this was the name Nightingale used to refer to a book containing the diagrams rather than the diagrams themselves.[13] It has been suggested[by whom?] that most of Nightingale's early reputation was built on her ability to give clear and concise presentations of data.

Spie chart[edit]

A useful variant of the polar area chart is the spie chart designed by Feitelson.[14] This superimposes a normal pie chart with a modified polar area chart to permit the comparison of a set of data at two different states. The base pie chart represents the first state in the usual way, with different slice sizes. The second state is represented by the superimposed polar area chart, using the same angles as the base, and adjusting the radii to fit the data. This is useful, among other things, for visualizing hazards to different population groups. For example, the base pie chart can show the distribution of age and gender groups in the general population, and the overlay their representation among road casualties; age and gender groups that are especially susceptible to being involved in accidents then stand out as slices that extend far beyond the original pie chart. The R Graph Gallery provides an example.[15]

Ring chart / Sunburst chart / Multilevel pie chart[edit]

Multi-level pie chart representing disk usage in a Linux file system
See also: Radial tree

A ring chart, also known as a sunburst chart or a multilevel pie chart, is used to visualize hierarchical data, depicted by concentric circles.[16] The circle in the centre represents the root node, with the hierarchy moving outward from the center. A segment of the inner circle bears a hierarchical relationship to those segments of the outer circle which lie within the angular sweep of the parent segment.[17]

3D pie chart / Perspective pie chart[edit]

A 3D pie chart, or perspective pie chart, is used to give the chart a 3D look. Often used for aesthetic reasons, the third dimension does not improve the reading of the data; on the contrary, these plots are difficult to interpret because of the distorted effect of perspective associated with the third dimension. The use of superfluous dimensions not used to display the data of interest is discouraged for charts in general, not only for pie charts.[7][18]

Doughnut chart[edit]

A doughnut chart (also spelled donut) is functionally identical to a pie chart, with the exception of a blank center and the ability to support multiple statistics at once. Doughnut charts provide a better data intensity ratio to standard pie charts since the blank center can be used to display additional, related data as shown in the example.

Example of a doughnut chart

History[edit]

The earliest known pie chart is generally credited to William Playfair's Statistical Breviary of 1801, in which two such graphs are used.[1][2][19] This invention was not widely used at first;[1] the French engineer Charles Joseph Minard was one of the first to use it in 1858, in particular in maps where he needed to add information in a third dimension.[20] It has been said that Florence Nightingale invented it, though in fact she just popularised it and she was later assumed to have created it due to the obscurity of Playfair's creation.[21]

See also[edit]

Notes[edit]

  1. ^ a b c Spence (2005)
  2. ^ a b Tufte, p. 44
  3. ^ Cleveland, p. 262
  4. ^ Wilkinson, p. 23.
  5. ^ Tufte, p. 178.
  6. ^ van Belle, p. 160–162.
  7. ^ a b c Stephen Few. "Save the Pies for Dessert", August 2007, Retrieved 2010-02-02
  8. ^ Steve Fenton "Pie Charts Are Bad"
  9. ^ Cleveland, p. 86–87
  10. ^ Simkin, D., & Hastie, R. (1987). An Information-Processing Analysis of Graph Perception. Journal of the American Statistical Association, 82(398), 454. doi:10.2307/2289447. Kosara, Robert. "In Defense of Pie Charts". Retrieved April 13, 2011. 
  11. ^ Spence, Ian; Lewandowsky, Stephan (1 January 1991). "Displaying proportions and percentages". Applied Cognitive Psychology 5 (1): 61–77. doi:10.1002/acp.2350050106. 
  12. ^ Friendly, p. 509
  13. ^ "Florence Nightingale's Statistical Diagrams". Retrieved 2010-11-22. 
  14. ^ "Feitelson, Dror (2003) Comparing Partitions With Spie Charts". 2003. Retrieved 2010-08-31. 
  15. ^ "R Graph Gallery: Spie chart". Retrieved 2010-08-31. [dead link]
  16. ^ Clark Jeff. (2006). Neoformix. "Multi-level Pie Charts"
  17. ^ Webber Richard, Herbert Ric, Jiangbc Wel. "Space-filling Techniques in Visualizing Output from Computer Based Economic Models"
  18. ^ Good and Hardin, chapter 8.
  19. ^ http://www.datavis.ca/milestones/index.php?group=1800%2B&mid=ms89
  20. ^ Palsky, p. 144–145
  21. ^ Dave article on this information on QI

References[edit]

  • Cleveland, William S. (1985). The Elements of Graphing Data. Pacific Grove, CA: Wadsworth & Advanced Book Program. ISBN 0-534-03730-5. 
  • Friendly, Michael. The Golden Age of Statistical Graphics, Statistical Science, Volume 23, Number 4 (2008), 502-535 [1]
  • Good, Phillip I. and Hardin, James W. Common Errors in Statistics (and How to Avoid Them). Wiley. 2003. ISBN 0-471-46068-0.
  • Guerry, A.-M. (1829). Tableau des variations météorologique comparées aux phénomènes physiologiques, d'aprés les observations faites à l'obervatoire royal, et les recherches statistique les plus récentes. Annales d'Hygiène Publique et de Médecine Légale, 1 :228-.
  • Harris, Robert L. (1999). Information Graphics: A comprehensive Illustrated Reference. Oxford University Press. ISBN 0-19-513532-6. 
  • Palsky Gilles. Des chiffres et des cartes: la cartographie quantitative au XIXè siècle. Paris: Comité des travaux historiques et scientifiques, 1996. ISBN 2-7355-0336-4.
  • Playfair, William, Commercial and Political Atlas and Statistical Breviary, Cambridge University Press (2005) ISBN 0-521-85554-3.
  • Spence, Ian. No Humble Pie: The Origins and Usage of a statistical Chart. Journal of Educational and Behavioral Statistics. Winter 2005, 30 (4), 353–368.
  • Tufte, Edward. The Visual Display of Quantitative Information. Graphics Press, 2001. ISBN 0-9613921-4-2.
  • van Belle, Gerald. Statistical Rules of Thumb. Wiley, 2002. ISBN 0-471-40227-3.
  • Wilkinson, Leland. The Grammar of Graphics, 2nd edition. Springer, 2005. ISBN 0-387-24544-8.
  • Clark Jeff. (2006). ‘’Neoformix’’. Multi-level Pie Charts [2]
  • Webber Richard, Herbert Ric, Jiangbc Wel. Space-filling Techniques in Visualizing Output from Computer Based Economic Models [3]
  • Stasko John. SunBurst [www.cc.gatech.edu/gvu/ii/sunburst/]
  • Woodbury, Henry. Nightingales Rose