# Index (statistics)

In statistics and research design, an index is a composite statistic – a measure of changes in a representative group of individual data points, or in other words, a compound measure that aggregates multiple indicators.[1][2] Indexes – also known as composite indicators – summarize and rank specific observations.[2]

Much data in the field of social sciences and sustainability are represented in various indices such as Gender Gap Index, Human Development Index or the Dow Jones Industrial Average. The ‘Report by the Commission on the Measurement of Economic Performance and Social Progress’, written by Joseph Stiglitz, Amartya Sen, and Jean-Paul Fitoussi in 2009 [3] suggests that these measures have experienced a dramatic growth in recent years due to three concurring factors:

• improvements in the level of literacy (including statistical)
• increased complexity of modern societies and economies, and
• widespread availability of information technology.

According to Earl Babbie, items in indexes are usually weighted equally, unless there are some reasons against it (for example, if two items reflect essentially the same aspect of a variable, they could have a weight of 0.5 each).[4]

According to the same author,[5] constructing the items involves four steps. First, items should be selected based on their content validity, unidimensionality, the degree of specificity in which a dimension is to be measured, and their amount of variance. Items should be empirically related to one another, which leads to the second step of examining their multivariate relationships. Third, indexes scores are designed, which involves determining their score ranges and weights for the items. Finally, indexes should be validated, which involves testing whether they can predict indicators related to the measured variable not used in their construction.[5]

A handbook for the construction of composite indicators was published jointly by the OECD and by the European Commission's Joint Research Centre in 2008.[6] The handbook – officially endorsed by the OECD high level statistical committee, describe ten recursive steps for developing an index:[7]

• Step 1: Theoretical framework
• Step 2: Data selection
• Step 3: Imputation of missing data
• Step 4: Multivariate analysis
• Step 5: Normalisation
• Step 6: Weighting
• Step 7: Aggregating indicators
• Step 8: Sensitivity analysis
• Step 9: Link to other measures
• Step 10: Visualisation

As suggested by the list, many modelling choices are needed to construct a composite indicator, which makes their use controversial.[6] The delicate issue of assigning and validating weights is discussed e.g. in.[8] A sociological reading of the nature of composite indicators is offered by Paul-Marie Boulanger, who sees these measures at the intersection of three movements:[9]

• the democratisation of expertise, the concept that more knowledge is needed to tackle societal and environmental issues that can be provided by the sole experts – this line of thought connects to the concept of extended peer community developed by post-normal science
• the impulse to the creation of a new public through a process of social discovery, which can be reconnected to the work of pragmatists such as John Dewey
• the semiotic of Charles Sanders Peirce; Thus a CI is not just a sign or a number, but suggests an action or a behaviour.

A subsequent work of the same author [10] analyses composite indicators in light of the social system theories of Niklas Luhmann to investigate how different measurements of progress are or are not taken up