Latent variable

From Wikipedia, the free encyclopedia
Jump to: navigation, search

In statistics, latent variables (as opposed to observable variables), are variables that are not directly observed but are rather inferred (through a mathematical model) from other variables that are observed (directly measured). Mathematical models that aim to explain observed variables in terms of latent variables are called latent variable models. Latent variable models are used in many disciplines, including economics, machine learning/artificial intelligence, bioinformatics, natural language processing, psychology, and the social sciences.

Sometimes latent variables correspond to aspects of physical reality, which could in principle be measured, but may not be for practical reasons. In this situation, the term hidden variables is commonly used (reflecting the fact that the variables are "really there", but hidden). Other times, latent variables correspond to abstract concepts, like categories, behavioral or mental states, or data structures. The terms hypothetical variables or hypothetical constructs may be used in these situations.

One advantage of using latent variables is that it reduces the dimensionality of data. A large number of observable variables can be aggregated in a model to represent an underlying concept, making it easier to understand the data. In this sense, they serve a function similar to that of scientific theories. At the same time, latent variables link observable ("sub-symbolic") data in the real world to symbolic data in the modeled world.

Latent variables, as created by factor analytic methods, generally represent 'shared' variance, or the degree to which variables 'move' together. Variables that have no correlation cannot result in a latent construct based on the common factor model.[1]

Contents

[edit] Examples of latent variables

[edit] Economics

Examples of latent variables from the field of economics include quality of life, business confidence, morale, happiness and conservatism: these are all variables which cannot be measured directly. However, given an economic model linking these latent variables to other, observable variables (such as GDP), the values of the latent variables can be inferred from measurements of the observable variables.

[edit] Psychology

[edit] Common methods for inferring latent variables

[edit] Bayesian algorithms and methods

Bayesian statistics is often used for inferring latent variables.

[edit] See also

[edit] References

  1. ^ Tabachnick, B.G.; Fidell, L.S. (2001) Using Multivariate Analysis, Allyn and Bacon, Boston. ISBN 0-321-05677-9[page needed]
  2. ^ a b Borsboom, D.; Mellenbergh, G.J.; van Heerden, J. (2003) "The Theoretical Status of Latent Variables", Psychological Review, 110 (2), 203–219 doi:10.1037/0033-295X.110.2.203
  3. ^ Greene, Jeffrey A.; Brown, Scott C. (2009) "The Wisdom Development Scale: Further Validity Investigations", International Journal of Aging And Human Development, 68(4) 289–320 (at p. 291)
Personal tools
Namespaces

Variants
Actions
Navigation
Interaction
Toolbox
Print/export
Languages