In statistics and causal graphs, a variable is a collider when it is causally influenced by two or more variables. The causal variables influencing the collider are themselves not necessarily associated. The name "collider" reflects the fact that in graphical models, the arrow heads from variables that lead into the collider appear to "collide" on the node that is the collider. They are sometimes also referred to as inverted forks.
The result of having a collider in the path is that the collider blocks the association between the variables that influence it. Thus, the collider does not generate an unconditional association between the variables that determine it.
Conditioning on the collider via regression analysis, stratification, experimental design, or sample selection based on values of the collider create a non-causal association between X and Y (Berkson's paradox). In the terminology of causal graphs, conditioning on the colllider open the path between X and Y. This will introduce bias when estimating the causal association between X and Y, potentially introducing associations where there are none. Colliders can therefore undermine attempts to test causal theories.
Colliders are sometimes confused with confounder variables. Unlike colliders, confounder variables should be controlled for when estimating causal associations.
- Hernan, Miguel A; Robins, James M (2010), Causal inference, Chapman & Hall/CRC monographs on statistics & applied probability, CRC, p. 70, ISBN 978-1-4200-7616-5
- Greenland, Sander; Pearl, Judea; Robins, James M (January 1999), "Causal Diagrams for Epidemiologic Research" (PDF), Epidemiology, 10 (1): 37–48, doi:10.1097/00001648-199901000-00008, ISSN 1044-3983, OCLC 484244020, PMID 9888278
- Pearl, Judea (1986). "Fusion, Propagation and Structuring in Belief Networks". Artificial Intelligence. 29 (3): 241–288. doi:10.1016/0004-3702(86)90072-x.
- Pearl, Judea (1988). Probabilistic reasoning in intelligent systems: networks of plausible inference. Morgan Kaufmann.