Blinded experiment: Difference between revisions

Content deleted Content added

Inline

Revision as of 08:34, 25 May 2019

A blind or blinded-experiment is an experiment in which information is withheld (masked or blinded) from participants to reduce bias.^[1] A blind can be imposed on any participant of an experiment, including subjects, researchers, technicians, data analysts, and evaluators. During the course of an experiment, a participant may become unblinded if they deduce or otherwise obtain information that has been masked to them. Unblinding is common in blind experiments, and participants must be assessed for unblinding to determine how well a blind succeed.

The terms "single-blind", "double-blind" and "triple-blind" are commonly used in medical research, but this terminology is recommended against because it is ambiguous.^[2] For instance, "double-blind" could mean that data analysts and patients were blinded; or patients and outcome assessors were blinded; or patients and people offering the intervention were blinded, etc. Current reporting guidelines recommend against the use of these terms; to describe an experiment's blinding, it is necessary to state who has been blinded to what information, and how well each blind succeeded.^[3]

Blinding is an important tool of the scientific method, and is used in many fields of research. In some fields, such as medicine, it is considered essential.^[4] The opposite of a blinded trial is an open trial.

History

The first blind experiment was conducted by the French Academy of Sciences in 1784 to investigate the claims of mesmerism as proposed by Franz Mesmer. In the experiment, researcher blindfolded mesmerists and asked them to identify objects that the experimenters had previously filled with "vital fluid". The subjects were unable to do so. In 1817, The first blind experiment recorded to have occurred outside of a scientific setting compared the musical quality of a Stradivarius violin to one with a guitar-like design. A violinist played each instrument while a committee of scientists and musicians listened from another room so as to avoid prejudice.^[5]^[6]

One early essay advocating the blinding of researchers came from Claude Bernard in the latter half of the 19th century.^[vague] Bernard recommended that the observer of an experiment should not have knowledge of the hypothesis being tested. This suggestion contrasted starkly with the prevalent Enlightenment-era attitude that scientific observation can only be objectively valid when undertaken by a well-educated, informed scientist.^[7] The first recorded double-blind study was conducted in 1907 by W. H. R. Rivers and H. N. Webber to investigate the effects of caffeine.^[8] Double-blind methods came into prominence in the mid-20th century.^[9]

Single-blind experiments

In a single-blind experiment, subjects are not informed of their treatment allocation. Other participants in the experiment may have access to this information.

Double-blind experiments

In a double-blind experiment, neither the participants nor the researchers know which participants belong to the control group and which belong to the test group. A record of which subjects belong to each group is kept by a third party, and is not revealed to the researchers until the study is over. A properly performed double-blind experiment eliminates both the placebo effect and observer bias. A double-blind experiment cannot be performed when the experimenters must know the full facts (for example, when comparing sham to real surgery). Double-blind experiments achieve a higher standard of scientific rigor than equivalent single-blind or non-blind experiments.

Triple-blind experiments

A triple-blind study is an extension of the double-blind design in which the committee monitoring response variables is also blind to the placement of the test subjects. A triple-blind study has the advantage of eliminating observer bias in the monitoring committee. This study design may not be possible in a trial where the monitoring committee has an ethical responsibility to ensure participant safety, as such a design may interfere with this prerogative.^[10]

Unblinding

Unblinding occurs when information becomes available to one from whom it has been masked. Unblinding that occurs before the conclusion of an experiment is a source of bias. Some degree of premature unblinding is common in blinded experiments.^[11] When a blind is imperfect, its success is judged on a spectrum with no blind (or complete failure of blind) on one end, perfect blinding on the other, and poor or good blinding between. Thus, the common view or studies as blinded or unblinded is an example of a false dichotomy.^[12]

Success of blinding is assessed by questioning study participants about information that has been masked to them (e.g. did you receive the drug or placebo?). In a perfectly blinded experiment, the responses should be consistent with no knowledge of the masked information. However, if unblinding has occurred, the responses will indicate the degree of unblinding. Since unblinding cannot be measured directly, but must be inferred from participants' responses, its measured value will depend on the nature of the questions asked. As a result, it is not possible to measure unblinding in a way that is completely objective. Nonetheless, it is still possible to make informed judgments about the quality of a blind. Poorly blinded studies rank above unblinded studies and below well-blinded studies in the hierarchy of evidence.^[13]

Meta-research has relieved high levels of unblinding in pharmacological trials. In particular, antidepressant trials are poorly blinded. Some researchers believe that the entire observed effect of antidepressant can be explained by failure of blinding, and that they have no genuine value in the treatment of depression.^[14] CONSORT standards and good clinical practice guidelines recommend that all studies assess and report quality of blinding. In practice, this rarely occurs.^[15]

Use

In medicine

Blinding is considered essential in medicine,^[16] but is often difficult to achieve. In drug studies, side effects can alert some patients to the presence of the drug, which effectively unblinds the study. While this shortcoming can be overcome with an active placebo, such controls are rarely used in medicine. It is also difficult to compare surgical and non-surgical interventions in blind trials. In some cases sham surgery is necessary for the blinding process. A good clinical protocol ensures that blinding is as effective as possible within ethical and practical constrains.

In physics

Modern nuclear physics and particle physics experiments often involve large numbers of data analysts working together to extract quantitative data from complex datasets. In particular, the analysts want to report accurate systematic error estimates for all of their measurements; this is difficult or impossible if one of the errors is observer bias. To remove this bias, the experimenters devise blind analysis techniques, where the experimental result is hidden from the analysts until they've agreed—based on properties of the data set other than the final value—that the analysis techniques are fixed.

One example of a blind analysis occurs in neutrino experiments, like the Sudbury Neutrino Observatory, where the experimenters wish to report the total number N of neutrinos seen. The experimenters have preexisting expectations about what this number should be, and these expectations must not be allowed to bias the analysis. Therefore, the experimenters are allowed to see an unknown fraction f of the dataset. They use these data to understand the backgrounds, signal-detection efficiencies, detector resolutions, etc.. However, since no one knows the "blinding fraction" f, no one has preexisting expectations about the meaningless neutrino count N' = N × f in the visible data; therefore, the analysis does not introduce any bias into the final number N which is reported. Another blinding scheme is used in B meson analyses in experiments like BaBar and CDF; here, the crucial experimental parameter is a correlation between certain particle energies and decay times—which require an extremely complex and painstaking analysis—and particle charge signs, which are fairly trivial to measure. Analysts are allowed to work with all the energy and decay data, but are forbidden from seeing the sign of the charge, and thus are unable to see the correlation (if any). At the end of the experiment, the correct charge signs are revealed; the analysis software is run once (with no subjective human intervention), and the resulting numbers are published. Searches for rare events, like electron neutrinos in MiniBooNE or proton decay in Super-Kamiokande, require a different class of blinding schemes.

The "hidden" part of the experiment—the fraction f for SNO, the charge-sign database for CDF—is usually called the "blindness box". At the end of the analysis period, one is allowed to "unblind the data" and "open the box".

In psychology

Single-blind trials are especially prone to bias in psychology and social science research where the experimenter has an idea of what the outcome should be, as the experimenter may transfer bias onto the subject. In some cases, while blind experiments would be useful, they are impractical or unethical: although it would be informative to raise children under arbitrary experimental conditions, such as on a remote island with a fabricated enculturation, it is a violation of ethics and human rights.

In forensics

In a police photo lineup, an officer shows a group of photos to a witness or crime victim and asks him or her to pick out the suspect. This is basically a single-blind test of the witness's memory, and may be subject to subtle or overt influence by the officer. There is a growing movement in law enforcement to move to a double-blind procedure in which the officer who shows the photos to the witness does not know which photo is of the suspect.^[17]^[18]

In music

Auditions for symphony orchestras take place behind a curtain so that the judges cannot see the performer. Blinding the judges to the gender of the performers has been shown to increase the hiring of women.^[19]

References

^ Oxford English Dictionary, 2nd ed.
^ Schulz KF, Chalmers I, Altman DG (February 2002). "The landscape and lexicon of blinding in randomized trials". Annals of Internal Medicine. 136 (3): 254–9. doi:10.7326/0003-4819-136-3-200202050-00022. PMID 11827510.
^ Moher D, Hopewell S, Schulz KF, Montori V, Gøtzsche PC, Devereaux PJ, Elbourne D, Egger M, Altman DG (March 2010). "CONSORT 2010 explanation and elaboration: updated guidelines for reporting parallel group randomised trials". BMJ. 340 (mar23 1): c869. doi:10.1136/bmj.c869. PMC 2844943. PMID 20332511.
^ "Oxford Centre for Evidence-based Medicine - Levels of Evidence (March 2009) - CEBM". cebm.net. 11 June 2009. Archived from the original on 26 October 2017. Retrieved 2 May 2018. {{cite web}}: Unknown parameter |deadurl= ignored (|url-status= suggested) (help)
^ Fétis, François-Joseph (1868). Biographie Universelle des Musiciens et Bibliographie Générale de la Musique, Tome 1 (Second ed.). Paris: Firmin Didot Frères, Fils, et Cie. p. 249. Retrieved 2011-07-21. {{cite book}}: Unknown parameter |name-list-format= ignored (|name-list-style= suggested) (help)
^ Dubourg, George (1852). The Violin: Some Account of That Leading Instrument and its Most Eminent Professors... (Fourth ed.). London: Robert Cocks and Co. pp. 356–357. Retrieved 2011-07-21. {{cite book}}: Unknown parameter |name-list-format= ignored (|name-list-style= suggested) (help)
^ Daston, Lorraine (2005). "Scientific Error and the Ethos of Belief". Social Research. 72 (1): 18. {{cite journal}}: Unknown parameter |name-list-format= ignored (|name-list-style= suggested) (help)
^ Rivers WH, Webber HN (August 1907). "The action of caffeine on the capacity for muscular work". The Journal of Physiology. 36 (1): 33–47. doi:10.1113/jphysiol.1907.sp001215. PMC 1533733. PMID 16992882.
^ Alder, Ken (2006). Kramer, Lloyd S.; Maza, Sarah C. (eds.). A Companion to Western Historical Thought. Blackwell Companions to History. Wiley-Blackwell. p. 307. ISBN 978-1-4051-4961-7. Retrieved 2012-02-11. Shortly after the start of the Cold War [...] double-blind reviews became the norm for conducting scientific medical research, as well as the means by which peers evaluated scholarship, both in science and in history. {{cite book}}: |work= ignored (help); Unknown parameter |name-list-format= ignored (|name-list-style= suggested) (help)
^ Friedman LM, Furberg CD, DeMets DL (2010). Fundamentals of Clinical Trials. New York: Springer. pp. 119–132. ISBN 978-1-4419-1585-6.
^ Bello S, Moustgaard H, Hróbjartsson A (January 2017). "Unreported formal assessment of unblinding occurred in 4 of 10 randomized clinical trials, unreported loss of blinding in 1 of 10 trials". J Clin Epidemiol. 81: 42–50. doi:10.1016/j.jclinepi.2016.08.002. PMID 27555081.
^ Schulz KF, Grimes DA (2002). "Blinding in randomised trials: hiding who got what". Lancet. 359 (9307): 696–700. doi:10.1016/S0140-6736(02)07816-9. PMID 11879884.
^ Kolahi J, Bang H, Park J (2009). "Towards a proposal for assessment of blinding success in clinical trials: up-to-date review". Community Dent Oral Epidemiol. 37 (6): 477–484. doi:10.1111/j.1600-0528.2009.00494.x. PMC 3044082. PMID 19758415.
^ Ioannidis JP (2008). "Effectiveness of antidepressants: an evidence myth constructed from a thousand randomized trials?". Philos Ethics Humanit Med. 3: 14. doi:10.1186/1747-5341-3-14. PMC 2412901. PMID 18505564.{{cite journal}}: CS1 maint: unflagged free DOI (link)
^ Bello S, Moustgaard H, Hróbjartsson A (2014). "The risk of unblinding was infrequently and incompletely reported in 300 randomized clinical trial publications". J Clin Epidemiol. 67 (10): 1059–1069. doi:10.1016/j.jclinepi.2014.05.007. PMID 24973822.
^ "Oxford Centre for Evidence-based Medicine - Levels of Evidence (March 2009) - CEBM". cebm.net. 11 June 2009. Archived from the original on 26 October 2017. Retrieved 2 May 2018. {{cite web}}: Unknown parameter |deadurl= ignored (|url-status= suggested) (help)
^ Dittmann, Melissa (July–August 2004). "Accuracy and the accused: Psychologists work with law enforcement on research-based improvements to crime-suspect identification". Monitor on Psychology. 35 (7). American Psychological Association: 74. {{cite journal}}: Unknown parameter |name-list-format= ignored (|name-list-style= suggested) (help)
^ Koerner, Brendan I. (July–August 2002). "Under the Microscope". Legal Affairs. Retrieved 2 May 2018. {{cite web}}: Unknown parameter |name-list-format= ignored (|name-list-style= suggested) (help)
^ Miller, Claire Cain (25 February 2016). "Is Blind Hiring the Best Hiring?". The New York Times. Retrieved 26 April 2019. {{cite web}}: Unknown parameter |name-list-format= ignored (|name-list-style= suggested) (help)

[1] Oxford English Dictionary, 2nd ed.

[2] Schulz KF, Chalmers I, Altman DG (February 2002). "The landscape and lexicon of blinding in randomized trials". Annals of Internal Medicine. 136 (3): 254–9. doi:10.7326/0003-4819-136-3-200202050-00022. PMID 11827510.

[3] Moher D, Hopewell S, Schulz KF, Montori V, Gøtzsche PC, Devereaux PJ, Elbourne D, Egger M, Altman DG (March 2010). "CONSORT 2010 explanation and elaboration: updated guidelines for reporting parallel group randomised trials". BMJ. 340 (mar23 1): c869. doi:10.1136/bmj.c869. PMC 2844943. PMID 20332511.

[4] "Oxford Centre for Evidence-based Medicine - Levels of Evidence (March 2009) - CEBM". cebm.net. 11 June 2009. Archived from the original on 26 October 2017. Retrieved 2 May 2018. {{cite web}}: Unknown parameter |deadurl= ignored (|url-status= suggested) (help)

[5] Fétis, François-Joseph (1868). Biographie Universelle des Musiciens et Bibliographie Générale de la Musique, Tome 1 (Second ed.). Paris: Firmin Didot Frères, Fils, et Cie. p. 249. Retrieved 2011-07-21. {{cite book}}: Unknown parameter |name-list-format= ignored (|name-list-style= suggested) (help)

[6] Dubourg, George (1852). The Violin: Some Account of That Leading Instrument and its Most Eminent Professors... (Fourth ed.). London: Robert Cocks and Co. pp. 356–357. Retrieved 2011-07-21. {{cite book}}: Unknown parameter |name-list-format= ignored (|name-list-style= suggested) (help)

[7] Daston, Lorraine (2005). "Scientific Error and the Ethos of Belief". Social Research. 72 (1): 18. {{cite journal}}: Unknown parameter |name-list-format= ignored (|name-list-style= suggested) (help)

[pmid16992882-8] Rivers WH, Webber HN (August 1907). "The action of caffeine on the capacity for muscular work". The Journal of Physiology. 36 (1): 33–47. doi:10.1113/jphysiol.1907.sp001215. PMC 1533733. PMID 16992882.

[9] Alder, Ken (2006). Kramer, Lloyd S.; Maza, Sarah C. (eds.). A Companion to Western Historical Thought. Blackwell Companions to History. Wiley-Blackwell. p. 307. ISBN 978-1-4051-4961-7. Retrieved 2012-02-11. Shortly after the start of the Cold War [...] double-blind reviews became the norm for conducting scientific medical research, as well as the means by which peers evaluated scholarship, both in science and in history. {{cite book}}: |work= ignored (help); Unknown parameter |name-list-format= ignored (|name-list-style= suggested) (help)

[Friedman,_Furberg_&_DeMets_(2010)-10] Friedman LM, Furberg CD, DeMets DL (2010). Fundamentals of Clinical Trials. New York: Springer. pp. 119–132. ISBN 978-1-4419-1585-6.

[11] Bello S, Moustgaard H, Hróbjartsson A (January 2017). "Unreported formal assessment of unblinding occurred in 4 of 10 randomized clinical trials, unreported loss of blinding in 1 of 10 trials". J Clin Epidemiol. 81: 42–50. doi:10.1016/j.jclinepi.2016.08.002. PMID 27555081.

[12] Schulz KF, Grimes DA (2002). "Blinding in randomised trials: hiding who got what". Lancet. 359 (9307): 696–700. doi:10.1016/S0140-6736(02)07816-9. PMID 11879884.

[Kolahi2009-13] Kolahi J, Bang H, Park J (2009). "Towards a proposal for assessment of blinding success in clinical trials: up-to-date review". Community Dent Oral Epidemiol. 37 (6): 477–484. doi:10.1111/j.1600-0528.2009.00494.x. PMC 3044082. PMID 19758415.

[Ioannidis2008-14] Ioannidis JP (2008). "Effectiveness of antidepressants: an evidence myth constructed from a thousand randomized trials?". Philos Ethics Humanit Med. 3: 14. doi:10.1186/1747-5341-3-14. PMC 2412901. PMID 18505564.{{cite journal}}: CS1 maint: unflagged free DOI (link)

[Bello2014-15] Bello S, Moustgaard H, Hróbjartsson A (2014). "The risk of unblinding was infrequently and incompletely reported in 300 randomized clinical trial publications". J Clin Epidemiol. 67 (10): 1059–1069. doi:10.1016/j.jclinepi.2014.05.007. PMID 24973822.

[16] "Oxford Centre for Evidence-based Medicine - Levels of Evidence (March 2009) - CEBM". cebm.net. 11 June 2009. Archived from the original on 26 October 2017. Retrieved 2 May 2018. {{cite web}}: Unknown parameter |deadurl= ignored (|url-status= suggested) (help)

[17] Dittmann, Melissa (July–August 2004). "Accuracy and the accused: Psychologists work with law enforcement on research-based improvements to crime-suspect identification". Monitor on Psychology. 35 (7). American Psychological Association: 74. {{cite journal}}: Unknown parameter |name-list-format= ignored (|name-list-style= suggested) (help)

[18] Koerner, Brendan I. (July–August 2002). "Under the Microscope". Legal Affairs. Retrieved 2 May 2018. {{cite web}}: Unknown parameter |name-list-format= ignored (|name-list-style= suggested) (help)

[19] Miller, Claire Cain (25 February 2016). "Is Blind Hiring the Best Hiring?". The New York Times. Retrieved 26 April 2019. {{cite web}}: Unknown parameter |name-list-format= ignored (|name-list-style= suggested) (help)

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

[14]

[15]

[16]

[17]

[18]

[19]

@@ Line 20: / Line 20: @@
 ==Triple-blind experiments==
 A triple-blind study is an extension of the double-blind design in which the committee monitoring response variables is also blind to the placement of the test subjects. A triple-blind study has the advantage of eliminating observer bias in the monitoring committee. This study design may not be possible in a trial where the monitoring committee has an ethical responsibility to ensure participant safety, as such a design may interfere with this prerogative.<ref name="Friedman, Furberg & DeMets (2010)">{{cite book | vauthors = Friedman LM, Furberg CD, DeMets DL | date = 2010 | title = Fundamentals of Clinical Trials | location = New York | publisher = Springer | pages = 119–132 | isbn = 978-1-4419-1585-6 }}</ref>
+==Unblinding==
+{{main article|Unblinding}}
+'''Unblinding''' occurs when information becomes available to one from whom it has been masked. Unblinding that occurs before the conclusion of an experiment is a source of bias. Some degree of premature unblinding is common in blinded experiments.<ref>{{cite journal|vauthors=Bello S, Moustgaard H, Hróbjartsson A|date=January 2017|title=Unreported formal assessment of unblinding occurred in 4 of 10 randomized clinical trials, unreported loss of blinding in 1 of 10 trials|url=|journal=J Clin Epidemiol|volume=81|issue=|pages=42–50|doi=10.1016/j.jclinepi.2016.08.002|pmid=27555081}}</ref> When a blind is imperfect, its success is judged on a [[spectrum]] with [[Open-label trial|no blind]] (or complete failure of blind) on one end, perfect blinding on the other, and poor or good blinding between. Thus, the common view or studies as blinded or unblinded is an example of a [[Dichotomy|false dichotomy]].<ref>{{cite journal|vauthors=Schulz KF, Grimes DA|date=2002|title=Blinding in randomised trials: hiding who got what|url=|journal=Lancet|volume=359|issue=9307|pages=696–700|doi=10.1016/S0140-6736(02)07816-9|pmid=11879884}}</ref>
+Success of blinding is assessed by questioning study participants about information that has been masked to them (e.g. did you receive the drug or [[placebo]]?). In a perfectly blinded experiment, the responses should be consistent with no knowledge of the masked information. However, if unblinding has occurred, the responses will indicate the degree of unblinding. Since unblinding [[Latent variable|cannot be measured directly]], but must be inferred from participants' responses, its measured value will depend on the [[Choice modeling|nature of the questions asked]]. As a result, it is not possible to measure unblinding in a way that is completely objective. Nonetheless, it is still possible to make informed judgments about the quality of a blind. Poorly blinded studies rank above unblinded studies and below well-blinded studies in the [[hierarchy of evidence]].<ref name="Kolahi2009">{{cite journal|vauthors=Kolahi J, Bang H, Park J|date=2009|title=Towards a proposal for assessment of blinding success in clinical trials: up-to-date review|url=|journal=Community Dent Oral Epidemiol|volume=37|issue=6|pages=477–484|doi=10.1111/j.1600-0528.2009.00494.x|pmc=3044082|pmid=19758415}}</ref>
+[[Metascience (research)|Meta-research]] has relieved high levels of unblinding in pharmacological trials. In particular, [[antidepressant]] trials are poorly blinded. Some researchers believe that the entire observed effect of antidepressant can be explained by failure of blinding, and that they have no genuine value in the treatment of depression.<ref name="Ioannidis2008">{{cite journal|vauthors=Ioannidis JP|date=2008|title=Effectiveness of antidepressants: an evidence myth constructed from a thousand randomized trials?|url=|journal=Philos Ethics Humanit Med|volume=3|issue=|pages=14|doi=10.1186/1747-5341-3-14|pmc=2412901|pmid=18505564}}</ref> [[CONSORT]] standards and [[good clinical practice]] guidelines recommend that all studies assess and report quality of blinding. In practice, this rarely occurs.<ref name="Bello2014">{{cite journal|vauthors=Bello S, Moustgaard H, Hróbjartsson A|date=2014|title=The risk of unblinding was infrequently and incompletely reported in 300 randomized clinical trial publications|url=|journal=J Clin Epidemiol|volume=67|issue=10|pages=1059–1069|doi=10.1016/j.jclinepi.2014.05.007|pmid=24973822}}</ref>
 ==Use==

v t e Design of experiments
Scientific method	Scientific experiment Statistical design Control Internal and external validity Experimental unit Blinding Optimal design: Bayesian Random assignment Randomization Restricted randomization Replication versus subsampling Sample size
Treatment and blocking	Treatment Effect size Contrast Interaction Confounding Orthogonality Blocking Covariate Nuisance variable
Models and inference	Linear regression Ordinary least squares Bayesian Random effect Mixed model Hierarchical model: Bayesian Analysis of variance (Anova) Cochran's theorem Manova (multivariate) Ancova (covariance) Compare means Multiple comparison
Designs Completely randomized	Factorial Fractional factorial Plackett–Burman Taguchi Response surface methodology Polynomial and rational modeling Box–Behnken Central composite Block Generalized randomized block design (GRBD) Latin square Graeco-Latin square Orthogonal array Latin hypercube Repeated measures design Crossover study Randomized controlled trial Sequential analysis Sequential probability ratio test
Glossary Category Mathematics portal Statistical outline Statistical topics