Response bias is a general term for a wide range of tendencies for participants to respond inaccurately or falsely to questions. These biases are prevalent in research involving participant self-report, such as structured interviews or surveys. Response biases can have a large impact on the validity of questionnaires or surveys.
Response bias can be induced or caused by numerous factors, all relating to the idea that human subjects do not respond passively to stimuli, but rather actively integrate multiple sources of information to generate a response in a given situation. Because of this, almost any aspect of an experimental condition may potentially bias a respondent. Examples include the phrasing of questions in surveys, the demeanor of the researcher, the way the experiment is conducted, or the desires of the participant to be a good experimental subject and to provide socially desirable responses may affect the response in some way. All of these "artifacts" of survey and self-report research may have the potential to damage the validity of a measure or study. Compounding this issue is that surveys affected by response bias still often have high reliability, which can lure researchers into a false sense of security about the conclusions they draw.
Because of response bias, it is possible that some study results are due to a systematic response bias rather than the hypothesized effect, which can have a profound effect on psychological and other types of research using questionnaires or surveys. It is therefore important for researchers to be aware of response bias and the effect it can have on their research so that they can attempt to prevent it from impacting their findings in a negative manner.
History of research
Awareness of response bias has been present in psychology and sociology literature for some time because self-reporting features significantly in those fields of research. However, researchers were initially unwilling to admit the degree to which they impact, and potentially invalidate research utilizing these types of measures. Some researchers believed that the biases present in a group of subjects cancel out when the group is large enough. This would mean that the impact of response bias is random noise, which washes out if enough participants are included in the study. However, at the time this argument was proposed, effective methodological tools that could test it were not available. Once newer methodologies were developed, researchers began to investigate the impact of response bias. From this renewed research, two opposing sides arose.
The first group supports Hyman's belief that although response bias exists, it often has minimal effect on participant response, and no large steps need to be taken to mitigate it. These researchers hold that although there is significant literature identifying response bias as influencing the responses of study participants, these studies do not in fact provide empirical evidence that this is the case. They subscribe to the idea that the effects of this bias wash out with large enough samples, and that it is not a systematic problem in mental health research. These studies also call into question earlier research that investigated response bias on the basis of their research methodologies. For example, they mention that many of the studies had very small sample sizes, or that in studies looking at social desirability, a subtype of response bias, the researchers had no way to quantify the desirability of the statements used in the study. Additionally, some have argued that what researchers may believe to be artifacts of response bias, such as differences in responding between men and women, may in fact be actual differences between the two groups. Several other studies also found evidence that response bias is not as big of a problem as it may seem. The first found that when comparing the responses of participants, with and without controls for response bias, their answers to the surveys were not different. Two other studies found that although the bias may be present, the effects are extremely small, having little to no impact towards dramatically changing or altering the responses of participants.
The second group argues against Hyman's point, saying that response bias has a significant effect, and that researchers need to take steps to reduce response bias in order to conduct sound research. They argue that the impact of response bias is a systematic error inherent to this type of research and that it needs to be addressed in order for studies to be able to produce accurate results. In psychology, there are many studies exploring the impact of response bias in many different settings and with many different variables. For example, some studies have found effects of response bias in the reporting of depression in elderly patients. Other researchers have found that there are serious issues when responses to a given survey or questionnaire have responses that may seem desirable or undesirable to report, and that a person's responses to certain questions can be biased by their culture. Additionally, there is support for the idea that simply being part of an experiment can have dramatic effects on how participants act, thus biasing anything that they may do in a research or experimental setting when it comes to self-reporting. One of the most influential studies was one which found that social desirability bias, a type of response bias, can account for as much as 10–70% of the variance in participant response. Essentially, because of several findings that illustrate the dramatic effects response bias has on the outcomes of self-report research, this side supports the idea that steps need to be taken to mitigate the effects of response bias to maintain the accuracy of research.
While both sides have support in the literature, there appears to be greater empirical support for the significance of response bias. To add strength to the claims of those who argue the importance of response bias, many of the studies that reject the significance of response bias report multiple methodological issues in their studies. For example, they have extremely small samples that are not representative of the population as a whole, they only considered a small subset of potential variables that could be affected by response bias, and their measurements were conducted over the phone with poorly worded statements.
Acquiescence bias, which is also referred to as "yea-saying", is a category of response bias in which respondents to a survey have a tendency to agree with all the questions in a measure. This bias in responding may represent a form of dishonest reporting because the participant automatically endorses any statements, even if the result is contradictory responses. For example, a participant could be asked whether they endorse the following statement, "I prefer to spend time with others" but then later on in the survey also endorses "I prefer to spend time alone," which are contradictory statements. This is a distinct problem for self-report research because it does not allow a researcher to understand or gather accurate data from any type of question that asks for a participant to endorse or reject statements. Researchers have approached this issue by thinking about the bias in two different ways. The first deals with the idea that participants are trying to be agreeable, in order to avoid the disapproval of the researcher. A second cause for this type of bias was proposed by Lee Cronbach, when he argued that it is likely due to a problem in the cognitive processes of the participant, instead of the motivation to please the researcher. He argues that it may be due to biases in memory where an individual recalls information that supports endorsement of the statement, and ignores contradicting information.
Researchers have several methods to try and reduce this form of bias. Primarily, they attempt to make balanced response sets in a given measure, meaning that there are a balanced number of positively and negatively worded questions. This means that if a researcher was hoping to examine a certain trait with a given questionnaire, half of the questions would have a "yes" response to identify the trait, and the other half would have a "no" response to identify the trait.
Nay-saying is the opposite form of this bias. It occurs when a participant always chooses to deny or not endorse any statements in a survey or measure. This has a similar effect of invalidating any kinds of endorsements that participants may make over the course of the experiment.
Demand characteristics refer to a type of response bias where participants alter their response or behavior simply because they are part of an experiment. This arises because participants are actively engaged in the experiment, and may try to figure out the purpose, or adopt certain behaviors they believe belong in an experimental setting. Martin Orne was one of the first to identify this type of bias, and has developed several theories to address their cause. His research points to the idea that participants enter a certain type of social interaction when engaging in an experiment, and this special social interaction drives participants to consciously and unconsciously alter their behaviors There are several ways that this bias can influence participants and their responses in an experimental setting. One of the most common relates to the motivations of the participant. Many people choose to volunteer to be in studies because they believe that experiments are important. This drives participants to be "good subjects" and fulfill their role in the experiment properly, because they believe that their proper participation is vital to the success of the study. Thus, in an attempt to productively participate, the subject may try to gain knowledge of the hypothesis being tested in the experiment and alter their behavior in an attempt to support that hypothesis. Orne conceptualized this change by saying that the experiment may appear to a participant as a problem, and it is his or her job to find the solution to that problem, which would be behaving in a way that would lend support to the experimenter's hypothesis. Alternatively, a participant may try to discover the hypothesis simply to provide faulty information and wreck the hypothesis. Both of these results are harmful because they prevent the experimenters from gathering accurate data and making sound conclusions.
Outside of participant motivation, there are other factors that influence the appearance of demand characteristics in a study. Many of these factors relate to the unique nature of the experimental setting itself. For example, participants in studies are more likely to put up with uncomfortable or tedious tasks simply because they are in an experiment. Additionally, the mannerisms of the experimenter, such as the way they greet the participant, or the way they interact with the participant during the course of the experiment may inadvertently bias how the participant responds during the course of the experiment. Also, prior experiences of being in an experiment, or rumors of the experiment that participants may hear can greatly bias the way they respond. Outside of an experiment, these types of past experiences and mannerisms may have significant effects on how patients rank the effectiveness of their therapist. Many of the ways therapists go about collecting client feedback involve self-reporting measures, which can be highly influenced by response bias. Participants may be biased if they fill out these measure in front of their therapist, or somehow feel compelled to answer in an affirmative matter because they believe their therapy should be working. In this case, the therapists would not be able to gain accurate feedback from their clients, and be unable to improve their therapy or accurately tailor further treatment to what the participants need. All of these different examples may have significant effects on the responses of participants, driving them to respond in ways that do not reflect their actual beliefs or actual mindset, which negatively impact conclusions drawn from those surveys.
While demand characteristics cannot be completely removed from an experiment, there are steps that researchers can take to minimize the impact they may have on the results. One way to mitigate response bias is to use deception to prevent the participant from discovering the true hypothesis of the experiment and then debrief the participants. For example, research has demonstrated that repeated deception and debriefing is useful in preventing participants from becoming familiar with the experiment, and that participants do not significantly alter their behaviors after being deceived and debriefed multiple times. Another way that researchers attempt to reduce demand characteristics is by being as neutral as possible, or training those conducting the experiment to be as neutral as possible. For example, studies show that extensive one-on-one contact between the experimenter and the participant makes it more difficult to be neutral, and go on to suggest that this type of interaction should be limited when designing an experiment. Another way to prevent demand characteristics is to use blinded experiments with placebos or control groups. This prevents the experimenter from biasing the participant, because the researcher does not know in which way the participant should respond. Although not perfect, these methods can significantly reduce the effect of demand characteristics on a study, thus making the conclusions drawn from the experiment more likely to accurately reflect what they were intended to measure.
Extreme responding is a form of response bias that drives respondents to only select the most extreme options or answers available. For example, in a survey utilizing a Likert scale with potential responses ranging from one to five, the respondent may only give answers as ones or fives. Another example is if the participant only answered questionnaires with "strongly agree" or "strongly disagree" in a survey with that type of response style. There are several reasons for why this bias may take hold in a group of participants. One example ties the development of this type of bias in respondents to their cultural identity. This explanation states that people from certain cultures are more likely to respond in an extreme manner as compared to others. For example, research has found that those from the Middle East and Latin America are more prone to be affected by extremity response, whereas those from East Asia and Western Europe are less likely to be affected. A second explanation for this type of response bias relates to the education level of the participants. Research has indicated that those with lower intelligence, measured by an analysis of IQ and school achievement, are more likely to be affected by extremity response. Another way that this bias can be introduced is through the wording of questions in a survey or questionnaire. Certain topics or the wording of a question may drive participants to respond in an extreme manner, especially if it relates to the motivations or beliefs of the participant.
The opposite of this bias occurs when participants only select intermediate or mild responses as answers.
Question order bias
Question order bias, or "order effects bias", is a type of response bias where a respondent may react differently to questions based on the order in which questions appear in a survey or interview. Question order bias is different from "response order bias" that addresses specifically the order of the set of responses within a survey question. There are many ways that questionnaire items that appear earlier in a survey can affect responses to later questions. One way is when a question creates a "norm of reciprocity or fairness" as identified in the 1950 work of Herbert Hyman and Paul Sheatsley. In their research they asked two questions. One was asked on whether the United States should allow reporters from communist countries to come to the U.S. and send back news as they saw it; and another question was asked on whether a communist country like Russia should let American newspaper reporters come in and send back news as they saw it to America. In the study, the percentage of “yes” responses to the question allowing communist reporters increased by 37 percentage points depending on the order. Similarly results for the American reporters item increased by 24 percentage points. When either of the items was asked second, the context for the item was changed as a result of the answer to the first, and the responses to the second were more in line with what would be considered fair, based on the previous response. Another way to alter the response towards questions based on order depends on the framing of the question. If a respondent is first asked about their general interest in a subject their response interest may be higher than if they are first posed technical or knowledge based questions about a subject. Part-whole contrast effect is yet another ordering effect. When general and specific questions are asked in different orders, results for the specific item are generally unaffected, whereas those for the general item can change significantly. Question order biases occur primarily in survey or questionnaire settings. Some strategies to limit the effects of question order bias include randomization, grouping questions by topic to unfold in a logical order.
Social desirability bias
Social desirability bias is a type of response bias that influences a participant to deny undesirable traits, and ascribe to themselves traits that are socially desirable. In essence, it is a bias that drives an individual to answer in a way that makes them look more favorable to the experimenter. This bias can take many forms. Some individuals may over-report good behavior, while others may under-report bad, or undesirable behavior. A critical aspect of how this bias can come to affect the responses of participants relates to the norms of the society in which the research is taking place. For example, social desirability bias could play a large role if conducting research about an individual's tendency to use drugs. Those in a community where drug use is seen as acceptable or popular may exaggerate their own drug use, whereas those from a community where drug use is looked down upon may choose to under-report their own use. This type of bias is much more prevalent in questions that draw on a subject's opinion, like when asking a participant to evaluate or rate something, because there generally is not one correct answer, and the respondent has multiple ways they could answer the question. Overall, this bias can be very problematic for self-report researchers, especially if the topic they are looking at is controversial. The distortions created by respondents answering in a socially desirable manner can have profound effects on the validity of self-report research. Without being able to control for or deal with this bias, researchers are unable to determine if the effects they are measuring are due to individual differences, or from a desire to conform to the societal norms present in the population they are studying. Therefore, researchers strive to employ strategies aimed at mitigating social desirability bias so that they can draw valid conclusions from their research.
Several strategies exist to limit the effect of social desirability bias. In 1985, Anton Nederhof compiled a list of techniques and methodological strategies for researchers to use to mitigate the effects of social desirability bias in their studies. Most of these strategies involve deceiving the subject, or are related to the way questions in surveys and questionnaires are presented to those in a study. A condensed list of seven of the strategies are listed below:
- Ballot-box method: This method allows a subject to anonymously self-complete a questionnaire and submit it to a locked "ballot box", thereby concealing their responses from an interviewer and affording the participant an additional layer of assured concealment from perceived social repercussion.
- Forced-choice items: This technique hopes to generate questions that are equal in desirability to prevent a socially desirable response in one direction or another.
- Neutral questions: The goal of this strategy is to use questions that are rated as neutral by a wide range of participants so that socially desirable responding does not apply.
- Randomized response technique: This technique allows participants to answer a question that is randomly selected from a set of questions. The researcher in this technique does not know which question the subject responds to, so subjects are more likely to answer truthfully. Researchers can then use statistics to interpret the anonymous data.
- Self-administered questionnaires: This strategy involves isolating the participant before they begin answering the survey or questionnaire to hopefully remove any social cues the researcher may present to the participant.
- Bogus-pipeline: This technique involves a form of deception, where researchers convince a subject through a series of rigged demonstrations that a machine can accurately determine if a participant is being truthful when responding to certain questions. After the participant completes the survey or questionnaire, they are debriefed. This is a rare technique, and does not see much use because of the cost, time commitment and because it is a one-use only technique for each participant.
- Selection interviewers: This strategy allows participants to select the person or persons who will be conducting the interview or presiding over the experiment. This, in the hope that with a higher degree of rapport, subjects will be more likely to answer honestly.
- Proxy subjects: Instead of asking a person directly, this strategy questions someone who is close to or knows the target individual well. This technique is generally limited to questions about behavior, and is not adequate for asking about attitudes or beliefs.
The degree of effectiveness for each of these techniques or strategies differs depending on the situation and the question asked. In order to be the most successful in reducing social desirability bias in a wide range of situations, it has been suggested that researchers utilize a combination of these techniques to have the best chance at mitigating the effects of social desirability bias. Validations are not made on a "more is better" assumption (higher stated prevalence of the behavior of interest) when selecting the best method for reducing SDB as this is a "weak validation" that does not always guarantee the best results. Instead, ground "truthed" comparisons of observed data to stated data should reveal the most accurate method.
- Non-response bias is not the opposite of response bias and is not a type of cognitive bias: it occurs in a statistical survey if those who respond to the survey differ in the outcome variable.
- Response rate is not a cognitive bias, but rather refers to a ratio of those who complete the survey and those who do not.
Highly vulnerable areas
Some areas or topics that are highly vulnerable to the various types of response bias include:
- Total survey error
- Compound question
- Heckman correction
- Loaded question
- Misinformation effect, similar effect for memory instead of opinion.
- Opinion poll
- List of cognitive biases
- Blair, G., Coppock, A., & Moor, M. (2020). "When to Worry about Sensitivity Bias: A Social Reference Theory and Evidence from 30 Years of List Experiments." American Political Science Review.
- Furnham, Adrian (1986). "Response bias, social desirability and dissimulation". Personality and Individual Differences. 7 (3): 385–400. doi:10.1016/0191-8869(86)90014-0.
- Nederhof, Anton J. (1985). "Methods of coping with social desirability bias: A review". European Journal of Social Psychology. 15 (3): 263–280. doi:10.1002/ejsp.2420150303.
- Orne, Martin T. (1962). "On the social psychology of the psychological experiment: With particular reference to demand characteristics and their implications". American Psychologist. 17 (11): 776–783. doi:10.1037/h0043424.
- Kalton, Graham; Schuman, Howard (1982). "The Effect of the Question on Survey Responses: A Review" (PDF). Journal of the Royal Statistical Society. Series A (General). 145 (1): 42–73. doi:10.2307/2981421. hdl:2027.42/146916. JSTOR 2981421.
- Gove, W. R.; Geerken, M. R. (1977). "Response bias in surveys of mental health: An empirical investigation". American Journal of Sociology. 82 (6): 1289–1317. doi:10.1086/226466. JSTOR 2777936. PMID 889001.
- Hyman, H; 1954. Interviewing in Social Research. Chicago: University of Chicago Press.
- Clancy, Kevin; Gove, Walter (1974). "Sex Differences in Mental Illness: An Analysis of Response Bias in Self-Reports". American Journal of Sociology. 80 (1): 205–216. doi:10.1086/225767. JSTOR 2776967.
- Campbell, A. Converse, P. Rodgers; 1976. The Quality of American Life: Perceptions, Evaluations and Satisfaction. New York: Russell Sage.
- Gove, Walter R.; McCorkel, James; Fain, Terry; Hughes, Michael D. (1976). "Response bias in community surveys of mental health: Systematic bias or random noise?". Social Science & Medicine. 10 (9–10): 497–502. doi:10.1016/0037-7856(76)90118-9. PMID 1006342.
- Knäuper, Bärbel; Wittchen, Hans-Ulrich (1994). "Diagnosing major depression in the elderly: Evidence for response bias in standardized diagnostic interviews?". Journal of Psychiatric Research. 28 (2): 147–164. doi:10.1016/0022-3956(94)90026-4. PMID 7932277.
- Fischer, Ronald (2004). "Standardization to Account for Cross-Cultural Response Bias: A Classification of Score Adjustment Procedures and Review of Research in JCCP". Journal of Cross-Cultural Psychology. 35 (3): 263–282. doi:10.1177/0022022104264122.
- Reese, Robert J.; Gillaspy, J. Arthur; Owen, Jesse J.; Flora, Kevin L.; Cunningham, Linda C.; Archie, Danielle; Marsden, Troymichael (2013). "The Influence of Demand Characteristics and Social Desirability on Clients' Ratings of the Therapeutic Alliance". Journal of Clinical Psychology. 69 (7): 696–709. doi:10.1002/jclp.21946. PMID 23349082.
- Cronbach, L. J. (1942). "Studies of acquiescence as a factor in the true-false test". Journal of Educational Psychology. 33 (6): 401–415. doi:10.1037/h0054677.
- Watson, D. (1992). "Correcting for Acquiescent Response Bias in the Absence of a Balanced Scale: An Application to Class Consciousness". Sociological Methods & Research. 21: 52–88. doi:10.1177/0049124192021001003.
- Moss, Simon. (2008). Acquiescence bias
- Knowles, Eric S.; Nathan, Kobi T. (1997). "Acquiescent Responding in Self-Reports: Cognitive Style or Social Concern?". Journal of Research in Personality. 31 (2): 293–301. doi:10.1006/jrpe.1997.2180.
- Meisenberg, Gerhard; Williams, Amandy (2008). "Are acquiescent and extreme response styles related to low intelligence and education?". Personality and Individual Differences. 44 (7): 1539–1550. doi:10.1016/j.paid.2008.01.010.
- Podsakoff, Philip M.; MacKenzie, Scott B.; Lee, Jeong-Yeon; Podsakoff, Nathan P. (2003). "Common method biases in behavioral research: A critical review of the literature and recommended remedies". Journal of Applied Psychology. 88 (5): 879–903. doi:10.1037/0021-9010.88.5.879. hdl:2027.42/147112. PMID 14516251.
- Orne, Martin T. (2009). "Demand Characteristics and the Concept of Quasi-Controls". In Rosenthal, Robert; Rosnow, Ralph L. (eds.). Artifacts in Behavioral Research. Oxford University Press. pp. 110–137. doi:10.1093/acprof:oso/9780195385540.003.0005. ISBN 978-0-19-538554-0.
- Nichols, Austin Lee; Maner, Jon K. (2008). "The Good-Subject Effect: Investigating Participant Demand Characteristics". The Journal of General Psychology. 135 (2): 151–165. doi:10.3200/GENP.135.2.151-166. PMID 18507315.
- Cook, Thomas D.; et al. (1970). "Demand characteristics and three conceptions of the frequently deceived subject". Journal of Personality and Social Psychology. 14 (3): 185–194. doi:10.1037/h0028849.
- Blankenship, Albert (1942). "Psychological Difficulties in Measuring Consumer Preference". Journal of Marketing. 6 (4, part 2): 66–75. doi:10.1177/002224294200600420.1. JSTOR 1246085.
- Israel, Glenn D.; Taylor, C.L. (1990). "Can response order bias evaluations?". Evaluation and Program Planning. 13 (4): 365–371. doi:10.1016/0149-7189(90)90021-N.
- Hyman, H. H.; Sheatsley, P. B. (1950). "The Current Status of American Public Opinion". In Payne, J. C. (ed.). The Teaching of Contemporary Affairs: Twenty-first Yearbook of the National Council of Social Studies. pp. 11–34. OCLC 773251346.
- Lavrakas, Paul J. (2008). Encyclopedia of Survey Research Methods. Thousand Oaks: SAGE Publications, Inc. pp. 664–665. ISBN 9781412918084.
- "Questionnaire design". Pew Research Center. 2015-01-29. Retrieved 2017-11-18.
- Bova, Christopher S.; Aswani, Shankar; Farthing, Matthew W.; Potts, Warren M. (2018). "Limitations of the random response technique and a call to implement the ballot box method for estimating recreational angler compliance using surveys". Fisheries Research. 208: 34–41. doi:10.1016/j.fishres.2018.06.017.
- Babor, T F; Stephens, R S; Marlatt, G A (1987). "Verbal report methods in clinical research on alcoholism: Response bias and its minimization". Journal of Studies on Alcohol. 48 (5): 410–424. doi:10.15288/jsa.1987.48.410. PMID 3312821.
- Embree, B G; Whitehead, P C (1993). "Validity and reliability of self-reported drinking behavior: Dealing with the problem of response bias". Journal of Studies on Alcohol. 54 (3): 334–344. doi:10.15288/jsa.1993.54.334. PMID 8487543.