Voice stress analysis
Voice risk or voice stress analysis (VSA) technology is said to record psychophysiological stress responses that are present in the human voice when a person suffers psychological stress in response to a stimulus (e.g., a question), and the consequences of the person's answer may be dire. This is based on the belief that non-verbal content of the voice carries, among other things, information about the physiological and psychological state of the speaker. Manufacturers of VSA typically claim it can be employed to detect deception in a variety of settings such as police interviews, insurance claims, and social benefit claims.
Principle and origins
In 1970, and prior to the publishing of Lippold's article in 1971, three military officers retired from the U.S. Army and formed a company they named Dektor Counterintelligence and Security (CIS). The three officers were Alan Bell, Bill Ford and Charles McQuiston. Bell's expertise was in counterintelligence, Ford's was in electronics, and McQuiston's was in polygraphy. Ford had invented an electronic device that utilized the theory of Lippold, Halliday and Redfearn in which he tape-recorded the human voice, slowed it down to one-third or one-fourth its normal rate, and fed it through several low pass filters which then fed the signal into an EKG strip chart recorder. The strip chart recorder then made chart tracings on heat-sensitive paper. They named their device the Psychological Stress Evaluator (PSE). According to Allan Bell Enterprises, "All lie-detection examinations or evaluations are predicated upon the fact that telling a significant lie will produce some degree of psychological stress. Psychological stress, in turn, causes a number of physiological changes."
The use of Voice Stress Analysis (VSA) technology to detect deception is highly controversial and use of these machines has been referred to as 'charlatanry' by academics. Discussions about its application revolve around whether this technology can indeed reliably detect stress, and if so, whether deception can be inferred from this stress. The latter is a logical inference problem referred to as the Othello error. It has even been argued that - even if stress could reliably be measured from the voice - this would be highly similar to measuring stress with for example the polygraph, and that all critiques voiced towards polygraph testing apply to VSA as well.
Several studies published in peer reviewed journals showed VSA to perform at chance level when it comes to detecting deception. Horvath, McCloughan, Weatherman, and Slowik (2013), for example, tested VSA on the recordings of interrogation of 74 suspects. Eighteen of these suspects later confessed, making the deception the most likely ground truth. With 48% accurate classification, VSA performed at chance level. Damhousse and colleagues interviewed over 300 arrestees about recent drug use. Their responses and the VSA output were compared to a subsequent urinalysis to determine if the VSA programs could detect deception. The results showed the programs were not able to detect deception at a rate any better than chance. Other peer reviewed studies showed similar results. In 2003, the National Research Council concluded “Overall, this research and the few controlled tests conducted over the past decade offer little or no scientific basis for the use of the computer voice stress analyser or similar voice measurement instruments”.
Research published in the Proceedings of the 2005 Hawaii International Conference on System Sciences, identified that VSA technology correctly classified 73% of deceptive examinees. Yet, for truth tellers - with 52% accurate classification - it performed as well as a coin flip. A report from the U.S. Air Force Research Laboratory in Rome New York concluded that the voice stress units tested were indeed able to recognize stress in the spoken voice. Yet, the authors also tested VSA as a deception detector on material of 2 convicted murderers and concluded that 'Although these systems state they detect deception, this was not proven'.
Support for VSA as a deception detector typically comes from poorly executed studies published in journals with a doubtful status. A new Indian open access Journal, for example, (the International Journal of Electrical, Electronics and Computer Engineering (IJEECE)) published a study claiming that Voice Stress Analysis (VSA) technology can identify emotional stress better than polygraph. This study did not, however, make any experimental comparison between VSA and the polygraph, meaning that the evidence for this claims remain mysterious. Also the statement 'better than polygraph' is not very informative, as it does not contain any information on the effect size of either of the techniques.
Similarly, an Ukrainian Journal issued by Kyiv Scientific Research Institute of Court Expertise published a study by James Chapman and Marigo Stathis, evaluating the use of the voice stress analysis technology for the detection of stress associated with possible deception. VSA detected stress associated with criminal activities in 95% of the cases in which a confession was obtained, and there no cases wherein a confession was obtained in the absence of stress. Importantly, such a study suffers from sampling bias. Suspects will only be interrogated when they fail the test. Consequently, confessions will only follow in those cases where the test outcome was deception indicated. When one subsequently rounds up all the cases in which a confession was obtained, it should come as no surprise that the confessions match deception indicated test outcomes to a high degree. This would happen even when a test performs at chance level. Using lie detectors as a prop to elicit confessions is referred to as the bogus pipeline effect, and is not without danger. In the Murder of Stephanie Crowe, for example, three suspects confessed following an interrogation with a VSA, but these confessions were later ruled false.
Notable examples of use
VSA played a decisive role in the three false confessions in the Murder of Stephanie Crowe The manufacturer of the VSA equipment later settled a lawsuit that alleged that the voice analyzer's manufacturers were liable for the harm the three defendants suffered. In a similar case, Donovan Allen falsely confessed to killing his mother after failing a VSA test. He was acquitted 15 years later based on exonerating DNA evidence.
Following the 2012 case in which George Zimmerman fatally shot Florida teenager Trayvon Martin, Zimmerman was given a voice stress analysis test by the police department of Sanford, Florida. He passed the test. A videotape of the test was publicly released in June 2012.
Federal judge approves computer voice stress analysis to monitor sex offenders. A recent ruling from a U.S. Federal court judge may require sex offenders to submit Computer Voice Stress Analyzer (CVSA) examinations throughout the post-release supervision process.
Confession made following a voice stress examination was allowed to be used as evidence in a rape trial.
Work and Pensions Secretary John Hutton announced the introduction of voice-risk analysis software to root out benefit cheats, but in 2010, the government dropped plans to introduce the controversial lie detector tests to after trials found that the technology is not sufficiently reliable. The Guardian newspaper reported in 2009 that the UK Department for Work and Pensions analysis of data from trials conducted between May 2007 and July 2008 in various agencies shows accuracy rates no better than chance. Following a Freedom of Information request by the trade union financed campaign body False Economy The Guardian revisited the story in 2014.
- Ruiz, Selye, & Guell, 1990. "Voice analysis to predict the psychological or physical state of a speaker", Aviation, Space, and Environmental Medicine, 1990. Ruiz et al. report that their "research suggests that psychological stress may be detected as acoustic modifications in the fundamental frequency of a speakers voice" and "that the fundamental frequency of the vocal signal is slowly modulated (8-14 Hz) during speech in an emotionally neutral situation. In situations demanding increased 'mental or psychomotor' activity, the 8-14 Hz modulation then decreases as the striated muscles surrounding the vocal cords contract in response to the arousal, thus limiting the natural trembling."
- Rothkrantz, L. J., Wiggers, P., Wees, J. A., & Vark, R. J. (2004). Voice Stress Analysis. Text, Speech and Dialogue Lecture Notes in Computer Science, 449-456. doi:10.1007/978-3-540-30120-2_57
- Haddad, D., Walter, S., Ratley, R., & Smith, M. (2001). "Investigation and Evaluation of Voice Stress Analysis Technologies." Rome Laboratory Report (AFRL-IF-RS-TM-2001-7), 18-19.
- Eriksson, A., & Lacerda, F. (2007). Charlatanry in forensic speech science: A problem to be taken seriously. International Journal of Speech, Language and the Law, 14(2), 169-193.
- Lykken, D. T. (1998). A tremor in the blood: Uses and abuses of the lie detector. Plenum Press.
- Horvath, F., McCloughan, J., Weatherman, D., & Slowik, S. (2013). The Accuracy of auditors' and layered voice Analysis (LVA) operators' judgments of truth and Deception During Police Questioning. Journal of forensic sciences, 58, 385-392.
- Damphousse, K. R. (2008). Voice stress analysis: Only 15 percent of lies about drug use detected in field test. NIJ Journal, 259, 8-12.
- Damphousse, K. R., Pointon, L., Upchurch, D., & Moore, R. K. (2007). Assessing the validity of voice stress analysis tools in a jail setting. US Department of Justice.
- Harnsberger, J. D., Hollien, H., Martin, C. A., & Hollien, K. A. (2009). Stress and Deception in Speech: Evaluating Layered Voice Analysis. Journal of forensic sciences, 54, 642-650.
- Hollien, H., Harnsberger, J. D., Martin, C. A., & Hollien, K. A. (2008). Evaluation of the NITV CVSA. Journal of forensic sciences, 53(1), 183-193.
- National Research Council (2003). The Polygraph and Lie Detection. Committee to review the scientific evidence on the polygraph. Division of Behavioral and Social Sciences and Education. Washington, DC: The National Academic Press.
- Sprague, R. H. (2005). "Evaluation of Voice Stress Analysis Technology". HICSS 2005 38th Hawaii International International Conference on Systems Science (03-06 January 2005/Big Island, HI). Los Alamitos: IEEE Computer Society Press.
- Patil, V. P., Nayak, K. K., & Saxena, M. "Voice Stress Detection", 2, 148-154. Journal of Electrical, Electronics and Computer Engineering, IJEECE (online) ISSN 1748-8893.
- Chapman, J. L., & Stathis, M. (2012). "Ocenka ehffektivnosti tekhnologii VSA (Voice Stress Analysis) na osnove praktiki ugolovnogo sudoproizvodstva v SShA". Kriminalistika i sudebnaya ehkspertiza: Sbornik nauchnyh trudov, 57, 238-250. [in Russian]]
- Iacono, W. G. (1991). Can we determine the accuracy of polygraph tests? In P. K. Ackles, J. R. Jennings, & M. G. H. Coles (Eds.), Advances in psychophysiology (pp. 201–201). Greenwich, CT: JAI Press
- Sauer, Mark (May 22, 2012). "Michael Crowe Found 'Factually Innocent' In Sister's Murder". KPBS. Retrieved 7 October 2016.. See also Marshall, Scott (May 25, 2005). "Maker of voice-analysis machine settles Crowe lawsuit". The San Diego Union Tribune. Retrieved 7 October 2016.
- Marshall, Scott (May 25, 2005). "Maker of voice-analysis machine settles Crowe lawsuit". The San Diego Union Tribune. Retrieved 7 October 2016.
- http://tdn.com/news/critic-says-longview-police-got-a-false-confession-out-of/article_aa747eed-6b32-58da-ab53-1412700d7e4a.html and http://www.dailymail.co.uk/news/article-3343992/Son-spent-15-years-bars-killing-mother-freed-new-DNA-evidence-proves-innocence.html
- Kovaleski, Serge F. (May 16, 2012). "Trayvon Martin Case Shadowed by Series of Police Missteps". The New York Times. Retrieved May 22, 2012.
- Stutzman, Rene (June 21, 2012). "Tape released of Zimmerman's re-enactment of Martin shooting". Orlando Sentinel.
- "Federal judge approves computer voice stress analysis to monitor sex offenders". Government Security News. March 11, 2014. Retrieved May 27, 2016.
- "Statements in rape case allowed in, made 17 hours after voice stress test, judge rules". The Journal Times. June 19, 2014. Retrieved May 27, 2016.
- http://www.guardian.co.uk/news/datablog/2009/mar/19/dwp-voice-risk-analysis-statistics accessed 4 May 2009