Speech production is the process by which spoken words are selected to be produced, have their phonetics formulated and then finally are articulated by the motor system in the vocal apparatus. Speech production can be spontaneous such as when a person creates the words of a conversation, reaction such as when they name a picture or read aloud a written word, or a vocal imitation such as in speech repetition.
In ordinary fluent conversation people pronounce each second roughly four syllables, ten or twelve phonemes and two to three words out of a vocabulary that can contain 10 to 100 thousand words. Errors in speech production are relatively rare occurring at a rate of about once in every 900 words in spontaneous speech. Words that are commonly spoken or learned early in life or easily imagined are quicker to say than ones that are rarely said, learnt later in life or abstract.
Normally speech is created with pulmonary pressure provided by the lungs that generates sound by phonation in the glottis in the larynx that then is modified by the vocal tract into different vowels and consonants. However speech production can occur without the use of the lungs and glottis in alaryngeal speech by using the upper parts of the vocal trait. An example of such alaryngeal speech is Donald Duck talk.
The first is the processes of conceptualization or conceptual preparation, in which the intention to create speech links a desired concept to a particular spoken word to be expressed. Here the preverbal intended messages are formulated that specify the concepts to be verbally expressed.
The second stage is formulation in which the linguistic form required for the expression of the desired message is created. Formulation includes grammatical encoding, morpho-phonological encoding, and phonetic encoding. Grammatical encoding is the process of selecting the appropriate syntactic word or lemma. The selected lemma then activates the appropriate syntactic frame for the conceptualized message. Morpho-phonological encoding is the process of breaking words down into syllables to be produced in overt speech. This syllabification is dependent on the preceding and proceeding words, for instance: I-com-pre-hend vs. I-com-pre-hen-dit. The final part of the formulation stage is phonetic encoding. This involves the activation of articulatory gestures dependent on the syllables selected in the morpho-phonological process, creating an articulatory score as the utterance is pieced together and the order of movements of the vocal apparatus is completed.
The third stage of speech production is articulation which is the execution of the articulatory score by the lungs, glottis, larynx, tongue, lips, jaw and other parts of the vocal apparatus resulting in overt speech.
Speech production motor control in right handers depends mostly upon areas in the left cerebral hemisphere. These areas include the bilateral supplementary motor area, the left posterior inferior frontal gyrus, the left insula, the left Primary motor cortex and temporal cortex. There are also subcortical areas involved such as the basal ganglia and cerebellum. The cerebellum aids the sequencing of speech syllables into fast, smooth and rhythmically organized words and longer utterances.
Speech production can be affected by several disorders:
- KE family
- Neurocomputational speech processing
- Silent speech interface
- Speech science
- Levelt, WJ (1999). "Models of word production.". Trends in Cognitive Sciences 3 (6): 223–232. doi:10.1016/S1364-6613(99)01319-4. PMID 10354575.
- Garnham, A, Shillcock RC, Brown GDA, Mill AID, Culter A (1981). "Slips of the tongue in the London–Lund corpus of spontaneous conversation". Linguistics 19 (7–8): 805–817. doi:10.1515/ling.1981.19.7-8.805.
- Oldfield RC, Wingfield A (1965). "Response latencies in naming objects". Quarterly Journal of Experimental Psychology 17 (4): 273–281. doi:10.1080/17470216508416445. PMID 5852918.
- Bird, H; Franklin, S; Howard, D (2001). "Age of acquisition and imageability ratings for a large set of words, including verbs and function words". Behavior Research Methods, Instruments, & Computers 33 (1): 73–9. doi:10.3758/BF03195349. PMID 11296722.
- Weinberg, B; Westerhouse, J (1971). "A study of buccal speech". Journal of Speech and Hearing Research 14 (3): 652–8. Bibcode:1972ASAJ...51Q..91W. doi:10.1121/1.1981697. PMID 5163900.
- McNeill D (2005). Gesture and Thought. University of Chicago Press. ISBN 978-0-226-51463-5.
- Levelt, WJM (1989). Speaking: From Intention to Articulation. MIT Press. ISBN 978-0-262-62089-5.
- Jescheniak, JD; Levelt, WJM (1994). "Word frequency effects in speech production: retrieval of syntactic information and of phonological form". Journal of Experimental Psychology: Learning, Memory, and Cognition 20 (4): 824–843. doi:10.1037/0278-7322.214.171.1244. CiteSeerX: 10.1.1.133.3919.
- Levelt, W. (1999). "The neurocognition of language", p.87 -117. Oxford Press
- Indefrey, P; Levelt, WJ (2004). "The spatial and temporal signatures of word production components". Cognition 92 (1–2): 101–44. doi:10.1016/j.cognition.2002.06.001. PMID 15037128.
- Booth, JR; Wood, L; Lu, D; Houk, JC; Bitan, T (2007). "The role of the basal ganglia and cerebellum in language processing". Brain Research 1133 (1): 136–44. doi:10.1016/j.brainres.2006.11.074. PMC 2424405. PMID 17189619.
- Ackermann, H (2008). "Cerebellar contributions to speech production and speech perception: psycholinguistic and neurobiological perspectives". Trends in Neurosciences 31 (6): 265–72. doi:10.1016/j.tins.2008.02.011. PMID 18471906.
- Gow DW (June 2012). "The cortical organization of lexical knowledge: a dual lexicon model of spoken language processing". Brain Lang 121 (3): 273–88. doi:10.1016/j.bandl.2012.03.005. PMC 3348354. PMID 22498237.
- Hickok G (2012). "The cortical organization of speech processing: feedback control and predictive coding the context of a dual-stream model". J Commun Disord 45 (6): 393–402. doi:10.1016/j.jcomdis.2012.06.004. PMC 3468690. PMID 22766458.
- Hickok G, Houde J, Rong F (February 2011). "Sensorimotor integration in speech processing: computational basis and neural organization". Neuron 69 (3): 407–22. doi:10.1016/j.neuron.2011.01.019. PMC 3057382. PMID 21315253.
- Hickok G, Poeppel D (2004). "Dorsal and ventral streams: a framework for understanding aspects of the functional anatomy of language". Cognition 92 (1-2): 67–99. doi:10.1016/j.cognition.2003.10.011. PMID 15037127.
- Poeppel D, Emmorey K, Hickok G, Pylkkänen L (October 2012). "Towards a new neurobiology of language". J. Neurosci. 32 (41): 14125–31. doi:10.1523/JNEUROSCI.3244-12.2012. PMC 3495005. PMID 23055482.
- Price CJ (August 2012). "A review and synthesis of the first 20 years of PET and fMRI studies of heard speech, spoken language and reading". Neuroimage 62 (2): 816–47. doi:10.1016/j.neuroimage.2012.04.062. PMC 3398395. PMID 22584224.
- Stout D, Chaminade T (January 2012). "Stone tools, language and the brain in human evolution". Philos. Trans. R. Soc. Lond., B, Biol. Sci. 367 (1585): 75–87. doi:10.1098/rstb.2011.0099. PMC 3223784. PMID 22106428.