Jump to content

Speech tempo: Difference between revisions

From Wikipedia, the free encyclopedia
Content deleted Content added
Reduce white space
Nobody has made objection, so here is my rewrite
Line 1: Line 1:
=Tempo of Speech=
{{unreferenced|date=December 2011}}


Speech Tempo is a measure of the number of speech units of a given type produced within a given amount of time. A common measure is that of [[Syllable|syllables]] per second. Speech tempo is believed to vary within the speech of one person according to contextual and emotional factors, between speakers and also between different languages and dialects. However, there are many problems involved in investigating this variance scientifically.
'''Tempo of speech''' is the relative speed or slowness of utterance which is measured by the rate of [[syllable]] succession and the number and duration of pauses in a sentence. The rate of speaking varies constantly. When two strongly stressed syllables stay close together, it is slower; when they are separated by unstressed syllables the speed is faster. Differences of rate are used to help the listener to differentiate the more important(slow rate)and the less important(fast rate)parts of the utterance. Variations of rate and pausing are closely connected with different [[Phonetics|phonetic]] styles, shades of meaning and the structure of the [[Intonation (linguistics)|intonation]] group.

===Problems of definition===
While most people seem to believe that they can judge how quickly someone is speaking, it is generally said that subjective judgements and opinions cannot serve as scientific evidence for statements about speech tempo; J. Laver has written that analyzing tempo can be "dangerously open to subjective bias ... listeners' judgements rapidly begin to lose objectivity when the utterance concerned comes either from an unfamiliar accent or .. from an unfamiliar language".<ref>{{cite book|last=Laver|first=John|title=Principles of Phonetics|date=1994|publisher=Cambridge|page=542}}</ref> Scientific observation depends on accurate segmenting of recorded speech along the time course of an utterance, usually using one of the acoustic analysis software tools available on the internet such as [[Audacity (audio editor)|Audacity]] <ref>{{cite web|title=Audacity|url=http://audacity.sourceforge.net|accessdate=2 April 2014}}</ref> or, specifically for speech research, [[Praat]],<ref>{{cite web|last=Boersma|first=Paul|title=Praat|url=http://www.fon.hum.uva.nl/praat/|accessdate=2 April 2014}}</ref> [[SIL International|SIL]] Speech Analyzer <ref>{{cite web|last=SIL|title=Speech Analyzer|url=http://www-01.sil.org/computing/sa/|accessdate=2 April 2014}}</ref> or SFS.<ref>{{cite web|title=SFS|url=http://www.phon.ucl.ac.uk/resource/sfs/|publisher=UCL|accessdate=2 April 2014}}</ref>

Measurements of speech tempo can be strongly affected by pauses and hesitations. For this reason, it is usual to distinguish between speech tempo ''including'' pauses and hesitations and speech tempo ''excluding'' them. The former is called '''Speaking rate''' and the latter '''Articulation rate'''.<ref>{{cite book|last=Laver|first=John|title=Principles of Phonetics|date=1994|publisher=Cambridge|isbn=0-521-45655-X|page=158}}</ref>

Various units of speech have been used as a basis for measurement. The traditional measure of speed in typing and [[Morse code]] transmission has been words per minute (wpm). However, in the study of speech the [[word]] is not well defined (being primarily a unit of [[grammar]]), and speech is not usually temporally stable over a period as long as a minute. Many studies have used the measure of [[syllable|syllables]] per second, but this is not completely reliable because, although the syllable as a phonological unit of a given language is well-defined, it is not always possible to get agreement on the phonetic syllable. For example, the English word 'particularly' in the form in which it occurs in dictionaries is, phonologically speaking, composed of five syllables /pə.tɪk.jə.lə.li/. Phonetic realizations of the word, however, may be heard as comprising five [pə.tɪk.jə.lə.li], four [pə.tɪk.jə.li], three [pə.tɪk.li] or even two syllables [ptɪk.li], and listeners are likely to have different opinions about the number of syllables heard.

An alternative measure that has been proposed is that of sounds per second. One study found rates varying from an average of 9.4 sounds per second for poetry reading to 13.83 per second for sports commentary.<ref>{{cite journal|last=Fonagy|first=I.|coauthors=K. Magdics|title=Speed of utterance in phrases of different length|journal=Language and Speech|date=1960|volume=4|pages=179-192}}</ref> The problem with this approach is that the researcher must be clear as to whether the "sounds" s/he is counting are [[phoneme|phonemes]] or physically observable phonetic units (sometimes called "phones"). As an example, the utterance 'Don't forget to record it' might in slow, careful speech be pronounced /dəʊnt fəget tə rɪkɔːd ɪt/, with 19 phonemes, each of which is phonetically realized. When the sentence is said at high speed it might be pronounced as [də̃ʊ̃ʔ fɡeʔtrɪkɔːd ɪt], with 16 units. If we are counting only units that can be observed and measured, it is clear that at faster speeds of utterance the number of sounds produced per second does not necessarily increase.<ref>{{cite book|last=Roach|first=P.|title=Some languages are spoken more quickly than others|date=1998}}</ref>

===Within-speaker variability===
Speakers vary their speed of speaking according to contextual and physical factors. A typical speaking rate for English is 4 syllables per second,<ref>{{cite book|last=Cruttenden|first=A.|title=Gimson's Pronunciation of English|date=2014|publisher=Routledge|page=54}}</ref> but in different emotional or social contexts the rate may vary, one study reporting a range between 3.3 and 5.9 syl/sec,<ref>{{cite journal|last=Arnfield|first=S.|coauthors=Roach, Setter, Greasley and Horton|title=Emotional stress and speech tempo variability|journal=Proceedings of the ESCA/NATO Workshop on Speech Under Stress|date=1995|pages=13-15}}</ref> Another study found significant differences in speaking rate between story-telling and taking part in an interview.<ref>{{cite journal|last=Kowal|first=S.|coauthors=Wiese and O'Donnell|title=The use of time in storytelling|journal=Language and Speech|date=1983|volume=26.4|pages=377-392}}</ref>

Speech tempo may be regarded as one of the components of [[prosody]]. Possibly the most detailed analytical framework for the role of tempo in English prosody is that of Crystal<ref>{{cite book|last=Crystal|first=David|title=Prosodic Systems and Intonation in English|date=1976|publisher=Cambridge|pages=152-156}}</ref> . His system, which uses terms mostly borrowed from musical usage, allows for ''simple'' variation away from normal in tempo, where monosyllables may be pronounced as "clipped", "drawled" or "held" and polysyllabic utterances may be spoken at "allegro", "allegrissimo", "lento" and "lentissimo". ''Complex'' variation includes "accelerando" and "rallentando". Crystal claims that "... tempo has probably the most highly discrete grammatical function of all prosodic parameters other than pitch ...". He cites from his [[speech corpus|corpus]]-based analysis instances of increased tempo in cases of speakers' self-corrections of speech errors, and in citing embedded material in the form of titles and names (e.g. "I'm sorry, but we won't be able to to start ''So you think you know what's happening'' for a few moments" and "This is the ''I'll show you a picture and you tell me what it is'' technique" (where the italicized text is spoken at faster tempo).

===Between-language differences===
Subjective impressions of tempo differences between different languages and dialects are difficult to substantiate with scientific data.<ref>{{cite book|last=Roach|first=P.|title=Some languages are spoken more quickly than others|date=1998}}</ref> Counting syllables per second will result in differences caused by the different syllable structures found in different languages; many languages have a predominantly CV (consonant+vowel) syllable structure while English syllables may begin with up to 3 consonants and end with up to 4. Consequently it is likely that a Japanese speaker can produce more syllables in their language per second than an English speaker can in theirs. Counting sounds per second is also problematical for the reason mentioned above, i.e. that the researcher needs to be sure what objects it is that s/he is counting.

Howard Giles has studied the relationship between perceived tempo and perceived competence of speakers of different accents of English, and found a positive linear relationship between the two (i.e. people who speak faster are perceived as more competent).<ref>{{cite book|last=Giles|first=Howard|title=Speech tempo|date=1992|publisher=Oxford|location=in W. Bright (ed.) Oxford international Encyclopedia of Linguistics}}</ref>

Osser and Peng counted sounds per second for Japanese and English and found no significant difference.<ref>{{cite journal|last=Osser|first=H.|coauthors=Peng, F.|title=A cross-cultural study of speech rate|journal=Language and Speech|date=1964|volume=7|pages=120-125}}</ref> The study by Kowal et al, referred to above, comparing story-telling with speaking in an interview, looked at English, Finnish, French, German and Spanish. They found no significant differences in rate between the languages, but highly significant differences between the speaking styles. Similarly, Barik found that differences in tempo between French and English were due to speaking style rather than to the language.<ref>{{cite journal|last=Barik|first=H.C.|title=Cross-linguistic study of temporal characteristics of different types of speech material|journal=Language and Speech|date=1977|volume=20|pages=116-126}}</ref> From the point of view of the perception of tempo differences between languages, Vaane used spoken Dutch, English, French, Spanish and Arabic produced at three different rates and found that untrained and phonetically trained listeners performed equally well at judging the rate of speaking for familiar and unfamiliar languages.<ref>{{cite journal|last=Vaane|first=E.|title=Subjective estimation of speech rate|journal=Phonetica|date=1982|volume=39|pages=136-149}}</ref>

In the absence of reliable evidence to support it, it seems that the widespread view that some languages are spoken more rapidly than others is an illusion. This illusion may well be related to other factors such as differences of [[isochrony|rhythm]] and [[speech disfluency|pausing]].

===Bibliography===

*Roach, P. (1998). 'Some languages are spoken more quickly than others', in L. Bauer and P. Trudgill (eds) ''Language Myths'', pp. 150-158. ISBN 014-02-6023-4 [http://www.personal.reading.ac.uk/~llsroach/phon2/tempopr.htm]
*Zellner, B. (1994). Pauses and the temporal structure of speech, in E. Keller (Ed.) ''Fundamentals of speech synthesis and speech recognition''. (pp. 41-62). Chichester: John Wiley [http://www.cogprints.org/884/3/Zellner.SpeechPauses.pdf]

==References==
<references />


Pause is an act of stopping in the flow of speech. In [[English language|English]] there are three main degrees of pauses: unit pause, double pause and treble pause. The unit pause is the interval of an individual's rhythm cycle from one syllable to the next. It is used to separate intonation groups. The double pause is twice as long as the unit pause: it is used to separate sentences. The [[Treble (sound)|treble]] pause, which is about three times as long as the unit pause, is used to separate [[paragraphs]].





Revision as of 11:06, 2 November 2014

Tempo of Speech

Speech Tempo is a measure of the number of speech units of a given type produced within a given amount of time. A common measure is that of syllables per second. Speech tempo is believed to vary within the speech of one person according to contextual and emotional factors, between speakers and also between different languages and dialects. However, there are many problems involved in investigating this variance scientifically.

Problems of definition

While most people seem to believe that they can judge how quickly someone is speaking, it is generally said that subjective judgements and opinions cannot serve as scientific evidence for statements about speech tempo; J. Laver has written that analyzing tempo can be "dangerously open to subjective bias ... listeners' judgements rapidly begin to lose objectivity when the utterance concerned comes either from an unfamiliar accent or .. from an unfamiliar language".[1] Scientific observation depends on accurate segmenting of recorded speech along the time course of an utterance, usually using one of the acoustic analysis software tools available on the internet such as Audacity [2] or, specifically for speech research, Praat,[3] SIL Speech Analyzer [4] or SFS.[5]

Measurements of speech tempo can be strongly affected by pauses and hesitations. For this reason, it is usual to distinguish between speech tempo including pauses and hesitations and speech tempo excluding them. The former is called Speaking rate and the latter Articulation rate.[6]

Various units of speech have been used as a basis for measurement. The traditional measure of speed in typing and Morse code transmission has been words per minute (wpm). However, in the study of speech the word is not well defined (being primarily a unit of grammar), and speech is not usually temporally stable over a period as long as a minute. Many studies have used the measure of syllables per second, but this is not completely reliable because, although the syllable as a phonological unit of a given language is well-defined, it is not always possible to get agreement on the phonetic syllable. For example, the English word 'particularly' in the form in which it occurs in dictionaries is, phonologically speaking, composed of five syllables /pə.tɪk.jə.lə.li/. Phonetic realizations of the word, however, may be heard as comprising five [pə.tɪk.jə.lə.li], four [pə.tɪk.jə.li], three [pə.tɪk.li] or even two syllables [ptɪk.li], and listeners are likely to have different opinions about the number of syllables heard.

An alternative measure that has been proposed is that of sounds per second. One study found rates varying from an average of 9.4 sounds per second for poetry reading to 13.83 per second for sports commentary.[7] The problem with this approach is that the researcher must be clear as to whether the "sounds" s/he is counting are phonemes or physically observable phonetic units (sometimes called "phones"). As an example, the utterance 'Don't forget to record it' might in slow, careful speech be pronounced /dəʊnt fəget tə rɪkɔːd ɪt/, with 19 phonemes, each of which is phonetically realized. When the sentence is said at high speed it might be pronounced as [də̃ʊ̃ʔ fɡeʔtrɪkɔːd ɪt], with 16 units. If we are counting only units that can be observed and measured, it is clear that at faster speeds of utterance the number of sounds produced per second does not necessarily increase.[8]

Within-speaker variability

Speakers vary their speed of speaking according to contextual and physical factors. A typical speaking rate for English is 4 syllables per second,[9] but in different emotional or social contexts the rate may vary, one study reporting a range between 3.3 and 5.9 syl/sec,[10] Another study found significant differences in speaking rate between story-telling and taking part in an interview.[11]

Speech tempo may be regarded as one of the components of prosody. Possibly the most detailed analytical framework for the role of tempo in English prosody is that of Crystal[12] . His system, which uses terms mostly borrowed from musical usage, allows for simple variation away from normal in tempo, where monosyllables may be pronounced as "clipped", "drawled" or "held" and polysyllabic utterances may be spoken at "allegro", "allegrissimo", "lento" and "lentissimo". Complex variation includes "accelerando" and "rallentando". Crystal claims that "... tempo has probably the most highly discrete grammatical function of all prosodic parameters other than pitch ...". He cites from his corpus-based analysis instances of increased tempo in cases of speakers' self-corrections of speech errors, and in citing embedded material in the form of titles and names (e.g. "I'm sorry, but we won't be able to to start So you think you know what's happening for a few moments" and "This is the I'll show you a picture and you tell me what it is technique" (where the italicized text is spoken at faster tempo).

Between-language differences

Subjective impressions of tempo differences between different languages and dialects are difficult to substantiate with scientific data.[13] Counting syllables per second will result in differences caused by the different syllable structures found in different languages; many languages have a predominantly CV (consonant+vowel) syllable structure while English syllables may begin with up to 3 consonants and end with up to 4. Consequently it is likely that a Japanese speaker can produce more syllables in their language per second than an English speaker can in theirs. Counting sounds per second is also problematical for the reason mentioned above, i.e. that the researcher needs to be sure what objects it is that s/he is counting.

Howard Giles has studied the relationship between perceived tempo and perceived competence of speakers of different accents of English, and found a positive linear relationship between the two (i.e. people who speak faster are perceived as more competent).[14]

Osser and Peng counted sounds per second for Japanese and English and found no significant difference.[15] The study by Kowal et al, referred to above, comparing story-telling with speaking in an interview, looked at English, Finnish, French, German and Spanish. They found no significant differences in rate between the languages, but highly significant differences between the speaking styles. Similarly, Barik found that differences in tempo between French and English were due to speaking style rather than to the language.[16] From the point of view of the perception of tempo differences between languages, Vaane used spoken Dutch, English, French, Spanish and Arabic produced at three different rates and found that untrained and phonetically trained listeners performed equally well at judging the rate of speaking for familiar and unfamiliar languages.[17]

In the absence of reliable evidence to support it, it seems that the widespread view that some languages are spoken more rapidly than others is an illusion. This illusion may well be related to other factors such as differences of rhythm and pausing.

Bibliography

  • Roach, P. (1998). 'Some languages are spoken more quickly than others', in L. Bauer and P. Trudgill (eds) Language Myths, pp. 150-158. ISBN 014-02-6023-4 [1]
  • Zellner, B. (1994). Pauses and the temporal structure of speech, in E. Keller (Ed.) Fundamentals of speech synthesis and speech recognition. (pp. 41-62). Chichester: John Wiley [2]

References

  1. ^ Laver, John (1994). Principles of Phonetics. Cambridge. p. 542.
  2. ^ "Audacity". Retrieved 2 April 2014.
  3. ^ Boersma, Paul. "Praat". Retrieved 2 April 2014.
  4. ^ SIL. "Speech Analyzer". Retrieved 2 April 2014.
  5. ^ "SFS". UCL. Retrieved 2 April 2014.
  6. ^ Laver, John (1994). Principles of Phonetics. Cambridge. p. 158. ISBN 0-521-45655-X.
  7. ^ Fonagy, I. (1960). "Speed of utterance in phrases of different length". Language and Speech. 4: 179–192. {{cite journal}}: Unknown parameter |coauthors= ignored (|author= suggested) (help)
  8. ^ Roach, P. (1998). Some languages are spoken more quickly than others.
  9. ^ Cruttenden, A. (2014). Gimson's Pronunciation of English. Routledge. p. 54.
  10. ^ Arnfield, S. (1995). "Emotional stress and speech tempo variability". Proceedings of the ESCA/NATO Workshop on Speech Under Stress: 13–15. {{cite journal}}: Unknown parameter |coauthors= ignored (|author= suggested) (help)
  11. ^ Kowal, S. (1983). "The use of time in storytelling". Language and Speech. 26.4: 377–392. {{cite journal}}: Unknown parameter |coauthors= ignored (|author= suggested) (help)
  12. ^ Crystal, David (1976). Prosodic Systems and Intonation in English. Cambridge. pp. 152–156.
  13. ^ Roach, P. (1998). Some languages are spoken more quickly than others.
  14. ^ Giles, Howard (1992). Speech tempo. in W. Bright (ed.) Oxford international Encyclopedia of Linguistics: Oxford.
  15. ^ Osser, H. (1964). "A cross-cultural study of speech rate". Language and Speech. 7: 120–125. {{cite journal}}: Unknown parameter |coauthors= ignored (|author= suggested) (help)
  16. ^ Barik, H.C. (1977). "Cross-linguistic study of temporal characteristics of different types of speech material". Language and Speech. 20: 116–126.
  17. ^ Vaane, E. (1982). "Subjective estimation of speech rate". Phonetica. 39: 136–149.