= Standard Chinese phonology =

The phonology of Standard Chinese has historically derived from the Beijing dialect of Mandarin. However, pronunciation varies widely among speakers, who may introduce elements of their local varieties. Television and radio announcers are chosen for their ability to affect a standard accent. The sound system has not only segments—i.e. vowels and consonants—but also tones, and each syllable has one. In addition to the four main tones, there is a neutral tone that appears on weak syllables.

This article uses the International Phonetic Alphabet (IPA) to compare the phonetic values corresponding to syllables romanized with pinyin.

==Consonants==
The sounds shown in parentheses are sometimes not analyzed as separate phonemes; for more on these, see below. Excluding these, and excluding the glides , , and , there are 19 consonant phonemes in the inventory.

| | Labial | Denti- alveolar | Retroflex | Alveolo- palatal | Velar |
| Nasal | | | | | |
| Plosive | | | | | |
| Affricate | | | | | () |
| | | | | () | |
| Fricative | | | | () | ~ |
| Liquid | | | ~ | | |

Between pairs of plosives or affricates having the same place of articulation and manner of articulation, the primary distinction is not voiced vs. voiceless (as in French or Russian), but unaspirated vs. aspirated (as in Scottish Gaelic or Icelandic). The unaspirated plosives and affricates may however become voiced in weak syllables (see below). In pinyin, an unaspirated/aspirated pair such as //p// and //pʰ// is represented with b and p respectively.

More details about the individual consonant sounds are given in the following table.

| Phoneme or sound | Approximate description | Audio example | Pinyin | Zhuyin | Wade–Giles* | Notes |
| | Like English p but unaspirated – as in spy | | b | | p | |
| | Like an aspirated English p, as in pie | | p | | p῾ | |
| | Like English m | | m | | m | |
| | Like English f | | f | | f | |
| | Like English t but unaspirated – as in sty | | d | | t | See . |
| | Like an aspirated English t, as in tie | | t | | t῾ | See . |
| | Like English n | | n | | n | See . Can occur in the onset and/or coda of a syllable. |
| | Like English clear l, as in RP lay (never dark, i.e. velarized) | | l | | l | |
| | Like English k, but unaspirated, as in scar | | g | | k | |
| | Like an aspirated English k, as in car | | k | | k῾ | |
| | Like ng in English sing | | ng | | ng | Occurs only in the syllable coda. |
| //x// () | Varies between h in English hat and ch in Scottish loch. | | h | | h | |
| | Like an unaspirated English ch, but with an alveolo-palatal pronunciation | | j | | ch | See . |
| | As t͡ɕ/pinyin "j", with aspiration | | q | | ch῾ | See . |
| | Similar to English sh, but with an alveolo-palatal pronunciation | | x | | hs | See . |
| | Similar to ch in English chat, but with a retroflex articulation and no aspiration | | zh | | ch | See . |
| | As ʈ͡ʂ/pinyin "zh", but with aspiration | | ch | | ch῾ | See . |
| | Similar to English sh, but with a retroflex articulation | | sh | | sh | See . |
| () | Similar to z in zoo in English, but with a retroflex articulation. L2 learners may pronounce it as an English R, but lips are unrounded. | | r | | j | For pronunciation in syllable-final position, see . |
| | Like English ts in cats, without aspiration | | z | | ts | See . |
| | As t͡s/pinyin "z", but with aspiration | | c | | ts῾ | See . |
| | Like English s, but usually with the tongue on the lower teeth. | | s | | s | See . |
| *In Wade–Giles, the distinction between retroflex and alveolo-palatal affricates, which are both written as ch and ch῾, is indicated by the subsequent vowel coda, since the two consonant series occur in complementary distribution; for example, chi and chü correspond to pinyin ji and ju, respectively, whereas chih and chu correspond to pinyin zhi and zhu (see ). | | | | | | |

All of the consonants may occur as the initial sound of a syllable, with the exception of //ŋ// (unless the zero initial is assigned to this phoneme; see below). Excepting the rhotic coda, the only consonants that can appear in syllable coda (final) position are //n// and //ŋ// (although /[m]/ may occur as an allophone of //n// before labial consonants in fast speech). Final //n//, //ŋ// may be pronounced without complete oral closure, resulting in a syllable that in fact ends with a long nasalized vowel. See also , below.

===Denti-alveolar and retroflex series===
The consonants listed in the first table above as denti-alveolar are sometimes described as alveolars, and sometimes as dentals. The affricates and the fricative are particularly often described as dentals; these are generally pronounced with the tongue on the lower teeth.

The retroflex consonants (like those of Polish) are actually apical rather than subapical, and so are considered by some authors not to be truly retroflex; they may be more accurately called post-alveolar. Some speakers not from Beijing may lack the retroflexes in their native dialects, and may thus replace them with dentals.

===Alveolo-palatal series===

The alveolo-palatal consonants (pinyin j, q, x) have standard pronunciations of /[t͡ɕ, t͡ɕʰ, ɕ]/. Some speakers realize them as palatalized dentals /[t͡sʲ]/, /[t͡sʰʲ]/, /[sʲ]/; this is claimed to be especially common among children and women, although officially it is regarded as substandard and as a feature specific to the Beijing dialect.

In phonological analysis, it is often assumed that, when not followed by one of the high front vowels /[i]/ or /[y]/, the alveolar-palatals consist of a consonant followed by a palatal glide (/[j]/ or /[ɥ]/). That is, syllables represented in pinyin as beginning , , , , , (followed by a vowel) are taken to begin /[t͡ɕj]/, /[t͡ɕʰj]/, /[ɕj]/, /[t͡ɕɥ]/, /[t͡ɕʰɥ]/, /[ɕɥ]/. The actual pronunciations are more like /[t͡ɕ]/, /[t͡ɕʰ]/, /[ɕ]/, /[t͡ɕʷ]/, /[t͡ɕʰʷ]/, /[ɕʷ]/ (or for speakers using the dental variants, /[t͡sʲ]/, /[t͡sʰʲ]/, /[sʲ]/, /[t͡sᶣ]/, /[t͡sʰᶣ]/, /[sᶣ]/). This is consistent with the general observation (see under ) that medial glides are realized as palatalization and/or labialization of the preceding consonant (palatalization already being inherent in the case of the palatals).

On the above analysis, the alveolar-palatals are in complementary distribution with the dentals /[t͡s, t͡sʰ, s]/, with the velars /[k, kʰ, x]/, and with the retroflexes /[ʈ͡ʂ, ʈ͡ʂʰ, ʂ]/, as none of these can occur before high front vowels or palatal glides, whereas the alveolo-palatals occur before high front vowels or palatal glides. Therefore, linguists often prefer to classify /[t͡ɕ, t͡ɕʰ, ɕ]/ not as independent phonemes, but as allophones of one of the other three series. The existence of the above-mentioned dental variants inclines some to prefer to identify the alveolo-palatals with the dentals, but identification with any of the three series is possible (unless the empty rime is identified with //i//, in which case the velars become the only candidate). The Yale and Wade–Giles systems mostly treat the alveolo-palatals as allophones of the retroflexes; Tongyong Pinyin mostly treats them as allophones of the dentals; and Mainland Chinese Braille treats them as allophones of the velars. In pinyin and bopomofo, however, they are represented as a separate sequence.

The alveolo-palatals arose historically from a merger of the dentals /[t͡s, t͡sʰ, s]/ and velars /[k, kʰ, x]/ before high front vowels and glides. Previously, some instances of modern /[t͡ɕ(ʰ)i]/ were instead /[k(ʰ)i]/, and others were /[t͡s(ʰ)i]/; distinguishing these two sources of /[t͡ɕ(ʰ)i]/ is known as the . The change took place in the last two or three centuries at different times in different areas. This explains why some European transcriptions of Chinese names (especially in postal romanization) contain , , , where an alveolo-palatal might be expected in modern Chinese. Examples are Peking for Beijing (/[kiŋ] → [tɕiŋ]/), Chungking for Chongqing (/[kʰiŋ] → [tɕʰiŋ]/), Fukien for Fujian (cf. Hokkien), Tientsin for Tianjin (/[tsin] → [tɕin]/); Sinkiang for Xinjiang (/[sinkiaŋ] → [ɕintɕiaŋ]/, and Sian for Xi'an (/[si] → [ɕi]/). The complementary distribution with the retroflex series arose when syllables that had a retroflex consonant followed by a medial glide lost the medial glide.

===Zero onset===
A full syllable such as ai, in which the vowel is not preceded by any of the standard initial consonants or glides, is said to have a null initial or zero onset. This may be realized as a consonant sound: and are possibilities, as are and in some non-standard varieties. It has been suggested by San Duanmu that such an onset be regarded as a special phoneme, or as an instance of the phoneme //ŋ//, although it can also be treated as no phoneme (absence of onset). By contrast, in the case of the particle
