Catalan orthography

From Wikipedia, the free encyclopedia
Jump to: navigation, search

Like those of many other Romance languages, the Catalan alphabet derives from the Latin alphabet and is largely based on the language’s phonology.[1]


The Catalan alphabet consists of the twenty-six letters of the ISO basic Latin alphabet:

Majuscule Forms (also called uppercase or capital letters)
Minuscule Forms (also called lowercase or small letters)
a b c d e f g h i j k l m n o p q r s t u v w x y z

The following letter-diacritic combinations are used, but they do not constitute distinct letters in the alphabet: À, É, È, Í, Ï, Ó, Ò, Ú, Ü and Ç. K and W are only used in loanwords. Outside loanwords, the letters Q and Y appear only in the digraphs qu/ and ny.

The following table shows the letters and their names in Standard Catalan (IEC) and Standard Valencian (AVL):

Letter Catalan Valencian
Name (IEC) Pronunciation Name (AVL) Pronunciation
A a a /ˈa/ a /ˈa/
B b be, be alta /ˈbe/, /ˈbe ˈaltə/ be /ˈbe/
C c ce /ˈse/ ce /ˈse/
D d de /ˈde/ de /ˈde/
E e e /ˈe/ e /ˈe/
F f efa /ˈefə/ efe, ef /ˈefe/, /ˈef/
G g ge /ˈʒe/ ge /ˈdʒe/
H h hac /ˈak/ hac /ˈak/
I i i, i llatina /ˈi/, /ˈi ʎəˈtinə/ i, i llatina /ˈi/, /ˈi ʎaˈtina/
J j jota /ˈʒɔtə/ jota /ˈdʒɔta/
K k ca /ˈka/ ca /ˈka/
L l ela /ˈelə/ ele, el /ˈele/, /ˈel/
M m ema /ˈemə/ eme, em /ˈeme/, /ˈem/
N n ena /ˈenə/ ene, en /ˈene/, /ˈen/
O o o /ˈo/ o /ˈo/
P p pe /ˈpe/ pe /ˈpe/
Q q cu /ˈku/ cu /ˈku/
R r erra /ˈɛrə/ erre, er /ˈɛre/, /ˈɛɾ/
S s essa /ˈesə/ esse, es /ˈese/, /ˈes/
T t te /ˈte/ te /ˈte/
U u u /ˈu/ u /ˈu/
V v ve, ve baixa /ˈve/, /ˈbe ˈbaʃə/ ve /ˈve/
W w ve doble /ˈve ˈdobːlə/, /ˈbe ˈdobːlə/ ve doble /ˈve ˈdoble/
X x ics, xeix /ˈiks/, /ˈʃeʃ/ ics, xeix /ˈiks/, /ˈʃejʃ/
Y y i grega /ˈi ˈɡɾeɡə/ i grega /ˈi ˈɡɾeɡa/
Z z zeta /ˈzetə/ zeta /ˈzeta/

The names be alta ("high B") and ve baixa ("low V") are used by speakers who do not distinguish the phonemes /b/ and /v/. Speakers that do distinguish them use the simple names be and ve.

The feminine form of the letters f (efa), l (ela), m (ema), n (ena), r (erra) and s (essa) can also be used in dialectal Valencian.

Sound to spelling correspondences[edit]

Catalan is a pluricentric language, the pronunciation of some of the letters is different in Eastern Catalan (IEC) and Valencian (AVL). Apart from these variations, the pronunciation of most consonants is fairly straightforward, and similar to French, Occitan or Portuguese pronunciation.

Main letters, letters with diacritics and digraphs
Spelling Catalan Valencian
b /b/
c /k/, /s/[2]
ç /s/
d /d/
f /f/
g /ɡ/, /ʒ/[2] /ɡ/, /dʒ/[2]
gu /ɡw/, /ɡu/, /ɡ/[2]
ig[3] /tʃ/, /itʃ/
ix[3] /ʃ/, /iʃ/ /jʃ/, /iʃ/
j /ʒ/ /dʒ/, /j/
k /k/
l /l/
ll /ʎ/
l·l /l(ː)/ /l/
m /m/
n /n/
ny /ɲ/
p /p/
q /k/
qu /kw/, /k/[2]
r /ɾ/, /r/,[4] /ɾ/, /r/[4]
rr /r/
s /s/, /z/
ss /s/
t /t/
tg /dʒ/[2]
tj /dʒ/
ts /ts/, /(t)s/
tx /tʃ/
tz /dz/ /dz/, /z/
v /b/ or /v/[5] /v/
w /b/ or /v/,[5] /w/ /v/, /w/
x /ʃ/, /ks/, /ɡz/ /tʃ/, /ʃ/, /ks/, /ɡz/
z /z/
Spelling Stressed Unstressed
Catalan Valencian Catalan Valencian
a /a/ /ə/ /a/
à /a/
e /ɛ/, /e/ /ə/ /e/
è /ɛ/ /ɛ/, /e/
é /e/
i /i/, /j/[6]
í /i/
ï /i/
o /ɔ/, /o/ /u/, /w/ /o/
ò /ɔ/
ó /o/
u /u/, /w/[6]
ú /u/
ü /u/, /w/
y /i/, /j/[6]
Other letter combinations
Spelling Catalan Valencian
bb /bː/, /b/
bl /bl/, /bːl/ /bl/
bm /bm/ or /mː/
bs /ps/, /(p)s/
bt /pt/
bv /bː/ or /bv/ /bv/
cc /ks/[2]
ch* /k/, /x/, /ʃ/
cn /gn/ or /nː/, /n/
cs /ks/
ct /kt/, /t/
cz /gz/ or /ks/
dd /d(ː)/
dj /dʒ/
dq(u) /tk/ or /kː/
ds /ts/, /(t)s/
dv /db/ or /bː/ or /dv/ /dv/ (or */bv/)
ff /f/
gd /gd/
gg[7] /ʒ(ː)/ or /dʒ/,[2] /g(ː)/ /dʒ/,[2] /g(ː)/
gl /gl/, /gːl/ /gl/
gm /gm/ or /mː/
gn /gn/ or /nː/, /n/
igc /tʃs/[2]
igd /idʒd/, /igd/
igg /dʒg/, /idʒg/
igj /idʒ/ or /iʒ(ː)/ /idʒ/
igs / (t)jos /tʃ/ or /(d)ʒus/ /tʃ/ or /dʒos/
(t)ja /(d)ʒə/ /dʒa/
(t)ges /(d)ʒəs/ /dʒes/
kh /x/, /k/
lc, lch* /lk/
ld /ld/, /l/ /ld/, /lt/
lds /ls/ /l(t)s/
lls /ʎʃ/
lt /lt/, /l/ /lt/, /(l)t/
lts /ls/ /l(t)s/
mb /mb/, /m/, /m(b)/ /mb/, /mp/, /m(b)/
mbs /ms/ /m(p)s/
mm /mː/, /m/
mn /mn/, /n/
mp /mp/, /m/ /mp/
mps /ms/ /m(p)s/
nc, nch* /ŋk/
ncs /ŋks/
nd /nd/, /n/ /nd/, /nt/
nds /ns/ /n(t)s/
ng /ŋg/, /ŋ(k)/
ngg /ŋg(ː)/
ngs /ŋ(k)s/
nm /mː/
nn /nː/, /n/
nt /nt/, /n/ /nt/
ntm /mː/ /ntm/ or /mː/
nts /ns/ /n(t)s/
nv /mb/ or /nv/ /nv/
nys /ɲʃ/
pb /bː/, /b(ː)/
pm /bm/ or /mː/
pn /bn/ or /nː/, /n/
pp /p(ː)/, /p/
ps /ps/, /s/
pt /pt/, /t/
rc, rch* /ɾk/
rcs /ɾks/
rd /ɾt/, /ɾ(t)/ /ɾt/
rds /ɾ(t)s/
rs /ɾs/, /s/ /ɾs/, /(ɾ)s/
rt /ɾt/, /ɾ(t)/ /ɾt/
rts /ɾ(t)s/, /(ɾ)ts/
sc, sch* /sk/, /ʃk/, /esk/, /s/,[2]/ʃ/
sc(o)s /sk(u)s/ /sk(o)s/
sh /z/, /ʃ/ /z/, /z/ or /s/, /ʃ/
st /st/, /s(t)/, /(s)t/ /st/
st(o)s /st(u)s/ /st(o)s/
tb /db/ or /bː/
tl /dl/ or /lː/
tll (tl) /ʎː/ /l(ː)/, /ʎ/
tm /dm/ or /mː/, /mː/, /m/ /dm/ or /mː/, /m/, /m/
tn /dn/ or /nː/, /nː/ /dn/ or /nː/, /n(ː)/
th /t/, /th/ or /t/, /θ/
tsch /tʃ/
xc /(k)sk/, /ks/[2]
xh /gz/ /gz/ or /ks/
xs /ks/
xt /(k)st/
xt(o)s /(k)st(u)s/ /(k)st(o)s/


Acute and grave accents[edit]

Catalan also uses the acute accent (é í ó ú) to mark stressed close vowels and the grave accent (à è ò) to mark stressed open vowels,[8] examples:

  • també [təmˈbe] ('also')
  • pastís [pəsˈtis] ('pie')
  • córrer [ˈkorə] ('to run')
  • pallús [pəˈʎus] ('fool')
  • ànima [ˈanimə] ('soul')
  • interès [intəˈɾɛs] ('interest')
  • pròxim [ˈpɾɔksim]('nearby')

Standard rules governing the presence of accents are based on word endings and the position of the stressed syllable. In particular, accents are expected for:

  • Oxytones ending in a syllabic vowel, a vowel + -s, or -en/in, examples:
    • parlà [pərˈɫa] ('he spoke')
    • parlés [pərˈɫes] ('that he spoke' past subjunctive)
    • entén [ənˈten] ('he understands')
  • This doesn't occur in words like parleu [pərˈɫɛw] ('you are speaking' plural), or parlem [pəɾˈɫɛm] ('we are speaking').
  • Paroxytones with any other ending, including non-syllabic -i, -u, examples:
    • parlàveu [pərˈɫaβəw] ('you were speaking' plural)
    • parlàvem [pərˈɫaβəm] ('we were speaking')
  • This doesn't occur in words like parla [ˈpaɾɫə] ('he is speaking'), parles [ˈpaɾɫəs] ('you are speaking' singular), or parlen [ˈparɫən] ('they are speaking').
  • Any proparoxytones, examples:
    • química [ˈkimikə] ('chemistry')
    • ciència [siˈɛnsiə] ('science')

Since there is no need to mark the stressed syllable of a monosyllabic word, most of them do not have an accent. Exceptions to this are those with a diacritical accent that differentiates some cases of words that would otherwise be homographic. Example: es [əs] ('it' impersonal) vs és [ˈes] ('s/he is'), te [tə] ('you' clitic) vs [ˈte] ('s/he has'), mes [ˈmɛs] ('month') vs més [ˈmes] ('more'), dona [ˈdɔnə] ('woman') vs dóna [ˈdonə] ('s/he gives'). In most cases, the word bearing no accent is either unstressed (as in the case of 'es' and 'te'), or the word without the accent is more common, usually a function word.

The different distribution of open e vs closed e between Eastern Catalan and Western Catalan is reflected in some orthographic divergences between standard Catalan and Valencian norms, example: anglès [əŋˈɡɫɛs] (Catalan) vs anglés [aŋˈɡɫes] (Valencian) ('English').


The diaeresis has two different uses: to mark hiatus over ï, ü, and to mark that u is not silent in the groups gü, qü.

If a diaeresis appears over an i or u that follows another vowel, it denotes a hiatus, examples:[9]

  • raïm [rəˈim] ('grape')
  • taüt [təˈut] ('coffin')

This diaeresis is not used over a stressed vowel that already should have an accent. Examples: suís [suˈis] ('Swiss' masculine), but suïssa [suˈisə] ('Swiss' feminine), suïs [ˈsuis] ('that you sweat' subjunctive) (without the diaeresis, this last example would be pronounced [ˈsujs], i.e. as only one syllable, like reis [ˈrejs] 'kings').

Certain verb forms of verbs ending in -uir do not receive a diaeresis, although they are pronounced with separate syllables. This concerns the infinitive, gerund, future and conditional forms (for example traduir, traduint, traduiré and traduiria, all with bisyllabic [u.i]). All other forms of such verbs do receive a diaeresis on the ï according to the normal rules (e.g. traduïm, traduïa).

In addition to this, ü represents [w] after a velar consonant /ɡ/ or /k/ and a front vowel (gu and qu are used to represent a hard (i.e. velar) pronunciation before i or e).[10]

  • ungüent [uŋˈɡwen] ('ointment')
  • qüestió [kwəstiˈo] ('topic')

The verb argüir represents a rare case of the sequence [ɡu.i], and the rules for [gu] and [ui] clash in this case. The ambiguity is resolved by an additional rule, which states that in cases where diaereses would appear on two consecutive letters, only the second receives one. This thus gives arguïm and arguïa, but argüir, argüint and argüiré as these forms don't receive a diaeresis on the i normally, according to the exception above.

Ce trencada (c-cedille)[edit]

Catalan ce trencada (Ç ç), literally in English 'broken cee', is a modified c with a cedilla mark ( ¸ ). It is only used before a u o to indicate a "soft c" /s/, much like in Portuguese, Occitan or French (e.g. compare coça [ˈkosə] 'kick', coca [ˈkokə] 'cake' and cosa [ˈkɔzə] 'thing'). In Catalan, ce trencada also appears as last letter of a word when preceded by any vowel (e.g. feliç [fəˈɫis] 'happy'), but then ç may be voiced to [z] before vowels and voiced consonants, e.g. feliçment [fəˈɫizmen] ('happily') and braç esquerre [ˈbɾaz əsˈkɛrə] ('left arm').

Punt volat (middot)[edit]

The so-called punt volat or middot is only used in the group l·l (called ela or el(e) geminada, 'geminate el') to represent a geminated sound /lː/, as ll is used to represent the palatal lateral /ʎ/. This usage of the middot sign is a recent invention from the beginning of twentieth century (in medieval and modern Catalan, before Fabra's standardization, this symbol was sometimes used to note certain elisions, especially in poetry). The only (and improbable) case of ambiguity in the whole language that could arise is the pair cel·la [ˈsɛɫɫə] ('cell') vs cella [ˈsɛʎə] ('eyebrow').


Catalan does not capitalize the days of the week, months, or national adjectives.[11]

dilluns, setembre, anglès
"Monday", "September", "English"


The Catalan punctuation rules are similar to English, with some minor differences.[12]

  • Guillemets (cometes baixes) « » are frequently used instead of double inverted commas. They are used to mark titles of works, or phrases used as proper names.[12]
  • In texts containing dialogue, quoted speech is usually set off with dashes, rather than inverted commas.[12]
Què proposes, doncs?
El que hauriem de fer‒s'atreví a suggerir‒és anar a...
'What do you propose, then?'
'What we should do' she ventured to suggest 'is go to and ...'
  • Questions are ended with ?, as in English.[12] Before 1993, questions could be enclosed with ¿...?, as in Spanish, but this is no longer recommended by the IEC.[12]

Other conventions[edit]

The distribution of the two rhotics /r/ and /ɾ/ closely parallels that of Spanish. Between vowels, the two contrast but they are otherwise in complementary distribution: in the onset, an alveolar trill, [r], appears unless preceded by a consonant; different dialects vary in regards to rhotics in the coda with Western Catalan generally featuring an alveolar tap, [ɾ], and Central Catalan dialects like those of Barcelona or Girona featuring a weakly trilled [r] unless it precedes a vowel-initial word in the same prosodic unit, in which case [ɾ] appears.[13]

In Eastern Catalan and North Western Catalan, most instances of word-final r are silent, but there are plenty of unpredictable exceptions (e.g. in Central Eastern Catalan por [ˈpo] 'fear' but mar [ˈmar] 'sea'). In Central Eastern Catalan monosyllabic words with a pronounced final r get a reinforcement final consonant [t] when in absolute final position (e.g. final r of cor ('heart') in reina del meu cor [ˈrejnə ðəɫ ˈmew ˈkɔrt] 'queen of my heart' vs el cor es mou [əɫ ˈkɔɾ əs ˈmɔw] 'the heart is moving').

In Valencian, most instances of word-final r are pronounced.

See also[edit]


  1. ^ Wheeler (2005:6)
  2. ^ a b c d e f g h i j k l Before i or e.
  3. ^ a b At the end of a syllable only.
  4. ^ a b At the beginning of a word.
  5. ^ a b In many dialects (except Balearic, Alguerese and standard Valencian [see Carbonell & Llisterri (1992:53)]) /v/ has merged into /b/.
  6. ^ a b c After another vowel.
  7. ^ [ Pronunciation of 'gg']
  8. ^ Wheeler (2005:6)
  9. ^ Wheeler (2005:8)
  10. ^ Wheeler (2005:7–8)
  11. ^ Swan 2001, p. 97.
  12. ^ a b c d e Wheeler, Yates & Dols 1999, p. 620.
  13. ^ Padgett (2003:2)


  • Carbonell, Joan F.; Llisterri, Joaquim (1992), "Catalan", Journal of the International Phonetic Association 22 (1-2): 53–56, doi:10.1017/S0025100300004618 
  • Padgett, Jaye (2003). Systemic contrast and Catalan rhotics. University of California, Santa Cruz. 
  • Wheeler, Max W.; Yates, Alan; Dols, Nicolau (1999). Catalan: A Comprehensive Grammar. New York: Routledge. ISBN 0-415-20777-0. 
  • Wheeler, Max W. (2005). The Phonology Of Catalan. Oxford: Oxford University Press. ISBN 0-19-925814-7.