What is a syllable?

Definition
	A syllable is a unit of sound composed of
	a central peak of sonority (usually a vowel), and the consonants that cluster around this central peak.

Discussion
	Syllable structure, which is the combination of allowable segments and typical sound sequences, is language specific.

Parts

Parts	Description	Optionality
Onset	Initial segment of a syllable	Optional
Rhyme	Core of a syllable, consisting of a nucleus and coda (see below)	Obligatory
– Nucleus	Central segment of a syllable	Obligatory
– Coda	Closing segment of a syllable	Optional

Example (English)
	Here is an example of the syllable structure of the English word limit:

Kinds

Here are some kinds of syllables:

Kind	Description	Example
Heavy	Has a branching rhyme. All syllables with a branching nucleus (long vowels) are considered heavy. Some languages treat syllables with a short vowel (nucleus followed by a consonant (coda) as heavy.	CV:C, CVCC, CVC
Light	Has a non-branching rhyme (short vowel). Some languages treat syllables with a short vowel(nucleus) followed by a consonant (coda) as light.	CV, CVC
Closed	Ends with a consonant coda.	CVC, CVCC, VC
Open	Has no final consonant	CV

Diagram
	Here is a diagram of a syllable:

Syllable Structure and the Distribution of
Phonemes in English Syllables

In describing the phonotactics (patterning of phonemes) of English syllables, linguists have focused on absolute restrictions concerning which phonemes may occupy which slots of the syllable. To determine whether probabilistic patterns also exist, we analyzed the distributions of phonemes in a reasonably comprehensive list of uninflected English CVC (consonant–vowel–consonant) words, some 2,001 words in all. The results showed that there is a significant connection between the vowel and the following consonant (coda), with certain vowel-coda combinations being more frequent than expected by chance. In contrast, we did not find significant associations between the initial consonant (onset) and the vowel. These findings support the idea that English CVC syllables are composed of an onset and a vowel–coda rime. Implications for lexical processing are discussed.

Linguists have often observed absolute restrictions in the patterning of phonemes in syllables. For example, it is often noted that /h/ can occur only at the start of an English syllable and that /N/ can occur only at the end. (See Tables 1 and 2 for an explanation of the phonetic symbols.) By the same token, certain combinations of phonemes occur in the language, whereas others do not. In the General American English accent, /A/ can occur before /r/ at the end of a syllable (car), but /æ/ cannot, and this rule has no exceptions. In some languages, there are many more such gaps or restrictions at the end of the syllable than at the beginning, and such asymmetry has been accepted as evidence that the syllable has a particular kind of internal structure (Goldsmith, 1990, p. 123-127). In English, however, it is not so obvious that the end of the syllable has many more restrictions than the beginning. As a result, there has been some debate as to whether there is enough imbalance in phonotactic constraints to suggest internal syllable structure.
This impasse can be broken, in our view, by abandoning the idea that only absolute, inviolable restrictions are worthy of note. We take a probabilistic approach in the present study by asking whether some consonants occur in certain positions of the syllable more or less often than expected by chance, and whether some legal combinations occur more or less often than expected by chance. For example, among the words we consider, we find that the sequence /Vf/ occurs much more often than one would expect from the frequency of /V/ and /f/ considered independently, and /æl/ occurs much less often than expected. Even though there are several words that end in /æl/, their unusually low frequency suggests a phonotactic restriction of a nonabsolute, violable kind. Broadening our purview to include restrictions of this kind gives a clearer picture of the phonotactic patterns of English. In the present study, we use statistical techniques to examine the patterns of phoneme co-occurrences in the CVC words of English --- single morphemes that consist of a vowel flanked by exactly one consonant on each side.

Relevance of Syllable Phonotactics

A statistical study of syllabic phonotactics is motivated by several considerations. On one level, we are interested in exploring the validity of quantitative approaches to language in general. Some important schools of linguistics hold that language is best described in terms of symbols and categories, and that quantitative tendencies that cannot be reduced to a system of categorical rules are merely accidents that are irrelevant to language as a systematic entity. Davis (1985), for example, rejected all arguments for internal syllable structure, on the grounds that he was able to find exceptions, however rare, to all properties that had been proposed for specific subsyllabic entities. But many other language researchers find quantitative approaches, such as connectionist models, to be very useful. One of our motivations is to show that a quantitative approach to English phonotactics can uncover the same types of patterns that have been noted as absolute rules in other languages. Researchers interested in universal properties of language might take this as evidence that statistical patterns are not necessarily accidental and deserve a closer look.
A second motivation for our study is to contribute toward research in lexical processing. Recognition and production of a word can be affected by how many other words are similar to it in pronunciation or spelling (e.g., Goldinger, Luce, & Pisoni, 1989; Grainger, 1992). The magnitude of such effects is associated with which end of the word one is considering. For example, the beginning of a word is the most salient part for identification (Cutler, 1982, p. 19). Also, word production is slower and more erratic when there are other words with a similar beginning; interference is not so pronounced when there are other words that have a similar ending (Sevald & Dell, 1994). If we were to find that phonotactic patterns in English show a tendency to constrain combinations of elements at the end of the syllable, thus contributing toward making words more similar at their ends than at their beginnings, then researchers will want to take such facts into account when exploring the relation between processing asymmetries and the structure of the English vocabulary.
Our final reason for studying syllable phonotactics is the evidence it may bring to bear on syllable structure. Linguists have adduced every possible configuration for the internal structure of syllables. For CVC syllables, the main concern is whether the vowel is grouped with the prior consonant (called the onset), with the posterior consonant (the coda), or with neither. Figure 1 illustrates those three basic theories. The leftmost tree illustrates the theory of the flat syllable, where the vowel groups with neither the onset nor the coda (Clements & Keyser, 1983; Davis, 1985; Hockett, 1955). The second tree illustrates the onset-rime theory, where the vowel groups with the coda to form a constituent called the rime (Fudge, 1969; Goldsmith, 1990; Kuryl~owicz, 1973; Selkirk, 1982). The last tree illustrates the theory of body-coda organization, where the vowel is grouped with the onset to form a constituent called the body (McCarthy, 1979, p. 455; Iverson & Wheeler, 1989). More recently, some phonologists have claimed that the components of the syllable are units of weight called moras (Hayes, 1989; Hyman, 1985). As the trees in Figure 2 illustrate, basic moric theory always has the vowel in the first mora and the coda in the second, with the complication that a long vowel is considered to be simultaneously in both moras.

Fig. 1: Flat, onset-rime, and body-coda theories of syllable structure, illustrated with the word cap.

Fig. 2: Moric theory of syllable structure, illustrated with cap (short vowel) and keep (long vowel).

Of all these theories, the onset-rime is perhaps the most widely accepted, in our opinion rightly so. Many linguistic phenomena are easily described in terms of properties of the vowel and the coda (i.e., the rime), with the onset consonant being irrelevant. These phenomena include verse metrics, word stress, and compensatory lengthening (see Halle & Vergnaud, 1982; Kuryl~owicz, 1973; and Hayes, 1989 for descriptions). It is much harder to find important processes that depend on the body to the exclusion of the coda. Rhyming traditions, language games such as Pig Latin, and speech errors often treat rimes as units but rarely treat bodies as units (e.g., MacKay, 1973; Stemberger, 1983). Furthermore, the bulk of experimental evidence favors the onset-rime theory (see Treiman & Kessler, 1995). On the other hand, Davis (1985) argued for the flat syllable; experiments with speakers of Korean suggest a body-coda organization (Derwing, Yoon, & Cho, 1993; Yoon, 1995); and Pierrehumbert and Nair (1995) claimed that moric theory can account for much of the experimental evidence. The question of internal syllable structure is not yet settled.
We believe that a statistical study of syllable phonotactics can bring light to the issue of syllable structure. Although the theoretical concept of linguistic structure is hard to pin down, many would agree that a structure is the natural domain for a constraint or process. If, for example, different types of consonants may appear before the vowel than after it, then that suggests that those are not undifferentiated consonant slots, but rather that those two elements belong to different structures. We explore this issue in Study 1. Of course, most theories put the onset and coda in different structures, but such information would argue against some versions of flat syllable theory (e.g., Clements & Keyser, 1983). Another approach is to test whether proposed constituent structures like the head or rime constitute a natural domain for phonotactic constraints. If there are many constraints against combining certain vowels with certain codas (such as the aforementioned illegal combination */ær/) but few if any against combining vowels with onsets, then that suggests that the vowel and coda form a structure, the rime. We investigate this matter in Study 2.
The idea of considering phonotactic constraints as evidence for syllable structure is not new. Fudge (1969) argued for an onset-rime structure, claiming that most or all phonotactic constraints in English involve the vowel and coda. But Clements and Keyser (1983, p. 20) favored the flat syllable, stating that ``cooccurrence restrictions holding between the nucleus [i.e., vowel] and preceding elements of the syllable appear to be just as common as cooccurrence restrictions holding between the nucleus and following elements.'' Fudge (1987) responded with extensive tables that showed that vowel-coda pairs in English have many more gaps than do onset-vowel pairs. Part of the reason that the matter remains unsettled is statistical. The problem is twofold, involving both false zeroes and false positives. Some phonemes are fairly uncommon in English, and the number of morphemes is finite, so some possible combinations may fail to exist just because they do not have a reasonable chance to occur. A count of zero co-occurrences does not mean there is a principled constraint against a sequence. On the other hand, finding a few co-occurrences does not mean that the phonemes combine freely. Some phonemes may be so common that one would expect them to appear together dozens or hundreds of times. Previous researchers were by no means naïve on these points: Clements and Keyser admitted that their claim that voiced fricatives do not appear before /U/ may be accidental, and they counted /vu/ as a circumscribed sequence even though they knew about the words voodoo and rendezvous. But statistical tests have rarely been applied. A notable exception is Randolph (1989), who used the likelihood ratio statistic to test the significance of collocational constraints within the syllable, all of which he rejected. But the statistics he reported were so remarkably low that one suspects that scaling factors threw them off by several orders of magnitude.
In this study, we explored co-occurrence patterns by examining a reasonably comprehensive list of uninflected English CVC words. We readily admit that a study of CVC words will not answer all questions about syllable structure. Some of the patterns we uncover may reflect properties of word edges rather than syllable edges; indeed, some phonologists declare that some or all word-final consonants should not even be considered part of a syllable (Kenstowicz, 1994, p. 260-261). Some patterns may have more to do with whether a consonant is prevocalic or postvocalic, or whether it precedes or follows sentence stress, than with whether it is in the onset or coda of a syllable. Despite these warnings, we believe that CVC words are an ideal place to begin study. The word list is very homogeneous: All single-coda consonants are paired with a single onset consonant, and there are no confounding factors such as extra phonemes or difference in stress or morphemic composition. Nor is there any doubt as to which syllable a consonant belongs to, which is a matter of no small controversy for English intervocalic consonants (Lass, 1984, p. 262-268). Our use of CVC words should not only make the statistical analyses more straightforward and interpretable, but should also facilitate follow-up studies. If in the future different patterns are found in other carefully constructed databases, the homogeneous nature of our database should help researchers formulate hypotheses as to what factors in the word lists account for those differences.

Study 1

In Study 1, we ask whether there are differences in the frequency of occurrence of the different consonants depending on whether they are in the onset or the coda.

Method

We analyzed the 2,001 monomorphemic CVC words found in the unabridged Random House Dictionary (Flexner, 1987). We were fervid in our zeal to eradicate polymorphemic words: A word was rejected if any part of it is used in the same sense in some other word, so that even words like this and then were omitted on the grounds that th may be a demonstrative morpheme. We omitted all words which the dictionary gave any reason to believe were not in current general use throughout America. Thus we omitted words with foreign phonemes or accented letters, foreign measures, and place and ethnic names that were not obviously Anglicized. We did include given names such as Dave.
We used the first pronunciation listed in the dictionary. The dictionary phonemes were transcribed on a unit-by-unit basis, except that /R/, which is treated as the sequence ûr by Random House (Flexner, 1987), was here treated as a vowel. As a result, words like bird were included in our list of CVCs. We did this because /R/ is a phonetically unitary sound and we wished to avoid any bias from prejudging its underlying properties. For the other vowels before /r/, which are variously treated in different accents, we followed the usage of the Dictionary in recognizing diphthongs before /r/, as well as the vowels /i/ beer, /U/ boor, /E/ (bear), /O/ (bore), and /A/ (bar). Tables 1 and 2 list the phonemes recognized in this study, and the phonetic classes to which they belong. Note that we treated diphthongs and affricates as units.
The Random House scheme distinguishes /w/ as in wine from /W/ as in whine. It also draws a distinction between the vowels /Q/ cot, /O/ caught, and /A/ khat, spa. Although these are more distinctions than are commonly made in America, we observe the full set of contrasts because they are dialectically neutral: All those vowel distinctions are made in parts of New England and throughout England. By the same token, we count /O/ as a mid round vowel even though for most Americans it is low and perhaps even unrounded.

TABLE 1 Frequency and Features of Vowels.
Vowel	Example	Frequency	Height	Backness	Tenseness
A	alms	30	low	central	tense
æ	ax	198	low	front	lax
Q	odd	128	low	back	lax
ai	ides	136	---	---	tense
Au	out	44	---	---	tense
e	ape	183	mid	front	tense
E	ebb	159	mid	front	lax
R	erg	115	mid	central	tense
i	eat	210	high	front	tense
I	if	207	high	front	lax
o	oats	146	mid	back	tense
O	ought	110	mid	back	tense
Oi	oink	25	---	---	tense
u	ooze	117	high	back	tense
U	ush	38	high	back	lax
V	up	155	mid	central	lax

TABLE 2 Frequency and Features of Consonants.
Phone	Example	Frequency	Place	Manner	Voice
b	boy	216	bilabial	interrupted	voiced
tS	chin	115	nonanterior	interrupted	unvoiced
d	dog	268	anterior	interrupted	voiced
D	this	15	anterior	fricative	voiced
f	fox	160	labiodental	fricative	unvoiced
g	girl	155	postcoronal	interrupted	voiced
h	hop	105	postcoronal	approximant	unvoiced
j	young	30	nonanterior	approximant	voiced
dZ	jump	115	nonanterior	interrupted	voiced
k	kiss	324	postcoronal	interrupted	unvoiced
l	love	365	anterior	approximant	voiced
m	maid	243	bilabial	nasal	voiced
n	new	306	anterior	nasal	voiced
N	sang	46	postcoronal	nasal	voiced
p	pad	240	bilabial	interrupted	unvoiced
r	read	287	nonanterior	approximant	voiced
s	sing	242	anterior	fricative	unvoiced
S	sheep	109	nonanterior	fricative	unvoiced
t	tongue	323	anterior	interrupted	unvoiced
T	thin	56	anterior	fricative	unvoiced
v	vase	99	labiodental	fricative	voiced
w	win	82	labial	approximant	voiced
W	whip	24	labial	approximant	unvoiced
z	zoo	71	anterior	fricative	voiced
Z	rouge	6	nonanterior	fricative	voiced

Table 2 lists the number of times each consonant occurs in the word list. Only word types were considered, unweighted by their frequency. A word type may contain two occurrences of consonants: Thus bib contributes 2 toward the count of /b/. To determine whether the frequencies are affected by syllable position, we performed for each consonant type separate two-cell goodness-of-fit tests with Pearson's χ², computing the expected frequencies under the null hypothesis that consonants would be evenly distributed between onset and coda. Because all words had exactly one onset and one coda consonant, this means that each consonant should occur half the time in an onset, and half the time in a coda. To correct for the fact that the size of the χ² statistic depends in part on the total number of times each consonant occurs, we also computed the φ coefficient of association for the contingency tables. This statistic includes the total number of consonant tokens as a divisor, and so scales from 0 to 1. In order to determine whether there is an overall association between consonant type and syllable slot across the consonantal system as a whole, we also computed the χ² statistic across all 25 consonant types. Finally, we performed a χ² decomposition by phonetic feature class to help understand effects intermediate between those of the entire table and of individual phonemes. Here we used G², the likelihood-ratio version of χ², because it is additive across decompositions. To guard against the danger of finding significant results simply because we made huge numbers of comparisons, we restricted ourselves throughout this study to only making comparisons between feature sets that are immediate children of the same node in the trees presented in Figures 3 and 4. In Study 1, we only looked at the place of articulation, contrasting two classes of phonemes only when they were immediate children of the same node in the topmost tree of Figure 3. For example, anterior coronal consonants as a group were only compared against nonanterior coronal consonants, but not against labial consonants.
Place features

Figure 3. Hierarchical organization of the three classes of consonant features. This study compares the distribution of two consonant feature classes only when they are immediate children of the same node. Table 2 lists the features of each of the consonant phonemes.

Results

Table 3 shows how often each consonant occurs in onset and coda. The table is arranged by the strength of association between consonant type and syllable position (φ). As one can see by the number of starred χ² statistics, most of the consonants appear in either the onset or the coda more often than one would expect. A χ² test for the consonantal system as a whole gives a significant result (χ²(24) = 496.52, p < .05). The strength of the association, as measured by Cramér's coefficient, is .35.

TABLE 3 Distribution of Consonants Within Onset and Coda.
Phone	Onset	Coda	χ²		φ
j	30	0	30.00	*	1.000
W	24	0	24.00	*	1.000
w	82	0	82.00	*	1.000
N	0	46	46.00	*	1.000
h	105	0	105.00	*	1.000
D	1	14	11.27	*	.867
Z	1	5	---		.667
z	13	58	28.52	*	.634
b	154	62	39.19	*	.426
T	17	39	8.64	*	.393
n	99	207	38.12	*	.353
dZ	74	41	9.47	*	.287
t	119	204	22.37	*	.263
l	135	230	24.73	*	.260
S	65	44	4.05	*	.193
f	92	68	3.60		.150
r	163	124	5.30	*	.136
g	88	67	2.85		.135
k	142	182	4.94	*	.123
v	45	54	0.82		.091
p	128	112	1.07		.067
d	126	142	0.96		.060
m	116	127	0.50		.045
s	126	116	0.41		.041
tS	56	59	0.08		.026

Note. Statistics examine the difference between the frequency of each consonant in the onset and its frequency in the coda.

*p < .05, 1 df.

In the G² decomposition, the total G² was 601.26. This value is somewhat unreliable because we had to adjust some zero cell frequencies, which are undefined for G². Our first step, therefore, was to partition off any consonants that had structural zeroes, including the glides /j/ and /w/, which cannot occur in codas because we define them to be parts of diphthongs (the vowel slot), and /Z/ and /D/, which arguably occur in the onset in only special circumstances (loan words and function words, respectively). Effects among those consonants, or between them as a group and the other consonants as a group, were each significant, and accounted for two thirds of the deviation (G² of 408.83). But when we decomposed the remaining G² by place of articulation according to the scheme of Figure 3, we found several other significant deviations having nothing to do with structural zeroes; these are summarized in Table 4. The table shows, for example, that coronal consonants prefer the coda significantly more than noncoronals. Among coronals, anterior consonants have a more marked preference for the coda than do nonanterior coronals. In contrast to Table 3, this table shows the direction of the skew within the contrast, not the absolute direction of the preference. Thus /d/ is listed as favoring the onset more than other anterior coronals, even though in absolute numbers (i.e., when compared to all other consonants) it is found in the coda somewhat more than in the onset.

TABLE 4 `G²` Decomposition of Consonant Distributions By Place of Articulation, Onset vs. Coda
Contrasting	Favoring onset	Favoring coda	`G²`	Cramér
Consonants	noncoronal	coronal	30.06	.09
coronal	nonanterior	anterior	60.93	.16
anterior	/d, s/	/l, n, T, t, z/	47.11	.17
noncoronal	labial	velar	7.85	.07
labial	/b/	/p, m/	28.53	.20
velar	/g/	/k/	7.05	.12

Note. Omits consonants largely restricted to either onset (/h, j, w, W/) or coda (/N, D, Z/). First column shows the phonetic category within which a significant skew between onset and coda is found. The members of that category are differentiated by whether they appear more often in onset than do other members of the same category. The nonanterior consonants /dZ, S, r, tS}/ did not vary significantly among themselves.

Discussion

Our results show that there is an association between consonant type and syllable position. In the phonemic analysis used by our source dictionary, glides (/h/, /j/, /w/, /W/) can only occur in the onset and /N/ can only occur in the coda. This much is common knowledge. What is not commonly recognized is the skew that is present for several other consonants as well. In particular, /z/, /T/, /n/, /t/, /l/ and /k/ show a significant preference for the coda, and /b/, /dZ/, /S/, and /r/ show a preference for the onset. The fact that consonants as a group prefer particular syllable slots is strong enough to be significant even if one factors out the consonants for which there is an absolute, inviolable restriction as to which slot they can go in. About half of the G² among the remaining consonants can be accounted for by significant differences between contrasting places of articulation, with the strongest effect being the contrast between anterior and nonanterior coronal consonants.
Because the words in our study are all monomorphemic, the prevalence of /z/, /T/, /n/, and /t/ in codas cannot be explained by their use as inflectional or derivational endings. The fact that coronals, especially anterior coronals, appear disproportionately often in codas echoes absolute constraints found in other languages. When languages restrict codas or word endings to consonants of a particular place of articulation, anterior coronals are the least likely to be excluded. The core, native vocabulary of Spanish, for example, has many words ending in the anterior coronals /D/, /T/, /s/, /l/, /n/, and /r/, but almost no words ending in other consonants, even though such consonants are frequent at the beginning of words.

Study 2

As we pointed out earlier, one test of structural constituency is whether the items within a proposed constituent are more strongly associated with each other than they are with items outside of the structure. Put another way, items within a constituent should vary less freely with respect to each other than they do with respect to other items. Study 2 was designed to determine whether the vowel and the coda are more strongly associated with each other than are the vowel and the onset. Such a finding would suggest that the vowel and the coda form a constituent, the rime. In addition, the existence of a rime constituent would allow us to treat the distributional patterns seen in Study 1 as properties of that constituent. That approach would simplify our account by obviating the need to hypothesize that the onset or the coda, or both, are distinguished constituents in their own right.