Sound systems of human language -- phoneme inventories, the International Phonetic Alphabet, articulatory and acoustic phonetics, phonological rules, suprasegmental features (stress, tone, intonation), and ear training for non-native sound perception. Covers place and manner of articulation, voicing contrasts, vowel space, minimal pair analysis, allophonic variation, phonotactic constraints, connected speech phenomena (assimilation, elision, liaison), and prosody across language families. Use when analyzing pronunciation, teaching sound systems, performing phonemic transcription, or diagnosing intelligibility problems in any language.
Every human language is built on a finite inventory of contrastive sounds. Phonetics studies how those sounds are physically produced (articulatory), transmitted (acoustic), and perceived (auditory). Phonology studies how sounds function within a particular language -- which distinctions are meaningful, which are automatic, and how sounds interact with each other in sequence. This skill treats language-universal principles rather than any single language's system, making it applicable to any language a learner encounters.
Agent affinity: crystal (phonetic description, language diversity), chomsky-l (phonological rules, underlying representations)
Concept IDs: lang-phoneme-inventory, lang-ipa-notation, lang-ear-training, lang-suprasegmentals
| Domain | Focus | Key Tools |
|---|---|---|
| Articulatory phonetics | How sounds are produced | Place/manner/voicing, vocal tract diagrams |
| Acoustic phonetics | Sound waves and formants | Spectrograms, F1/F2 vowel plots, VOT |
| Auditory phonetics |
| How sounds are perceived |
| Categorical perception, ear training drills |
| Phonemics | Contrastive sound units | Minimal pairs, complementary distribution |
| Phonotactics | Legal sound sequences | Onset/coda constraints, syllable structure |
| Prosody | Suprasegmental features | Stress, tone, intonation contours |
Every consonant is classified along three dimensions:
Example. English [t] is a voiceless alveolar plosive. Japanese [ts] in "tsunami" is a voiceless alveolar affricate. The distinction matters: substituting one for the other changes meaning or intelligibility in context.
Vowels are classified by tongue position along two axes, plus lip rounding:
The IPA vowel trapezoid plots these dimensions, and formant analysis (F1 for height, F2 for backness) provides acoustic confirmation.
VOT measures the delay between the release of a stop consonant and the onset of voicing for the following vowel. It distinguishes:
This is a major source of cross-linguistic interference: a Spanish speaker's unaspirated [p] sounds like English [b] to an English listener, and vice versa.
The IPA provides a universal transcription system where one symbol corresponds to one sound. This skill assumes familiarity with IPA as a meta-tool for describing any language.
A minimal pair is two words that differ by exactly one sound in the same position and have different meanings: "bat" /bæt/ vs. "pat" /pæt/. Minimal pairs prove that two sounds are separate phonemes. If no minimal pair can be found, the sounds may be allophones of the same phoneme.
Worked example. In English, [l] and [ɫ] (clear and dark L) are allophones: "leaf" [liːf] vs. "feel" [fiːɫ]. No minimal pair distinguishes them because they are in complementary distribution (clear before vowels, dark before consonants or word-finally). In Russian, [l] and [ɫ] ARE separate phonemes: "лук" /luk/ (onion) vs. "лук" /ɫuk/ would be meaningful (Cyrillic spelling distinguishes palatalized from non-palatalized).
| Process | Description | Example |
|---|---|---|
| Assimilation | A sound becomes more like a neighboring sound | English "input" /ɪnpʊt/ often pronounced [ɪmpʊt] -- /n/ becomes [m] before bilabial /p/ |
| Elision | A sound is deleted | French "je ne sais pas" -> [ʃɛpa] in rapid speech |
| Epenthesis | A sound is inserted | Japanese borrows "strike" as "sutoraiku" -- vowels inserted to fit CV syllable structure |
| Metathesis | Sounds swap positions | English "ask" pronounced [æks] in some dialects |
| Lenition | A sound weakens | Spanish intervocalic /b/ -> [β] (fricative): "lobo" [loβo] |
| Fortition | A sound strengthens | Korean word-initial lenis stops become tense after nasals |
| Nasalization | Vowel acquires nasal quality | French nasal vowels: "bon" [bɔ̃] |
| Palatalization | Consonant shifts toward palatal place | Russian consonants before front vowels: /t/ -> [tʲ] |
| Vowel harmony | Vowels in a word agree on a feature | Turkish: "evler" (houses) but "odalar" (rooms) -- front/back harmony |
Phonological rules are written as: A -> B / C __ D
"A becomes B in the context of C preceding and D following."
Example. English aspiration rule: voiceless stops become aspirated word-initially before a stressed vowel.
/p, t, k/ -> [pʰ, tʰ, kʰ] / # __ V[+stress]
This explains why "pin" has [pʰ] but "spin" has unaspirated [p] -- the /p/ in "spin" is not word-initial.
Stress is relative prominence of a syllable through louder volume, higher pitch, longer duration, or fuller vowel quality. Languages differ in whether stress is:
In free-stress languages, stress can be contrastive: English "CONtract" (noun) vs. "conTRACT" (verb); Russian "мука" /mukÁ/ (flour) vs. /múka/ (torment).
Tone uses pitch to distinguish word meaning. Approximately 60-70% of the world's languages are tonal.
Intonation is the pitch pattern over an entire phrase or sentence. Unlike tone, it is not lexical but grammatical and pragmatic:
Perceiving non-native sounds is a trained skill, not an innate ability. The critical period hypothesis suggests children acquire phonemic distinctions more easily, but adults can train perception through:
Research by Flege (1995) and Best & Tyler (2007) shows that adult learners can acquire new phonemic categories, but the difficulty depends on the relationship between L1 and L2 sounds. The hardest case is when an L2 contrast falls within a single L1 category -- the learner must split an existing category, not create a new one.
Fluent speech is not a sequence of isolated sounds. Sounds in running speech interact in ways that isolated-word pronunciation does not predict:
Understanding connected speech is critical for listening comprehension. Learners trained on isolated words often fail to parse natural-speed speech because they have never encountered the connected forms.
When assessing a learner's pronunciation, distinguish between:
When a learner starts a new language, compare its sound system to known languages: