Speech sound is the sound produced by the human voice when people talk.
When we speak, our body creates sound by moving air from the lungs through the vocal cords and shaping that sound with the mouth, tongue, teeth, lips, and nose.
Speech sounds are used to form:
Speech begins with air. The basic process is:
| Body Part | Role in Speech |
|---|---|
| Lungs | Push air upward to begin speech. |
| Diaphragm | Helps control breathing and airflow. |
| Larynx | Contains the vocal cords. |
| Vocal cords | Vibrate to create voiced sounds. |
| Tongue | Shapes many speech sounds. |
| Lips | Help form sounds such as /p/, /b/, and /m/. |
| Teeth | Help form sounds such as /f/, /v/, and /th/. |
| Nose | Allows air to pass through for nasal sounds. |
| Mouth | Main space where speech sounds are shaped. |
Speech sounds can be grouped into voiced and voiceless sounds.
A voiced sound happens when the vocal cords vibrate.
Examples of voiced sounds include:
A voiceless sound happens when the vocal cords do not vibrate.
Examples of voiceless sounds include:
Vowels are speech sounds made when air flows freely through the mouth.
The letters usually associated with vowels are:
Sometimes Y acts like a vowel.
Examples of vowel sounds:
Consonants are speech sounds made when airflow is partly or completely blocked.
The blockage can happen with the:
Examples:
The place of articulation is where in the mouth or throat a speech sound is made.
| Place of Articulation | Description | Example Sounds |
|---|---|---|
| Bilabial | Both lips come together. | /p/, /b/, /m/ |
| Labiodental | Lower lip touches upper teeth. | /f/, /v/ |
| Dental | Tongue touches or nears teeth. | /th/ |
| Alveolar | Tongue touches ridge behind teeth. | /t/, /d/, /s/, /z/, /n/ |
| Velar | Back of tongue touches soft palate. | /k/, /g/ |
| Glottal | Sound made at the vocal cords. | /h/ |
The manner of articulation describes how airflow is controlled when making a sound.
| Manner | Description | Example Sounds |
|---|---|---|
| Stop | Air is completely blocked, then released. | /p/, /b/, /t/, /d/, /k/, /g/ |
| Fricative | Air is forced through a narrow opening. | /f/, /v/, /s/, /z/, /sh/ |
| Nasal | Air flows through the nose. | /m/, /n/ |
| Affricate | Begins like a stop and ends like a fricative. | /ch/, /j/ |
| Liquid | Smooth flowing sound. | /l/, /r/ |
| Glide | Quick vowel-like sound. | /w/, /y/ |
A phoneme is the smallest unit of sound that can change the meaning of a word.
| Word Pair | Changing Sound |
|---|---|
| cat / bat | /c/ and /b/ |
| sit / sat | /i/ and /a/ |
| fan / van | /f/ and /v/ |
| sip / zip | /s/ and /z/ |
A syllable is a beat or unit of sound in a word. Most syllables contain a vowel sound.
| Word | Number of Syllables |
|---|---|
| cat | 1 |
| tiger | 2 |
| computer | 3 |
| communication | 5 |
Syllables help organize speech into smaller parts.
Pitch is how high or low a voice sounds. Pitch depends on how fast the vocal cords vibrate.
People use pitch to show meaning and emotion.
Loudness is how strong or quiet a speech sound is. Loudness depends on the amount of air pressure used when speaking.
Loudness can help show emotion, urgency, or emphasis.
Speech rate is how fast or slow someone talks.
Stress means saying part of a word or sentence more strongly than the rest. Stress can change meaning.
Stress helps listeners understand what information is most important.
Intonation is the rise and fall of pitch during speech. Intonation helps communicate meaning, mood, and sentence type.
| Intonation Pattern | Common Meaning |
|---|---|
| Rising pitch | Question or uncertainty |
| Falling pitch | Statement or command |
| Changing pitch | Emotion or emphasis |
Computers can record, analyze, and recognize speech sounds.
Speech technology is used in:
To process speech, computers analyze features such as:
Speech recognition is when a computer converts spoken words into text or commands.
Examples:
Speech recognition systems must handle challenges such as:
When speech is recorded by a computer, it is converted into digital data.
The process usually includes:
Digital speech can then be edited, stored, transmitted, or analyzed.
A spectrogram is a visual representation of sound. It shows how frequencies change over time.
A spectrogram usually shows:
Speech scientists and computers use spectrograms to study speech patterns.
Spectrograms can help identify:
Understanding speech sound is useful in many fields.
| Field | Use of Speech Sound |
|---|---|
| Computer Science | Speech recognition and AI assistants |
| Medicine | Speech therapy and hearing tests |
| Education | Reading, language learning, and communication |
| Music | Singing and vocal performance |
| Forensics | Voice analysis and speaker identification |
| Engineering | Audio systems and noise reduction |
| Accessibility | Captions, dictation, and assistive technology |
| Term | Meaning |
|---|---|
| Speech sound | Sound made by the human voice to communicate. |
| Speech production | Process of creating speech sounds. |
| Vocal cords | Tissue in the larynx that vibrates to create voiced sounds. |
| Voiced sound | Sound made with vocal cord vibration. |
| Voiceless sound | Sound made without vocal cord vibration. |
| Vowel | Speech sound made with open airflow. |
| Consonant | Speech sound made with blocked or restricted airflow. |
| Place of articulation | Where a sound is made in the mouth or throat. |
| Manner of articulation | How airflow is controlled when making a sound. |
| Phoneme | Smallest sound unit that can change word meaning. |
| Syllable | Beat or unit of sound in a word. |
| Pitch | How high or low a voice sounds. |
| Loudness | How strong or quiet a sound is. |
| Speech rate | Speed of speaking. |
| Stress | Extra emphasis placed on a sound, syllable, or word. |
| Intonation | Rise and fall of pitch in speech. |
| Spectrogram | Visual display of sound frequencies over time. |
| Speech recognition | Technology that converts spoken words into text or commands. |