Stops
Complete closure of vocal tract
Articulators closed
Fricative
Partial closure of vocal tract
Articulators close together but not touching
Phonemes
Contrasting Sounds
Voiced
Vocal folds closed together
Vibration
Unvoiced
Vocal folds apart
No Vibration
Phonetics
Study of speech sounds
Minimal Pairs
2 words that differ with only 1 phoneme [goat, boat]
ALWAYS have a contrast
Consonant Cluster
2 consonants next to each other w/o a vowel in between them
Articulatory Phonetics
Motions made by the vocal tract to produce speech sounds
Fourier
Simple sounds can be added to create complex sounds
Harmonics
Energy at evenly spaced frequencies
Result of vocal chord vibration
Multiple frequencies simultaneously
Laryngeal Vibration
ALWAYS Voiced
Pitch
Spacing between harmonics
Filtering
Caused by shape of vocal tract
Suppresses the energy of certain harmonics so that the overall distribution of energy becomes concentrated around particular frequencies (called formants)
Formants
Frequencies w/ a lot of energy
High F0
High Pitch
Low F0
Low Pitch
High F2
Front Vowel
1300-1500
Low F2
Back Vowel
2250-2800
Do unvoiced sounds have harmonics?
No
Harmonics are a result of vocal chord vibration, unvoiced has no vibration
High Frequency Harmonics have [more/less] energy than low frequency harmonics?
LESS
Typical range of F1
300-1100
[300-500 Low]
[600-1100 High]
Typical range of F2
900-3000
[900-1700 Low]
[1800-3000 High]
High F1 means...
Low Vowel
Low F1 means...
High Vowel
High F2 means...
Front Vowel
Low F2 means...
Back Vowel
p
unvoiced labial stop
b
voiced labial stop
m
voiced labial nasal
f
unvoiced LD fricative
v
voiced LD fricative
�
voiced dental fricative
?
unvoiced dental fricative
t
unvoiced alveolar stop
d
voiced alveolar stop
s
unvoiced alveolar fricative
z
voiced alveolar fricative
n
voiced alveolar nasal
?
unvoiced PA fricative
?
voiced PA fricative
k
unvoiced velar stop
g
voiced velar stop
?
voiced velar nasal
i
High Front Vowel
F1 300
F2 2500
I
Mid-High Front Vowel
F1 500
F2 2400
e
Mid-Low Front Vowel
F1 700
F2 2200
�
Low Front Vowel
F1 900
F2 1900
Order of Front Vowels
i, I, e, �
Beet, Bit, Bay, Bat
Order of Back Vowels
u, ?, o, ?
Boot, Book, Boat, Bot
u
High Back Vowel
F1 400
F2 1000
?
Mid-High Back Vowel
F1 500
F2 1100
o
Mid-Low Back Vowel
F1 700
F2 1200
?
Low Back Vowel
F1 800
F2 1300
Advantages of Rule Based Synthesis
FLEXIBLE
(1) A program can fit in a small space
(2) Easy to adjust to different dialects/languages
Aspiration
puff of air that accompanies the release of air in a consonant
Sibilants
Crazy loud
Consonant sounds articulated with a hissing sound ( s, ? )
Coarticulation
Pronouncing vowels and nasal at same time, fading in/out of the vowel formants
High frequency harmonics = ____ pitch and _______ energy
High Pitch
Low Energy
Low frequency harmonics = ____ pitch and _______ energy
Low Pitch
High Energy
A Rule-Based Synthesis system works by...
Imitating the physical effects of speech on air vibration with something other than a human voice
Did the Voder use recordings of a human voice?
No!
How to change a /o/ into a /u/
Make a tone about 400 Hz loud while making a tone about 700 Hz quieter
What is the bear minimum for creating a synthesized /u/?
Two tones, one at 300 Hz and one at 1000 Hz
A synthesized phoneme has five tones at: 5020 Hz, 5040 Hz, 5700 Hz and 5900 Hz.
Which phoneme is most likely being synthesized?
/s/
A synthesized phoneme has 11 tones each spaced 350 Hz apart from one another. What is most likely being synthesized?
a child production of a vowel
An acoustic property unique to stops in English is
aspiration
Stops
Complete closure of vocal tract
Articulators closed
Fricative
Partial closure of vocal tract
Articulators close together but not touching
Phonemes
Contrasting Sounds
Voiced
Vocal folds closed together
Vibration
Unvoiced
Vocal folds apart
No Vibration
Phonetics
Study of speech sounds
Minimal Pairs
2 words that differ with only 1 phoneme [goat, boat]
ALWAYS have a contrast
Consonant Cluster
2 consonants next to each other w/o a vowel in between them
Articulatory Phonetics
Motions made by the vocal tract to produce speech sounds
Fourier
Simple sounds can be added to create complex sounds
Harmonics
Energy at evenly spaced frequencies
Result of vocal chord vibration
Multiple frequencies simultaneously
Laryngeal Vibration
ALWAYS Voiced
Pitch
Spacing between harmonics
Filtering
Caused by shape of vocal tract
Suppresses the energy of certain harmonics so that the overall distribution of energy becomes concentrated around particular frequencies (called formants)
Formants
Frequencies w/ a lot of energy
High F0
High Pitch
Low F0
Low Pitch
High F2
Front Vowel
1300-1500
Low F2
Back Vowel
2250-2800
Do unvoiced sounds have harmonics?
No
Harmonics are a result of vocal chord vibration, unvoiced has no vibration
High Frequency Harmonics have [more/less] energy than low frequency harmonics?
LESS
Typical range of F1
300-1100
[300-500 Low]
[600-1100 High]
Typical range of F2
900-3000
[900-1700 Low]
[1800-3000 High]
High F1 means...
Low Vowel
Low F1 means...
High Vowel
High F2 means...
Front Vowel
Low F2 means...
Back Vowel
p
unvoiced labial stop
b
voiced labial stop
m
voiced labial nasal
f
unvoiced LD fricative
v
voiced LD fricative
�
voiced dental fricative
?
unvoiced dental fricative
t
unvoiced alveolar stop
d
voiced alveolar stop
s
unvoiced alveolar fricative
z
voiced alveolar fricative
n
voiced alveolar nasal
?
unvoiced PA fricative
?
voiced PA fricative
k
unvoiced velar stop
g
voiced velar stop
?
voiced velar nasal
i
High Front Vowel
F1 300
F2 2500
I
Mid-High Front Vowel
F1 500
F2 2400
e
Mid-Low Front Vowel
F1 700
F2 2200
�
Low Front Vowel
F1 900
F2 1900
Order of Front Vowels
i, I, e, �
Beet, Bit, Bay, Bat
Order of Back Vowels
u, ?, o, ?
Boot, Book, Boat, Bot
u
High Back Vowel
F1 400
F2 1000
?
Mid-High Back Vowel
F1 500
F2 1100
o
Mid-Low Back Vowel
F1 700
F2 1200
?
Low Back Vowel
F1 800
F2 1300
Advantages of Rule Based Synthesis
FLEXIBLE
(1) A program can fit in a small space
(2) Easy to adjust to different dialects/languages
Aspiration
puff of air that accompanies the release of air in a consonant
Sibilants
Crazy loud
Consonant sounds articulated with a hissing sound ( s, ? )
Coarticulation
Pronouncing vowels and nasal at same time, fading in/out of the vowel formants
High frequency harmonics = ____ pitch and _______ energy
High Pitch
Low Energy
Low frequency harmonics = ____ pitch and _______ energy
Low Pitch
High Energy
A Rule-Based Synthesis system works by...
Imitating the physical effects of speech on air vibration with something other than a human voice
Did the Voder use recordings of a human voice?
No!
How to change a /o/ into a /u/
Make a tone about 400 Hz loud while making a tone about 700 Hz quieter
What is the bear minimum for creating a synthesized /u/?
Two tones, one at 300 Hz and one at 1000 Hz
A synthesized phoneme has five tones at: 5020 Hz, 5040 Hz, 5700 Hz and 5900 Hz.
Which phoneme is most likely being synthesized?
/s/
A synthesized phoneme has 11 tones each spaced 350 Hz apart from one another. What is most likely being synthesized?
a child production of a vowel
An acoustic property unique to stops in English is
aspiration