Linguistics

Stops

Complete closure of vocal tract
Articulators closed

Fricative

Partial closure of vocal tract
Articulators close together but not touching

Phonemes

Contrasting Sounds

Voiced

Vocal folds closed together
Vibration

Unvoiced

Vocal folds apart
No Vibration

Phonetics

Study of speech sounds

Minimal Pairs

2 words that differ with only 1 phoneme [goat, boat]
ALWAYS have a contrast

Consonant Cluster

2 consonants next to each other w/o a vowel in between them

Articulatory Phonetics

Motions made by the vocal tract to produce speech sounds

Fourier

Simple sounds can be added to create complex sounds

Harmonics

Energy at evenly spaced frequencies
Result of vocal chord vibration
Multiple frequencies simultaneously
Laryngeal Vibration
ALWAYS Voiced

Pitch

Spacing between harmonics

Filtering

Caused by shape of vocal tract
Suppresses the energy of certain harmonics so that the overall distribution of energy becomes concentrated around particular frequencies (called formants)

Formants

Frequencies w/ a lot of energy

High F0

High Pitch

Low F0

Low Pitch

High F2

Front Vowel
1300-1500

Low F2

Back Vowel
2250-2800

Do unvoiced sounds have harmonics?

No
Harmonics are a result of vocal chord vibration, unvoiced has no vibration

High Frequency Harmonics have [more/less] energy than low frequency harmonics?

LESS

Typical range of F1

300-1100
[300-500 Low]
[600-1100 High]

Typical range of F2

900-3000
[900-1700 Low]
[1800-3000 High]

High F1 means...

Low Vowel

Low F1 means...

High Vowel

High F2 means...

Front Vowel

Low F2 means...

Back Vowel

p

unvoiced labial stop

b

voiced labial stop

m

voiced labial nasal

f

unvoiced LD fricative

v

voiced LD fricative

voiced dental fricative

?

unvoiced dental fricative

t

unvoiced alveolar stop

d

voiced alveolar stop

s

unvoiced alveolar fricative

z

voiced alveolar fricative

n

voiced alveolar nasal

?

unvoiced PA fricative

?

voiced PA fricative

k

unvoiced velar stop

g

voiced velar stop

?

voiced velar nasal

i

High Front Vowel
F1 300
F2 2500

I

Mid-High Front Vowel
F1 500
F2 2400

e

Mid-Low Front Vowel
F1 700
F2 2200

Low Front Vowel
F1 900
F2 1900

Order of Front Vowels

i, I, e, �
Beet, Bit, Bay, Bat

Order of Back Vowels

u, ?, o, ?
Boot, Book, Boat, Bot

u

High Back Vowel
F1 400
F2 1000

?

Mid-High Back Vowel
F1 500
F2 1100

o

Mid-Low Back Vowel
F1 700
F2 1200

?

Low Back Vowel
F1 800
F2 1300

Advantages of Rule Based Synthesis

FLEXIBLE
(1) A program can fit in a small space
(2) Easy to adjust to different dialects/languages

Aspiration

puff of air that accompanies the release of air in a consonant

Sibilants

Crazy loud
Consonant sounds articulated with a hissing sound ( s, ? )

Coarticulation

Pronouncing vowels and nasal at same time, fading in/out of the vowel formants

High frequency harmonics = ____ pitch and _______ energy

High Pitch
Low Energy

Low frequency harmonics = ____ pitch and _______ energy

Low Pitch
High Energy

A Rule-Based Synthesis system works by...

Imitating the physical effects of speech on air vibration with something other than a human voice

Did the Voder use recordings of a human voice?

No!

How to change a /o/ into a /u/

Make a tone about 400 Hz loud while making a tone about 700 Hz quieter

What is the bear minimum for creating a synthesized /u/?

Two tones, one at 300 Hz and one at 1000 Hz

A synthesized phoneme has five tones at: 5020 Hz, 5040 Hz, 5700 Hz and 5900 Hz.
Which phoneme is most likely being synthesized?

/s/

A synthesized phoneme has 11 tones each spaced 350 Hz apart from one another. What is most likely being synthesized?

a child production of a vowel

An acoustic property unique to stops in English is

aspiration

Stops

Complete closure of vocal tract
Articulators closed

Fricative

Partial closure of vocal tract
Articulators close together but not touching

Phonemes

Contrasting Sounds

Voiced

Vocal folds closed together
Vibration

Unvoiced

Vocal folds apart
No Vibration

Phonetics

Study of speech sounds

Minimal Pairs

2 words that differ with only 1 phoneme [goat, boat]
ALWAYS have a contrast

Consonant Cluster

2 consonants next to each other w/o a vowel in between them

Articulatory Phonetics

Motions made by the vocal tract to produce speech sounds

Fourier

Simple sounds can be added to create complex sounds

Harmonics

Energy at evenly spaced frequencies
Result of vocal chord vibration
Multiple frequencies simultaneously
Laryngeal Vibration
ALWAYS Voiced

Pitch

Spacing between harmonics

Filtering

Caused by shape of vocal tract
Suppresses the energy of certain harmonics so that the overall distribution of energy becomes concentrated around particular frequencies (called formants)

Formants

Frequencies w/ a lot of energy

High F0

High Pitch

Low F0

Low Pitch

High F2

Front Vowel
1300-1500

Low F2

Back Vowel
2250-2800

Do unvoiced sounds have harmonics?

No
Harmonics are a result of vocal chord vibration, unvoiced has no vibration

High Frequency Harmonics have [more/less] energy than low frequency harmonics?

LESS

Typical range of F1

300-1100
[300-500 Low]
[600-1100 High]

Typical range of F2

900-3000
[900-1700 Low]
[1800-3000 High]

High F1 means...

Low Vowel

Low F1 means...

High Vowel

High F2 means...

Front Vowel

Low F2 means...

Back Vowel

p

unvoiced labial stop

b

voiced labial stop

m

voiced labial nasal

f

unvoiced LD fricative

v

voiced LD fricative

voiced dental fricative

?

unvoiced dental fricative

t

unvoiced alveolar stop

d

voiced alveolar stop

s

unvoiced alveolar fricative

z

voiced alveolar fricative

n

voiced alveolar nasal

?

unvoiced PA fricative

?

voiced PA fricative

k

unvoiced velar stop

g

voiced velar stop

?

voiced velar nasal

i

High Front Vowel
F1 300
F2 2500

I

Mid-High Front Vowel
F1 500
F2 2400

e

Mid-Low Front Vowel
F1 700
F2 2200

Low Front Vowel
F1 900
F2 1900

Order of Front Vowels

i, I, e, �
Beet, Bit, Bay, Bat

Order of Back Vowels

u, ?, o, ?
Boot, Book, Boat, Bot

u

High Back Vowel
F1 400
F2 1000

?

Mid-High Back Vowel
F1 500
F2 1100

o

Mid-Low Back Vowel
F1 700
F2 1200

?

Low Back Vowel
F1 800
F2 1300

Advantages of Rule Based Synthesis

FLEXIBLE
(1) A program can fit in a small space
(2) Easy to adjust to different dialects/languages

Aspiration

puff of air that accompanies the release of air in a consonant

Sibilants

Crazy loud
Consonant sounds articulated with a hissing sound ( s, ? )

Coarticulation

Pronouncing vowels and nasal at same time, fading in/out of the vowel formants

High frequency harmonics = ____ pitch and _______ energy

High Pitch
Low Energy

Low frequency harmonics = ____ pitch and _______ energy

Low Pitch
High Energy

A Rule-Based Synthesis system works by...

Imitating the physical effects of speech on air vibration with something other than a human voice

Did the Voder use recordings of a human voice?

No!

How to change a /o/ into a /u/

Make a tone about 400 Hz loud while making a tone about 700 Hz quieter

What is the bear minimum for creating a synthesized /u/?

Two tones, one at 300 Hz and one at 1000 Hz

A synthesized phoneme has five tones at: 5020 Hz, 5040 Hz, 5700 Hz and 5900 Hz.
Which phoneme is most likely being synthesized?

/s/

A synthesized phoneme has 11 tones each spaced 350 Hz apart from one another. What is most likely being synthesized?

a child production of a vowel

An acoustic property unique to stops in English is

aspiration