1 / 34

Auditory Perception Hillenbrand SPPA 2060

Auditory Perception Hillenbrand SPPA 2060. Auditory perception is one branch of a larger science called psychophysics . Psychophysics studies the relationships that exist between perceptual dimensions (also psychological, subjective, or mental) and the physical properties of stimuli.

hila
Download Presentation

Auditory Perception Hillenbrand SPPA 2060

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Auditory Perception Hillenbrand SPPA 2060

  2. Auditory perception is one branch of a larger science called psychophysics. Psychophysics studies the relationships that exist between perceptualdimensions (also psychological, subjective, or mental) and the physical properties of stimuli. The distinction between perceptual dimensions and physical dimensions is all important.

  3. Physical dimensions: Any aspect of a physical stimulus that could be measured in a straightforward way with an instrument(e.g., a light meter, a sound level meter, a spectrum analyzer, a fundamental frequency meter, etc.)

  4. Perceptual dimensions: These are the mental experiencesthat occur inside the mind of the observer. These experiences are actively created by the sensory system and brain based on an analysis of the physical properties of the stimulus. Perceptual dimensions can be measured, but not with a meter, Measuring perceptual dimensions requires an observer (e.g., a listener, a “looker”, a smeller, a taster …).

  5. For example, in vision: The percept of hueis created by the eye and brain based (in part) on the visual system’s analysis of the wavelength composition of the stimulus. But: hue ¹ wavelength wavelength: physical dimension (can be measured with a meter) hue: psychological dimension (can be measured, but that requires an observer)

  6. Visual Psychophysics Physical Properties Perceptual Dimensionsof Light Hue Wavelength Brightness Luminance Shape Contour/Contrast Both dimensions can be measured – the physical dimensions can be measured with the right instrument; measuring psychological dimensions requires an observer.

  7. Auditory Psychophysics (aka psychoacoustics or auditory perception) Physical Properties Perceptual Dimensionsof Light Pitch Fund. Freq. (f0) Loudness Intensity Timbre (sound quality) Spectrum env./ Amplitude env.

  8. Perceptual Experiences are Actively Created, Not Passively Received Subjective contour: The triangles, circles and squares are “seen” not so much because they are “there” in the physical sense, but because they are inferred. Unconscious inference lies at the heart of perception. In some sense, “I’ll see it when I believe it.” is more true than “I’ll believe it when I see it.”

  9. Reversible Figures Reversible figuresreveal the active organizationof percepts– the drawing on the left is organized by you brain into a bird, then reorganized into a rabbit, then back to a bird, … Same with the old lady/young lady.

  10. The corridor illusion: Which cylinder is larger? The cylinder to the right appears larger because the visual system infers that it is further away. The inference is unconscious, automatic and obligatory.

  11. The McGurk Effect (McGurk & Macdonald, 1976*) *McGurk, H., and MacDonald, J. (1976). Hearing lips and seeing voices. Nature, 264, 746-748.

  12. Some History on the McGurk Illusion The most striking demonstration of the combined (bimodal) nature of speech understanding appeared by accident. Harry McGurk, a senior developmental psychologist at the University of Surrey in England, and his research assistant John MacDonald were studying how infants perceive speech during different periods of development. For example, they placed a videotape of a mother talking in one location while the sound of her voice played in another. For some reason, they asked their recording technician to create a videotape with the audio syllable "ba" dubbed onto a visual "ga." When they played the tape, McGurk and McDonald perceived "da." Confusion reigned until they realized that "da" resulted from a quirk in human perception, not an error on the technician's part. After testing children and adults with the dubbed tape, the psychologists reported this phenomenon in a 1976 paper humorously titled "Hearing Lips and Seeing Voices," a landmark in the field of human sensory integration. This audio-visual illusion has become known as the McGurk effect or McGurk illusion." Further reading: Dominic W. Massaro & David G. Stork, "Speech Recognition and Sensory Integration", AmericanScientist, 1998, vol. 86, p. 236-244. The McGurk effect has played an important role in audio-visual speech integration and speech reading. McGurk links on the web include the following: http://www.psych.ucr.edu/faculty/rosenblum/AVspeech.html http://www.theshop.net/campbell/mcgurk1.htm http://www.amsci.org/amsci/articles/98articles/massaro.html http://www.sys.uea.ac.uk/~iam/newav/newav.html http://www.media.uio.no/personer/arntm/McGurk_english.html http://macserver.haskins.yale.edu/Haskins/HEADS/BIBLIOGRAPHY/bibliomcgurk.html

  13. The Three Main Perceptual Attributes of Sound • Pitch (not fundamental frequency) • Loudness (not intensity) • Timbre (not spectrum envelope or amplitude envelope) The terms pitch, loudness, and timbre refer not to the physical characteristics of sound, but to the mental experiences that occur in the minds of listeners.

  14. Pitch and Fundamental Frequency Rule 1: All else being equal, the higher the f0, the higher the perceived pitch. Lower f0, lower pitchHigher f0, lower pitch

  15. Rule 2: The ear is more sensitive to f0 differences in the low frequencies than the higher frequencies. This means that: 300 vs. 350 ¹ 3000 vs. 3050 That is, the difference in perceivedpitch (not f0) between 300 and 350 Hz is NOT the same as the difference in pitch between 3000 and 3050 Hz, even though the physical differences in f0 are the same. 300-350 3000-3050

  16. Lower f0, lower pitchHigher f0, lower pitch • Three ways to measure f0 • Frequency domain: MeasureH1(i.e., the lowest frequency harmonic). • Frequency domain: Measure the harmonic spacing. • Time domain: Measure the fundamental period.

  17. The “Problem” of the Missing Fundamental Normal f0: f0 Removed:

  18. Conclusion: The fundamental does not need to be physically present in the signal for a listener to hear a pitch corresponding to where f0ought to be. What Explains This? Even with the 1st harmonic removed, a signal remains periodic at the original f0.

  19. The “Pitch Shift” Effect If the auditory system evaluated pitch by measuring the harmonic spacing, these 2 signals (1200, 1400, 1600 … and 1240, 1440, 1640 …) would have the same pitch. They do not have the same pitch, so we can rule out harmonic spacing. Which theory is left? Measuring the fundamental period.

  20. What does all this mean? Rule 3:The sensation of pitch is probably based on a measurement of the fundamental period. It is definitely not based on a measurement of either (a) the lowest frequency harmonic in a harmonic spectrum (because of the “missing fundamental” effect), or (b) harmonic spacing (because of the “pitch shift” effect).

  21. Loudness and Intensity Rule 1: All else being equal, the higher the intensity, the greater the loudness. Higher intensity, higher loudness Lower intensity, lower loudness

  22. Rule 2: The relationship between intensity and loudness is seriously nonlinear. Doubling intensity does not double loudness. In order to double loudness, intensity must be increased by a factor of 10, or by 10 dB [10 x log10 (10) = 10 x 1 = 10 dB]. This is called the 10 dB rule. Two signals differing by 10 dB: (500 Hz sinusoids) Note that the more intense sound is NOT 10 times louder, even though it is 10 times more intense.

  23. The 10 dB rule means that a 70 dB signal is twice as loud as a 60 dB signal, four times as loud as a 50 dB signal, eight times as loud as a 40 dB signal, etc. A 30 dB hearing loss is considered mild –just outside the range of normal hearing. Based on the 10 dB rule, how much is loudness affected by a 30 dB hearing loss? (Answer: 1/8th. But note that this does not mean that someone with a 30 dB loss will have 8 times more difficulty with speech understanding than someone with normal hearing.)

  24. Rule 3: Loudness is strongly affected by the frequency of the signal. If intensity is held constant, a mid-frequency signal(in the range from ~1000-4000 Hz) will be louder than lower or higher frequency signals. 125 Hz, 3000 Hz, 8000 Hz The 3000 Hz signal should appear louder than the 125 or the 8000 signal, despite the fact that their intensities are (about) equal. (Remember that this is the reason for the dBHL scale.)

  25. Timbre(aka sound qualityor tone color) Timbre, also known as sound quality or tone color, is oddly defined in terms of what it is not: When two sounds are heard that match for pitch, loudness, and duration, and a difference can still be heard between the two sounds, that difference is called timbre.

  26. Example: a clarinet, a saxophone, and a piano all play a middle C at the same loudness and same duration. Each of these instruments has a unique sound quality. This difference is called timbre, tone color, or sound quality. There are also many examples of timbre difference in speech. For example, two vowels (e.g., /A/ and /i/) spoken at the same loudness and same pitch differ from one another in timbre.

  27. There are two physical correlates of timbre: • spectrum envelope • amplitude envelope spectrum envelope: Smooth line drawn to enclose an amplitude spectrum. amplitude envelope: Smooth line drawn to enclose a sound wave (time domain representation). [more later]

  28. Timbre and Spectrum Envelope Timbre differences between one musical instrument and another are partly related to differences in spectrum envelope -- differences in the relative amplitudes of the individual harmonics. In the examples above, we would expect all of these sounds to have the same pitch because the harmonic spacing is the same in all cases. The timbre differences that you would hear are controlled in part by the differences in the shape of the spectrum envelope.

  29. Six Synthesized Sounds Differing in Spectrum Envelope Note the similarities in pitch (due to constant f0/harmonic spacing) and the differencesin timbre or sound quality.

  30. Vowels Also Differ in Spectrum Envelope Shown here are the smoothed envelopes only(i.e., the harmonic fine structure is not shown) of 10 American-English vowels.*Note that each vowel has a unique shape to its spectrum envelope. Perceptually, these sounds differ from one another intimbre. Purely as a matter of convention, the term timbre is seldom used by phoneticians, although it applies just as well here as it does in music. In phonetics, timbre differences among vowels are typically referred to as differences in vowel quality or vowel color. *From Hillenbrand and Houde(2003). “A narrow band pattern-matching model of vowel perception,” Journal of the Acoustical Society of America, 113, 1044-1055.

  31. Aperiodic sounds can also differ in spectrum envelope, and the perceptual differences are properly described as timbre differences.

  32. Amplitude Envelope Timber is also affected by amplitude envelope. Amplitude envelope is a smooth line drawn to enclose a sound wave. It is also sometimes called the amplitude contourof the sound wave. These are both good terms since the amplitude envelope shows how overall signal amplitude varies over time. Amplitude envelope refers mainly to the characteristics of the way sounds are turned on and turned off. The four signals below are sinusoids that differ in their amplitude envelopes. Leading edge = attack Trailing edge = decay The attack especially has a large effect on timbre.

  33. Same melody, same spectrum envelope (if sustained), different amplitude envelopes (i.e., different attack and decay characteristics). Note differences in timbre or sound quality as the amplitude envelope varies.

  34. Timbre differences related to amplitude envelope also play a role in speech. Note the differences in the shape of the attack for /b/ vs. /w/ and /S/ vs. /tS/. abrupt attack more gradual attack more gradual attack abrupt attack

More Related