Pac aac audio coding standard l.jpg
This presentation is the property of its rightful owner.
Sponsored Links
1 / 25

PAC/AAC audio coding standard PowerPoint PPT Presentation


  • 99 Views
  • Uploaded on
  • Presentation posted in: General

PAC/AAC audio coding standard. A. Moreno [email protected] Georgia Institute of Technology ECE8873-Spring/2004. Overview. Audio Recording Coding-ultimate goal AAC Encoder Block Diagram Principles of Psychoacoustics Perceptual Entropy Quantization and Coding Samples. Introduction.

Download Presentation

PAC/AAC audio coding standard

An Image/Link below is provided (as is) to download presentation

Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.


- - - - - - - - - - - - - - - - - - - - - - - - - - E N D - - - - - - - - - - - - - - - - - - - - - - - - - -

Presentation Transcript


Pac aac audio coding standard l.jpg

PAC/AAC audio coding standard

A. [email protected] Institute of TechnologyECE8873-Spring/2004


Overview l.jpg

Overview

  • Audio Recording

  • Coding-ultimate goal

  • AAC Encoder Block Diagram

  • Principles of Psychoacoustics

  • Perceptual Entropy

  • Quantization and Coding

  • Samples


Introduction l.jpg

Introduction

"If a tree falls in the forest with no one around to hear it, does it make a sound?"


Audio recording l.jpg

Audio Recording

  • Edison, 1877


Audio recording5 l.jpg

Audio Recording

  • Philips, 1978

A/D Converter

PCM


Coding l.jpg

Coding

  • Ultimate Goal: reduce the number of bits needed to represent the data.

    Bitrate = Fsa x Wordlength


Aac encoder block diagram l.jpg

AAC Encoder Block Diagram

Perceptual Model

Iterative Rate Control Loop

ScaleFactorExtract

EntropyCoding

s(n)

Quant

Gain Control

Multi-ChannelM/S, Intensity

MDCT

TNS

Prediction

z^-1

Side information coding, Bitstream

channel


Principles of psychoacoustics l.jpg

Principles of Psychoacoustics

  • Source localization.

    Two ears are necessary.

    Brain uses intensity differences, and time delays between the two perceived signals.


Principles of psychoacoustics9 l.jpg

audible

inaudible

Principles of Psychoacoustics

Absolute Hearing Threshold


Principles of psychoacoustics10 l.jpg

Principles of Psychoacoustics

  • Human Ear Loudness characteristic

    Robinson and Dadson equi-loudness contours.


Principles of psychoacoustics11 l.jpg

Principles of Psychoacoustics

  • Critical BandsConcept introduced by Harvey Fletcher 1940.

Frequency to Place Transform.Function of frequency that quantifies the cochlear filter passbands.

Example: The critical band for a 1kHz is about 160Hz in width.

A narrow band noise centered at 1kHz is perceived with the same loudness as long as the width < 160Hz.


Principles of psychoacoustics12 l.jpg

audible

inaudible

Principles of Psychoacoustics

  • Simultaneous Masking: Frequency


Principles of psychoacoustics13 l.jpg

Principles of Psychoacoustics

Simplified Paradigms:Noise Masking Tone

Tone Masking Noise

THN

1Bark

THT

1Bark

K=3dB...5dB (constant)


Principles of psychoacoustics14 l.jpg

th

1Bark

Principles of Psychoacoustics

Spread of Masking


Principles of psychoacoustics15 l.jpg

Principles of Psychoacoustics

  • Masking: Temporal


Perceptual entropy l.jpg

Perceptual Entropy

  • Perceptual Entropy, objective metric of perceptually relevant introduced by J. Johnston

    The perceived information from an audio signal is only a fraction of the total information emanated by the source.


Perceptual entropy17 l.jpg

Perceptual Entropy

  • Procedure:

  • Window and transform to frequency.

  • Masking Threshold is computed using perceptual rules

  • A determination is made of the number of bits required to quantize the spectrum, without injecting perceptible noise.


Perceptual entropy18 l.jpg

Perceptual Entropy

Determine nature(Noise-like)(Tone-like)

ApplyThresholdingrules

HannWindow

s(n)

MDCT

Spectral Flatness Measure

Coefficient of ‘Tonality’

Offset

JND Estimates


Perceptual entropy19 l.jpg

Perceptual Entropy

i:index of critical band;

bli, blh: lower and upper bounds of band i;

ki:number of transform component in band i;

Ti:masking threshold in band i;

nint:rounding to the nearest integer.


Returning l.jpg

Returning

  • "If a tree falls in the forest with no one around to hear it, does it make a sound?"

    From a Perceptual Coding standpoint, if no one can hear it, THERE IS NO TREE.


Aac encoder block diagram21 l.jpg

AAC Encoder Block Diagram

Perceptual Model

Iterative Rate Control Loop

ScaleFactorExtract

EntropyCoding

s(n)

Quant

Gain Control

Multi-ChannelM/S, Intensity

MDCT

TNS

Prediction

z^-1

Side information coding, Bitstream

channel


Quantization and coding l.jpg

Quantization and Coding

  • Power-law quantizer

  • Huffman Coding (table can be chosen)

  • Global Gain -> Quantization step size

  • Scale Factors-> noise shaping factor


Quantization and coding23 l.jpg

Quantization and Coding

while NOISE_CTL

while FINDING_RATE

Nr_bits= get_bits_needed();

if (Nr_bits > max_bits)

adjust_global_gain();

else

FINDING_RATE=0;

end

q_noise=get_quant_noise_level();

if (q_noise> Th(band))

adjust_band_scale_factor();

elseNOISE_CTL=0;

end


Samples l.jpg

Samples

128kbps AAC Stereo (48kHz)

Original 48kHz Stereo

Castanets

Piano

Timpani


References l.jpg

References

[1] Ted Painter and Andreas Spanias. Perceptual coding of digital audio. Proceedings of the IEEE, 88(4):449-513. Abril 2000.

[2] Karlheinz Brandenburg, MP3 and AAC explained, AES 17th International Conference on High Quality Audio Coding, 1999.

[3] J.D. Johnston, A.J. Ferreira, Sum-Difference Stereo Transform Coding, Proc. ICASSP 1992.

[4] Deepen Sinha, James D. Johnston. Audio Compression at low bit rates using a Signal Adaptive switched Filterbank. Proc. of the ICASSP 1996, pp. 1053-1056 .


  • Login