1 / 26

Fundamentals of Multimedia Chapter 9 Image Compression Standards

Fundamentals of Multimedia Chapter 9 Image Compression Standards. Ze-Nian Li & Mark S. Drew 건국대학교 인터넷미디어공학부 임 창 훈. Outline. 9.1 The JPEG Standard 9.2 The JPEG2000 Standard (skip) 9.3 The JPEG-LS Standard (skip) 9.4 Bi-level Image Compression Standards (skip). 9.1 The JPEG Standard.

gates
Download Presentation

Fundamentals of Multimedia Chapter 9 Image Compression Standards

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Fundamentals of MultimediaChapter 9 Image Compression Standards Ze-Nian Li & Mark S. Drew 건국대학교 인터넷미디어공학부 임 창 훈

  2. Outline 9.1 The JPEG Standard 9.2 The JPEG2000 Standard (skip) 9.3 The JPEG-LS Standard (skip) 9.4 Bi-level Image Compression Standards (skip) Li & Drew; 건국대학교 인터넷미디어공학부 임창훈

  3. 9.1 The JPEG Standard • JPEG is an image compression standard that was developed by the “Joint Photographic Experts Group”. • JPEG was formally accepted as an international standard in 1992. • JPEG is a lossy image compression method. • It employs a transform coding method using the DCT (Discrete Cosine Transform). Li & Drew; 건국대학교 인터넷미디어공학부 임창훈

  4. The JPEG Standard • An image is a function of iand j (or xand y) in the spatial domain. • The 2D DCT is used as one step in JPEG in order to yield a frequency response which is a function F(u, v) in the spatial frequency domain, indexed by uand v. Li & Drew; 건국대학교 인터넷미디어공학부 임창훈

  5. Observations for JPEG Image Compression • The effectiveness of the DCT transform coding method in JPEG relies on 3 major observations: • Observation 1: Useful image contents change relatively slowly across the image in a small area, for example, within an 8×8 image block. • Much of the information in an image is repeated, hence “spatial redundancy". Li & Drew; 건국대학교 인터넷미디어공학부 임창훈

  6. Observations for JPEG Image Compression • Observation 2: Psychophysical experiments suggest that humans are much less likely to notice the loss of very high spatial frequency components than the loss of lower frequency components. • The spatial redundancy can be reduced by largely reducing the high spatial frequency contents. • Observation 3: Visual acuity is much greater for gray (luminance) than for color (chrominance). • Chroma subsampling (4:2:0) is used in JPEG. Li & Drew; 건국대학교 인터넷미디어공학부 임창훈

  7. Fig. 9.1: Block diagram for JPEG encoder. Li & Drew; 건국대학교 인터넷미디어공학부 임창훈

  8. 9.1.1 Main Steps in JPEG Image Compression • Transform RGB to YIQ or YUV and subsample color. • DCT on image blocks. • Quantization. • Zig-zag ordering and run-length encoding. • Entropy coding. Li & Drew; 건국대학교 인터넷미디어공학부 임창훈

  9. DCT on image blocks • Each image is divided into 8×8 blocks. • The 2D DCT is applied to each block image f(i, j), with output being the DCT coefficients F(u, v) for each block. • Using blocks, however, has the effect of isolating each block from its neighboring context. • This is why JPEG images look blocky when a high compression ratiois specified by the user. Li & Drew; 건국대학교 인터넷미디어공학부 임창훈

  10. Quantization • F(u, v) represents a DCT coefficient, Q(u, v) is a quantization matrix entry, and represents the quantized DCT coefficients which JPEG will use in the entropy coding. • The quantization step is the main source for loss in JPEG compression. Li & Drew; 건국대학교 인터넷미디어공학부 임창훈

  11. Quantization • The entries of Q(u, v) tend to have larger values towards the lower right corner. This aims to introduce more loss at the higher spatial frequencies - a practice supported by Observations 1 and 2. • Table 9.1 and 9.2 show the default Q(u, v) values obtained from psychophysical studies with the goal of maximizing the compression ratio while minimizing perceptual losses in JPEG images. Li & Drew; 건국대학교 인터넷미디어공학부 임창훈

  12. Table 9.1 The Luminance Quantization Table Table 9.2 The Chrominance Quantization Table Li & Drew; 건국대학교 인터넷미디어공학부 임창훈

  13. Fig. 9.2: JPEG compression for a smooth image block. Li & Drew; 건국대학교 인터넷미디어공학부 임창훈

  14. Fig. 9.2 (cont’d): JPEG compression for a smooth image block. Li & Drew; 건국대학교 인터넷미디어공학부 임창훈

  15. Fig. 9.2: JPEG compression for a textured (complex) image block. Li & Drew; 건국대학교 인터넷미디어공학부 임창훈

  16. Fig. 9.3 (cont’d): JPEG compression for a textured (complex) image block. Li & Drew; 건국대학교 인터넷미디어공학부 임창훈

  17. Run-length Coding (RLC) on AC coefficients • RLC aims to turn the values into sets {#-zeros-to-skip , next non-zero value}. • To make it most likely to hit a long run of zeros: a zig-zag scan is used to turn the 8×8 matrix into a 64-vector. Li & Drew; 건국대학교 인터넷미디어공학부 임창훈

  18. Fig. 9.4: Zig-Zag Scan in JPEG. Li & Drew; 건국대학교 인터넷미디어공학부 임창훈

  19. DPCM on DC coefficients • The DC coefficients are coded separately from the AC ones. Differential Pulse Code Modulation (DPCM)is the coding method. • If the DC coefficients for the first 5 image blocks are 150, 155, 149, 152, 144, then the DPCM would produce 150, 5,-6, 3, -8, assuming di= DCi− DCi-1, and d0=DC0. Li & Drew; 건국대학교 인터넷미디어공학부 임창훈

  20. Entropy Coding • The DC and AC coefficients finally undergo an entropy coding step to gain a possible further compression. • Use DC as an example: each DPCM coded DC coefficient is represented by (SIZE, AMPLITUDE), where SIZE indicates how many bits are needed for representing the coefficient, and AMPLITUDE contains the actual bits. Li & Drew; 건국대학교 인터넷미디어공학부 임창훈

  21. In the example we are using, codes 150, 5, −6, 3, −8 will be turned into (8, 10010110), (3, 101), (3, 001), (2, 11), (4, 0111) . • SIZE is Huffman coded since smaller SIZEs occur much more often. • AMPLITUDE is not Huffman coded, its value can change widely so Huffman coding has no appreciable benefit. Li & Drew; 건국대학교 인터넷미디어공학부 임창훈

  22. Table 9.3 Baseline entropy coding details - size category. Li & Drew; 건국대학교 인터넷미디어공학부 임창훈

  23. 9.1.2 Four Commonly Used JPEG Modes • Sequential Mode - the default JPEG mode. Each gray-level image or color image component is encoded in a single left-to-right, top-to-bottom scan. • Progressive Mode. • Hierarchical Mode. • Lossless Mode - discussed in Chapter 7, to be replaced by JPEG-LS (Section 9.3). Li & Drew; 건국대학교 인터넷미디어공학부 임창훈

  24. Progressive Mode • Progressive JPEG delivers low quality versions of the image quickly, followed by higher quality passes. • Spectral selection: Takes advantage of the spectral (spatial frequency spectrum) characteristics of the DCT coefficients: higher AC components provide detail information. Scan 1: Encode DC and first few AC components, e.g., AC1, AC2. Scan 2: Encode a few more AC components, e.g., AC3, AC4, AC5. ... Scan k: Encode the last few ACs, e.g., AC61, AC62, AC63. Li & Drew; 건국대학교 인터넷미디어공학부 임창훈

  25. Progressive Mode (Cont’d) 2. Successive approximation: Instead of gradually encoding spectral bands, all DCT coefficients are encoded simultaneously but with their most significant bits (MSBs) first. Scan 1: Encode the first few MSBs, e.g., Bits 7, 6, 5, 4. Scan 2: Encode a few more less significant bits, e.g., Bit 3. ... Scan m: Encode the least significant bit (LSB), Bit 0. Li & Drew; 건국대학교 인터넷미디어공학부 임창훈

  26. 9.1.3 A Glance at the JPEG Bitstream Fig. 9.6: JPEG bitstream. Li & Drew; 건국대학교 인터넷미디어공학부 임창훈

More Related