1 / 20

A 16-Bit Invertible Architecture for H.26L

A 16-Bit Invertible Architecture for H.26L. Pankaj Topiwala FastVDO Inc. Columbia, MD 21044 VCEG-N24. Proposal Features. Our solution ( uniquely ) offers low-complexity, invertible 16-bit architecture for 12-bit input Approach first proposed at Austin Mtg, April, 01

Download Presentation

A 16-Bit Invertible Architecture for H.26L

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. A 16-Bit Invertible Architecture for H.26L Pankaj Topiwala FastVDO Inc. Columbia, MD 21044 VCEG-N24

  2. Proposal Features • Our solution (uniquely) offers low-complexity, invertible 16-bit architecture for 12-bit input • Approach first proposed at Austin Mtg, April, 01 • Full disclosure (M16) • Word doc, PPT, Excel, SW insertion (tml5.2), Example reconstructions • Now offered conditionally royalty-free

  3. 16-Bit Proposals • N20 (Sharp) and N22 (TI) – keep current transform, refactorize quantization method. Get 9-bits in, 16-bits out • Encoder not 16-bit for N20 • New 4-pt transform methods: • N24 (FastVDO): [1 1 1 1; 5-2 2-5; 1-1-1 1; 1-2 2 -1]* • N44 (MS): [1 1 1 1; 2 1-1-2; 1-1-1 1; 1-2 2-1] • N43 (Nokia): [a a a a; b c -c -b;a -a -a a;c -b b -c] *FastVDO actually offers a series of 4-pt transforms, of which this is the lowest complexity solution.

  4. Relative Merits • Comparisons • All 5 virtually match current 32-bit performance • All offer reduced, nearly identical complexity • Only N24 offers16-bit architecture for 12-bit input • Room for improvement by scaling tricks • Only N24 offers exact invertibility • Fast, low-bit 8-pt, 16-pt transforms ready (e.g. ABT)

  5. Motivation • Similar coding performance as the DCT • Supporting a 16-bit (or less) architecture • Low complexity, multiplierless implementation (adds, right shifts) • Invertible integer-to-integer mapping • In-place computation

  6. DCT • Coding gain: 7.57 dB; Complexity: 8 adds, 6 mults in floating point • Integer approximation:

  7. Lifting Structure

  8. BinDCT

  9. Inverse BinDCT

  10. 1 2 3 4 Previous Integer DCT p 7/16 3/8 1/2 1/2 - - u 3/8 3/8 3/8 1/2 - - # adds 10 10 9 8 8 (16) 8 # shifts 5 5 4 3 0 (8) 0 # mults 0 0 0 0 6 (0) 6 CG in dB 7.57 7.56 7.55 7.55 7.57 7.57 Performance-Complexity

  11. Dynamic Range

  12. Transform Matrices

  13. x[0] Y[0] >>1 x[1] Y[2] x[2] Y[3] >>1 >>3 >>1 >>4 x[3] Y[1] Detailed Implementation • Using only binary right-shifts • Biorthogonal (nearly orthog.) • Linear (up to bit constraint)

  14. Coding Performance in H.26L • PSNRs in Excel file (N24.xls) • No performance loss comparing to the previous integer transform

  15. Scaling Matrix • Approximate uniform quantization for all transform coefficients

  16. Approximation Accuracy • MSE between the basis functions • MSE between the produced coefficients

  17. All-Lifting BinDCT

  18. 8 x 8 BinDCT Coding gain: 8.77 to 8.82 dB for AR(1) process with p=0.95

  19. 16 x 16 BinDCT Coding gain: 9.4499 dB for AR(1) process with p=0.95

  20. Conclusion • Fast multiplier-free transform • Supporting a 16-bit architecture for 12-bit input • Explicit 16-bit quantization tables provided (V24qtable.zip) • Similar coding performances to current transform • Invertible integer-to-integer mapping

More Related