1 / 5

Home Exam 1 : Codec c63 on Intel x86 using Streaming SIMD Extensions (SSE)

INF5063: Programming heterogeneous multi-core processors. … because the OS-course is just to easy!. Home Exam 1 : Codec c63 on Intel x86 using Streaming SIMD Extensions (SSE). Your task.

topper
Download Presentation

Home Exam 1 : Codec c63 on Intel x86 using Streaming SIMD Extensions (SSE)

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. INF5063: Programming heterogeneous multi-core processors … because the OS-course is just to easy! Home Exam 1:Codec c63 on Intel x86using Streaming SIMD Extensions (SSE)

  2. Your task • Utilize the CPU vector unit to get the most performance out of a single core for a simple video encoder • Start by profiling the encoder to see which parts of the encoder that are the bottlenecks • Optimize the bottleneck • Remember, after optimizing one part of the code, more profiling might be needed to find new bottlenecks • Write a report with details on which parts of the encoding process that benefited from your optimizations. The report should also explain how your code works. • You can use your own machines for this assignment, but you have to make sure that the code works on bush or clinton at the Simula network (login will be provided later)

  3. Precode The precode is a basic single threaded c63 encoder written in C You should NOT change the algorithms in the code You are NOT allowed to use code from other projects – the precode and your own code only Your implementation is supposed to be single threaded, and optimized to use the data parallelism available in a single x86 core. Otherwise, you are free to use and modify it as you see fit

  4. Formal Information • Work in groups of 2 • Deadline:24thSeptember – 23:59:59.99 • The assignment will be graded andwill count 33% of the final grade. • Deliver your code and report to: https://devilry.ifi.uio.no/ • Prepare a short (5 - 10 minutes) presentation for the class on the 25thSeptember.

  5. Good Luck!

More Related