1 / 30

Asymmetry Aware Scheduling Algorithms for Asymmetric Processors

Asymmetry Aware Scheduling Algorithms for Asymmetric Processors. Nagesh Lakshminarayana Sushma Rao Hyesoon Kim Computer Science Georgia Institute of Technology. Outline. Background and Problem Application characteristics on AMP/SMP LJFPF Policy CJFPF Policy Conclusion. PE B. PE B.

yul
Download Presentation

Asymmetry Aware Scheduling Algorithms for Asymmetric Processors

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Asymmetry Aware Scheduling Algorithms for Asymmetric Processors Nagesh Lakshminarayana Sushma Rao Hyesoon Kim Computer Science Georgia Institute of Technology

  2. Outline • Background and Problem • Application characteristics on AMP/SMP • LJFPF Policy • CJFPF Policy • Conclusion

  3. PEB PEB PEA Interconnect PEB PEB Heterogeneous Architectures • A particularly interesting class of parallel machines is Heterogeneous Architecture: • Multiple types of Processing Elements (PEs) available on the same machine

  4. Special accelerator Multicore CPU + GPU IBM Cell processor Heterogeneous Architectures • Heterogeneous architectures are becoming very common: Focus of this talk Fast core Fast core Slow core Slow core Slow core Slow core Asymmetric Processors

  5. Scheduling Problem: Multiple applications Non-scalable applications Fast core Fast core Slow core Slow core Slow Core Slow core Slow core Scalable applications Fast Core

  6. Scheduling Problem: Multi-threaded application Fast core Fast core Slow core Slow core Slow core Slow core

  7. Problem How to schedule multi-threaded applications on Asymmetric Multiprocessors (AMP)?

  8. Outline • Background and Problem • Application characteristics on AMP/SMP • LJFPF Policy • CJFPF Policy • Conclusion

  9. Experimental Methodology • Use a 1.87GHz two-socket Quad-core machine to measure the performance • Use SpeedStep technology to emulate an AMP

  10. Performance Results on AMP/SMP

  11. Slow-Limited Applications Fast core Fast core Slow core Slow core Slow core Slow core barrier

  12. Middle-perf Benchmarks Similar to a slow-limited benchmark but sequential section is much longer barrier

  13. Unstable Benchmarks barrier barrier Asymmetric workloads Lots of barriers

  14. PARSEC Benchmarks

  15. Outline • Background and Problem • Applications on AMP/SMP • LJFPF Policy • CJFPF Policy • Conclusion

  16. LJFPF Policy • Longest Job to a Fast Processor First Slow core Fast core Slow core Fast core barrier

  17. How Does the Scheduler Know • Length of work? • Current mechanism: application sends the information • On-going work: Prediction mechanism

  18. Evaluation • Matrix Multiplication Sequential version Parallel version Symmetric workload Parallel version Asymmetric workload

  19. Asymmetric Workload (Matrix Multiplication)

  20. Real Application • ITK (Medical image processing tool kit) • Open source but a real application

  21. Evaluation: MultiRegistration • Kernel loop has 50 iterations 50 % 8 ≠0 • Divide 50 iterations into 7, 7, 7, 7, 6, 6, 5, 5

  22. Results: ITK Benchmark 2.3%

  23. Outline • Background and Problem • Application characteristics on AMP/SMP • LJFPF Policy • CJFPF Policy • Conclusion

  24. Critical Section Lock Lock

  25. Critical Section Limited Workloads Case (a) Case (b) Critical section Useful work waiting

  26. Critical Section Effects Half-half performs similar to all-fast

  27. CJFPF Policy • Critical Job to a Fast Processor First Policy Fast core Slow core Slow core Slow core

  28. CJFPF Results Longer critical section The benefit of the CJFPF policy decreases

  29. Conclusion • We evaluated the characteristics of multi-threaded applications on AMPs. • Barriers and critical sections are important factors. • Propose two new scheduling policies: Longest job to fast core first (LJFPF), critical job to fast core first (CJFPF) • Scheduling polices improve performance for asymmetric workloads. • Future work • Develop a prediction mechanism • Evaluate symmetric workloads on AMPs • Other kinds of heterogeneous architectures

  30. Thank you!

More Related