Evaluating the Error Resilience of Parallel Programs. Bo Fang , Karthik Pattabiraman , Matei Ripeanu , The University of British Columbia Sudhanva Gurumurthi AMD Research. Reliability trends.
Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author.While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server.
Investigate the error-resilience characteristics
Find correlation between error resilience characteristics and application properties
Application-specific, software-based, fault tolerance mechanisms
image = image_setup(image_orig);
#pragma omp parallel for shared(var_list1)
Add support for structure and thread selection(C1)
Fault injection instruction/register selector
Instrument IR code of the program with function calls
Fault injection executable
Specify code region and thread to inject (C1 & C2)
Collect statistics on per-thread basis (C1)
Custom fault injector