60 likes | 81 Views
Explore the final flow charts of your algorithms with a focus on parallelism exploitation. Detailed overview of kernels, functions, data flow, and testing procedures to ensure robustness. Verification of performance goals and potential for higher levels of parallelism on current GPUs.
 
                
                E N D
Give an overview of your design; include the final flow charts of your algorithms, and discuss how parallelism is exploited in them. Design Overview
Describe in more detail the kernels and functions you created. Trace the data flow through each kernel, describing any data marshaling where applicable. Implementation
What is your testing procedure to make sure your implementation is robust and correct? For cases where the accuracy may be compromised in order to use the single-precision hardware and/or to achieve higher level of parallelism, provide an acceptance test for your output. Verification
Did you achieve your performance goals? What went well with CUDA in terms of the performance gained? What hindered performance? Given enough development time, what is the best performance you think you could ultimately achieve with this application on the current generation of GPUs? What are the features in the toolset or hardware that would allow you to increase performance even more? Performance