
CUDA Based LDI Generation Rui Hu, Xitong Liu
Outline • Algorithm Implementation • Correctness Validation • Performance Evaluation • CUDA Programming Tricks • Conclusion
Depth Sorting Thread Each thread processes one fragment __shared__ float sortBufS[MaxFragPerThread][LDImaximumLayer];
Correctness Validation • Compare our results of Simple Cube with the result generated by SOFA Framework • Both results match
CUDA Programming Tricks • Register variable reuse – Save register space • Check CUDA runtime errors • cutilCheckMsg • cudaGetLastError • cutilCheckError • cutilSafeCall • Select the appropriate compiling options