1 / 39

GPU Shading and Rendering: OpenGL Shading Language

GPU Shading and Rendering: OpenGL Shading Language. Marc Olano UMBC. OpenGL Shading. High level language OpenGL Shading Language = GLslang = GLSL Integrated into OpenGL API (no extra run-time). Organization. API Vertex Shading Fragment Shading Lots of demos.

eden
Download Presentation

GPU Shading and Rendering: OpenGL Shading Language

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. GPU Shading and Rendering:OpenGL Shading Language Marc Olano UMBC

  2. OpenGL Shading • High level language • OpenGL Shading Language = GLslang = GLSL • Integrated into OpenGL API (no extra run-time)

  3. Organization • API • Vertex Shading • Fragment Shading • Lots of demos. • 2-year old Apple PowerBook G4/1.5GHz • ATI Mobility Radeon 9700

  4. API-integrated • Compiler built into driver • Presumably they know your card best • IHV’s must produce (good) compilers • Use built-in parameters (glColor, glNormal, …) • Add your own • Other options can still produce low-level code • Cg, ASHLI, RapidMind, … • With loss of integration

  5. Using High-level Code • Create shader objectS = glCreateShader(GL_VERTEX_SHADER)S = glCreateShaderObjectARB(GL_VERTEX_SHADER_ARB) • Vertex or Fragment • Load shader into objectglShaderSource(S, n, shaderArray, lenArray)glShaderSourceARB(S, n, shaderArray, lenArray) • Array of strings • Compile objectglCompileShader(S)glCompileShaderARB(S)

  6. Loading Shaders • glShaderSource(S, n, shaderArray, lenArray) • One string containing entire mmap’d file • Strings as #includes • Varying variables between vertex and fragment • Strings as lines • Null-terminated if lenArray is Null or length=-1

  7. Using High-level Code (2) • Create program objectP = glCreateProgram()P = glCreateProgramObjectARB() • Attach all shader objectsglAttachShader(P, S)glAttachObjectARB(P, S) • Vertex, Fragment or both • Link togetherglLinkProgram(P)glLinkProgramARB(P) • UseglUseProgramObject(P)glUseProgramObjectARB(P)

  8. Using High-level Code (3) • Where is my attributes/uniforms parameter?i=glGetAttribLocation(P,”myAttrib”)i=glGetUniformLocation(P,”myAttrib”) • Set themglVertexAttrib1f(i,value)glVertexAttribPointer(i,…)glUniform1f(i,value)

  9. Using Low-level Code • Load shaderglProgramStringARB(GL_VERTEX_PROGRAM_ARB, GL_PROGRAM_FORMAT_ASCII_ARB, length, shader) • Vertex or fragment • Single string (vs. array) • EnableglEnable(GL_VERTEX_PROGRAM_ARB)

  10. Shader debugger Immediate updates Choose model/texture Tweak parameters Examine/dump frames Several available Not hard to build OpenGL debugger Trace of calls made Examine resources Breakpoints/actions Graph performance A couple of choices Useful Tools

  11. gDEBugger – A Professional OpenGL Debugger and Profiler • Provides graphic pipeline information needed to find bugs and to optimize application performance: • Shortens debugging and profiling time • Improves application quality • Optimizes application performance

  12. Free gDEBugger License for Academic Users! • OpenGL ARB and Graphic Remedy Academic Program: • Annual program for all OpenGL Academic users • License of the full feature version for one year • Includes all software updates • A limited number of free licenses available fornon-commercial developers who are not in academia • More details: http://academic.gremedy.com

  13. Non-windows OS • Linux • gDEBugger in progress • Apple OpenGL Profiler and Driver Monitor • Free part of OS / Developer tools

  14. Vertex Demo:Blend Positions

  15. High-level Code void main() { float Kin = gl_Color.r; // key input // screen position from vertex and texture vec4 Vp = ftransform(); vec4 Tp = vec4(gl_MultiTexCoord0.xy*1.8-.9, 0.,1.); // interpolate between Vp and Tp gl_Position = mix(Tp,Vp,pow(1.-Kin,8.)); // copy to output gl_TexCoord[0] = gl_MultiTexCoord0; gl_TexCoord[1] = Vp; gl_TexCoord[3] = vec4(Kin); }

  16. Main Function void main() { float Kin = gl_Color.r; // key input // screen position from vertex and texture vec4 Vp = ftransform(); vec4 Tp = vec4(gl_MultiTexCoord0.xy*1.8-.9, 0.,1.); // interpolate between Vp and Tp gl_Position = mix(Tp,Vp,pow(1.-Kin,8.)); // copy to output gl_TexCoord[0] = gl_MultiTexCoord0; gl_TexCoord[1] = Vp; gl_TexCoord[3] = vec4(Kin); }

  17. Use Standard OpenGL State void main() { float Kin = gl_Color.r; // key input // screen position from vertex and texture vec4 Vp = ftransform(); vec4 Tp = vec4(gl_MultiTexCoord0.xy*1.8-.9, 0.,1.); // interpolate between Vp and Tp gl_Position = mix(Tp,Vp,pow(1.-Kin,8.)); // copy to output gl_TexCoord[0] = gl_MultiTexCoord0; gl_TexCoord[1] = Vp; gl_TexCoord[3] = vec4(Kin); }

  18. Built-in Types void main() { float Kin = gl_Color.r; // key input // screen position from vertex and texture vec4 Vp = ftransform(); vec4 Tp = vec4(gl_MultiTexCoord0.xy*1.8-.9, 0.,1.); // interpolate between Vp and Tp gl_Position = mix(Tp,Vp,pow(1.-Kin,8.)); // copy to output gl_TexCoord[0] = gl_MultiTexCoord0; gl_TexCoord[1]= Vp; gl_TexCoord[3]= vec4(Kin); }

  19. Swizzle / Channel Selection void main() { float Kin = gl_Color.r; // key input // screen position from vertex and texture vec4 Vp = ftransform(); vec4 Tp = vec4(gl_MultiTexCoord0.xy*1.8-.9, 0.,1.); // interpolate between Vp and Tp gl_Position = mix(Tp,Vp,pow(1.-Kin,8.)); // copy to output gl_TexCoord[0] = gl_MultiTexCoord0; gl_TexCoord[1] = Vp; gl_TexCoord[3] = vec4(Kin); }

  20. Vector Construction void main() { float Kin = gl_Color.r; // key input // screen position from vertex and texture vec4 Vp = ftransform(); vec4 Tp = vec4(gl_MultiTexCoord0.xy*1.8-.9, 0.,1.); // interpolate between Vp and Tp gl_Position = mix(Tp,Vp,pow(1.-Kin,8.)); // copy to output gl_TexCoord[0] = gl_MultiTexCoord0; gl_TexCoord[1] = Vp; gl_TexCoord[3] = vec4(Kin); }

  21. Built-in Functions void main() { float Kin = gl_Color.r; // key input // screen position from vertex and texture vec4 Vp = ftransform(); vec4 Tp = vec4(gl_MultiTexCoord0.xy*1.8-.9, 0.,1.); // interpolate between Vp and Tp gl_Position = mix(Tp,Vp,pow(1.-Kin,8.)); // copy to output gl_TexCoord[0] = gl_MultiTexCoord0; gl_TexCoord[1] = Vp; gl_TexCoord[3] = vec4(Kin); }

  22. Vertex + Fragment Demo:Fresnel Environment Map

  23. Trick #1: Where is the Eye ObjectSpace Projection Matrix ModelViewMatrix EyeSpace ClipSpace • Where is the Eye in Eye Space? • (0,0,0)? Not necessarily! • Know where it is in Clip Space • (0,0,-1,0), looking in the (0,0,1,0) direction • Assuming GL_LESS depth test • Invert projection to find the eye! • Works for any eye position, or even parallel projection.

  24. Trick #2: Subtract Homogeneous Points • Homogeneous point: vec4(V.xyz, V.w) • 3D equivalent: V.xyz/V.w • Defers division, makes perspective, translation, and many things happy • Vector subtraction: V–E • V.xyz/V.w – E.xyz/E.w • (V.xyz*E.w – E.xyz*V.w)/(V.w*E.w)

  25. Trick #3: Skip Division for Normalize • normalize(V.xyz/V.w) = normalize(V.xyz) • If V.w isn’t negative • Put it all together: • normalize(V-E) • = normalize(V.xyz*E.w - E.xyz*V.w)

  26. OpenGL State Demo:Vertex Lighting

  27. Lighting Vectors in Eye Space void main() { // convert shading-related vectors to eye space vec4 P = gl_ModelViewMatrix*gl_Vertex; vec4 E = gl_ProjectionMatrixInverse*vec4(0,0,-1,0); vec3 V = normalize(E.xyz*P.w-P.xyz*E.w); vec3 N = normalize(gl_NormalMatrix*gl_Normal); …

  28. Accumulate Each Light … // accumulate contribution from each light gl_FrontColor = vec4(0); for(int i=0; i<gl_MaxLights; i++) { vec3 L = normalize(gl_LightSource[i].position.xyz*P.w - P.xyz*gl_LightSource[i].position.w); vec3 H = normalize(L+V); float diff = dot(N,L); gl_FrontColor += gl_LightSource[i].ambient; if (diff > 0.) { gl_FrontColor +=gl_LightSource[i].diffuse * diff; gl_FrontColor +=gl_LightSource[i].specular * max(pow(dot(N,H), gl_FrontMaterialShininess),0.); } } …

  29. Standard Vertex Shader Stuff … // standard texture coordinate and position stuff gl_TexCoord[0] = gl_TextureMatrix[0]*gl_MultiTexCoord0; gl_Position = ftransform(); }

  30. Noise • Controlled, repeatable randomness • Still spotty implementation • Can use texture or compute

  31. Noise Characteristics • Repeatable • Locally continuous but distant points uncorrolated • values [-1,1], average 0 • 1/2 – 1 cycle per unit • Versions for n-D input

  32. Noise Subtleties • Many noise functions based on a lattice • Like a spline between integer coordinates • Hash of integer coordinates  control points • Interpolating values easy but poor • Even with higher-order interpolation • Perlin’s noise • Passes through 0 at each integer • Hash gives gradient

  33. Modified Noise [Olano 2005] • Three relatively independent modifications • New computable hash • Change gradient computation • Reorder computation • Variety of computation/texture options • Can just store in a texture • Can compute with some texture accesses • Can compute with no texture accesses

  34. Computable Hash • Normal hash chains access to permutation texture • Want totally computable hash • mod(k*x2, m) • Still chain for higher-D • hash(floor(P.x) + hash(floor(P.y))) • Not quite as good, but cheap & computable • Noise usually not used alone

  35. Gradient • 3D Gradient = (±fract(P.x), ±fract(P.y)) • Each sign from one bit of hash • Made slightly more difficult without bitwise ops • Allows noise(x) = noise(x,0) • If 2D noise is stored in a texture • Can share the same texture for 1D noise as well • Not normally true!

  36. Reordered Computation • Refactor to be able to build n-D noise from two shifted calls to n-1 D noise • If 2D noise is stored in a texture • Can build 3D noise from 2 texture accesses • Can build 4D noise from 4 texture accesses

  37. Shader Design Strategies • Learn and adapt from RenderMan • Noise • Layers • Multiple Passes • Baked computation

More Related