1 / 374

DataStage Enterprise Edition

DataStage Enterprise Edition. Day 1 Review of EE Concepts Sequential Access Best Practices DBMS as Source Day 2 EE Architecture Transforming Data DBMS as Target Sorting Data. Day 3 Combining Data Configuration Files Extending EE Meta Data in EE Day 4 Job Sequencing

paul2
Download Presentation

DataStage Enterprise Edition

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. DataStage Enterprise Edition

  2. Day 1 Review of EE Concepts Sequential Access Best Practices DBMS as Source Day 2 EE Architecture Transforming Data DBMS as Target Sorting Data Day 3 Combining Data Configuration Files Extending EE Meta Data in EE Day 4 Job Sequencing Testing and Debugging Proposed Course Agenda

  3. The Course Material Course Manual Exercise Files and Exercise Guide Online Help

  4. Using the Course Material • Suggestions for learning • Take notes • Review previous material • Practice • Learn from errors

  5. IntroPart 1 Introduction to DataStage EE

  6. What is DataStage? • Design jobs for Extraction, Transformation, and Loading (ETL) • Ideal tool for data integration projects – such as, data warehouses, data marts, and system migrations • Import, export, create, and managed metadata for use within jobs • Schedule, run, and monitor jobs all within DataStage • Administer your DataStage development and execution environments

  7. DataStage Server and Clients

  8. DataStage Administrator

  9. Client Logon

  10. DataStage Manager

  11. DataStage Designer

  12. DataStage Director

  13. Developing in DataStage • Define global and project properties in Administrator • Import meta data into Manager • Build job in Designer • Compile Designer • Validate, run, and monitor in Director

  14. DataStage Projects

  15. Quiz– True or False • DataStage Designer is used to build and compile your ETL jobs • Manager is used to execute your jobs after you build them • Director is used to execute your jobs after you build them • Administrator is used to set global and project properties

  16. IntroPart 2 Configuring Projects

  17. Module Objectives • After this module you will be able to: • Explain how to create and delete projects • Set project properties in Administrator • Set EE global properties in Administrator

  18. Project Properties • Projects can be created and deleted in Administrator • Project properties and defaults are set in Administrator

  19. Setting Project Properties • To set project properties, log onto Administrator, select your project, and then click “Properties”

  20. Licensing Tab

  21. Projects General Tab

  22. Environment Variables

  23. Permissions Tab

  24. Tracing Tab

  25. Tunables Tab

  26. Parallel Tab

  27. IntroPart 3 Managing Meta Data

  28. Module Objectives • After this module you will be able to: • Describe the DataStage Manager components and functionality • Import and export DataStage objects • Import metadata for a sequential file

  29. What Is Metadata? Data Source Target Transform Meta Data Meta Data Meta Data Repository

  30. DataStage Manager

  31. Manager Contents • Metadata describing sources and targets: Table definitions • DataStage objects: jobs, routines, table definitions, etc.

  32. Import and Export • Any object in Manager can be exported to a file • Can export whole projects • Use for backup • Sometimes used for version control • Can be used to move DataStage objects from one project to another • Use to share DataStage jobs and projects with other developers

  33. Export Procedure • In Manager, click “Export>DataStage Components” • Select DataStage objects for export • Specified type of export: DSX, XML • Specify file path on client machine

  34. Quiz: True or False? • You can export DataStage objects such as jobs, but you can’t export metadata, such as field definitions of a sequential file.

  35. Quiz: True or False? • The directory to which you export is on the DataStage client machine, not on the DataStage server machine.

  36. Exporting DataStage Objects

  37. Exporting DataStage Objects

  38. Import Procedure • In Manager, click “Import>DataStage Components” • Select DataStage objects for import

  39. Importing DataStage Objects

  40. Import Options

  41. Exercise • Import DataStage Component (table definition)

  42. Metadata Import • Import format and column destinations from sequential files • Import relational table column destinations • Imported as “Table Definitions” • Table definitions can be loaded into job stages

  43. Sequential File Import Procedure • In Manager, click Import>Table Definitions>Sequential File Definitions • Select directory containing sequential file and then the file • Select Manager category • Examined format and column definitions and edit is necessary

  44. Manager Table Definition

  45. Importing Sequential Metadata

  46. IntroPart 4 Designing and Documenting Jobs

  47. Module Objectives • After this module you will be able to: • Describe what a DataStage job is • List the steps involved in creating a job • Describe links and stages • Identify the different types of stages • Design a simple extraction and load job • Compile your job • Create parameters to make your job flexible • Document your job

  48. What Is a Job? • Executable DataStage program • Created in DataStage Designer, but can use components from Manager • Built using a graphical user interface • Compiles into Orchestrate shell language (OSH)

  49. Job Development Overview • In Manager, import metadata defining sources and targets • In Designer, add stages defining data extractions and loads • And Transformers and other stages to defined data transformations • Add linkss defining the flow of data from sources to targets • Compiled the job • In Director, validate, run, and monitor your job

  50. Designer Work Area

More Related