IBM InfoSphere Advanced DataStage V8 (KM400)

Request a Quote for this class

About this Course

This course works with Information Server V8.5.

This course is designed to introduce advanced job development techniques in DataStage V8.5.This course replaces course number (DX455).

Audience Profile

This advanced course is for Experienced DataStage developers seeking training in more advanced DataStage techniques and who seek an understanding of the parallel framework architecture.

At Course Completion

Upon completing this course, students will be able to:

  • Describe the parallel processing architecture and development and runtime environments
  • Describe the compile process and the runtime job execution process
  • Describe how partitioning and collection works in the parallel framework
  • Describe sorting and buffering in the parallel framework and optimization techniques
  • Describe and work with parallel framework data types
  • Create reusable job components
  • Use loop processing in a Transformer stage
  • Process groups in a Transformer stage
  • Extend the functionality of DataStage by building custom stages and creating new Transformer functions
  • Use Connector stages to read and write from relational tables and handle errors in Connector stages
  • Process XML data in DataStage jobs using the XML stage
  • Design a job that processes a star schema database with Type 1 and Type 2 slowly changing dimensions
  • List job and stage best practices

Prerequisites

You should complete:

  • DataStage Essentials course or equivalent
  • and have at least one year of experience developing parallel jobs using DataStage

 

Course Outline

  1. Unit 1 - Introduction to the Parallel Framework Architecture
  2. Unit 2 - Compilation and Execution
  3. Unit 3 - Partitioning and Collecting Data
  4. Unit 4 - Sorting Data
  5. Unit 5 - Buffering in Parallel Jobs
  6. Unit 6 - Parallel Framework Data Types
  7. Unit 7 - Reusable components
  8. Unit 8 - Advanced Transformer Logic
  9. Unit 9 - Extending the Functionality of Parallel Jobs
  10. Unit 10 - Accessing Databases (start if there is time)
  11. Unit 11 - Processing XML Data
  12. Unit 12 - Slowly Changing Dimensions Stages
  13. Unit 13 - Best Practices