1-678-619-2366

 

Datastage Online Training

 

Introduction to Data warehousing

  • What is Data warehousing
  • Architecture of Data warehousing
  • Definitions
  • ETL Process
  • Types of Tables in D/W
  • Types of FACTS tables
  • Types of DIMENSION tables
  • Types of Schemas in D/W
  • What is Data Mart
  • Warehouse Approaches
  • Introduction to Data Modelling
  • Entity Relationship model (E-R model)
  • Data Modeling for Data Warehouse
  • Dimensions and fact tables
  • Star Schema and Snowflake Schemas
  • Coverage Tables
  • Fact less Tables
  • What to look for in modelling tools
  • Modelling tools

ETL Design process

  • Introduction to Extraction, Transformation & Loading
  • Types of ETL Tools
  • What to look for in ETL Tools
  • Key tools in the market
  • ETL Trends & New Solution Options

Data stage installation

  • Datastage Installation
  • Prerequisites to install Datastage
  • Installation process

DATA STAGE

Introduction to Datastage version 8.x

  • Datastage Introduction
  • IBM Information Server architecture
  • DataStage within the IBM Information Server architecture
  • Datastage components
  • DataStage main functions
  • Client components

Datastage Administrator:

  • Datastage project Administration
  • Editing projects and Adding projects
  • Deleting projects
  • Cleansing up project files
  • Auto purging
  • Permissions to users
  • Runtime Column Propagation
  • Enable Remote Execution of Parallel jobs
  • Add checkpoints for sequencer
  • Project protect
  • .APT Config file

Datastage Designer:

  • Introduction to Datastage Designer
  • Importing table definitions
  • Importing flat file definitions
  • Managing the meta data environment
  • Dataset management
  • Deletion of Dataset
  • Importing jobs
  • Exporting jobs(Back up)
  • Configuration file view
  • Explanation of Menu Bar
  • Palette
  • Passive stages
  • Active stages
  • Database stages
  • Debug stages
  • File stages
  • Processing stages
  • Mutiple Instances
  • Runtime Column Propagation(RCP)
  • Job design overview
  • Designer work area
  • Annotations
  • Creating jobs,deleting jobs
  • Compiling jobs
  • Batch compiling
  • Aggregator stage ,Copy stage
  • Change Capture stage,Compress stage
  • Filter stage,Funnel stage
  • Modify stage
  • Join stage,Lookup stages
  • Difference between join and Lookup stages
  • Merge stage
  • Difference between Lookup and Merge stages
  • Remove duplicate stage
  • Sort stage,Pivot stage
  • Surrogate key stage, switch stage
  • Types of Lookups
  • Types of Transformer stages
  • Basic transformer stage
  • Transformer stage
  • Null handling in Transformer stage
  • If Then Else in Transformer
  • Stage variables
  • Constraints
  • Derivations
  • Peek stage, Head stage, Tail stage
  • Job properties
  • Local variables
  • Functions in Transformers
  • String,Date,Null handling functions
  • All properties in all stages
  • Slowly changing Dimensions (SCD)
  • SCD Type-1
  • SCD Type-2
  • SCD Type-3
  • Implementation of SCD T ype-1 in Datastage
  • Implementation of SCD T ype-2 in Datastage

Datastage Director:

  • Introduction to Datastage director
  • Datastage Director window
  • Jobs status view
  • Datastage director options
  • Running Datastage jobs
  • Validating a job
  • Running a job
  • Batch Running
  • Stopping a job and resetting job
  • Monitoring a job
  • Job scheduling
  • Unscheduling a job
  • Rescheduling a job
  • Deleting a job
  • Unlocking jobs
  • View Logfile
  • Clear log
  • Fatal error description
  • Warning description
  • Info description
  • Difference between Compile and Validate
  • Difference between Validate and Run

JOB SEQUENCER:

  • Arrange job activities in Sequencer
  • Triggers in Sequencer
  • Reset method
  • Recoverability
  • Notification Activity
  • Terminator Activity
  • Wait for file Activity
  • Start Look Activity
  • Execute Command Activity
  • Sequencer
  • CONTAINERS:
  • Reusability
  • Minimizing complexity
  • Local container
  • Shared container
  • Some jobs in container

PARALLEL PROCESSING AND PARTIONING METHODS:

  • Parallel
  • Pipeline Parallelism
  • Partition Parallelism
  • Partitioning and Collecting
  • Configuration file
  • Fastname, Pools, Resource Disk, Resource Scratch Disk
  • Running Job with different nodes
  • Symmetric Multi Processing
  • Massively Parallel Processing
  • Partition techniques
  • Round Robin
  • Random
  • Hash
  • Entire
  • Same
  • Modulus
  • Range
  • DB2
  • Auto
  • Datastage components
  • Server components
  • Clients components
  • Datastage Server
  • Datastage Repository
  • Naming Standards of jobs
  • Document preparation
  • ETL specs preparation
  • Unit testcases preparation

KEY SERVICE I

· Potential Migration approach and techniques

      • Datastage version upgrade migration (ie DS 7.5.2 to 8.1\8.5\ 8.7\9.1)
      • Datastage Server job to Parallel Job migration
      • ETL tool migration(ie Informatics\Abinito to Datastage)
      • DWH Database Migration (ie Oracle to Teradata )
      • DWH concept migration (SCD –1 Type structure to Type 2)

KEY SERVICE II

    • Estimation Templates (Simple /Medium/Complex Job)
    • Test case Vs Bug report templates
    • Check list for Datastage developers

Online Training Courses Online Training Courses

 1-678-619-2366
  support@khamuritsolutions.com