Datastage Online Training
Introduction to Data warehousing
- What is Data warehousing
- Architecture of Data warehousing
- Definitions
- ETL Process
- Types of Tables in D/W
- Types of FACTS tables
- Types of DIMENSION tables
- Types of Schemas in D/W
- What is Data Mart
- Warehouse Approaches
- Introduction to Data Modelling
- Entity Relationship model (E-R model)
- Data Modeling for Data Warehouse
- Dimensions and fact tables
- Star Schema and Snowflake Schemas
- Coverage Tables
- Fact less Tables
- What to look for in modelling tools
- Modelling tools
ETL Design process
- Introduction to Extraction, Transformation & Loading
- Types of ETL Tools
- What to look for in ETL Tools
- Key tools in the market
- ETL Trends & New Solution Options
Data stage installation
- Datastage Installation
- Prerequisites to install Datastage
- Installation process
DATA STAGE
Introduction to Datastage version 8.x
- Datastage Introduction
- IBM Information Server architecture
- DataStage within the IBM Information Server architecture
- Datastage components
- DataStage main functions
- Client components
Datastage Administrator:
- Datastage project Administration
- Editing projects and Adding projects
- Deleting projects
- Cleansing up project files
- Auto purging
- Permissions to users
- Runtime Column Propagation
- Enable Remote Execution of Parallel jobs
- Add checkpoints for sequencer
- Project protect
- .APT Config file
Datastage Designer:
- Introduction to Datastage Designer
- Importing table definitions
- Importing flat file definitions
- Managing the meta data environment
- Dataset management
- Deletion of Dataset
- Importing jobs
- Exporting jobs(Back up)
- Configuration file view
- Explanation of Menu Bar
- Palette
- Passive stages
- Active stages
- Database stages
- Debug stages
- File stages
- Processing stages
- Mutiple Instances
- Runtime Column Propagation(RCP)
- Job design overview
- Designer work area
- Annotations
- Creating jobs,deleting jobs
- Compiling jobs
- Batch compiling
- Aggregator stage ,Copy stage
- Change Capture stage,Compress stage
- Filter stage,Funnel stage
- Modify stage
- Join stage,Lookup stages
- Difference between join and Lookup stages
- Merge stage
- Difference between Lookup and Merge stages
- Remove duplicate stage
- Sort stage,Pivot stage
- Surrogate key stage, switch stage
- Types of Lookups
- Types of Transformer stages
- Basic transformer stage
- Transformer stage
- Null handling in Transformer stage
- If Then Else in Transformer
- Stage variables
- Constraints
- Derivations
- Peek stage, Head stage, Tail stage
- Job properties
- Local variables
- Functions in Transformers
- String,Date,Null handling functions
- All properties in all stages
- Slowly changing Dimensions (SCD)
- SCD Type-1
- SCD Type-2
- SCD Type-3
- Implementation of SCD T ype-1 in Datastage
- Implementation of SCD T ype-2 in Datastage
Datastage Director:
- Introduction to Datastage director
- Datastage Director window
- Jobs status view
- Datastage director options
- Running Datastage jobs
- Validating a job
- Running a job
- Batch Running
- Stopping a job and resetting job
- Monitoring a job
- Job scheduling
- Unscheduling a job
- Rescheduling a job
- Deleting a job
- Unlocking jobs
- View Logfile
- Clear log
- Fatal error description
- Warning description
- Info description
- Difference between Compile and Validate
- Difference between Validate and Run
JOB SEQUENCER:
- Arrange job activities in Sequencer
- Triggers in Sequencer
- Reset method
- Recoverability
- Notification Activity
- Terminator Activity
- Wait for file Activity
- Start Look Activity
- Execute Command Activity
- Sequencer
- CONTAINERS:
- Reusability
- Minimizing complexity
- Local container
- Shared container
- Some jobs in container
PARALLEL PROCESSING AND PARTIONING METHODS:
- Parallel
- Pipeline Parallelism
- Partition Parallelism
- Partitioning and Collecting
- Configuration file
- Fastname, Pools, Resource Disk, Resource Scratch Disk
- Running Job with different nodes
- Symmetric Multi Processing
- Massively Parallel Processing
- Partition techniques
- Round Robin
- Random
- Hash
- Entire
- Same
- Modulus
- Range
- DB2
- Auto
- Datastage components
- Server components
- Clients components
- Datastage Server
- Datastage Repository
- Naming Standards of jobs
- Document preparation
- ETL specs preparation
- Unit testcases preparation
KEY SERVICE I
· Potential Migration approach and techniques
- Datastage version upgrade migration (ie DS 7.5.2 to 8.1\8.5\ 8.7\9.1)
- Datastage Server job to Parallel Job migration
- ETL tool migration(ie Informatics\Abinito to Datastage)
- DWH Database Migration (ie Oracle to Teradata )
- DWH concept migration (SCD –1 Type structure to Type 2)
KEY SERVICE II
- Estimation Templates (Simple /Medium/Complex Job)
- Test case Vs Bug report templates
- Check list for Datastage developers