Power Center Basic Concepts
Power Center Basic Concepts
Power Center Basic Concepts
Ale Ribeiro
June 6, 2006
1
Agenda
• What is PowerCenter?
• PowerCenter Client Applications
• Demo
• PowerCenter – Designer, Workflow Manager, Workflow Monitor
• PowerCenter Architecture
2
PowerCenter
3
PowerCenter Client Applications
Administration Development
Manage repository Perform domain and Create ETL Create and Monitor and
• connections repository service tasks: mappings start workflows control
• folders • Create/configure nodes workflows
• objects and repository services
• users and groups • Upgrade/delete
• Start/stop
• Backup/restore
4
Designer Tools – Create mappings
5
Mapping
Note: Sources and targets can be flat files, relational tables, XML files,
application systems, message queues, etc
Unit 1 6
Mapping (cont’d)
• A mapping is a set of source and target definitions linked by transformation
objects that define the rules for data transformation. Mappings represent the
data flow between sources and targets. When the Integration Service runs a
session, it uses the instructions configured in the mapping to read,
transform, and write data.
• Every mapping must contain the following components:
Source definition. Describes the characteristics of a source table or file.
Transformation. Modifies data before writing it to targets. Use different transformation objects to
perform different functions.
Target definition. Defines the target table or file.
Links. Connect sources, targets, and transformations so the Integration Service can move the
data as it transforms it.
• A mapping can also contain one or more mapplets. A mapplet is a set of
transformations that you build in the Mapplet Designer and can use in
multiple mappings.
7
Example
8
Transformations
Unit 1 9
PowerCenter Transformations (partial list)
Source Qualifier: reads data from flat file and relational sources
Expression: performs row-level calculations
Filter: drops rows conditionally
Sorter: sorts data
Aggregator: performs aggregate calculations
Joiner: joins heterogeneous sources
Lookup: looks up values and passes them to other objects
Update Strategy: tags rows for insert, update, delete, reject
Router: routes rows conditionally
Transaction Control: allows data-driven commits and rollbacks
10
Advanced PowerCenter Transformations
11
Mapplet – Set of transformation that can be
reusable
Mapplet
Input & Output
transformations
(pass data from
or to mapping)
Unit 14 12
Example: Data Sources Defined Outside Mapplet
Mapping
Mapplet
Mapplet Input
transformation
Mapplet Output
transformation
Unit 14 13
Recap
14
Workflow Manager Tools – Create and Start
Workflow
15
Task
16
Session
17
Decision Task
Tests for a condition during the workflow and sets a flag based on
the condition
Use a link condition (or a Control task) downstream to test the flag
and control execution flow
Can use workflow variables in condition
Options on all
tasks to fail parent Treat inputs as
and disable AND/OR
Unit 16 18
Email Task
Sends an email within a workflow
Note: emails can also be sent post-session in a Session task
Can be used with a link condition to notify success or failure of prior
tasks
Unit 16 19
Event Wait Task
Pauses processing of the pipeline until a specified event occurs
Events can be:
Pre-defined – file watch
User-defined – created by an Event Raise task elsewhere in the workflow
Unit 17 20
Event Wait Task (cont’d)
Events Tab
21
Event Raise Task
Sets the location of a user-defined event in the workflow
User-defined events are triggered when the PowerCenter Server executes
the Event Raise Task
User-defined events must be declared in the workflow Events tab
22
Command Task
23
Command Task (cont’d)
Add Cmd
Remove Cmd
24
Reusable Tasks
Unit 17 25
Worklet
An object representing a set or grouping of Tasks
Can contain any Task available in the Workflow Manager
Worklets expand and execute inside a Workflow
A Workflow which contains a Worklet is called the “parent
Workflow”
Worklets CAN be nested
Reusable Worklets – create in the Worklet Designer
Non-reusable Worklets – create in the Workflow Designer
Unit 18 26
Workflow
Unit 1 27
Workflow Structure
• Workflow 1
1 • Session 1
• Worklet A
1 • Session A1
2 • Session A2
3 • Session A3
• Worklet B Worklet C
Session B1 Session B2 Session C1 3
1 2 Session C2 4
28
Workflow Schedule
•Workflow can be scheduled to run continuously, repeat at a given time or
interval, or start manually.
•The Integration Service runs a workflow unless the prior workflow run fails.
•When a workflow fails, the Integration Service removes the workflow from the
schedule, and you must reschedule it
29
Workflow Monitor
30
Recap
Unit 1 31
PowerCenter Architecture
Domain
Sources Targets
PowerCenter Client
Repository
32
Architecture – Components
• Domain is a collection of nodes and services. Primary unit of administration
• The Repository Service manages connections to the PowerCenter repository from
client applications. The Repository Service is a separate, multi-threaded process that
retrieves, inserts, and updates metadata in the repository database tables. The
Repository Service ensures the consistency of metadata in the repository.
• The Integration Service reads mapping and session information from the repository.
It extracts data from the mapping sources and stores the data in memory while it
applies the transformation rules that you configure in the mapping. The Integration
Service loads the transformed data into the mapping targets.
33
Metadata
34
Recap
Unit 1 35
Where do we use PowerCenter?
36
PowerCenter Connect Options
37
Questions?
38