Informatica Overview
The Informatica Platform Simplifies Your ETL Processes
Initiate ETL Projects Quickly and Cost-Effectively
Serving as the foundation for all data integration projects, the Informatica Platform lets IT organizations initiate the ETL process from virtually any business system, in any format. As part of the Informatica Platform, Informatica PowerCenter delivers robust yet easy-to-use ETL capabilities that simplify the development and deployment of smaller departmental data marts and data warehouses. In addition, the ETL capabilities facilitate reuse from one project to another.
Enhance ETL with Universal Data Access Capabilities
PowerCenter improves the flexibility of your ETL process with the ability to extract more enterprise data-types than any other technology on the market. Complemented by Informatica PowerExchange and PowerCenter Options, PowerCenter delivers successful ETL initiatives with access to virtually any enterprise data-type, including:
- Structured, unstructured, and semi-structured data
- Relational, mainframe, file, and standards-based data
- Message queue data
- Automate Most ETL Processes for Fewer Errors and Greater Productivity
PowerCenter makes your ETL developers' jobs easier with cross-functional tools, reusable components, and an enterprise-wide platform that automates many ETL processes. For data warehousing and ETL developers, that means fewer ETL errors and emergency fixes, less risk of rework, faster development time, and greater productivity.
Features
PowerCenter Enterprise forms the foundation for all your data and enterprise integration initiatives—including data governance, data migration, and enterprise data warehousing—setting the standard for high-performance enterprise data integration and quality software. PowerCenter Enterprise scales to support large volumes of disparate data sources and meets demands for security and performance.
Development Agility
Collaborative,
Team-Based Development
Team-Based Development
PowerCenter provides a variety of graphical development environments designed for data integration developers as well as business users. The shared metadata repository allows groups of users to collaborate on integration projects, enabling rapid iteration cycles that result in significant time savings
Prototype to Production
with a Click
with a Click
Users can create virtual prototypes of reports and integration jobs without having to move data from its original sources. They'll be able to profile, integrate, and cleanse data on the fly, prototyping integration work in hours instead of weeks. With a few clicks, prototypes can be converted to a physical integration without recoding.
Automated Test Development
Typically, 30 percent of software development is spent on testing code. Data integration projects are no different. Only PowerCenter provides testing tools that automatically generate test cases—saving anywhere from 50 to 80 percent of testing effort while providing significantly more test coverage than manual approaches.
Management Confidence
Reliability and High Availability
Informatica customers count on PowerCenter Enterprise to run their critical business processes. Our high-availability products provide checkpoint recovery so that on the rare occasion that you do have a failure, we pick up the integration job right where it left off.
Traceability and Lineage
Maintaining code and tracking issues can be a daunting challenge when you're hand coding. With an extensible metadata repository that tracks data lineage for you, PowerCenter Enterprise traces the path of data back to the source or its final destination, providing the detailed documentation required by government regulations.
Proactive Monitoring
Only Informatica protects data integration projects with an automated early warning system that alerts your IT team to processes and data quality that differ from the norm as soon as issues arise. PowerCenter Enterprise lets IT monitor workflows, sessions, change control activities, and correlate events across multiple systems
Informatica PowerCenter architecture
Informatica PowerCenter uses a client-server architecture containing several components, described in general terms below, and illustrated in Figure 1. You may find it useful to familiarize yourself with PowerCenter’s architecture before beginning the installation.
For a detailed description of the components that make up PowerCenter, see Chapter 1, “Product Overview,” in Informatica PowerCenter Getting Started.
Informatica PowerCenter contains the following components licensed for use with AX Datasource:
- Informatica domain – The primary unit for management and administration of services in PowerCenter. Your license agreement restricts you to a single domain.
- Node – A logical representation of a machine in a domain. The node that hosts the domain is the master gateway for the domain. Your license agreement restricts you to a single node.
- Informatica Services – A Windows service that starts the Service Manager on a node.
- Service Manager – Starts and runs the application services on a machine in a domain.
- Integration Service – Reads workflow information from the PowerCenter repository, and runs sessions and workflows that extract, transform, and load data.
- Repository Service – Manages connections to the PowerCenter repository.
- Informatica Administrator – A Web application for managing the Informatica domain, PowerCenter security, and the PowerCenter repository.
- Informatica domain configuration database – Stores the information (metadata) related to the configuration of the Informatica domain.
- PowerCenter repository – Stores the information (metadata) required to extract, transform, and load data. Resides in a relational database.
- PowerCenter Client, which consists of:
- Designer – Allows you to define sources and targets, and create mappings with transformation instructions, for use in workflows.
- Workflow Manager – Allows you to create, schedule, and run workflows.
- Workflow Monitor – Allows you to monitor scheduled and running workflows.
- Repository Manager – Allows you to administer the PowerCenter repository: assign permissions to users and groups, manage folders, and view PowerCenter repository metadata.
Name changes
The following name changes have occurredPrevious name (in Informatica PowerCenter 8.6.1) | New name (in Informatica PowerCenter 9.0.1) |
PowerCenter Server | PowerCenter |
PowerCenter Administration Console | Informatica Administrator |
PowerCenter domain | Informatica domain |
Deployment Flexibility
Get the Connectivity You Deserve
Whether it's structured data in a database, unstructured data like emails or PDF files, social media data in the Cloud, or enterprise applications like SAP or Oracle applications, PowerCenter Enterprise has a high-speed connector that makes data integration quick and easy.
Map Once, Deploy Anywhere
Only Informatica provides a single graphical environment that lets developers create data integration and quality mappings that can be implemented across a variety of technologies. Powered by Vibe™, PowerCenter Enterprise gives you the flexibility to support deployment virtually, on proprietary ETL engines, even on Hadoop, without any recoding.
Meet Your Data Delivery Needs
Not all data is created equal—and neither are your data delivery needs. PowerCenter Enterprise provides a wide variety of technologies for performing data integration, meeting needs from big data and batch to real-time ultra-messaging for high-speed trading.
· Advanced XML Data Integration Option
The Informatica PowerCenter Advanced XML Data Integration Option enables real-time access to hierarchical data otherwise locked in XML files and messages.
· Data Integration Analyst Option
Available for Informatica PowerCenter and Informatica Data Services, the Data Integration Analyst option empowers business analysts to perform data integration tasks themselves while IT retains control of the overall data integration process.
· Data Validation Option
The Informatica PowerCenter Data Validation Option reduces the time and costs of upgrade testing, data integration project testing, and production data auditing and verification by up to 90%—with no programming skills required.
· Enterprise Grid Option
The Informatica PowerCenter Enterprise Grid Option adds PowerCenter's native data integration grid capabilities, including partitioning and high availability, for more cost-effective performance, dynamic scalability and reliability.
· High Availability Option
The Informatica PowerCenter High Availability Option minimizes service interruptions during hardware and/or software outages.
· Metadata Exchange Option
The Informatica PowerCenter Metadata Exchange Options provide access to technical and business metadata from third-party data modeling tools, business intelligence software, and source and target database catalogs.
· Partitioning Option
The Informatica PowerCenter Partitioning Option helps IT organizations take advantage of parallel data processing in multiprocessor and grid-based hardware environments.
· Pushdown Optimization Option
The Informatica PowerCenter Pushdown Optimization Option enables data transformation processing, where appropriate, to be pushed down into relational databases or appliances to improve overall performance and throughput.
· Unstructured Data Option
The Informatica PowerCenter Unstructured Data Option expands PowerCenter's data access capabilities to include unstructured data formats, providing virtually unlimited access to all data formats.
Components under Workflow Manager Tool
Here is a concise idea about the Workflow Manager in Informatica.
1. Workflow: This is top level object and the entire task (process) for data loading has to be defined under the workflow. It is like a Mapping that integrates different kind of tasks as a Unit.
2. Task: A task is an individual process to perform a very specific activity during data loading. There are 10 different kinds of tasks that can be grouped under a Workflow:
2.1 Session:
This is a compulsory task for data loading.
A session is an instance of Mapping Program or in other words a running instance of a mapping is referred as Session. For one Mapping Program we can create one or more Sessions. Generally we require one session for one mapping but for Parallel data loading we may create multiple sessions.
2.2 Command:
To execute operating System Commands or Programs. For example: If we need to inform all the users about data loading process, we can write Shell Script at Operating System and execute them via 'Command Task' just before the session execution.
2.3 Email:
To send emails to users using Mail Server (if configured). This job can be done via Command Task also but Email Task is integrated part of Workflow Manager and is much simple compared to Command Task.
2.4 Decision:
This task is used to evaluate condition based on other tasks' values to decide next course of actions. It is like IF statement.
2.5 Control:
This task is to control the flow of tasks within the Workflow. For example: If we need that control should not reach to specific task (like a Command Task) when a condition fails then we can use Control Task.
2.6 Event Wait:
This task is used to define an event and when the particular event fires (activates) then process continues.
2.7 Event Raise:
This task is used to fire (activate) an event forcefully.
2.8 Assignment:
This task is used to assign values to Parameters and Variables used within workflow.
2.9 Timer:
This task is used to specify time of execution (delay) for a task.
2.10 Worklet:
This task is to define reusable Workflow. If we need to execute set of tasks again and again under different Workflows then it is better to define them as Worklet and use under different Workflows.
Note:
Three types of tasks (Session, Command and Email) can be defined as Reusable Tasks. Reusable task means: A task is created as an Independent Task and it is used within WorkFlow or Worklet. So if a task is created within workflow or worklet directly then it is non-reusable task.
So if we need a task to be executed once within a Workflow or Worklet then create it as non-reusable task otherwise create them as reusable (independent) task.
Worklet Task also can be defined as Reusable Task via separate Menu Interface.
No comments:
Post a Comment