Enterprise Data Catalog User Guide: Informatica 10.2.2
Enterprise Data Catalog User Guide: Informatica 10.2.2
10.2.2
This software and documentation are provided only under a separate license agreement containing restrictions on use and disclosure. No part of this document may be
reproduced or transmitted in any form, by any means (electronic, photocopying, recording or otherwise) without prior consent of Informatica LLC.
Informatica and the Informatica logo are trademarks or registered trademarks of Informatica LLC in the United States and many jurisdictions throughout the world. A
current list of Informatica trademarks is available on the web at https://www.informatica.com/trademarks.html. Other company and product names may be trade
names or trademarks of their respective owners.
U.S. GOVERNMENT RIGHTS Programs, software, databases, and related documentation and technical data delivered to U.S. Government customers are "commercial
computer software" or "commercial technical data" pursuant to the applicable Federal Acquisition Regulation and agency-specific supplemental regulations. As such,
the use, duplication, disclosure, modification, and adaptation is subject to the restrictions and license terms set forth in the applicable Government contract, and, to the
extent applicable by the terms of the Government contract, the additional rights set forth in FAR 52.227-19, Commercial Computer Software License.
Portions of this software and/or documentation are subject to copyright held by third parties, including without limitation: Copyright DataDirect Technologies. All rights
reserved. Copyright © Sun Microsystems. All rights reserved. Copyright © RSA Security Inc. All Rights Reserved. Copyright © Ordinal Technology Corp. All rights
reserved. Copyright © Aandacht c.v. All rights reserved. Copyright Genivia, Inc. All rights reserved. Copyright Isomorphic Software. All rights reserved. Copyright © Meta
Integration Technology, Inc. All rights reserved. Copyright © Intalio. All rights reserved. Copyright © Oracle. All rights reserved. Copyright © Adobe Systems Incorporated.
All rights reserved. Copyright © DataArt, Inc. All rights reserved. Copyright © ComponentSource. All rights reserved. Copyright © Microsoft Corporation. All rights
reserved. Copyright © Rogue Wave Software, Inc. All rights reserved. Copyright © Teradata Corporation. All rights reserved. Copyright © Yahoo! Inc. All rights reserved.
Copyright © Glyph & Cog, LLC. All rights reserved. Copyright © Thinkmap, Inc. All rights reserved. Copyright © Clearpace Software Limited. All rights reserved. Copyright
© Information Builders, Inc. All rights reserved. Copyright © OSS Nokalva, Inc. All rights reserved. Copyright Edifecs, Inc. All rights reserved. Copyright Cleo
Communications, Inc. All rights reserved. Copyright © International Organization for Standardization 1986. All rights reserved. Copyright © ej-technologies GmbH. All
rights reserved. Copyright © Jaspersoft Corporation. All rights reserved. Copyright © International Business Machines Corporation. All rights reserved. Copyright ©
yWorks GmbH. All rights reserved. Copyright © Lucent Technologies. All rights reserved. Copyright © University of Toronto. All rights reserved. Copyright © Daniel
Veillard. All rights reserved. Copyright © Unicode, Inc. Copyright IBM Corp. All rights reserved. Copyright © MicroQuill Software Publishing, Inc. All rights reserved.
Copyright © PassMark Software Pty Ltd. All rights reserved. Copyright © LogiXML, Inc. All rights reserved. Copyright © 2003-2010 Lorenzi Davide, All rights reserved.
Copyright © Red Hat, Inc. All rights reserved. Copyright © The Board of Trustees of the Leland Stanford Junior University. All rights reserved. Copyright © EMC
Corporation. All rights reserved. Copyright © Flexera Software. All rights reserved. Copyright © Jinfonet Software. All rights reserved. Copyright © Apple Inc. All rights
reserved. Copyright © Telerik Inc. All rights reserved. Copyright © BEA Systems. All rights reserved. Copyright © PDFlib GmbH. All rights reserved. Copyright ©
Orientation in Objects GmbH. All rights reserved. Copyright © Tanuki Software, Ltd. All rights reserved. Copyright © Ricebridge. All rights reserved. Copyright © Sencha,
Inc. All rights reserved. Copyright © Scalable Systems, Inc. All rights reserved. Copyright © jQWidgets. All rights reserved. Copyright © Tableau Software, Inc. All rights
reserved. Copyright© MaxMind, Inc. All Rights Reserved. Copyright © TMate Software s.r.o. All rights reserved. Copyright © MapR Technologies Inc. All rights reserved.
Copyright © Amazon Corporate LLC. All rights reserved. Copyright © Highsoft. All rights reserved. Copyright © Python Software Foundation. All rights reserved.
Copyright © BeOpen.com. All rights reserved. Copyright © CNRI. All rights reserved.
This product includes software developed by the Apache Software Foundation (http://www.apache.org/), and/or other software which is licensed under various
versions of the Apache License (the "License"). You may obtain a copy of these Licenses at http://www.apache.org/licenses/. Unless required by applicable law or
agreed to in writing, software distributed under these Licenses is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express
or implied. See the Licenses for the specific language governing permissions and limitations under the Licenses.
This product includes software which was developed by Mozilla (http://www.mozilla.org/), software copyright The JBoss Group, LLC, all rights reserved; software
copyright © 1999-2006 by Bruno Lowagie and Paulo Soares and other software which is licensed under various versions of the GNU Lesser General Public License
Agreement, which may be found at http:// www.gnu.org/licenses/lgpl.html. The materials are provided free of charge by Informatica, "as-is", without warranty of any
kind, either express or implied, including but not limited to the implied warranties of merchantability and fitness for a particular purpose.
The product includes ACE(TM) and TAO(TM) software copyrighted by Douglas C. Schmidt and his research group at Washington University, University of California,
Irvine, and Vanderbilt University, Copyright (©) 1993-2006, all rights reserved.
This product includes software developed by the OpenSSL Project for use in the OpenSSL Toolkit (copyright The OpenSSL Project. All Rights Reserved) and
redistribution of this software is subject to terms available at http://www.openssl.org and http://www.openssl.org/source/license.html.
This product includes Curl software which is Copyright 1996-2013, Daniel Stenberg, <daniel@haxx.se>. All Rights Reserved. Permissions and limitations regarding this
software are subject to terms available at http://curl.haxx.se/docs/copyright.html. Permission to use, copy, modify, and distribute this software for any purpose with or
without fee is hereby granted, provided that the above copyright notice and this permission notice appear in all copies.
The product includes software copyright 2001-2005 (©) MetaStuff, Ltd. All Rights Reserved. Permissions and limitations regarding this software are subject to terms
available at http://www.dom4j.org/ license.html.
The product includes software copyright © 2004-2007, The Dojo Foundation. All Rights Reserved. Permissions and limitations regarding this software are subject to
terms available at http://dojotoolkit.org/license.
This product includes ICU software which is copyright International Business Machines Corporation and others. All rights reserved. Permissions and limitations
regarding this software are subject to terms available at http://source.icu-project.org/repos/icu/icu/trunk/license.html.
This product includes software copyright © 1996-2006 Per Bothner. All rights reserved. Your right to use such materials is set forth in the license which may be found at
http:// www.gnu.org/software/ kawa/Software-License.html.
This product includes OSSP UUID software which is Copyright © 2002 Ralf S. Engelschall, Copyright © 2002 The OSSP Project Copyright © 2002 Cable & Wireless
Deutschland. Permissions and limitations regarding this software are subject to terms available at http://www.opensource.org/licenses/mit-license.php.
This product includes software developed by Boost (http://www.boost.org/) or under the Boost software license. Permissions and limitations regarding this software
are subject to terms available at http:/ /www.boost.org/LICENSE_1_0.txt.
This product includes software copyright © 1997-2007 University of Cambridge. Permissions and limitations regarding this software are subject to terms available at
http:// www.pcre.org/license.txt.
This product includes software copyright © 2007 The Eclipse Foundation. All Rights Reserved. Permissions and limitations regarding this software are subject to terms
available at http:// www.eclipse.org/org/documents/epl-v10.php and at http://www.eclipse.org/org/documents/edl-v10.php.
This product includes software licensed under the terms at http://www.tcl.tk/software/tcltk/license.html, http://www.bosrup.com/web/overlib/?License, http://
www.stlport.org/doc/ license.html, http://asm.ow2.org/license.html, http://www.cryptix.org/LICENSE.TXT, http://hsqldb.org/web/hsqlLicense.html, http://
httpunit.sourceforge.net/doc/ license.html, http://jung.sourceforge.net/license.txt , http://www.gzip.org/zlib/zlib_license.html, http://www.openldap.org/software/
release/license.html, http://www.libssh2.org, http://slf4j.org/license.html, http://www.sente.ch/software/OpenSourceLicense.html, http://fusesource.com/downloads/
license-agreements/fuse-message-broker-v-5-3- license-agreement; http://antlr.org/license.html; http://aopalliance.sourceforge.net/; http://www.bouncycastle.org/
licence.html; http://www.jgraph.com/jgraphdownload.html; http://www.jcraft.com/jsch/LICENSE.txt; http://jotm.objectweb.org/bsd_license.html; . http://www.w3.org/
Consortium/Legal/2002/copyright-software-20021231; http://www.slf4j.org/license.html; http://nanoxml.sourceforge.net/orig/copyright.html; http://www.json.org/
license.html; http://forge.ow2.org/projects/javaservice/, http://www.postgresql.org/about/licence.html, http://www.sqlite.org/copyright.html, http://www.tcl.tk/
software/tcltk/license.html, http://www.jaxen.org/faq.html, http://www.jdom.org/docs/faq.html, http://www.slf4j.org/license.html; http://www.iodbc.org/dataspace/
iodbc/wiki/iODBC/License; http://www.keplerproject.org/md5/license.html; http://www.toedter.com/en/jcalendar/license.html; http://www.edankert.com/bounce/
index.html; http://www.net-snmp.org/about/license.html; http://www.openmdx.org/#FAQ; http://www.php.net/license/3_01.txt; http://srp.stanford.edu/license.txt;
http://www.schneier.com/blowfish.html; http://www.jmock.org/license.html; http://xsom.java.net; http://benalman.com/about/license/; https://github.com/CreateJS/
EaselJS/blob/master/src/easeljs/display/Bitmap.js; http://www.h2database.com/html/license.html#summary; http://jsoncpp.sourceforge.net/LICENSE; http://
jdbc.postgresql.org/license.html; http://protobuf.googlecode.com/svn/trunk/src/google/protobuf/descriptor.proto; https://github.com/rantav/hector/blob/master/
LICENSE; http://web.mit.edu/Kerberos/krb5-current/doc/mitK5license.html; http://jibx.sourceforge.net/jibx-license.html; https://github.com/lyokato/libgeohash/blob/
master/LICENSE; https://github.com/hjiang/jsonxx/blob/master/LICENSE; https://code.google.com/p/lz4/; https://github.com/jedisct1/libsodium/blob/master/
LICENSE; http://one-jar.sourceforge.net/index.php?page=documents&file=license; https://github.com/EsotericSoftware/kryo/blob/master/license.txt; http://www.scala-
lang.org/license.html; https://github.com/tinkerpop/blueprints/blob/master/LICENSE.txt; http://gee.cs.oswego.edu/dl/classes/EDU/oswego/cs/dl/util/concurrent/
intro.html; https://aws.amazon.com/asl/; https://github.com/twbs/bootstrap/blob/master/LICENSE; https://sourceforge.net/p/xmlunit/code/HEAD/tree/trunk/
LICENSE.txt; https://github.com/documentcloud/underscore-contrib/blob/master/LICENSE, and https://github.com/apache/hbase/blob/master/LICENSE.txt.
This product includes software licensed under the Academic Free License (http://www.opensource.org/licenses/afl-3.0.php), the Common Development and
Distribution License (http://www.opensource.org/licenses/cddl1.php) the Common Public License (http://www.opensource.org/licenses/cpl1.0.php), the Sun Binary
Code License Agreement Supplemental License Terms, the BSD License (http:// www.opensource.org/licenses/bsd-license.php), the new BSD License (http://
opensource.org/licenses/BSD-3-Clause), the MIT License (http://www.opensource.org/licenses/mit-license.php), the Artistic License (http://www.opensource.org/
licenses/artistic-license-1.0) and the Initial Developer’s Public License Version 1.0 (http://www.firebirdsql.org/en/initial-developer-s-public-license-version-1-0/).
This product includes software copyright © 2003-2006 Joe WaInes, 2006-2007 XStream Committers. All rights reserved. Permissions and limitations regarding this
software are subject to terms available at http://xstream.codehaus.org/license.html. This product includes software developed by the Indiana University Extreme! Lab.
For further information please visit http://www.extreme.indiana.edu/.
This product includes software Copyright (c) 2013 Frank Balluffi and Markus Moeller. All rights reserved. Permissions and limitations regarding this software are subject
to terms of the MIT license.
DISCLAIMER: Informatica LLC provides this documentation "as is" without warranty of any kind, either express or implied, including, but not limited to, the implied
warranties of noninfringement, merchantability, or use for a particular purpose. Informatica LLC does not warrant that this software or documentation is error free. The
information provided in this software or documentation may include technical inaccuracies or typographical errors. The information in this software and documentation
is subject to change at any time without notice.
NOTICES
This Informatica product (the "Software") includes certain drivers (the "DataDirect Drivers") from DataDirect Technologies, an operating company of Progress Software
Corporation ("DataDirect") which are subject to the following terms and conditions:
1. THE DATADIRECT DRIVERS ARE PROVIDED "AS IS" WITHOUT WARRANTY OF ANY KIND, EITHER EXPRESSED OR IMPLIED, INCLUDING BUT NOT LIMITED TO,
THE IMPLIED WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NON-INFRINGEMENT.
2. IN NO EVENT WILL DATADIRECT OR ITS THIRD PARTY SUPPLIERS BE LIABLE TO THE END-USER CUSTOMER FOR ANY DIRECT, INDIRECT, INCIDENTAL,
SPECIAL, CONSEQUENTIAL OR OTHER DAMAGES ARISING OUT OF THE USE OF THE ODBC DRIVERS, WHETHER OR NOT INFORMED OF THE POSSIBILITIES
OF DAMAGES IN ADVANCE. THESE LIMITATIONS APPLY TO ALL CAUSES OF ACTION, INCLUDING, WITHOUT LIMITATION, BREACH OF CONTRACT, BREACH
OF WARRANTY, NEGLIGENCE, STRICT LIABILITY, MISREPRESENTATION AND OTHER TORTS.
The information in this documentation is subject to change without notice. If you find any problems in this documentation, report them to us at
infa_documentation@informatica.com.
Informatica products are warranted according to the terms and conditions of the agreements under which they are provided. INFORMATICA PROVIDES THE
INFORMATION IN THIS DOCUMENT "AS IS" WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING WITHOUT ANY WARRANTIES OF
MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND ANY WARRANTY OR CONDITION OF NON-INFRINGEMENT.
4 Table of Contents
Viewing Axon Glossary Assets in Informatica Axon. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30
Business Glossary Assets. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30
Glossary Overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 32
Business Term Overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 33
Category Overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35
Policy Overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 36
Viewing Business Glossary Assets in Informatica Analyst . . . . . . . . . . . . . . . . . . . . . . . . 37
Tabular Assets. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 37
Tabular Asset Overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38
Checking the Data Quality of Tabular Assets. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42
Column and Field Assets. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 43
Column and Field Overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44
Creating an Example-based Data Domain. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 48
Data Domain Curation for Columns and Fields. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 48
Data Domain Assets. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 50
Data Domain Overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 51
Data Domain Group Overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52
Data Domain Curation. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 54
Composite Data Domain Assets. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 55
Composite Data Domain Overview . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 56
Mapping Assets. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 57
Mapping Overview. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 58
Table of Contents 5
Difference Between the Lineage and Impact View and Relationships View. . . . . . . . . . . . . . . . . 76
Opening Different Assets from the Relationships View. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 77
6 Table of Contents
Configuring Custom Attributes. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 98
Configuring Filter Settings . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 99
Index. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 105
Table of Contents 7
Preface
The Enterprise Data Catalog User Guide is intended to help a data analyst or data architect in an enterprise to
use Enterprise Data Catalog.
This guide covers the tasks that you can perform using the Enterprise Data Catalog interface. This guide
assumes that you have installed and configured Enterprise Data Catalog. This guide also assumes that you
have installed and configured the services to access the Enterprise Data Catalog. See the Enterprise Data
Catalog Installation and Configuration Guide and the Catalog Administration Guide for more information.
Informatica Resources
Informatica provides you with a range of product resources through the Informatica Network and other online
portals. Use the resources to get the most from your Informatica products and solutions and to learn from
other Informatica users and subject matter experts.
Informatica Network
The Informatica Network is the gateway to many resources, including the Informatica Knowledge Base and
Informatica Global Customer Support. To enter the Informatica Network, visit
https://network.informatica.com.
To search the Knowledge Base, visit https://search.informatica.com. If you have questions, comments, or
ideas about the Knowledge Base, contact the Informatica Knowledge Base team at
KB_Feedback@informatica.com.
8
Informatica Documentation
Use the Informatica Documentation Portal to explore an extensive library of documentation for current and
recent product releases. To explore the Documentation Portal, visit https://docs.informatica.com.
Informatica maintains documentation for many products on the Informatica Knowledge Base in addition to
the Documentation Portal. If you cannot find documentation for your product or product version on the
Documentation Portal, search the Knowledge Base at https://search.informatica.com.
If you have questions, comments, or ideas about the product documentation, contact the Informatica
Documentation team at infa_documentation@informatica.com.
Informatica Velocity
Informatica Velocity is a collection of tips and best practices developed by Informatica Professional Services
and based on real-world experiences from hundreds of data management projects. Informatica Velocity
represents the collective knowledge of Informatica consultants who work with organizations around the
world to plan, develop, deploy, and maintain successful data management solutions.
You can find Informatica Velocity resources at http://velocity.informatica.com. If you have questions,
comments, or ideas about Informatica Velocity, contact Informatica Professional Services at
ips@informatica.com.
Informatica Marketplace
The Informatica Marketplace is a forum where you can find solutions that extend and enhance your
Informatica implementations. Leverage any of the hundreds of solutions from Informatica developers and
partners on the Marketplace to improve your productivity and speed up time to implementation on your
projects. You can find the Informatica Marketplace at https://marketplace.informatica.com.
To find your local Informatica Global Customer Support telephone number, visit the Informatica website at
the following link:
https://www.informatica.com/services-and-training/customer-success-services/contact-us.html.
To find online support resources on the Informatica Network, visit https://network.informatica.com and
select the eSupport option.
Preface 9
Chapter 1
Enterprise Data Catalog extracts metadata from external sources such as databases, data warehouses,
business glossaries, data integration resources, or business intelligence reports. For ease of search, the
catalog maintains an indexed inventory of all the assets in an enterprise. Assets represent the data objects
such as tables, columns, reports, views, and schemas. Metadata and statistical information in the catalog
include profile results, information about data domains, and information about data relationships.
You can use Enterprise Data Catalog to perform the following tasks on the catalog:
10
• Collaborate on assets by adding reviews, ratings, questions, following assets, and certifying assets.
1. Open a browser.
2. In the Address field, type the URL for the Enterprise Data Catalog login page in the following format:
http://<host>:<port>/ldmcatalog
The host is the gateway node host name. The port is the Informatica catalog service Enterprise Data
Catalog port number.
3. On the Enterprise Data Catalog login page, type the user name and password.
Use the same log in credentials that you used to log in to the Catalog Administrator.
Enterprise Data Catalog displays the total number of assets in the catalog along with the number of
resources. Clicking the number of resources displays all the resources on the Search Results page.
Business Examples
The business examples provide some of the scenarios in which you can use Enterprise Data Catalog. These
business examples assume that Enterprise Data Catalog is installed and configured and the catalog is
populated with metadata from all data sources.
Example 1
Susan works as a data analyst in a retail corporation that has outlets across the country. She is asked to
analyze the product and sales distribution for the corporation and produce a report based on her findings.
The input that Susan received is the sales distribution report for the retail corporation that exists in one of the
regional data warehouses. Susan must search in thousands of databases maintained at different regional
data warehouses for the retail corporation across the country.
The following table lists the tasks that Susan must complete and the steps that she must perform in
Enterprise Data Catalog to accomplish the tasks:
Search for the sales Use the search feature provided by Enterprise Data Catalog to search for the report
distribution report. in all the databases. Susan can search based on the report name or perform a
generic search using the wildcard search feature. Susan can further customize the
search using the search filters.
Discover the source tables After finding the report, Susan can view the details of the report to identify the
that contribute data to the tables from where data is taken for the report. She can view the report details using
report. the Asset Details tab.
Ensure the quality of source Susan can check and verify the profiling information displayed for the report in the
tables used in the report. Asset Details tab to ensure the quality of data.
Identify the related assets of Susan can view the relationship between the assets using the Relationships tab. She
the source tables in the report can also find the flow of data using the Lineage and Impact tab. These steps help
to gather the required assets her identify the different assets that she must consider to build the new report.
that she needs to make the
new report.
Enrich assets used in the new Enriching the assets with the attributes makes search easier. For example, enriching
report to tag these assets with the sales figures based on the regional location makes it easier to find the sales
additional information for figures for a particular region. Susan designs a new custom attribute based on
ease of search. regions in the Catalog Administrator and names the attributes North, South, East,
and West. She then uses the Edit Attributes feature in Enterprise Data Catalog to tag
these region attributes to the corresponding sales figure assets used in the report.
Susan can now search using any of the attributes to get a list of assets for the
specific region.
Example 2
Alex, the data architect in a financial institution, must make sure that there are no security violations in the
institution. Recently, the institution has had multiple instances where customers have reported fraudulent
transactions on their credit cards. Alex must search in millions of databases to find the tables and reports
where the credit card numbers exist. He must then report the violations and take the necessary steps to
remove or mask credit card details in tables where the credit card details must not be stored.
The following table lists the tasks that Alex must complete and the steps that he must perform in Enterprise
Data Catalog to accomplish the tasks:
Search all the Use the search feature provided by Enterprise Data Catalog to search for the report in all the
databases for the databases. Alex can search based on the credit card asset name, the column name, or
credit card data perform a generic search using the wildcard search. Alex can further customize the search
domain. using the search filters.
Identify the tables and After finding the assets that include the credit card details, Alex must shortlist the assets
reports where the that violate the security policy of the institution by storing the credit card details. Alex finds
credit card details the details of each asset by viewing the Asset Details tab. Alex also uses the Lineage and
exist. Impact tab to view the data flow for the asset. If the asset must not have the credit card
details according to the policy of the institution, Alex marks that asset as a violation.
Annotate tables that Alex annotates the assets that violate the policy of the institution with a custom attribute
violate the security named Non Compliance. Alex uses the Enterprise Data Catalog to create this custom
policy with attributes. attribute and uses Enterprise Data Catalog to annotate the violating assets with the custom
attribute. The IT team can find the required assets from millions of assets and take the
necessary corrective action.
Business Examples 13
Chapter 2
When you choose to show details in the search results page, the page displays the details of the asset, such
as the asset description, certify icon next to the asset name, rating for the asset, path to the asset, and type
of the asset. From the search results displayed, you can sort the results based on the relevance and asset
name. You can edit the properties of the selected assets to assign custom attributes and add a business
title. You can use the search filters displayed to refine your search further based on the Certified Assets,
Ratings, Resource Name, Asset Type, Resource Type, Last Updated, Rows, and Data Domains.
After finding the required asset, you can click the asset to view the profiling details, the lineage, and the
relationship of the asset with other assets.
14
Searching for Assets
You can search for assets from the Enterprise Data Catalog home page.
2. Type the asset that you want to search for in the search box and click the search icon.
The matching results appear as shown in the following image:
Note:
• The search results show a summary of the details of the asset.
• The number in parentheses under the Filter by panel sections indicates the number of matching
results found for the searched asset under those categories.
Wildcard Search
You can use the asterisk character (*) and the question mark character (?) to perform a wildcard search.
If you enter * in the Search box and click the search icon ( ), Enterprise Data Catalog lists all the assets in
the catalog. You can also use the asterisk with parts of an asset name. For example, to search for all the
assets that end with the string "_name," enter *_name in the Search box. Enterprise Data Catalog lists all of
the assets that match the search criteria, for example, Associate_Name, party_name, and cust_name.
Use the ? character to substitute individual characters in the name of an asset. For example, if the asset that
you want to search for begins with the letters HR, is followed by two numbers that denote the year, and ends
with REPORT, enter the following string in the Search box: HR??REPORT. Enterprise Data Catalog lists all of the
assets that match the search criteria, for example, HR12REPORT, HR13REPORT, and HR14REPORT.
The following image shows an example of search results for a wildcard search:
Search Suggestions
When you type the first few letters of an asset name into the Search box, Enterprise Data Catalog suggests
matching asset names.
If you type the name of an asset incorrectly, Enterprise Data Catalog compares the typed letters with names
of existing assets in the catalog. The probable matches then appear as search suggestions. For example, if
you enter sela to search for an asset named salary, Enterprise Data Catalog suggests salary as a probable
match.
Keyword Search
You can use a keyword when you want to search specifically for a type of asset in the catalog.
• Table
• Column
• File
• Report
You can use keywords in regular sentences or phrases.
For example, to search for all tables with customer details, you enter tables with customer in the Search
box. Enterprise Data Catalog returns the following results:
• To sort assets based on the relevance to the search, click and select Relevance.
Note: If you sort the assets based on name, the system attributes, or the custom attributes, the sort options
list displays an up arrow (↑) or a down arrow (↓) before the sort criteria selected. An up arrow indicates that
the assets are sorted in the ascending order based on the selected criteria and a down arrow indicates that
the assets are sorted in the descending order.
Select the required assets and click Show Details to view additional details about the asset.
Note: The additional details displayed depend on the settings configured in the Enterprise Data Catalog
Application Configuration dialog box. See the Configure Search Filters section for more details.
Customize Search
This chapter includes the following topics:
19
The following image shows the search filters in the Filter By panel:
1. Click Add.
2. Start typing the name of the resource in the box. A list of matching resource names appears.
3. Select the resource name. Enterprise Data Catalog refreshes the search results based on the selected
resource name.
1. Click Show All. The Select Resource Name dialog box appears.
Note: The Show All option appears if the number of resources created in the catalog is more than five.
2. From the Name column, select a resource name.
3. Click OK. Enterprise Data Catalog refreshes the search results based on the selected resource name.
• Column
• Table
• Resource
• View
For example, when you select Table from the Asset Type list, Enterprise Data Catalog lists all the assets of
type Table.
1. Click Add.
2. Start typing the name of the asset in the box. A list of matching asset types appears.
3. Select the asset type that you want. Enterprise Data Catalog refreshes the search results based on the
type of selected asset.
1. Click Show All. The Select Asset Type dialog box appears.
Note: The Show All option appears if the number of assets is more than five in the catalog.
2. From the Name column, select the types of assets
3. Click OK. Enterprise Data Catalog refreshes the search results based on the attribute selected.
1. Click Add.
2. Start typing the name of the data source in the box. A list of matching data sources appears.
3. Select the data source that you want. Enterprise Data Catalog refreshes the search results based on the
type of selected data source.
1. Click Show All. The Select Resource Type dialog box appears.
Note: The Show All option appears if the number of resource types is more than five in the catalog.
2. Select the required data sources from the Name column.
3. Click OK. Enterprise Data Catalog refreshes the search results based on the type of selected data
source.
You can select one of the following options from the Last Updated list:
Option Description
Today Displays the search results for asset details updated on the present day.
Yesterday Displays the search results for asset details updated on the previous day.
Last 7 days Displays the search results for asset details updated during the last seven days.
Last 1 month Displays the search results for asset details updated during the last one month.
Last 6 months Displays the search results for asset details updated during the last six months.
Last 1 year Displays the search results for asset details updated during the last year.
From the beginning Displays the search results for asset details present in the catalog from the time the catalog
was created.
Custom Displays the search results based on a range of dates specified by you. To specify the date
ranges, perform the following steps:
1. Select the Custom option from the Last Updated list.
2. Click the boxes to launch the calendar and select the date ranges.
3. Click GO to display the search results.
• Select All to specify that assets of all sizes must be considered for a search.
• Select a specific range listed.
• Specify a range for the asset sizes if you do not find the range of asset in the displayed list.
To specify a range of sizes for the searched assets, type the ranges in the boxes provided and click GO.
You can categorize data domains based on data domain groups. For example, you might want to view all the
data domains that belong to the Payment Card Industry (PCI) or Personal Health Information (PHI) data
domain group.
Note: If you are a user without administrative privileges, you must make sure that you have the required
permissions for the data domain resource in Catalog Administrator before you view or search for a data
domain.
1. Click Add.
2. Start typing the name of a data domain. A list of matching data domains appears.
3. Select the data domains. Enterprise Data Catalog refreshes the search results based on the selected
data domains.
The Certified Assets list displays the options, such as All Assets and Certified Assets. If you select the All
Assets option, the search results page displays all the assets that are certified along with the assets that are
not certified, and if you select the Certified Assets option, the search results page displays only the certified
assets.
Refine by Rating
You can search for assets based on the ratings that users provide on assets in the catalog. The Ratings list
displays ratings on a scale of one to five and the asset count in parentheses.
You can select one of the following options from the Ratings list:
Option Description
Displays the search results for assets that are rated between one and five.
Displays the search results for assets that are rated between two and five.
Displays the search results for assets that are rated between three and five.
Displays the search results for assets that are rated either four or five.
1. In the Composite Data Domains section of the Filter By panel, click Add .
2. Start typing the name of the composite data domain in the box. A list of matching composite data
domains appears.
3. Select a composite data domain. Enterprise Data Catalog refreshes the search results based on the
selected composite data domain.
1. Click Show All. The Select Composite Data Domains dialog box appears.
Note: The Show All option appears if the number of composite data domains inferred in the catalog is
more than five.
2. From the Name column, select a composite data domain.
3. Click OK. Enterprise Data Catalog refreshes the search results based on the selected composite data
domain.
Refine by Attributes
You can refine your search based on the system attributes or the custom attributes listed.
You can select All from the attribute list to select all the attributes included in the specific attribute type.
Adding Attributes
1. Click Add.
View Assets
This chapter includes the following topics:
For example, for a relational column, the Overview tab displays the details, such as the asset description,
people associated with the asset, business classifications, business terms, system attributes, composite
data domains, and data domains that are assigned to the column. If you configure the profile settings for a
resource in the Catalog Administrator, you can also view the profile results in the Overview tab, such as the
value frequency, similar columns, pattern, and inferred data types.
You can view the term properties from the Analyst tool for business glossary terms. You can also view the
glossary properties from the Informatica Axon for Axon glossaries in the Overview tab. The tab displays the
details, such as the technical, classified, glossary, and other assets to which the business term is related.
The tasks that you can perform from the Overview tab vary based on the asset type. For example, you can
associate a business title with any asset, assign custom attribute values to an asset from different sections,
check the quality of data for relational tables, relational views, CSV files, JSON files, XML files, and
Salesforce objects, and assign data domains to certain column and field assets.
27
Axon Glossary Assets
You can view and associate an Axon glossary as a business title with an asset in Enterprise Data Catalog. An
Axon glossary defines the organization's agreed definitions for data, activities, business terms, and other
objects. A set of well-described glossary terms allows users from different business functions to link their
data and activities to definitions that everyone within the organization can understand.
When you view details of an Axon glossary, Enterprise Data Catalog displays the details, such as the source
description, related Axon glossary assets, classified assets, and related technical assets in the Overview tab.
The following image shows the sample Overview tab for an Axon glossary:
For more information about Axon glossaries, see the Informatica Axon Data Governance User Guide.
Enterprise Data Catalog displays the following details for an Axon glossary:
The Source Description section displays the description of the Axon glossary. The description that you
view in the section comes from the description of the Axon glossary in Informatica Axon.
Related Glossaries
The Related Glossary Assets section displays other glossaries that are related to the Axon glossary that
you are viewing. The Axon glossaries are related through relationships in Informatica Axon.
The Related Technical Assets section displays the technical assets and data domains in the catalog that
are related to the glossary. If you associate the glossary with one or more assets in the catalog, the
section lists the assets that are associated with the glossary.
Classified Assets
The Classified Assets section displays all the assets that are grouped based on an Axon glossary that is
assigned as a custom attribute value to the assets from the Business Terms section. You can assign
multiple Axon glossaries as custom attribute values to an asset. The Classified Assets section also
displays the path to the listed assets and the name of the custom attribute associated with the assets.
Use a glossary as a custom attribute if you want to group assets based on a specific attribute and
provide more business context to the assets. For example, if you have tables named Emp_data_2015,
Emp_data_2016, and Emp_data_2017, you might want to group these tables based on the location they
belong to. You can assign an glossary called Employee Data as a custom attribute to all the three tables.
After you have assigned the custom attribute to the tables, the Overview tab for the glossary Employee
Data displays Emp_data_2015, Emp_data_2016, and Emp_data_2017 table names in the classified
assets section.
To view the Classified Assets section for a glossary, click the glossary link in the Business Terms
section of the asset.
People
The People section displays the users associated with the Axon glossary. The section by default
displays the data owner and data steward fields. Use the Edit Properties icon ( ) to add or change
the users associated with the glossary. The additional user fields that you see in the section come when
the catalog administrator creates and configures a custom attribute of User data type for an Axon
glossary in Catalog Administrator.
Business Logic
The Business Logic section displays the business context in which the Axon glossary is used. The
business logic that you view in the section comes from the business context of the Axon glossary in
Informatica Axon. You also use to Edit Properties icon ( ) to add your own business logic for the
Axon glossary.
Examples
The Examples section displays the contextual example of the Axon Glossary. The examples that you
view in the section come from the contextual example of the Axon glossary in Informatica Axon. You
also use to Edit Properties icon ( ) to add your own examples for the Axon glossary.
Custom Attributes
The properties that the Custom Attributes section displays depends upon the different types of custom
attributes the catalog administrator creates and configures for an Axon glossary in the Catalog
decimal, integer, and URL. Use the Edit Properties icon ( ) to add or change values of the properties
in the section.
You can change the custom attributes that are displayed in the Overview tab when you configure search
filters in the Application Configuration dialog box.
System Attributes
The System Attributes section displays the general properties about the Axon glossary. The properties
include the following information:
• Security classification of the glossary. Specifies whether the glossary is classified for public, internal,
confidential, or secret usage.
• Stage of the glossary in the approval lifecycle.
• Type of the glossary. Specifies whether the glossary refers to an entity, domain, or business term.
• Access type of the glossary. Specifies whether the glossary needs to be available for public viewing
or not.
• Current status of the glossary in Informatica Axon.
• Name of the resource that includes the glossary.
• Alias. Alternate names for the glossary that is commonly used in your organization.
Contains
The Contains section displays the child glossaries that are related through hierarchical relationship with
the glossary that you are viewing. The section also displays names of the listed glossaries and paths to
the glossaries.
You can view the following types of business glossary assets in Enterprise Data Catalog:
Glossaries
Glossaries are collections of business terms, categories, and policies. Different terms within an
organization might create and maintain separate glossaries, such as Finance and Human Resource.
When you view the details for a glossary, Enterprise Data Catalog displays the business glossary assets
in the glossary.
Business terms are words or phrases that use business language to define relevant concepts for
business users in an organization. When you view the details for a business term, Enterprise Data
Catalog displays the term properties, such as the name, description, and usage.
Categories
Categories are descriptive classifications of business terms and policies that define a structure for a
business glossary. For example, an Analyst tool user might create a category called Financial
Statements and assign the terms related to financial statements to this category. When you view the
details for a category, Enterprise Data Catalog displays the terms in the category.
Policies
Policies define the business purposes, processes, or protocols that govern business practices that are
related to business terms. For example, an Analyst tool user might create a policy called US GAAP to
represent a framework of accounting standards and apply the policy to specific business terms. When
you view the details for a policy, Enterprise Data Catalog displays the related terms and categories.
You can view details for business glossary assets in the Overview tab. You can also open business glossary
assets in the Analyst tool from the Overview tab.
Note: You cannot view lineage and impact for business glossary assets.
The following image shows the Overview tab for a business term:
For more information about business terms, categories, and policies, see the Informatica Business Glossary
Guide.
Note: The properties that Enterprise Data Catalog displays in the Overview tab vary based on the glossary
template that the business glossary administrator uses in the Analyst tool.
People
The People section lists the business glossary users that create, approve, and manage terms. The
People section includes the following users:
Use the Edit Properties icon ( ) to add or change the users associated with the glossary. The
additional user fields that you see in the section come when the catalog administrator creates and
configures a custom attribute of User data type for a glossary in the Catalog Administrator.
System Attributes
The System Attributes section includes the following information about the glossary:
Custom Attributes
The properties that the Custom Attributes section displays depends upon different types of custom
attributes the catalog administrator creates and configures for an axon glossary in the Catalog
Administrator. The section displays custom attributes of data types, such as the date, string, boolean,
decimal, integer, and URL. Use the Edit Properties icon ( ) to add or change values of the properties
in the section.
Contains
The Contains section lists all the business glossary assets the glossary contains. Glossary assets
include categories, terms, and policies. The section by default lists a maximum of five business glossary
assets with details, such as the business glossary asset name, associated business title, and type of the
asset. Click Show All to view the complete list of assets the glossary contains.
Business Terms
The Business Terms section displays custom attributes of data types, such as BusinessTerm, Glossary,
AxonGlossary. The section displays related terms when the catalog administrator creates and configures
the custom attribute for the glossary asset in the Catalog Administrator. You can associate multiple
business terms as custom attribute values with the asset. Use the Edit Properties icon to add,
remove, or change the business term values.
Note: The properties that Enterprise Data Catalog displays in the Overview tab can vary based on the
business term template that the business glossary administrator uses in the Analyst tool.
Enterprise Data Catalog displays the following details for business terms:
Source Description
The Source Description section displays the description of the business term. The description that you
view in the section comes from the description of the business term in Informatica Analyst tool.
People
The People section displays the business glossary users that create, approve, and manage the term. The
people properties can include the following users:
Context
The Context section displays properties that provide background information about the term. The
context properties can include the following information:
Categories
The Categories section displays the categories to which the term belongs. Enterprise Data Catalog
displays the full path for each category in the following format:
System Attributes
The System Attributes section displays general properties about the term. The properties can include the
following information:
The properties that the Custom Attributes section displays depends upon the different types of custom
attributes the catalog administrator creates and configures for a business term in the Catalog
Administrator. The section displays custom attributes of data types, such as the date, string, boolean,
decimal, integer, and URL. Use the Edit Properties icon ( ) to add or change values of the properties
in the section.
Business Rules
The Business Rules section displays mapplet rules or rule definitions that contain transformations or
business logic.
For example, a mapplet rule that defines how a Social Security number is masked might be linked to the
term "Social Security Number." When you view the Overview for "Social Security Number," this property
displays the mapplet rule used to mask the data.
Alternatively, the rule definition used to calculate business term "Net Price" is Net Price = List Price
- Trade Discount Amount. When you view the Overview for "Net Price," this property displays the rule
definition.
The Related Technical Assets section displays the technical assets and data domains in the catalog that
are related to the term. If you associate the business term with one or more assets in the catalog, this
section lists the assets that are associated with the term.
The Related Glossary Assets section displays business glossary assets that are related to the term.
Related glossary assets can include business glossary terms that are related to the term through custom
relationships, categories that include the term, and policies that are related to the term.
Classified Assets
The Classified Assets section displays all the assets that are grouped based on a business term that is
assigned as a custom attribute to the assets from the Business Terms section. You can assign multiple
business terms as custom attributes to an asset. The section also displays the path to the listed assets
and the name of the custom attribute associated with the assets.
Use a business term as a custom attribute if you want to group assets based on a specific attribute and
provide more business context to the assets. For example, if you have tables named LA_Sales_2014,
LA_Sales_2015, and LA_Sales_2016, you might want to group these tables based on the location they
belong to. You can assign a business term called Los Angeles as a custom attribute to all the three
tables. After you have assigned the custom attribute to the tables, the Overview tab for the business
term Los Angeles displays LA_Sales_2014, LA_Sales_2015, and LA_Sales_2016 table names in the
classified assets section.
To view the Classified Assets section for a glossary, click the business term link in the Business Terms
section of the asset.
Other Assets
The Other Assets section displays assets in other Informatica applications that are related to the term.
The assets can include Metadata Manager objects and Analyst tool objects such as data domains, data
objects, profiles, and rule specifications.
Enterprise Data Catalog lists assets in this section when the assets exist within the same domain as the
Catalog Service.
Note: The exact properties that Enterprise Data Catalog displays in the Overview tab can vary based on the
category template that the business glossary administrator uses in the Analyst tool.
Source Description
The Source Description section displays the description of the category. The description that you view in
the section comes from the description of the category in Informatica Analyst tool.
People
The People section displays business glossary users that create, approve, and manage the category. The
people properties can include the following users:
• Category owner.
• Data steward. The user who manages the category.
• Stakeholders. Users who participate in the approval process before the data steward publishes the
category.
Classified Assets
The Classified Assets section displays all the assets that are grouped based on a category. The category
is assigned as a custom attribute to the assets from the Business Terms section. You can assign
multiple categories as custom attribute value to an asset. The section also displays the path to the listed
assets and the name of the custom attribute associated with the assets.
Use a category as a custom attribute if you want to group assets based on a specific attribute and
provide more business context to the assets. For example, if you have tables named
AccountsPayable_Receivable_2016, Payrol_2016, and Inventory_2016, you might want to group these
tables based on the department they belong to. You can assign a category called Accounting
Department as a custom attribute to all the three tables. After you have assigned the custom attribute to
the tables, the asset details view for the category Accounting Department brings together and displays
AccountsPayable_Receivable_2016, Payrol_2016, and Inventory_2016 table names in the classified
assets section.
To view the Classified Assets section for a glossary, click the category link in the Business Terms
section of the asset.
Custom Attributes
The properties that the Custom Attributes section displays depends upon the different types of custom
attributes the catalog administrator creates and configures for a category in the Catalog Administrator.
The section displays custom attributes of data types, such as the date, string, boolean, decimal, integer,
and URL. Use the Edit Properties icon ( ) to add or change values of the properties in the section.
System Attributes
The System Attributes section displays general properties about the category. The properties can
include the following information:
The Related Glossary Assets section displays business glossary assets that are related to the category.
Related glossary assets can include terms in the category in addition to policies and other categories
that are related to the category.
Policy Overview
The Overview tab for a policy displays the details, such as the status and phase of the policy, business rules
applied to the policy, and related glossary assets.
Note: The properties that Enterprise Data Catalog displays in the Overview tab vary based on the policy
template that the business glossary administrator uses in the Analyst tool.
People
The People section shows business glossary users that create, approve, and manage the policy. The
people properties can include the following users:
• Policy owner.
• Data steward. The user who manages the policy.
• Stakeholders. Users who participate in the approval process before the data steward publishes the
policy.
Business Rules
The Business Rules section displays the rule intent for the policy. The rule intent describes how an
organization implements a policy and specifies the compliance criteria for terms within the policy. For
example, a rule intent for the "Privacy" policy might specify the encryption method used for terms within
the policy.
The Related Glossary Assets section displays business glossary assets that are related to the policy.
Related glossary assets can include terms to which the policy is applied in addition to categories and
other policies that are related to the policy.
System Attributes
The System Attributes section displays general properties about the policy. The properties can include
the following information:
The properties that the Custom Attributes section displays depends upon the different types of custom
attributes the catalog administrator creates and configures for a policy in the Catalog Administrator. The
section displays custom attributes of data types, such as the date, string, boolean, decimal, integer, and
URL. Use the Edit Properties icon ( ) to add or change values of the properties in the section.
Tabular Assets
You can view tabular assets such as relational tables, CSV files, and dimension tables in Enterprise Data
Catalog. You can view details about tabular assets, check the data quality of tabular assets, rate and review
tabular assets, and ask questions about tabular assets in the Overview tab.
A tabular asset is any asset that contains data that you can view in a tabular format.
If the columns or fields in a tabular asset have profile results, you can check the quality of the column or field
data from the Overview view.
Tabular Assets 37
The following image shows the sample Overview tab for a relational table:
Enterprise Data Catalog displays the following details for a tabular asset:
Description
The Description section displays the tabular asset description. Use the Edit Properties icon ( ) to edit
or add description for the asset. You can enter rich text as the description for the asset. Use the text
formatting options to format the rich text. To add more descriptive context to the asset, you can also
add multiple sections under the Description section. For example, you create and configure a rich text
type custom attribute called Additional Description for the tabular asset in the Catalog Administrator.
Enterprise Data Catalog displays the custom attribute as an another section under the Description
section in the Overview tab.
Sample Columns
The Overview tab displays the sample columns for tabular assets, such as view columns, table columns,
CSV fields, XML file fields, and JSON file fields, other semi-structured sources, and for sources that you
can run the column profile on.
The Sample Columns section displays the following details for the sample columns:
• Name. Displays the column or field name. To view the column or field asset details, click the column
name. The column asset details appear in the Overview tab.
• Business title. Displays the business title associated with the column. Hover the mouse over the
business title to view the name and confidence score. You can accept or reject the business title.
• Data Domains. Displays the data domains associated with the column. Hover the mouse over a data
domain to view the Name Match, Data Match, Proximity Match, System Suggested, and Assigned by
In the Columns tab, you can view the following details for each column in the tabular asset:
• Name. Displays the column or field name. To view the column or field asset details, click the column
name. The column asset details appear in the Overview tab.
• Business Title. Displays the business title associated with the column. You can also view the
recommended business terms if you want to associate a business term as a business title with the
column.
• Data domains. Displays the data domains associated with the column. Hover the mouse over a data
domain to view the Name Match, Data Match, Proximity Match, System Suggested, and Assigned by
details. The inferred data domains appear in orange color and the curated data domains appear in
green color. The Name Match displays Yes when the column name matches with the data domain.
The Proximity Match displays a score that is a ratio of the number of proximal data domains
discovered in the data asset to the number of configured proximal data domains for an inferred data
domain.
• Null Distinct Non-Distinct %. Displays the percentage of null, distinct, and non-distinct values in a
column. Hover the mouse over the horizontal bar chart to view the number rows that contain null,
distinct, and non-distinct values.
• Source Data Type Inferred Data Types. Displays the documented data type and inferred data types of
the column. Hover the mouse to view the list of documented data type and inferred data types with
percentages.
Reviews
The Reviews section displays the reviews and ratings for a tabular asset. By default, the section displays
a text box and five star rating scale next to the text box. To review the tabular asset, enter the review in
the text box and rate the tabular asset, and then click Submit.
The section displays the review that you provided at the top of the list of the existing reviews. You can
mark any review as helpful if you find the review useful.
The following image displays the Reviews section in the Overview tab:
The section displays a maximum of five latest reviews. To view all the reviews, click Show All. The
Reviews tab displays the complete list of reviews about the tabular asset, average ratings of the asset,
and total user ratings on the tabular asset.
The following image displays the Reviews tab for a tabular asset:
Tabular Assets 39
In the Reviews tab, you can review the tabular asset and filter the reviews based on the ratings.
You can also sort the list based on the following options:
• Recent Reviews. Sorts the reviews list based on the latest reviews.
• Voted Most Helpful. Sorts the reviews list based on reviews that are voted as most helpful.
• Ratings. Sorts the reviews list based on reviews with highest ratings.
For more information about reviews, see “Rate and Review Asset” on page 91.
Questions
The Questions section displays the questions that are asked about the tabular asset. To ask a question,
enter the question in the text box, and then click Submit. The section displays the question that you ask
at the top of the existing questions. The questions include the answer and delete options. A user who
has knowledge about the tabular asset can answer the questions asked. You can also mark an answer
as helpful if you find the answer useful.
The following image displays the Questions section in the Overview tab:
The section displays a maximum of five latest questions that are asked. To view all the questions, click
Show All. The Questions tab displays the complete list of questions asked about the tabular asset.
The following image displays the Questions tab for a tabular asset:
• My Questions. Displays all the questions that you asked about the tabular asset.
• Answered Questions. Displays all the questions that have answers.
• Unanswered Questions. Displays all the questions without answers.
For more information about questions, see “Asset Queries ” on page 92.
People
The People section displays the users associated with the tabular asset. The section by default displays
the data owner and data steward fields. The section also displays the followers of the tabular asset. Use
the Edit Properties icon ( ) to add or change the users associated with the glossary. The additional
user fields that you see in the section appear after the catalog administrator creates and configures a
custom attribute of User data type for a tabular asset in the Catalog Administrator.
Custom Attributes
The properties that the Custom Attributes section displays depends upon the different types of custom
attributes the catalog administrator creates and configures for a tabular asset in the Catalog
Administrator. The section displays custom attributes of data types, such as the date, string, boolean,
decimal, integer, and URL. Use the Edit Properties icon ( ) to add or change values of the properties
in the section.
System Attributes
The Overview tab displays the System Attributes section for tabular assets. The system attributes that
are shown by default vary based on the parent resource type. For example, the default system attributes
for a relational table include comments, data source UUID, native type, and resource name.
You can change the system attributes that are displayed in the Overview tab when you configure search
filters in the Application Configuration dialog box.
The following image shows the Composite Data Domain section in the Overview tab of a tabular asset:
Tabular Assets 41
Business Terms
The Related Business Terms section displays custom attributes of data types, such as BusinessTerm,
Glossary, AxonGlossary. The section displays related terms when the catalog administrator creates and
configures the custom attribute for the tabular asset in the Catalog Administrator. You can associate
multiple business terms as custom attribute values with the asset. Use the Edit Properties ( ) to add,
remove, or change the business term values.
Business Classifications
The Business Classification section displays all the asset classifications based on custom attributes of
reference data types. The custom attributes of a reference data type depend on how the custom
attributes are defined in Informatica Business Glossary or Informatica Axon. For example, if you have a
table called Sales Details and you want to classify the table based on a region. To classify the table, you
can assign a custom attribute of reference data type called APAC to the table. Use the Edit Properties
Links
The Overview tab displays the Links section for tabular assets in resources that do not support profiling.
The Links section displays the child assets of the tabular asset. For example, if you view the Overview
for a pivot table, the Links section displays the axes in the table. If you view a presentation table, the
Links section displays the presentation columns and derived presentation columns.
The following image shows the sample of the Columns view for a relational table with profile results:
1. Bar charts that show the percentage of null values, distinct values, and non-distinct values in each field.
2. Colums view that displays profile results, data domains, business title, and column names.
The bar chart for each column or field shows the percentage of null values, distinct values, and non-distinct
values in the column or field. To view the number of rows associated with each percentage, hover the cursor
over the bar chart.
The following types of assets are examples of column and field assets:
For column and field assets in resources that support profiling, you can accept or reject the data domains
that are inferred for the asset. You can also assign one or more data domains to the asset. If an appropriate
data domain does not exist, you can create one.
Curation is the process of accepting and rejecting data domains for assets. You can curate data domain
results for a column or field asset from the Overview tab.
The following image shows the Overview tab for a relational column that has profile results:
Enterprise Data Catalog displays the following details for a column and field assets:
Description
The Description section displays the column or field asset description. Use the Edit Properties icon
( ) to edit or add description for the asset. You can enter rich text as the description for the asset.
Use the text formatting options to format the rich text. To add more descriptive context to the asset, you
People
The People section displays the users associated with the column or field asset. The section by default
displays the data owner and data steward fields. Use the Edit Properties icon ( ) to add or change
the users associated with the glossary. The additional user fields that you see in the section appear after
the catalog administrator creates and configures a custom attribute of User data type for a column or
field asset in the Catalog Administrator.
Business Terms
The Related Business Terms section displays custom attributes of data types, such as Business Term,
Glossary, Axon Glossary. The section displays related terms after the catalog administrator creates and
configures the custom attribute for a column or field asset in the Catalog Administrator. You can
associate multiple business terms as custom attribute values with the asset. Use the Edit Properties
Business Classifications
The business Classification section displays all the asset classifications based on custom attributes of
reference data types. The custom attributes of a reference data type depend on how the custom
attributes are defined in Informatica Business Glossary or Informatica Axon. For example, if you have a
column or field containing the sales details belonging to a particular region, you might want to classify
the assets based that region. To classify the asset, you can assign a custom attribute of reference data
type called APAC to the asset. Use the Edit Properties ( ) to add, remove, or change the business
classification values.
Custom Attributes
The properties that the Custom Attributes section displays depends upon the different types of custom
attributes the catalog administrator creates and configures for a column or field asset in the Catalog
Administrator. The section displays custom attributes of data types, such as the date, string, boolean,
decimal, integer, and URL. Use the Edit Properties icon ( ) to add or change values of the properties
in the section.
System Attributes
The Overview tab displays the System Attributes section for all column and field assets. The system
attributes that are shown by default vary based on the parent resource type. For example, the default
system attributes for an Oracle Business Intelligence Enterprise Edition report field include the data set
UUID, native ID, native type, position, and resource name.
You can change the system attributes that are displayed in the Overview tab when you configure search
filters in the Application Configuration dialog box.
Value Frequency
You can view the value frequency for view column, table column, CSV field, XML file field, and JSON file
data assets in the Overview tab. In the value frequency section, you can view the distribution of values
across the data asset, the maximum and minimum values in the data asset, the distribution of distinct
values in the data asset, and a list of values and their frequency in a tabular format.
The following image displays the value frequency section in the Overview tab:
1. Value distribution chart. Displays the number of NULL values, distinct values, and non-distinct values in the
first 1000 rows in the data asset.
2. Maximum and minimum value. Displays the maximum value and minimum value in the data asset.
3. Slider. Displays the values in the slider along with the frequency and percentage of values in a tabular format
below the graph.
The value distribution for the first 1000 rows in the data asset appears as a horizontal bar chart. The
horizontal bar chart displays the distribution of NULL values, distinct values, and non-distinct values in
different colors. When you move the pointer over the chart, the number of rows and the percentage of
values pertaining to the data point appears in a data label. A data label is similar to a tool tip and it
displays the data that pertains to the data point in the chart over which you hover the pointer.
After you run the value frequency for a data asset, you can view the following information about the data
asset in the value frequency section:
• Max. Displays the maximum value in the data asset. After you click Max, the maximum value along
with the frequency and percentage appears in a tabular format below the value frequency graph.
• Min. Displays the minimum value in the data asset. After you click Min, the minimum value and the
next 19 values in the data asset along with the frequency and percentage appear in a tabular format
below the value frequency graph.
• Value frequency graph. The graph displays the distribution of distinct values in the data asset as
vertical bars in the column chart. The X-axis in the column chart represents values, and the Y-axis
represents the frequency of values. The values are arranged from maximum to minimum number of
occurrences in the chart. The chart displays the first 1000 distinct values in the data asset and has a
slider. The values within the slider appear in a tabular format below the value frequency graph along
with the frequency and percentage of the value.
• Values in tabular format. The values along with the frequency and percentage appear in a tabular
format. The values within the slider appear in the table or after you click the Max or Min value. At a
time, you can view only 20 values in the table.
Similar Columns
The Similar Columns section displays all the columns that are similar to the column you are viewing.
Enterprise Data Catalog discovers similar columns based on column names, column patterns, unique
values, and value frequencies. Click View All icon to view all the similar columns.
You can view the following details in the Similar Columns section:
• Column Name. Displays the column or field name. To view the column or field asset details, click the
column name. The column asset details appear in the Overview tab.
• Business Term. Displays the associated business term.
• Data Domain. Displays the inferred data domains for the column and the percentage of rows that
conform to the data domain. The inferred data domains appear in orange color and the curated data
domains appear in green color.
• Confidence %. Displays the score and the parameters based on which the score is calculated. The
confidence score is calculated by an internal algorithm based on the discovered similar column
names, similar column patterns, similar unique values, and value frequencies for the column that you
are viewing. You can view the Data, Distinct Values, Name, and Pattern parameters.
Data Domain
The Data Domain section appears for column and field assets that support profiling, such as relational
table and view columns, fields in CSV, JSON, and XML files, and fields in Salesforce objects.
A data domain is a predefined or user-defined Model repository object that enables you to discover the
functional meaning of column data or column names in a data source. Examples of data domains
include Social Security number, account status, IP address, and UPC code.
The inferred data domains appear in orange boxes with the inference percentage. For example,
The data domains that you accept are assigned to the asset. The assigned data domains appear in green
boxes. For example, . You can reject any assigned data domain, if required.
You can create an example-based data domain if you want to discover assets that have similar columns
or fields. For example, your organization uses a unique pattern for employee ID and you want to discover
all the assets that contain the employee ID pattern. To accomplish this task, you can either create a data
domain in Catalog Administrator or create an on-the-fly data domain, also called example-based data
domain in Enterprise Data Catalog.
You can create a example-based data domain instantly with the existing pattern of the column or field.
This data domain does not contain any rule. To use the column or field pattern as a data domain, click
the icon to create an example-based data domain. After you create the data domain, run the
SimilarityDiscovery scanner and DataDomainPropagation scanner in Catalog Administrator to propagate
and discover assets that contain a similar pattern. You can view a example-based data domain in
Catalog Administrator. To convert an example-based data domain to a rule-based data domain, you can
add one or more rules, conformance values, and proximity data domains.
In the Pattern section, you can view the data patterns for the column or field values along with the
percentage and frequency. Each inferred pattern appears a horizontal bar chart along with the pattern
format, number of rows, and percentage of rows.
In the Inferred data types section, you can view all of the inferred data types for the column or field. Each
inferred data type appears as a horizontal bar chart along with the data type, number of rows, and
percentage of rows.
To create a data domain in Enterprise Data Catalog, you must have the Catalog Management: Domain
Curation privilege.
1. In the Search Results screen, identify the column or field asset for which you want to assign the data
domain.
2. Click the asset.
The asset appears in the Overview tab.
3.
In the Data Domain section, click the icon.
The Edit Data Domains dialog box appears.
4. Enter a name for the data domain. Click Add.
5. In the New Data Domain dialog box, you can enter the following details:
• Optionally, enter a description for the data domain.
• Optionally, you can select a data domain group to which you want to assign the data domain.
6. Click OK.
In the Edit Data Domains dialog box, the data domain appears in the Accepted & Inferred Data Domains
list. The Assigned By column lists your user name as the person who assigned the data domain to the
asset.
7. Click OK.
In Catalog Administrator, you can run data domain discovery on resources. In Enterprise Data Catalog, you
can view the inferred data domains for column and field assets. When you configure the Auto accept if more
than conformance criteria option for a resource, Enterprise Data Catalog accepts the data domain
automatically when the data domain match exceeds the configured value. Data domain inference is also
based on the data domains that are assigned to similar assets in Enterprise Data Catalog. When you create a
data domain in Enterprise Data Catalog and assign it to a column or field, Enterprise Data Catalog infers the
data domain for similar assets.
You can perform the following curation tasks for a column or field in the Overview tab:
Note: If you accept or reject an inferred data domain for an asset, you cannot set the data domain back to the
inferred status.
Viewing and Curating the Data Domains for Tabular, Column, and Field Asset
You can assign a data domain to a column or field even though the data domain was not inferred for the
asset. You can also accept a previously rejected data domain or reject an assigned data domain.
1. In the Enterprise Data Catalog home page, type a table name that contains columns or fields assets, and
click the search icon.
2. In the Search Results page, click the table asset.
3. In the Overview tab for the asset, you can view the inferred data domains in the Sample Columns
section. The Data Domain column displays the data domain name, if column name matches the data
domain, and if there is a proximity match in the asset.
4. Click a column or field in the table asset.
The column or field asset appears in an Overview tab.
5. In the Overview tab, you can view asset details, such as Description, People, Business Terms, Business
Classifications, Custom Attributes, System Attributes, Value Frequency, Similar Columns, Pattern,
Inferred Data Types, and Data Domain.
6. Scroll to the Similar Columns section to view all the columns that are similar to the column you are
viewing.
The Data Domain column in the Similar Columns section displays the inferred data domains for the
column and the percentage of rows that conform to the data domain.
7. Scroll to the Data Domain section to view the data domains that are inferred for the asset from profile
results or from similar columns. This section appears only for column and field assets in resources that
support profiling such as relational table and view columns, fields in CSV, JSON, and XML files, and
fields in Salesforce objects.
8. Data domains that appear in an orange box with the inference percentage are inferred data domains. For
example, .
You can perform the following tasks on an inferred data domain:
• Click the icon to accept the data domain. After you accept the data domain, the box turns green.
The data domains that are auto-accepted appear in a green box. For example,
• Click the icon to reject the data domain. After you reject a data domain, the data domain no longer
appears in the Data Domain section for the asset.
9.
In the Data Domain section, click the icon to view the list of data domains that are assigned or
inferred for the asset and the list of data domains that are rejected for the asset.
10. To assign a data domain to the asset, perform one of the following tasks:
A data domain is a predefined or user-defined asset that is based on the semantics of column or field data or
a column or field name. Some examples of data domains are Social Security number, account status, IP
address, and UPC code. Assigning a data domain to a column or field makes the asset easier to identify and
understand.
You can organize data domains that apply to similar types of data in data domain groups. For example, the
Bank Account data domain group might contain data domains such as Account Status, Account Number, and
Credit Card Number.
Catalog administrators, Analyst tool users, and Developer tool users can create data domains and data
domain groups. You can also create an example-based data domain in Enterprise Data Catalog when you
view the overview details for a column or field that supports profiling. Data domains and data domain groups
are stored in the Model repository.
Data domains are inferred for column and field assets based on the profiles that users run in Catalog
Administrator and based on the data domains that are assigned to similar assets.
Curation is the process of accepting and rejecting data domains for assets. As you curate data domains for
columns and fields, Enterprise Data Catalog becomes more accurate in making data domain inferences for
other columns and fields. You can accept or reject the assets that are inferred for a data domain. You can
also reject the assets to which the data domain is assigned.
For more information about data domains, see the Informatica Catalog Administrator Guide.
Enterprise Data Catalog displays the following details for data domain assets:
Description
The Description section displays the data domain description. Use the Edit Properties icon ( ) to edit
or add description for the asset. You can enter rich text as the description for the asset. Use the text
formatting options to format the rich text. To add more descriptive context to the asset, you can also
add multiple sections under the Description section. For example, you create and configure a rich text
type custom attribute called Alternative Description for the data domain in the Catalog Administrator.
Enterprise Data Catalog displays the custom attribute as an another section under the Description
section in the Overview tab.
Source Description
The Source Description section displays the data domain description that you provide while creating the
data domain.
System Attributes
The System Attributes section displays the following system attributes for the data domain:
• Is Domain by Example. Displays Yes for an example-based data domain, and No for a rule-based or
pre-defined data domain.
• Resource Name. Displays DataDomain by default. "DataDomain" is the name of the internal job that
synchronizes data domains between the catalog and the Model repository.
• Last Modified. Displays the date, time, and year the data domain was last updated.
• Asset type. Displays the type of the asset.
The People section displays the users associated with the data domain . The section by default displays
the data owner and data steward fields. Use the Edit Properties icon ( ) to add or change the users
associated with the glossary. The additional user fields that you see in the section appear after the
catalog administrator creates and configures a custom attribute of User data type for a data domain in
the Catalog Administrator.
Custom Attributes
The properties that the Custom Attributes section displays depends upon the different types of custom
attributes the catalog administrator creates and configures for a data domain in the Catalog
Administrator. The section displays custom attributes of data types, such as the date, string, boolean,
decimal, integer, and URL. Use the Edit Properties icon ( ) to add or change values of the properties
in the section.
Assigned Columns
The Assigned Columns section lists the columns and fields for which the data domain is assigned or
inferred.
You can view the following details in the Assigned Columns section:
You can reject an assigned data domain. You can choose a data domain, and click to accept or reject
a data domain for a table, column, or field asset.
Rejected Columns
The Rejected Columns section lists the columns and fields for which the data domain was rejected.
You can view the following details in the Rejected Columns section:
Enterprise Data Catalog displays the following details for data domain groups:
The Description section displays the data domain group description. Use the Edit Properties icon ( )
to edit or add description for the asset. You can enter rich text as the description for the asset. Use the
text formatting options to format the rich text. To add more descriptive context to the asset, you can
also add multiple sections under the Description section. For example, you create and configure a rich
text type custom attribute called Additional Description for the data domain group in the Catalog
Administrator. Enterprise Data Catalog displays the custom attribute as an another section under the
Description section in the Overview tab.
Source Description
The Source Description section displays the data domain group description that you provide while
creating the data domain group.
Contains
The Contains section lists the data domains in the group. The section by default lists maximum a five
data domains. Click Show All to view the complete list of assets the glossary contains.
System Attributes
The System Attributes section displays the following system attributes for the data domain:
• Last Modified. Displays the date, time, and year the data domain group was last updated.
• Resource Name. Displays DataDomainGroup by default. "DataDomain" is the name of the internal job
that synchronizes data domains between the catalog and the Model repository.
• Asset type. Displays Data Domain Group as the asset type.
People
The People section displays the users associated with the data domain group . The section by default
displays the data owner and data steward fields. Use the Edit Properties icon ( ) to add or change
the users associated with the glossary. The additional user fields that you see in the section come when
the catalog administrator creates and configures a custom attribute of User data type for a data domain
group in the Catalog Administrator.
Business Terms
The Business Terms section displays custom attributes of data types, such as BusinessTerm, Glossary,
AxonGlossary. The section displays related terms when the catalog administrator creates and configures
the custom attribute for the data domain group in the Catalog Administrator. You can associate multiple
business terms as custom attribute values with the asset. Use the Edit Properties ( ) to add,
remove, or change the business term values.
Custom Attributes
The properties that the Custom Attributes section displays depends upon the different types of custom
attributes the catalog administrator creates and configures for a data domain group in the Catalog
Administrator. The section displays custom attributes of data types, such as the date, string, boolean,
When you curate a data domain, you make the data domain more accurately reflect the type of data that
belongs in the data domain. Curating a data domain also makes Enterprise Data Catalog more accurate when
it infers data domains for similar assets.
You can perform the following curation tasks for a data domain:
• Accept or reject the inferred assets for the data domain. When you accept an asset, the data domain
becomes assigned to the asset.
• Reject any assigned asset for the data domain.
• Accept any rejected asset for the data domain.
Note: If you accept or reject an inferred asset for a data domain, you cannot set the asset to back the inferred
status.
You can accept or reject one asset for a data domain, or you can accept or reject multiple assets at once.
1. In the Enterprise Data Catalog home page, type a data domain name, and click the search icon.
2. In the Search Results page, click the data domain asset.
The Overview tab for the asset appears.
3. To accept an asset for the data domain, perform the following tasks:
1. In the Assigned Columns list, find the asset that you want to accept.
If the list of assets is long, you can search for an asset in the list by entering the first few characters
of the asset name in the search field.
2. To accept the asset for the data domain, click the icon.
4. To accept multiple assets for the data domain, perform the following tasks:
1. In the Assigned Columns list, select the assets that you want to assign.
If the list of assets is long, you can search for an asset in the list by entering the first few characters
of the asset name in the search field. To select all the assets, select the Name checkbox.
2. To accept the assets for the data domain, click the icon, and click Accept.
5. To reject an asset for the data domain, perform the following tasks:
1. In the Rejected Columns list, find the asset that you want to reject.
If the list of assets is long, you can search for an asset in the list by entering the first few characters
of the asset name in the search field.
2. To reject the assets for the data domain, click the icon, and click Reject.
The following image shows a sample data domain asset:
Catalog administrator creates composite data domains and defines rules based on a combination of existing
data domains or composite data domains in Catalog Administrator.
You can search for composite data domains and view the details about composite data domains in the
Overview tab. When you search for an asset using a composite data domain, the search results page includes
the composite data domain you searched for along with a list of assets for which the composite data domain
is inferred.
For example, when you search for a composite data domain called Customer Details, the catalog displays the
Customer Details composite data domain and the associated assets such as Sales_Details, Order_Invoice,
Customer_Addresses, and Customer_Orders_Delivery in the search results page.
1. Found in. Displays a list of resources that contain assets for which composite data domains are inferred.
For more information about composite data domains, see the Informatica Catalog Administrator Guide.
Enterprise Data Catalog displays the following details for composite data domain assets:
Description
The Description section displays the composite data domain description. Use the Edit Properties icon
( ) to edit or add description for the asset. You can enter rich text as the description for the asset.
Use the text formatting options to format the rich text. To add more descriptive context to the asset, you
can also add multiple sections under the Description section. For example, you create and configure a
rich text type custom attribute called Alternative CDD Description for the composite data domain in the
Catalog Administrator. Enterprise Data Catalog displays the custom attribute as an another section
under the Description section in the Overview tab.
Custom Attributes
The properties that the Custom Attributes section displays depends upon the different types of custom
attributes the catalog administrator creates and configures for a composite data domain in the Catalog
Administrator. The section displays custom attributes of data types, such as the date, string, boolean,
decimal, integer, and URL. Use the Edit Properties icon ( ) to add or change values of the properties
in the section. .
Business Terms
The Business Terms section displays custom attributes of data types, such as Business Term, Glossary,
Axon Glossary. The section displays related terms after the catalog administrator creates and configures
the custom attribute for a composite data domain in the Catalog Administrator. You can associate
People
The People section displays the users associated with the composite data domain. The section by
default displays the data owner and data steward fields. Use the Edit Properties icon ( ) to add or
change the users associated with the glossary. The additional user fields that you see in the section
appear after the catalog administrator creates and configures a custom attribute of User data type for a
composite data domain in the Catalog Administrator.
Found in
The Found in section displays a list of resources that contain assets for which composite data domains
are inferred. The section also displays the type of the resource that contains the composite data domain,
and business glossary term that is associated with the resource.
When you expand a resource in the Found in section, you can view a list of instances. Each instance
displays assets on which the composite data domain is inferred. An instance can contain multiple
assets depending upon the composite data domain rules that you define in Catalog Administrator.
For example, you create a composite data domain called Customer Details using First_Name AND
Last_name rule in Catalog Administrator. Based on the composite data domain rule that you have
defined, Enterprise Data Catalog infers Order_Invoice and Delivery_details tables and associates the
composite data domain to those tables. When you open the Customer Details composite data domain in
the Overview tab, the Found in section displays tables Order_Invoice and Delivery_details under an
instance.
Mapping Assets
A mapping is a set of inputs and outputs that represent the data flow between sources and targets. A
mapping contains components such as source objects, target objects, and transformations.
You can view details for mapping assets in the Overview tab. The mapping assets in the catalog come from
the Informatica PowerCenter and Informatica Cloud Services. To display Cloudera data flow asset details,
open an operation asset type in the Overview tab.
Note: You cannot view lineage and impact for mapping and operation assets.
The following image shows a sample Overview tab for a mapping asset:
Mapping Assets 57
1. Source And Target. Displays the source and target assets that the mapping contains.
Mapping Overview
The Overview tab for a mapping asset displays the details, such as the system attributes, and source and
target assets the mapping contains.
Enterprise Information Catalog displays the following details for mapping assets:
Description
The Description section displays the mapping asset description. Use the Edit Properties icon ( ) to
edit or add description for the asset. You can enter rich text as the description for the asset. Use the text
formatting options to format the rich text. To add more descriptive context to the asset, you can also
add multiple sections under the Description section. For example, you create and configure a rich text
type custom attribute called Additional Description for the mapping asset in the Catalog Administrator.
Enterprise Data Catalog displays that custom attribute as an another section under the Description
section in the Overview tab.
People
The People section displays the users associated with the mapping asset . The section by default
displays the data owner and data steward fields. Use the Edit Properties icon ( ) to add or change
the users associated with the glossary. The additional user fields that you see in the section appear after
the catalog administrator creates and configures a custom attribute of User data type for a mapping
asset in the Catalog Administrator.
The Related Business Terms section displays all the BusinessTerm, Glossary, AxonGlossary data type
custom attributes that an catalog administrator creates and configures for the mapping asset. You can
associate multiple business terms as custom attribute values to the asset in the Business Terms
section. Use the Edit Properties icon ( ) to edit the section to add or remove business term values.
The System Attributes section displays the mapping description, type of resource that contains the
mapping, and folder name where the mapping is created.
Custom Attributes
The properties that the Custom Attributes section displays depends upon the different types of custom
attributes the catalog administrator creates and configures for a mapping asset in the Catalog
Administrator. The section displays custom attributes of data types, such as the date, string, boolean,
decimal, integer, and URL. Use the Edit Properties icon ( ) to add or change values of the properties
in the section.
Source and Target
The Source and Target section displays all the source and target assets for a mapping. The Source and
Target section also displays details, such as, the asset name, business term associated with the asset,
type of the asset, and path to the asset.
Mapping Assets 59
Chapter 5
Lineage describes the flow of data from the origins to an asset. Lineage shows you where the data for an
asset comes from and which assets affect the asset that you are studying. When you view an asset in a
lineage and impact diagram, the lineage includes the asset that you are viewing and all of the upstream
assets in the data flow.
Impact describes the flow of data from an asset to the destinations. Impact shows you where the data is
used and which assets might be affected if you change the asset that you are studying. When you view an
asset in a lineage and impact diagram, the impact includes the asset that you are viewing and all of the
downstream assets in the data flow.
You can view the lineage and impact for an asset in the Lineage and Impact view. By default, the Lineage and
Impact view displays the lineage and impact diagram for the asset. You can display the tabular asset
summary. The asset summary lists all of the assets that affect and are affected by the asset that you are
viewing. The view also displays transformation logic for assets that contain transformations.
View the lineage and impact for an asset to answer the following types of questions:
An analyst at your organization needs to find out which fields or columns are used to calculate the Cost
Ratio field in a Tableau report. She searches for "Cost Ratio" in the catalog and opens the Lineage and
Impact view for the asset. The analyst views the lineage and impact diagram and discovers that Cost
Ratio is calculated directly from a table in a data warehouse. The analyst can then perform further
analysis on the table.
If I replace an asset, which other assets might be affected? Who owns the affected assets?
A developer at your organization owns relational table "Transactions_DB" that is obsolete and needs to
be replaced with a different table. He searches for "Transactions_DB" in the catalog and opens the
Lineage and Impact view for the asset. The developer discovers that the table is used in multiple
60
Business Intelligence reports. He opens the asset summary and finds the names of the users that are
associated with the report assets. The developer then notifies them that the table will be replaced.
The asset from which you open the Lineage and Impact view is called the seed asset. By default, the lineage
and impact diagram displays the origins of the data, the seed asset, and the destinations for the data. The
seed asset appears within its own frame even if there are other assets in the data flow that are within the
same resource.
Intermediate assets are the assets between an origin and the seed asset and between the seed asset and a
destination. All intermediate assets are hidden by default. Hiding the intermediate assets decreases the time
it takes to display the diagram and simplifies the view.
The assets shown in the diagram vary based on the seed asset type. For example, if the seed asset is a
relational table, the lineage and impact diagram shows other tables and data structures that impact and are
impacted by the table. The lineage and impact diagram does not display the columns or fields within the
other tables and data structures.
The following image shows the default diagram view for a relational table:
1. Origins
2. Seed asset
3. Destination
The arrow between adjacent assets is called a link. A diagram can contain the following types of links:
• A direct link ( ) means that there are no other assets between the two assets. Direct links
are marked with solid arrows.
• An indirect link ( ) means that the link contains intermediate assets. Indirect links are
marked with dashed arrows.
To view intermediate assets, use the Lineage and Impact sliders at the top of the diagram or click the plus
sign (+) that appears when you hover over the link.
To get information about any asset in the diagram, hover over the asset name. When you hover over the
name, Enterprise Data Catalog displays a tooltip that contains information about the asset such as asset
type, resource name, resource type, and associated business term. To open the asset, click the asset name.
The following table describes the controls that you can use to customize the view:
Control Description
Lineage and Display the intermediate assets that are upstream and downstream of the seed asset.
Impact sliders
Zoom controls Zoom in, zoom out, select the zoom percentage, view actual size, or fit the diagram to the
window.
You can also use the scroll wheel on your pointing device to zoom in and out.
Center control Reposition the diagram so that the seed asset is in the center.
Menu Print the diagram, export the diagram, and show or hide the legend.
When the sliders are at the first position to the right of the Hide position, the diagram displays the origins,
seed asset, and destinations. To display additional upstream assets, move the Lineage slider to the right. To
display additional downstream assets, move the Impact slider to the right.
The sliders display additional assets by distance from the seed asset. A hop indicates the distance of an
asset from the seed asset. For example, the complete data flow for asset D is A -> B -> C -> D -> E -> F -> G.
Assets C and E are one hop from asset D. Assets B and F are two hops from asset D.
The number of positions on the sliders varies based on the number of hops as follows:
• If a diagram has 10 or fewer hops, each slider position represents one hop. If you move the Lineage slider
one position to the right, the diagram displays the assets in the next upstream hop.
• If a diagram has more than 10 hops, each slider position represents multiple hops. For example, in a
diagram with 100 hops, each slider position represents 10 hops. If you move the Impact slider two
positions to the right, the diagram displays the assets in the first 20 downstream hops.
To fully expand the diagram, move the Lineage and Impact sliders to the Show All positions.
To hide all upstream assets, move the Lineage slider to the Hide position. To hide all downstream assets,
move the Impact slider to the Hide position.
The following image shows the default lineage and impact diagram for a relational column:
When you move the Impact slider two positions to the right, the diagram displays the first and second
downstream hops.
When you move the Lineage slider one position to the right and the Impact slider to the Hide position, the
diagram displays the asset in the first upstream hop and hides all downstream assets.
The following image shows the diagram with the first upstream hop displayed and the downstream assets
hidden:
When you move both sliders to the Show All positions, the diagram is fully expanded.
When you click +, the link expands and displays all of the intermediate assets within the link.
The following image shows an expanded link between two relational tables:
1. The link between the origin and the seed asset is expanded to show an intermediate asset.
2. In the configuration panel, the Expanded Paths list shows the expanded links. To collapse a link, click X.
To view a list of the expanded links, click << to open the configuration panel. Each link that you expand is
listed in the Expanded Paths list.
In the Expanded Paths list, the expanded links are named as follows:
• If you expand a link that is upstream of the seed asset, the expanded link is named for the upstream
asset. For example, if you expand the link between origin asset "BUDGET" and seed asset "costfact," the
Expanded Paths list shows "BUDGET" as an expanded link.
• If you expand a link that is downstream of the seed asset, the expanded link is named for the downstream
asset. For example, if you expand the link between seed asset "costfact" and downstream asset
"CostRatioReport," the Expanded Paths list shows "CostRatioReport" as an expanded link.
You can collapse any expanded link from the Expanded Paths list in the configuration panel. To collapse an
expanded link, click X next to the link that you want to collapse.
You can display child assets for data structures such as relational tables, database queries, and business
intelligence workbooks and reports. Columns and fields do not have child assets.
When you display child assets, you can display all child assets or select specific child assets to display. If the
child assets that you select are within other data structures, the diagram displays the child assets within the
data structures that contain them. For example, if the seed asset is an Oracle Business Intelligence
Enterprise Edition report, and you select a report field, the diagram displays the field within the report page.
1. Seed asset
2. Child assets
3. Reset icon. Hides all child assets in the seed asset.
4. Drill down icon. Allows you to select the child assets to display in the diagram.
1.
Click the drill down icon ( ) in the upper right corner of the seed asset.
2. In the Drill Down dialog box, select the child assets that you want to display.
To search for a child asset, enter the first few characters of the asset name in the Find field. To select all
child assets, check the check box next to the Name column.
3. Click OK.
The lineage and impact diagram displays the child assets that you select and the assets to which the
child assets are linked.
1. Click << in the lower right corner of the diagram to open the configuration panel.
2. Enable the Show Business Terms option.
The diagram shows the business terms for all assets that have associated business terms.
To hide business terms, disable the Show Business Terms option.
You can choose to display the transformation logic for assets that contain transformations in Lineage and
Impact view.
Transformation logic view displays all the mappings or data flow relation types between source assets and
target assets for the resource type that you select. The transformation logic view, by default, displays the
source assets, transformations, and target assets. The view also displays the properties panel to view details
about assets in a mapping or data flow relation type.
If an asset contains multiple transformation logic between the source assets and target assets, the lineage
and Impact view displays a drop-down menu when you hover over the transformation icon. The drop-down
menu lists all the resource types that contain transformation logic. You can select a resource from the drop-
down menu for which you want to view the transformation logic.
The transformation view displays transformation logic for data structures such as tables and columns. The
view displays various types of transformations such as filter, joiner, lookup, expression, sorter, union, and
aggregate.
The following image shows a sample transformation logic for a relational table in the Lineage and Impact
view:
1. Source asset
2. Transformation logic
3. Target asset
4. Menu
5. Fit to window
A developer in your organization needs to find out how a field named Cost_Ratio in a Tableau report is
derived using transformations. The developer searches for Cost_Ratio in the catalog and opens the Lineage
and Impact view for the asset. The Lineage and Impact view displays different tables that contribute to the
asset. To analyze the asset further, in the Lineage and Impact view, he chooses to display transformation
icons and opens the asset Cost_Ratio which contains a transformation. He then finds out that the asset is
derived using a PowerCenter transformation.
In the diagram view, you can choose to display transformation logic for assets that contain
transformations.
To view the transformation logic in the Lineage and Impact view, click the transformation logic button
When you enable transformations, the view displays transformation icons ( ) next to the asset
names indicating that there are mappings or data flow relation types between the source assets and
target assets.
To go back to the Lineage and Impact view, click the asset name link at the upper left corner of the
transformation logic view.
Note: The Lineage and Impact view does not display transformation icons next to the asset names if
assets do not contain transformations.
You can drill down on a table if you want to select and display fields in the transformation logic view.
When you drill down on the table, the transformation logic view displays source fields, ports in the
mapping, and target fields. Perform the following steps to expand a table in the transformation logic
view:
1.
Click the ( ) drill down icon next to the table name in the transformation logic view. The Select
window appears.
2. In the Select window, select the fields you want to display in the transformation logic view.
3. Click OK. The transformation logic view then displays the source fields, mapping with ports, and
target fields for the selected fields.
4. To go back to the table-level transformation, click the back icon button at the upper right corner of
the transformation logic view.
You can view the properties panel in the transformation logic view. The properties panel displays general
details of assets in a mapping such as the name of the resource, type of the resource, and last
modification date of the resource. The asset information that you view depends on the type of mapping
you see in the transformation logic view.
To view the properties panel in the transformation logic view, hover over the asset name in the mapping
and click the asset name.
You can download the property details to a flat file ( .txt) file. To download the property details, click the
download icon ( ) at the upper right corner of the properties panel.
When you display the table-level transformation, the transformation logic view by default displays the
source table, mappings, and the target table. The mapping includes transformations, source qualifier,
and target transformation. You can expand the source table or target table to view fields and ports in the
transformation logic view.
When you display the field-level transformation, the transformation logic view by default displays the
source fields, mapping, and target fields. The mapping includes transformation ports, source qualifier
ports, and target transformation ports.
To view transformation at the field level, you must expand the table and select fields you want to view in
the transformation logic view. Alternatively, you can open a field in the diagram view and then enable
transformation to view if the field contains transformation logic.
Customize the view using the controls at the top of the diagram and the configuration panel. You can also
pan the diagram by clicking and dragging the diagram.
The following table describes the controls that you can use to adjust the view:
Control Description
Print the diagram, export the diagram, and show or hide the legend.
To print a lineage and impact diagram, open the menu ( ) and select Print.
To export the diagram to an .svg file, open the menu ( ) and select Export. The options to download the
diagram vary based on the browser.
To open the asset summary from the lineage and impact diagram, click the Asset Summary View icon ( )
at the top of the diagram. The asset summary lists the tables and data structures that affect and are affected
by the seed asset. The asset summary does not list columns or fields.
The assets in the asset summary view are independent of the assets shown in the lineage and impact
diagram. For example, the default diagram view for any asset shows the origins, seed asset, and
destinations. However, the default asset summary lists all assets in the data flow.
• The Asset Lineage Summary tab lists all assets in the data flow from the origins to the seed asset. These
assets impact the seed asset.
• The Asset Impact Summary tab lists all assets in the data flow from the seed asset to the destinations.
These assets are impacted by the seed asset.
The columns on the tabs display information about the assets such as asset name, asset type, resource
name, and associated business terms. You can filter the assets listed in the tabs by asset type, resource
name, and resource type. You can also export the information to a Microsoft Excel file.
Column Description
Resource type Type of resource that contains the asset, for example, Oracle or PowerCenter.
Users Users that are associated with the asset. Click a user name to display information about the user
such as department and email address.
Path Path of the asset from its parent. For example, the path for a table is <resource name>/<database>/
<schema>/<table>.
Tip: If the cells in a column contain long text strings, hover over the cell to display the tooltip. The tooltip
displays the entire string.
You can perform the following actions with the columns in the asset summary:
To hide a column, right-click the title bar, and uncheck the column that you want to hide.
To rearrange the order, click a column name and drag the column to a different location.
When you open the asset summary view, the filter panel is displayed by default. If the filter panel is not
displayed, click < on the right side of the asset summary view to open the filter panel.
To filter the asset summary, select only the asset types, resource names, and resource types that you want to
view. To remove all filters of a particular type, select All.
Note: The filters that you apply in the asset summary view have no effect on the lineage and impact diagram.
To close the filter panel, click > on the left side of the panel.
If you applied filters to the asset summary, the filtered assets do not appear in the export file.
1.
Click the Export icon ( ).
2. Select Lineage Summary, Impact Summary, or both.
3. Click OK.
The options to download the file vary based on your browser.
View Relationships
This chapter includes the following topics:
The related assets that you view for a selected asset depend on the asset type. For example, if the selected
asset is a column, the Relationships view shows all the data domains, similar columns, business term, and
users that are related to the column. If the selected asset is a data domain, the Relationships view shows all
the columns, business term, users, and data domain groups that are related to the data domain.
By default, the Relationships view displays the relationships for the selected asset. You can hover the cursor
over any related asset in the Relationships view for additional details such as the asset type, resource type,
and the time and date the asset was last updated.
The Relationships view for a business term displays all the related technical assets, related glossary objects,
classified assets, users, and data domains.
Relationships View
When you open an asset in the Relationships view, the selected asset appears at the center of the
Relationships view, and the related assets appear around the selected asset.
The Relationships view shows different circles that represent a specific asset or a group of assets. By
default, the selected asset is highlighted in blue and the related assets of the same type are represented as
small icons within the asset type circle.
The assets that you see in the Relationships view vary based on the selected asset type. For example, if you
select a table, the Relationships view displays related assets such as data domains, business term, reports,
and synonyms.
74
To expand the asset type circle, click the number icon ( ) at the top of the circle, and then click Show
All to display all the assets of the same asset type which are included in the selected asset. When you click
Show All, a dialog box appears, which provides you details about all the assets of the same asset type.
The following image shows the default Relationships view for a data domain named Product ID Information:
1. Selected asset
2. Related assets
3. Zoom controls
4. Reset the view
5. Find an asset
6. Show asset types
The following tables describes the controls that you can use to customize the view:
Control Description
Reset Reset the Relationships view back to the original view size.
Relationships View 75
Types of Relationships
The type of asset relationships that you see in the Relationships view depends on the asset that you open in
the Relationships view.
The following table lists some of the types of asset relationships the you can see in the Relationships view:
Business Term Related glossary assets, related technical assets, and classified assets.
The related glossary assets include categories, policies, and business terms.
The related technical assets include all the technical assets and data domains in the catalog.
The classified assets include all the assets that are grouped based on a business term that is
assigned as a custom attribute to the assets.
Columns Data domains, synonyms, similar columns, reports, users, and business term.
Data Domains Columns, data domain groups, fields, files tables, views, synonyms users, and business term.
The fields include CSV fields, JSON fields, and XML fields.
The files include CSV files, JSON files, and XML files.
Tables Views, synonyms, reports, users, other tables, data domains, and business term.
Views Tables, data domains, users, reports, synonyms, and business term.
The lineage and Impact view shows you the end-to-end data flow for an asset. You can use this view to
understand how a change to one or more assets affects the remaining assets in the data flow.
Lineage shows you where the data for an asset comes from and which assets affect the asset that you are
studying. Impact shows you where the data is used and which assets might be affected if you change the
asset that you are studying.
To open the assets in the circle, click the name of the asset that you want to open.
If the asset is a calculated field, associate a business term as a business title with the asset to see the
business rules that explain how the field should be calculated. For example, a Tableau report in the catalog
contains the field called Cost Ratio. The enterprise business glossary contains a term called Cost Ratio with
the following rule definition: Cost Ratio = Actual Cost / Planned Cost * 100. Associate the term Cost
Ratio with the report field so that users can see how the value of Cost Ratio should be calculated.
Associate a business term as a business title with a data domain to make it easier to associate the term with
assets to which the data domain is assigned. For example, you associate a business term called UPC with
data domain Product ID Information. When you associate a business term with any asset to which data
domain Product ID Information is assigned, Enterprise Data Catalog displays UPC as a recommended term
for the asset.
78
Adding a Business Title to an Asset
You can add a business title to any asset in the catalog that is not a business glossary or an Axon glossary
asset. Add a business title with an asset to provide more business context to the asset. You can either
associate a business term or provide a display name to add a business title to an asset. You can add one
business title to an asset.
After you add a business title to an asset, Enterprise Data Catalog displays the business title with the asset
name in the following places:
• To associate a business term as a business title for the asset, click the Edit Properties icon ( ) in
the search results. The Edit Properties dialog box appears.
• In the Edit Properties dialog box, click Business Terms, select a business term, and then click OK.
2. In the Add Business Title dialog box, you can either select a business term, an inferred business term, or
provide a display name to add a business title to an asset:
• You can provide a display name if the asset has no business terms or inferred terms.
• If one or more data domains with an associated business term are added to the asset, the inferred
terms appear in the Recommended Business Terms section. To accept an inferred term, click the
check mark icon next to the term.
• You can either select a business glossary or Axon resource to associate a business term if the asset
has no inferred terms.
3. To associate a business glossary asset with an asset, select the business glossary resource from the
Select a Resource list, and then select the business glossary from the Glossary list. Then, select the
term that you want to associate from the list of terms.
• If the list of terms is long, you can search for a term in the list by entering the first few characters of
the term name in the search field.
4. To associate an Axon glossary with an asset, select the Axon resource from the Select a Resource list,
and then select the Axon glossary that you want to associate from the list of Axon glossaries.
• If the list of Axon glossaries is long, you can search for a glossary in the list by entering the first few
characters of the glossary name in the search field.
5. Click OK.
A business term with the confidence score of 80 percent and above is automatically assigned as a business
title to a column. The confidence score is calculated by an internal algorithm based on accepted business
terms on data domains, column similarity, and name similarity between column and business term. If there
are multiple business terms that meet the threshold of 80 percent and higher, the term with the highest
The catalog displays the recommended business terms if confidence score is between 60 and 80 percent.
Business term recommendations appear in the Add Business Title window and Sample Columns section of a
table. The Add Business Title window displays a maximum of three recommended business terms.
You can curate the recommended business terms. As a part of the curation task, you can accept or reject the
business term recommendation. Accept the recommended business term as a business title if you find the
term appropriate for the column asset. Reject the recommended business term if you do not find it
appropriate. If you reject the recommended business terms or the auto-assigned business title, the terms
move to the Rejected Business Terms section.
Enterprise Data Catalog does not consider the business terms that are in the Rejected Business Terms
section until you clear the rejected terms from the list.
You can accept or reject multiple recommended business terms from the Overview tab of a resource. In the
Resource Contains section of a resource, click the Assets in the resource link to open the list of assets in the
resource. The Assets in the resource window displays the details, such as the name of the asset, type of the
asset, business term associated with the asset, and recommended business terms. You can filter the list
based on the asset type, assets with business title recommendation, assets with business title association,
and assets without business title association. To accept a recommended term, click the check mark icon
next to the term. If you have manually assigned a business title to a column, other business terms are not
inferred on the column.
Note: To view the inferred business terms in the catalog, ensure that you have enabled the Run Similarity
Profile option for a resource in the Catalog Administrator.
1.
Click the Edit Properties icon ( ) next to the assigned business title name of the asset.
The Add Business Title dialog box appears.
You can assign custom attribute values to a technical asset if the catalog administrator has configured
custom attributes for the asset type. For example, the catalog administrator configures a custom attribute of
string data type called Data Center Location for a CSV field. The catalog administrator sets the possible
values for the attribute to Denver, Milwaukee, and New York. When you launch the overview for a CSV field,
you can set the Data Center Location to Denver, Milwaukee, or New York in the Custom Attributes section of
the asset,
1.
In the Overview tab for an asset, click the Edit Properties icon ( ) from the sections, such as the
Description, People, Custom Attributes, Business Terms, and Business Classifications.
Alternatively, you can click the Edit Properties icon ( ) for the asset in the search results.
The Edit Properties dialog box appears.
2. In the Edit Properties dialog box, click Custom Attributes.
The Custom Attributes section on the left side of the dialog box lists the custom attributes that you can
select for the asset.
3. Select a custom attribute from the Custom Attributes section.
The options to enter the attribute values vary based on how the attribute is configured:
• If the attribute is configured with a predefined set of values, you can see a checklist of possible
values to the right of the dialog box. Select one or more values that you want to assign to the asset.
If the list of values is long, you can search for a value by entering the first few characters of the
attribute value in the search field.
• If the attribute allows you to enter a text string as the attribute value, you can see a text box to the
right of the dialog box. Enter a string value in the text box.
The exported CSV file displays all the custom attributes, asset types, Axon glossaries, business glossary
terms, and data domains that you selected during export. You can use the exported CSV file to assign or edit
custom attribute values to multiple assets at the same time, view data domains that are associated with the
assets, associate or edit Axon glossaries, and associate or edit business glossary terms. When you import
the CSV file back into the catalog, the catalog updates the assets based on the attribute values that you
specify in the exported CSV file.
For example, if relational tables in a resource called Finance_Department contains details about the payroll,
you might want to assign a custom attribute to all the tables containing payroll information and search for
the tables using the assigned custom attribute. You can export the custom attribute called Payroll to a CSV
file. In the exported CSV file, you can assign the Payroll custom attribute to the tables that contain details
about payroll and import the file into the catalog. The catalog updates the tables based on the attribute
values you assigned in the exported CSV file.
Before you import, you can choose to configure email notifications to send an email notification on the
import status. The email notification message contains a log file with the details of the import job.
After the import job is complete, Enterprise Data Catalog sends the email to the email address that you
specify during the import job. You can send the email notification to more than one person. To send email
notifications to multiple recipients, use a comma to separate the email address entries.
The following table lists the default order of the columns, column headings, and the format for the values in
the CSV file:
Name Specifies the name of the assets that you exported from a resource.
Title (Informatica Displays business glossary terms that are associated with the assets you exported. Use
Business Glossary) this column to edit or associate business glossary terms to assets.
Title (Informatica Axon) Displays Axon glossaries that are associated with the assets you exported. Use this
column to edit or associate Axon glossaries to assets.
Inferred and Accepted Displays data domains that are inferred or accepted for the assets you exported. Do not
Data Domains edit the values of this column.
Custom Attribute Name Specifies the name of the custom attribute. Use this column to edit or assign custom
attributes to assets.
1.
From the overview tab of a resource, open the menu ( ) and select Export.
The Export Custom Attributes window appears.
2. Select the asset types that you want to export along with custom attributes. Optionally, select the Select
Asset Types check box to export all asset types that are in the resource.
3. Click Next.
The Export Custom Attributes window displays a list of custom attributes that are configured in the
resource.
4. Select the custom attributes that you want to export.
• Optionally, select the Choose attributes to export check box to export all the custom attributes that
are configured in the resource.
• Optionally, clear the data domains and business terms options that are selected by default.
5. Click OK.
Note: Do not edit or modify the values of the custom attributes that are not applicable in the exported
CSV file.
1.
From the overview tab of a resource, open the menu ( ) and select Import.
The Import Custom Attributes window appears.
2. Click Browse to select the CSV file you want to import from the local machine. Optionally, to import the
file, you can drag and drop the CSV file into the dotted rectangle in the Select file to import section.
3. Click Next.
4. To receive notifications on the import status, select the Send email notification check box and enter a
valid email address in the address field. Enter multiple email addresses separated by a comma.
Note: You must enable the Receive Alerts through Email option for the catalog service and enable email
service in the Informatica Administrator to receive notification on the import status.
5. Click OK.
6. Optionally, choose to allow the import job to run in the background, or click OK to submit the import job.
The Enterprise Data Catalog sends you a notification after the import job is complete and updates
assets based on the attribute values that were assigned in the exported CSV file.
You can export data from data assets, such as tables, and views to a TDE file. After you export asset data,
view the TDE file in Tableau Desktop. To export asset data to a TDE file, verify that you have enabled the data
read privilege and data read permission for the resource in Informatica Administrator. To view classified
information of the asset data in Tableau Desktop, verify that you have enabled the sensitive data read
privilege in Informatica Administrator.
Prerequisites
Before you export the asset data to a TDE file, contact your administrator and make sure that the following
prerequisites are met:
• Create the necessary database connections in the Informatica Administrator console for resources, such
as Teradata, Hive, and Oracle.
• Assign required permissions and privileges to the non-administrator users to export and view the asset
data in the Tableau Desktop.
• Set the row count value that you want to export in the Custom Properties of Informatica Administrator.
Configure the dpRowLimit custom property to a value that you want to export. By default, Enterprise Data
Catalog exports 50,000 row values to a TDE file.
1.
From the overview tab of a table, or view, open the menu ( ) and select Export.
The Export Data window appears.
2. From the Select a connection drop-down list, select the database connection, and then click OK.
Enterprise Data Catalog downloads the TDE file to your machine.
Note: If you use a Hive connection to export, TDE export file does not include the data types, such as
Array, Map, and Struc. If you use any other connection type, TDE export file does not include the binary
data types.
3. Open the exported TDE file in Tableau Desktop application to view the asset details.
Business Example
You are a data steward who works in the retail industry, and business analysts in your enterprise need to
analyze the product sales for the quarter. You need to prepare a report on the quarterly sales.
You log into Enterprise Data Catalog and search for tables, such as the Sales_details_Q1, Sales_details_Q2,
and Sales_details_Q3, for your report. You want to know if the tables are reliable for the report. To find out if
the tables are reliable, you view the reviews, ratings, or comments on the tables. You also ask a query to the
data owner, data steward, or the subject matter expert of the tables.
Follow Asset
You can follow assets to monitor asset changes in the catalog. For example, as a data analyst, you can
follow an asset to be informed about the changes that are made to the asset, so that you can manage the
asset and take necessary actions. You can follow data assets, such as resources, tables, views, reports, and
files. You can follow basic changes, source changes, and collaboration changes of the assets.
If you follow the basic changes of an asset, you get notified on changes when users edit or associate a
custom attribute, add or remove a business title, add or remove a description, and assign people or remove
people.
86
If you follow the source changes of an asset, you get notified whenever there is a change in the source of the
asset. For example, you have followed a table named Customer_details to know the source changes. A new
column named New_Address is added to the table. Enterprise Data Catalog notifies you about the change.
If you follow the collaboration changes of an asset, you get notified when users add reviews, ratings, and
questions or when users certify the asset. You implicitly follow the collaboration changes of an asset when
you add ratings, reviews, questions, or answers to the asset.
When you follow an asset, Enterprise Data Catalog sends you a notification alert or email notification when a
user makes changes to the asset that you are following. You can view the asset change notifications in the
Notifications page except the resource change notifications. You can view the resource change notifications
in the Change Summary page. You can also view all the assets that you follow in the Followed Assets page.
Following an Asset
Follow assets to keep a track of asset changes in the catalog. The assets that you follow appear in the
Followed Assets page. Enterprise Data Catalog sends you notification alerts and email notifications when
users make changes to the assets that you follow.
1. In the Search Results page, click the asset name link to open the asset that you want to follow.
The asset opens in the Overview tab.
2. In the Overview tab of the asset, click Follow.
The follow menu appears. The menu displays All Changes, Enrichments, Source Changes, and
Collaboration Changes options.
3. You can follow assets by selecting any or all of the following options:
• To follow all the changes of the asset, select the All Changes option from the menu.
• To follow the enrichment changes of the asset, select the Enrichments option from the menu.
• To follow the source level changes of the asset, select the Source Changes option from the menu.
• To follow the collaboration changes of the asset, select the Collaboration Changes option from the
menu.
Enterprise Data Catalog notifies that you are following the asset.
Notifications
Enterprise Data Catalog generates different types notifications when changes are made to assets that you
follow. The notification types include application notifications, change email notification, and digest email
notification.
Application Notification
You can view application notifications when users make changes to the asset that you follow. An alert
on the Enterprise Data Catalog header indicates that you have a notification. You can click the ( ) alert
icon to view the list of recent notifications. To view the complete list of notifications in the Notifications
page, click Show All.
The Notifications page displays the details, such as the name of the asset that was changed, path to the
asset, change description of the asset, and the last modified time of the asset. You can filter the list of
notifications based on the changes that you have followed, last modified time of the asset, and asset
name. You can also click the asset name link in the notifications list to open the asset in the catalog.
Follow Asset 87
Change Email Notification
You can enable email notifications for every change in assets so that you receive a separate notification
email when there is a change to the asset that you follow. For example, when a user adds a description
and business title to the asset that you follow, Enterprise Data Catalog sends you two separate emails
for the two changes.
You can enable email notifications to receive emails on daily, weekly, and monthly asset changes.
• Daily changes. Email digest of all the change notifications you received for the day.
• Weekly changes. Email digest of all the change notifications for the week. The notification email is
sent every Sunday.
• Monthly changes. Email digest of all the change notifications for the month. The notification email is
sent on the last day of the month.
Note: You can manually run the Change Notification resource in the Catalog Administrator to receive
digest emails before the stipulated period of time.
• When you approve or reject the association of data domains with an asset.
• When Enterprise Data Catalog infers data domains and composite data domains for an asset.
• When you add, remove, or edit the business title of an asset.
• When you add or edit the description of an asset.
• When you add, edit, or delete the custom attribute values from an asset.
• When you certify or decertify an asset.
• When you edit the comment or tag of an asset.
• When you add, edit, or delete a question about an asset.
• When you add, edit, or delete an answered question about an asset.
• When you add, edit, or delete a review about an asset.
• When you add or remove assets from the data source of an asset.
• When you import and export the custom attributes.
• When custom attribute value is applied to all assets in the resource.
The data owner, data steward, and subject matter expert receive email notifications on the data source
changes even if they do not follow an asset. You can choose to configure email notifications to receive
emails on every change or at different notification frequencies, such as daily, weekly, and monthly.
Prerequisites
Before you enable email notifications in the catalog, ensure that the following prerequisites are met:
Following a Resource
You can follow a resource to keep a track of all the source changes in the resource. Enterprise Data Catalog
displays the resource changes in the Change Summary page. You can also configure email notifications to
receive notifications in an email.
To follow a resource, open the resource in the Overview tab, and then click Follow.
When you follow a resource, Enterprise Data Catalog displays the change summary notifications whenever
there are changes to the resource after the resource scan.
The Change Summary page appears as a separate tab when you open the resource that you follow. When you
open the Change Summary page, the page displays the details, such as the name of the asset that was added
or deleted, path to the asset, change details of the asset, and the last modified time of the asset.
You can filter the list of change summary notifications based on the resource scan time. Click the asset
name link in the change summary notifications list to open the asset in the catalog.
Follow Asset 89
Managing Assets Followed
You can view and manage all the assets that you follow in the Followed Assets page.
In the Enterprise Data Catalog header, click the ( ) user icon, and then select the Followed Assets option
from the list to open the Followed Assets page. The Followed Assets page displays all the assets that you
follow in the Assets tab and resources that you follow in the Resources tab.
By default, the Assets and Resources tabs display assets in a grid layout. To change the layout of the
section, you can click the ( ) icon. In the Assets tab, you can filter the assets based on the asset type,
resource name, and type of change you have followed.
You can also choose to follow or unfollow the assets and resources. To follow or unfollow, select the asset
or resource from the list, and then click Following.
Unfollowing an Asset
Unfollow an asset to stop receiving notification alerts on changes made to the asset. You can either unfollow
all the changes or choose to unfollow a specific change depending upon the type of change you chose when
you decided to follow an asset.
Note: You automatically unfollow an asset when there is a change in the source of the asset.
Unfollowing a Resource
Unfollow a resource to stop receiving notification alerts on changes made to the resource.
To unfollow the resource, open the resource in the Overview tab, and then click Following.
For example, Lisa, a data analyst in a retail industry wants to prepare a report based on the coupon sales for
the year 2018. In the catalog, she searches for assets which include the coupon sales details. Based on the
findings, she finds out that a few tables are useful for her report. She then provides a five-star rating for the
table assets and adds a review explaining why she found the assets useful for her report.
A user who has access to Enterprise Data Catalog can rate and review an asset. The data owner and user
who adds a rating and review can delete the rating and review. When you rate and review an asset, you
implicitly follow all the collaboration changes of the asset. The collaboration changes include other ratings
and reviews about the asset, and questions and answers about the asset. You can mark a review as helpful if
you find the review useful. You can rate and review data assets, such as the tables, views, files, and reports.
After you rate and review an asset, catalog displays the average asset ratings under the asset name in the
Overview tab and Search Results page. You can also filter assets based on the ratings in the Search Results
page.
1. From the Search Results page, open the asset that you want to rate and review.
The asset opens in the Overview tab.
2. In the Reviews section, enter the review in the review box, and then rate the asset using the five-star
rating next to the review box.
3. Optionally, you can just rate the asset using the five-star ratings.
4. Click Submit.
5. Optionally, you might want to replace your existing review. To replace your existing review, enter the
review in the review box and rate the asset.
6. Click Submit.
The Replace Review dialog box appears.
7. Click OK to confirm.
Enterprise Data Catalog replaces your existing review and ratings.
Deleting a Review
A user who rates and reviews an asset and data owner can delete the review.
To delete a review, click the ( ) icon under the review, and then click OK in the Delete Review dialog
box.
You can ask a question that is descriptive, exploratory, predictive, or causal in nature. You can ask questions
about all data assets in the catalog. You can add multiple questions and answers about a single asset.
For example, John, a data analyst who wants to prepare a sales report using relational tables containing
sales details. The analyst opens a table containing sales details in the Catalog. After viewing the details of
the table, the analyst is unsure if he can use the table for his report. He posts a question stating if he can use
the table for his sales analysis in the Overview tab of the table.
A user who has access to the catalog can ask a question and answer the question about the asset. The data
owner and user who asks the question can delete the question. Similarly, the user who answers the question
and data owner can delete the answer.
When you delete a question, the answers associated with the questions also get deleted. The user who asks
a question and answers a question, implicitly follows all the collaboration changes of the asset.
Adding Queries
You can ask a question about an asset in the Overview tab of the asset. You can ask multiple questions
about the same asset. When you ask a question, a user who has access to the Catalog and knowledge about
the asset can answer the question.
To ask a question, in the Questions section, enter the question, and then click Submit.
Deleting Queries
You might want to delete a question that is irrelevant. The user who adds the question and data owner of the
asset can delete the question. When you delete a question, the answers that are associated with the question
get deleted along with the question.
To delete a question, choose the question that you want to delete, and then click Delete in the Questions
section.
Adding Answers
You can answer a question about an asset in the Overview tab of the asset. A user who has access to the
Catalog and knowledge about the asset can answer the question.
You can add multiple answers for the same question. You can also mark an answer as helpful if you find the
answer reliable so that the other users can consider the answer.
1. In the Questions section, identify the question that you want to answer.
2. Click Answer.
3. In the answer field, enter the answer.
4. Click Submit.
5. Optionally, mark an answer as helpful from the list of available answers.
1. In the Questions section of an asset, identify the answer that you want to delete from the list of answers
under the question.
2. Click Delete next to the answer.
The Delete Answer dialog box appears.
3. Click OK.
Certify Asset
You can certify an asset to endorse the asset so that other users can use the asset as a trustworthy one over
the assets that are not certified. Certifying the asset indicates the reliability of the asset in the Catalog.
The data owner, data steward, and subject matter expert can certify and decertify the asset. When you
decertify an asset, you remove the certification associated with the asset. You can certify data assets, such
as the tables, views, files, columns, synonyms, and reports. You can add a description and a maximum of 10
tags to the asset when you certify an asset. You add tags to make the asset easily searchable in the catalog.
You can also filter assets based on the certified assets. To certify the asset, ensure that you have the read
and write permission for the DomainUsers resource in the Catalog Administrator.
For example, the data owner of the asset called TicketSalesDetails finds out that there are multiple assets
with the same name as TicketSalesDetails in the Catalog. The data owner wants to ensure that the right
asset is endorsed to the users as the duplicate assets did not contain the latest asset information. To
highlight the right asset in the Search Results page, he certifies the asset by adding a description and tags so
that you can easily search the for the asset and the asset is quickly available in catalog search results.
When you certify an asset with a tag and search for the tag, the certified asset appears at the top of the
search results in the Search Results page. To indicate that the asset is certified, Enterprise Data Catalog
displays the certify icon ( ) under the asset name in the Overview tab and next to asset name in the Search
Results page.
Certifying an Asset
Certify an asset to endorse the asset in the catalog. The resource stakeholders, such as data owner, data
steward, and subject matter expert, can certify all the assets in the resource.
1. From the overview tab of an asset, open the menu and select Certify Asset.
The Certify Asset dialog box appears.
2. Optionally, in the Certify Asset dialog box, add a comment and associate tags.
Note: You can add a maximum of 10 tags in the Add Tags field.
3. Click OK.
Enterprise Data Catalog displays the certified icon next to the asset name after you certify the asset.
Certify Asset 93
Decertifying an Asset
The data owner, data steward, and subject matter expert can decertify an asset. Decertify an asset if the
asset contains obsolete information.
To decertify an asset, from the Overview tab of the asset, open the menu, and then select Decertify.
Configure Application
This chapter includes the following topics:
You can view the Application Configuration page after the catalog administrator assigns the Catalog
Privileges : Application Configuration privilege in the Informatica Administrator tool. For more information
about privileges and permissions, see the Informatica Administrator Reference for Catalog guide.
1.
Click the Application Configuration icon ( ).
The Application Configuration page appears.
2. Click Asset Types.
95
The list of asset types appears as shown in the following image:
3. Select the required asset types from the Asset Types section.
4. Click Save.
Enterprise Data Catalog adds the selected data types to the Asset Type list on the Filter by panel.
Note: You cannot search for the assets in the search results page if you clear to remove the assets types
from the asset types list.
1.
Click the Application Configuration icon ( ).
The Application Configuration page appears.
3. Select the required system attributes and select the following details for each attribute based on your
requirements:
• Display in Search Results. Displays the attribute in search results.
• Display in Object Overview. Displays the attribute in the System Attributes section on the Overview
tab.
• Allow Searching. Searches based on the attribute. You can specify the search priority for the attribute
in the Search Rank list by selecting one of the following options:
- Low
- Medium
- High
• Allow Sorting. Allows you to sort the assets based on the attribute.
• Allow Filtering. Allows you to make the attribute a search filter on the Filter by panel.
4. Click Save.
The selected system attributes appear on the Filter by panel.
1.
Click the Application Configuration icon ( ).
The Application Configuration page appears and a list of available custom attributes appears in the
Custom Attributes tab as shown in the following image:
2. Select the required custom attributes and select the following details for each attribute based on your
requirements:
• Display in Search Results. Displays the attribute in search results.
• Allow Searching. Searches based on the attribute. You can specify the search priority for the attribute
in the Search Rank list by selecting one of the following options:
- Low
- Medium
- High
• Display in Object Overview. Displays the attribute in the different sections depending on the type of
the custom attribute on the Overview tab.
• Allow Sorting. Allows you to sort the assets based on the attribute.
1.
Click the Application Configuration icon ( ).
The Application Configuration page appears.
2. Click Filter Settings.
3. To change the order of the search filters, select a filter and then click the Move up or Move down icon
( ). Optionally, you can select a search filter, and then drag the filter to change the order.
4. Clear the check boxes if you want to collapse the search filters in the Filter By panel.
5. Click Save.
Open the Google Chrome Web Store and add the Enterprise Data Catalog Plug-in to the browser.
You can launch the Enterprise Data Catalog Plug-in from any website that you open in a Chrome tab. By
default, the plug-in displays the plug-in home page. The home page lists the most searched keywords based
on the recent search that you perform using Enterprise Data Catalog.
You can open a URL in a Chrome tab to launch the plug-in after you enrich an asset with a URL in Enterprise
Data Catalog.
When you view an asset in Enterprise Data Catalog Plug-in, it displays details, such as the business title
associated with the asset, description of the asset, business terms, people associated with the asset, and
source assets that are related to the asset.
You can use Enterprise Data Catalog Plug-in to search for an asset based on the name, or perform a wildcard
search to find all the catalog assets. You can use the filter and sort options to refine your search. To get a
comprehensive view of the asset, you can launch Enterprise Data Catalog application from the Enterprise
Data Catalog Plug-in to display the asset details view or the lineage and impact view.
Example
Susan, a business user in your organization, looks at a Tableau report called Cost_Ratio_Report. She needs to
find out more about the business context and technical assets associated with the report so that she can
understand the report better. She opens the Enterprise Data Catalog Plug-in extension within the Tableau
Server and finds out that the business terms, such as Variable Cost, Cost Ratio, and Fixed Cost are
associated with the report. She also finds out that James is the owner of the asset and Mary, Patricia, and
Jennifer are associated as data stewards for the asset. Susan launches the Enterprise Data Catalog
application from the extension to display the asset details view or the lineage and impact view for the report.
101
Enterprise Data Catalog Plug-in User Interface
The following image displays the user interface of Enterprise Data Catalog Plug-in:
Note: You cannot launch the Enterprise Data Catalog Plug-in from dynamic web pages that refresh the
content automatically.
1.
Right-click the Enterprise Data Catalog Plug-in extension icon from the header area and choose
Options from the menu.
The Enterprise Data Catalog Plug-in Configuration dialog box appears.
2. Enter the following properties for the connection:
• Username. Username to log in to the Enterprise Data Catalog Plug-in.
• Password. Password to log in to the Enterprise Data Catalog Plug-in.
• Security Domain. Name of the Security Domain.
• Enterprise Data Catalog URL. Enterprise Data Catalog application URL. Use the following URL format:
http[s]://<host>:<port>/ldmcatalog
1.
Click the Enterprise Data Catalog Plug-in ( ) icon on the top right area of the browser window.
2.
Click the Enterprise Data Catalog Plug-in ( ) icon on the right side of the page.
The Enterprise Data Catalog Plug-in home page appears.
3. In the home page, type the name of the asset that you want to search for in the search box and click the
search icon.
The matching results appear in the results page.
4. Click an asset name that you want to view in the Enterprise Data Catalog Plug-in.
A Assets
Following 90
add Axon glossaries
answers 92 details 28
adding Overview 28
queries 92 Axon glossary assets
application configuration viewing in Informatica Axon 30
asset types 95
custom attributes 98
filter settings 99
overview 95
B
system attributes 96 business example
asset use cases 12
queries 92 business glossary assets
Asset Impact Summary overview 30
columns 71 viewing in Analyst tool 37
displaying 70 business terms
exporting 73 associating with assets 79
Asset Lineage Summary associating with data domains 79
columns 71 details 33
displaying 70 displaying in lineage and impact diagrams 66
exporting 73 removing a term association 80
asset summary view viewing in Analyst tool 37
column descriptions 71
displaying 70
exporting 73
filtering 72
C
hiding columns 71 categories
overview 70 details 35
rearranging column order 71 viewing in Analyst tool 37
asset tasks certify
overview 27 assets 93
asset types certifying
application configuration 95 assets 93
assets changes 89
assigning custom attribute values 81 child assets
associating a business term 79 displaying in the diagram view 65
Axon glossaries 28 column assets
Axon glossary details 28 accepting data domains 49
business glossary assets 30 assigning data domains 48
business glossary categories 35 creating data domains 48
business glossary glossaries 32 curating data domains 48
business glossary policies 36 details 44
business term details 33 overview 43
column assets 43 rejecting data domains 49
composite data domain assets 55 composite data domains
data domain assets 50 asset details 56
export and import 82 configuration panel
field assets 43 opening and closing 62
overview 27 configuring
removing a business term association 80 emailnotificartion 88
removing custom attribute values 81 curation
search overview 14 assigning one asset 54
searching from the home page 15 for individual assets 48
tabular assets 37 overview 54
viewing additional details 18 rejecting one asset 54
105
custom attributes examples
application configuration 98 Lineage and Impact sliders 63
assigning attribute values 81
export and import 82
removing attribute values 81
F
field assets
E L
Enterprise Data Catalog legend
logging in 11 displaying and hiding 62
overview 10 lineage and impact
user interface 11 adjusting the diagram view 62
Enterprise Data Catalog Plug-in asset summary view 70
Configuration 103 assets within a link 65
Installation 103 diagram view 61
Overview 101 difference with Relationships view 76
Prerequisites 103 displaying business glossary terms 66
user interface 102 displaying child assets 65
view Tableau assets 104 exporting the diagram 70
hiding business glossary terms 66
106 Index
lineage and impact (continued) search (continued)
printing the diagram 70 customizing 19
seed asset definition 61 from the home page 15
slider controls 63 keyword search 16
slider examples 63 overview 14
tabular view 70 refine by asset size 23
Lineage Asset Summary refine by asset type 21
filtering 72 refine by attributes 25
lineage definition 60 refine by certified assets 24
Lineage slider refine by composite data domains 25
examples 63 refine by data domains 23
using 63 refine by last updated time 22
links refine by ratings 24
definition 61 refine by resource name 20
expanding 65 refine by resource type 22
logging in search suggestions 16
Enterprise Data Catalog 11 wildcard search 16
search results
sorting 17
O
overview 60
T
Tableau Data Extract File
Export 85
P Overview 84
Prerequisites 84
policies tabular assets
details 36 checking data quality 43
viewing in Analyst tool 37 details 38
overview 37
R
rate
U
review 91 unfollowing
ratings assets 90
reviewing 91 resource 90
relationships user
overview 74 collaboration 86
Relationships view
adjusting the diagram 75
asset type circles 74
difference with Lineage and Impact view 76
V
opening assets from 77 view
overview 74 lineage and impact 60
selected assets 74 notifications 87
resource
summary 89
W
S wildcard search
performing 16
search
configuring search filters 95
Index 107