Interview

Download as txt, pdf, or txt
Download as txt, pdf, or txt
You are on page 1of 8

Capgemini interview questions:

1)Self introduction
2)About project

3)input 1,2,3,4,5,6,7,8,13 output 1,2,3,5,8,13 how do you write logic -wrong


questions

4) how do you load data to a table based on target load order


-> we can define whihch target will load frist by using target load order

5) send first three records to one target, next 5 records to another target,
remaining to another target.
-> 1.TAke expression drag column to expression
2.take sequence generator and start value with 1 and drag nexvalue to
expression
3.take router and take both column to the router and make condtion
nexvalue<=3 , nexvalue>3 and nexvalue<=8 and nexvalue>8
4. connect to the output
***********************************************************************************
***********************************************************
Cognizant Interview questions

1. Tell me about your self

2. How to remove duplicate from table in oracle


-> DELETE employee
WHERE rowid NOT IN
(SELECT MAX(rowid)
FROM employee

GROUP BY EMPID);

-> WITH cte AS (SELECT EMPID ,ROW_NUMBER()OVER (PARTITION BY EMPID, ORDER BY EMPID)
as row_num FROM employee)
DELETE FROM cte
WHERE row_num > 1;

3. What are the column names of Fact table in your project


-> STUDY: id,name,country,date, study description SITE:id, country, SUBJECT:

4. what is the purpose of loading the Dimention Table in your project


-> To make the reports and dashboards ,data monitor

5. What the use of Agg. Transformation


-> To perform aggregate function like sum ,avg, min ,max

So be careful guys

28-5-22
Cognizant Interview Questions
1.explain about scd type 2
-> Transformation needed : expression, Lookup, sequence-generator, router,update-
strategy

2.how to load unique records into one table duplicate records into another table
-> 1. take aggregate transformation find the count of the cloumn
2. take router and make two condtion one is count =1 for unique and count!=1
or default gruop can be used for duplicate
3.indirect file method
-> Used to load the source files with same structure by using singal source
qulifier to the target

4.how to pass one session parameter into another session

->we can not pass parameter

this is for variable


-> Yes, By giving the
In mapping1 create a mapping variable say var1 and set its value to cloumn.
Create a workflow variable in the workflow, say var_wkf
In session1, in Post-session on success variable assignment, assign var_wkf = var1
In mapping2, create a mapping variable, say var2
In session2, in Pre-session variable assignment, assign var2=var_wkf
You can use the var2 variable in mapping2, it should have the value set in mapping1

5.what is joiner transformation , explain about how to improve performance


-> joiner is used to join the two heteroguoues source in the mapping ,
-> use sorted data whenever posibale ,use source with fewer rows as master so that
iterations will be less with detail ,Perform joins in a database

6.what is transaction control transformation


->Use the Transaction Control transformation to commit or roll back transactions
from transactional targets
we can use this whenever we have large amount ot data and we can commit after
some interval

7.what are the process for code deployment in your project


->

8.do you have knowledge on MD5 function


->You want to write changed data to a database. Use MD5 to generate checksum values
for rows of data you read from a source. When you run a mapping, compare the
previously generated checksum values against the new checksum values. Then, write
the rows with updated checksum values to the target. You can conclude that an
updated checksum value indicates that the data has changed.

9.explain about router transformation


->Router transformatio is used to apply multiple conditions on same input data
rather than using multiple filter

10.explain about performance tuning


->To optimize session performance by eliminating performance bottlenecks.
performance bottlenecks mean bez of one object or compoent the entire session or
workflow is getting stopped or running for long time

11.explain about update strategy transformation


->The Update Strategy transformation is an active transformation. When you design a
data warehouse, you need to decide what type of information to store in targets. As
part of the target table design, you need to determine whether to maintain all the
historic data or just the most recent changes. there are four tranf we can use
insert, delete, update, or reject.

11.How to update the records without update strategy transformation


->use "insert else update" option at session, the informatica first look in to
target table to see if the data exeist for coming primary key valye. If yes, it
will update otherwise insert.

12.explain about incremental loading(cdc)


->the process of loading the selective data which is newly created or updated from
source system to the target system
***********************************************************************************
***********************************************************************
Tcs Interview questions

1. HLD and LLD documents


-> HLD : High level document is the overall system design - covering the system
architecture and database design. It describes the relation between various
modules and functions of the system. data flow, flow charts and data
structures are covered under HLD
LLD: (LLD) is like detailing the HLD. It defines the actual logic for each
and every component of the system. Class diagrams with all the methods and relation
between classes comes under LLD. Programs specs are covered under LLD

2. How do u get requirements


-> Through JIRA and DMS

3 .explain complex thing you had done in your project


->
4. How will you give the parameterized connections
-> We will create one file in BWParam location in that we will mention all target
and source and if needed parameter and will give location of that file in seesion
or in workflow

5.tell connected and unconnected lookup


-> connected lkp: Is connected in pipeline or mapping . we can retrun multiple
ports in connected lkp ,Use a dynamic or static cache
UN-connected : Is not connected in pipeline or mapping . we can retrun only
one ports in un-connected lkp, Use a static cache

6 . Scd 2
-> Transformation needed : expression, Lookup, sequence-generator, router,update-
strategy

7. Star schema and snowflake schema


-> Star schema: Fact table sourdded by dimension table .
Snowflake: Fact table sourdded by dimension table and that dim table is again
connected to dimension

8. Partitioning
->Partitioning Option to increases the performance of ETL through parallel data
processing.

9. Hash Auto-Keys
->The Integration Service uses all grouped or sorted ports to generate compound
Partition keys

10.Hash User-Keys
->The Integration Service uses a hash function to group rows of data among
Partitions, We define the number of ports to generate the partition key

10.performance tuning
->to optimize session performance by eliminating performance
bottlenecks.performance bottlenecks mean bez of one object or compoent the entire
session or workflow is getting stopped or running for long time

11. Joins
-> To join the two tables ,inner,left ,right,full

12. Dense , rank , rownum


->
13. Delete duplicate records
->WITH cte AS (
SELECT EMPID ,
ROW_NUMBER() OVER (PARTITION BY EMPID, ORDER BY EMPID) as row_num FROM employee
)
DELETE FROM cte
WHERE row_num > 1;
14.stored procedures
->
15. Dynamic lookup cache
-> dynamic LKP chnages the cache every time the service perocess the lookup. use
this when we want insert or update the target

16. How do you load alternate records.


-> take expression sequence-generator cycle 1-2 only , then router 1 will go to the
one target and 2 will another target

17. Primary key and foreign key


-> Primary key is used to uniquely identify a row .Foreign key is used to connect
two tables or maintain relationship between them

18. Indexes
->Used to speed up the data retrieval form the database . liek select , or
delete,update , insert

19.views
->virtual table of an SQL statement.The fields in a view are fields from one or
more real tables in the database

20 Sorter active or passive why??


-> Sorter is passive bez it does not changing no of rows

21 Active and connected


->An active transformation can change the number of rows that pass through the
transformation
A passive transformation does not change the number of rows that pass through
the transformation

22.How do you create single lookup transformation using multiple tables?


->we have the lOOKUP OVERRIDE Query in the Lookup transformation. Use the SQl Query
to join the tables you lookup on

23.what is worklet and what use of worklet and in which situation we can use it?..
->Worklet:- Group of tasks taken to accomplih a task is known as a Worklet.
Use of Worklet:- You can bind many of the tasks in one place so that they can
easily get identified and also they can be of a specific purpose.

24.what is a junk dimension ?.


->A "junk" dimension is a collection of random transactional codes, flags and/or
text attributes that are unrelated to any particular dimension
25.Conformed Dimensions
->This is used in multiple locations. It helps in creating consistency so that the
same can be maintained across the fact tables

26.Degenerate Dimensions
->When the dimension attribute is stored in the fact table and not in a separate
dimension table

***********************************************************************************
**************************************
Infosys Interview questions
--------------------------------
1)SQl operations
->
2)difference between fact and dimension
-> Fact: Quantitative data , we can make calulation on them
Dimension : Descriptive data, information regarding facts fields
3)push down optimization
-> Convrt the Transformation logic into sql queries and push them into either
source ot target side

4)partitions in informatica
-> parallel processing for workflow

5)persistent lookup cache


-> Integration service will save the cache file and it will reuse for the next time

6)dynamic lookup
-> its type of lookup cache . whenever IS peocess the lookup every time the cache
will chnage it is called the lookup cache

7)MD5 function in informatica


->You want to write changed data to a database. Use MD5 to generate checksum values
for rows of data you read from a source. When you run a mapping, compare the
previously generated checksum values against the new checksum values. Then, write
the rows with updated checksum values to the target. You can conclude that an
updated checksum value indicates that the data has changed.

8)why sorter is active transformation


->when we check distict option then it will acts as active

9)why union is active transformation


-> it combines two or more data streams into one an no of rows are not chnaging

10)constraint based load


->Constraint-Based Loading is used to load the corresponding transformed row first
to the primary key table, then to any foreign key tables. In the Workflow Manager,
you can specify constraint-based loading for a session. When you select this
option, the Integration Service orders the target load on a row-by-row basis

11)target load order


->we can define whihch target will load frist by using target load order

13)difference between oltp and olap and real time examples


->https://www.guru99.com/oltp-vs-olap.html

14)sql index ing


->It is used to retrive the data fastly from database
15)sql performance tuning
->Reduce Table Size,Simplify Joins,Use SELECT Fields FROM Instead of SELECT *
FROM,Use WHERE Instead of HAVING

16)Analytic functions in sql


->rank ,row_num,dense_rank
17)delete vs truncate
->Delete
Truncate
The DELETE command is used to delete specified rows(one or more).
While this command is used to delete all the rows from a
table.
It is a DML(Data Manipulation Language) command.
While it is a DDL(Data Definition Language) command.
WHERE clause in the DELETE
While there may not be WHERE clause in the

18)which is the better one of delete and truncate and why?


->TRUNCATE is faster than DELETE,TRUNCATE is used to delete all the rows from a
table

19)do you have knowledge about IICS


->NO
20)difference between union and union all
->union: will eliminate the duplicate
union all: will not eliminiate duplicate

21)data driven types and update startegy and explain them


->To insert, update, or delete row values by an identifying key, choose the Data
Driven Load type.

22)temporary due for pushdown


->Allows the PowerCenter Integration Service to create temporary view objects in
the database when it pushes the session to the database
********************************************************************************

1- How to find duplicates in a table


-> Select empId , Count(*) from employees
group by empId
having count(*)>1
2- How to delete duplicates from a table
->delete from EMPLOYEES
where employee_id in (
Select e.employee_id , Count(*) from EMPLOYEES e
group by e.employee_id
having count(*)>1
)

3- Difference between union and union all


-> uninon will remove the duplicate , but union all will not remove the duplicate

If you have ever given an SQL interview some of the questions

would be definitely from below list:


4- Difference between rank,row_number and dense_rank

5- Find records in a table which are not present in another


table
->SELECT e.employee_id
FROM EMPLOYEES e
WHERE NOT EXISTS (
SELECT *
FROM job_history jh
WHERE e.employee_id =jh.employee_id
)

&****************

SELECT *
FROM EMPLOYEES
WHERE employee_id NOT IN (SELECT employee_id FROM job_history)

5. second highest salary employees

select e.department_id, e.salary,


dense_rank () over ( order by e.salary desc ) rnk
from EMPLOYEES e

6- Find second highest salary employees in each department


-> with high_salary as (
select e.department_id, e.salary,
dense_rank() over (partition by e.department_id order by e.salary desc) rnk

from EMPLOYEES e)

select * from high_salary


where rnk=2;

7- Find employees with salary more than their manager's


salary
->SELECT *
FROM employees e,
employees m
WHERE e.manager_id = m.employee_id
AND e.salary > m.salary;

8- Difference between inner and left join


->
9- update a table and swap gender values.
>update employees
set
gender = case gender
when 'Male' then 'Female'
when 'Female' then 'Male'
else 'Other' end

10- Number of records in output with different kinds of join.

Active Transformation – An active transformation changes the number of rows that


pass through the mapping.

Source Qualifier Transformation


Sorter Transformations
Aggregator Transformations
Filter Transformation
Union Transformation
Joiner Transformation
Normalizer Transformation
Rank Transformation
Router Transformation
Update Strategy Transformation
Advanced External Procedure Transformation

Passive Transformation – Passive transformations do not change the number of rows


that pass through the mapping.

Expression Transformation
Sequence Generator Transformation
Lookup Transformation
Stored Procedure Transformation
XML Source Qualifier Transformation
External Procedure Transformation
Input Transformation(Mapplet)
Output Transformation(Mapplet)

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy