Eliminating of duplicate records without using dynamic lookups

Answers were Sorted based on User's Feedback



Eliminating of duplicate records without using dynamic lookups..

Answer / jakka

Below are the ways to eliminate the duplicate records:
1.Select distinct in source qualifier
2.Based on your requirement,select and give number of
sorted ports in source qualifier and need to override the
query using order by cluase. Then use aggregator and select
sorted input as given and use group by ports
3. Use sorter to sort data and then use aggregator as
mentined in point 2.
4.Use sorter to sort data and check distinct in soter
properties.

Hope this will help you...

Jakka

Is This Answer Correct ?    2 Yes 0 No

Eliminating of duplicate records without using dynamic lookups..

Answer / kal_leo@hotmail.com

Hi U can eliminate duplicate records by an simple oneline
SQL Query.

select id, count(*) from seq1 group by id having
count(*)>1;

U can run this query directly on any SQL*Plus or U can
override SQL in any transformation.

Hope this is of any help.

Kal

Is This Answer Correct ?    2 Yes 1 No

Eliminating of duplicate records without using dynamic lookups..

Answer / vikram

To eliminate duplicate we use Source,Source qualifier,Look
up T/N, router T/N AND Target.. In This Case we use dynamic
cache


BUT USE
example emp table consist of i.d salary,name dept no.
eliminating the duplicates of dept no.

Source,Source Qualifier,Sorter T/N,Expression T/N,Router T/N
And split target in to two as duplicates and distinct
we have to create 3attributes in expression T/N AS
V_pre_deptno v_flag 0_flag
give condition as v_pre_dept no = dept no
v_flag= IFF(dept no = v_pre_deptno,1,0)

Is This Answer Correct ?    0 Yes 1 No

Post New Answer

More ETL Interview Questions

what is the difference between date cache and lindex cache?

1 Answers  


what is the difference between cardinality and Nullability?

0 Answers  


What are dimensions?

0 Answers  


how would u estimate the size of Aggregator transform data and index cache?

0 Answers  


how many maximum session supports to multiload in teradata ??

0 Answers   IBM,


what is the meant by Normalization and de-normalization?

0 Answers  


What is a mapping?

0 Answers  


what is different between sequential batch and concurrent batch and which is recommended and why?

0 Answers  


Difference between flat files and relational sources?

2 Answers  


What is virtual Data Warehousing?

0 Answers  


How to Check the source record count from traget table.

5 Answers   IBM,


Explain about power designer data modeling software?

0 Answers  


Categories