How to delete duplicate record in Informatica?
Answers were Sorted based on User's Feedback
Answer / suresh
if the source is database means we can delete the duplicate
records by enabling the option select distinct in source
qualifier t/r properties or by writing the following query
in source qualifier filter
delete from emp where rowid not in (select min(rowid) from
emp group by empno);
if the source is flat file means by enabling the option
distinct in sorter t/r we can delete the duplicate records
| Is This Answer Correct ? | 27 Yes | 1 No |
Answer / mayuri
Select ditinct in properties of source qualifier
transformation.
| Is This Answer Correct ? | 21 Yes | 6 No |
Answer / infa dev
Source and the target will the same table from which the
duplicate needs to be deleted from. Then use a lookup trans
on to this same table and then use an update trans with
DD_DELETE if the row is duplicate else do nothing.
| Is This Answer Correct ? | 12 Yes | 3 No |
Answer / julius caeser
Hi
There are 2 ways to do this and both of them are efficient.
Method 1: Sorter -Filter.
Send all the data to a sorter and , sort by all feilds that
u want to remove duplicacy from . note that in the
preoperties tab, select Unique .
This will select and send forward only Unique Data .
Method 2; Use an Aggregator
Use AGG Transformation and group by the keys /feilds that u
want to remove duplicacy from.
| Is This Answer Correct ? | 11 Yes | 3 No |
Answer / aswin
use aggregator and do not apply any aggregation function, it will return only last row in the group..if you dont provide any groupby and aggregate value it will return only last row..so you can aviod dups
| Is This Answer Correct ? | 0 Yes | 1 No |
Answer / mayuresh
Hi,
how to delete duplicate values from table if source and
target are same
my Question is which logic we need to apply before using
DD_DELETE option in update strategy ?
Do we require to use aggregator to group the values and if
there count > 1 then DD_DELETE
PLZ reply
| Is This Answer Correct ? | 0 Yes | 2 No |
Answer / sarvesh
create table newtable as select distinct * from original
tablename;
by doing this we can elimate the duplicate rows
| Is This Answer Correct ? | 6 Yes | 10 No |
what is the exact meaning of domain?
What are the different options used to configure the sequential batches?
What are the different threads in DTM process?
how to calculate the optimum cache size in aggregator transformation?
What is the approximate size of data warehouse?
6 Answers Accenture, i3 Solutions,
What is a taget load order?
Performance wise which is better joiner or look up ? Give me with example?
What is the difference between Connected and UnConnected Lookup Transformation.Give me one or two examples please?
how did u understand business requirement?
PLS REAL TIMERS IAM WAITING FOR FAVOROBLE REPLY..REGARDING ETL TESTING INFORMATION...ETL TESTER NEEDS ETL DEVOLPMENT KNOWLEDGE OR NOT..?. IF HE NEED DEVLPMNT KNOW WHAT HE LEANR FIRST... AS A DEVOLOPER U KNOW THE ETL TESTING PROCESS.. PLS EXPLAIN IN DETAIL THANK U
How many ways are there to create ports?
What is the Router transformation?