How to delete duplicate record in Informatica?
Answers were Sorted based on User's Feedback
Answer / suresh
if the source is database means we can delete the duplicate
records by enabling the option select distinct in source
qualifier t/r properties or by writing the following query
in source qualifier filter
delete from emp where rowid not in (select min(rowid) from
emp group by empno);
if the source is flat file means by enabling the option
distinct in sorter t/r we can delete the duplicate records
Is This Answer Correct ? | 27 Yes | 1 No |
Answer / mayuri
Select ditinct in properties of source qualifier
transformation.
Is This Answer Correct ? | 21 Yes | 6 No |
Answer / infa dev
Source and the target will the same table from which the
duplicate needs to be deleted from. Then use a lookup trans
on to this same table and then use an update trans with
DD_DELETE if the row is duplicate else do nothing.
Is This Answer Correct ? | 12 Yes | 3 No |
Answer / julius caeser
Hi
There are 2 ways to do this and both of them are efficient.
Method 1: Sorter -Filter.
Send all the data to a sorter and , sort by all feilds that
u want to remove duplicacy from . note that in the
preoperties tab, select Unique .
This will select and send forward only Unique Data .
Method 2; Use an Aggregator
Use AGG Transformation and group by the keys /feilds that u
want to remove duplicacy from.
Is This Answer Correct ? | 11 Yes | 3 No |
Answer / aswin
use aggregator and do not apply any aggregation function, it will return only last row in the group..if you dont provide any groupby and aggregate value it will return only last row..so you can aviod dups
Is This Answer Correct ? | 0 Yes | 1 No |
Answer / mayuresh
Hi,
how to delete duplicate values from table if source and
target are same
my Question is which logic we need to apply before using
DD_DELETE option in update strategy ?
Do we require to use aggregator to group the values and if
there count > 1 then DD_DELETE
PLZ reply
Is This Answer Correct ? | 0 Yes | 2 No |
Answer / sarvesh
create table newtable as select distinct * from original
tablename;
by doing this we can elimate the duplicate rows
Is This Answer Correct ? | 6 Yes | 10 No |
Hi, What is exactly node ? what is exactly domain ? How loadbalancer will work ? what is Integration service (IS) ? Can anyone give me exact scenario when I click on start workflow in the workflow manager ? (Note : I am not expecting answer like, node means logical representation of a machine and domain will contain multiple nodes .I gone through Informatica help but I couldn’t understand. I am expecting exactly what is happening in background detailed explanation or Pictorial representation will be highly appreciation ) Advance thanks
How to retrieve last two days updated records?
What is the need of an ETL tool?
In which circumstances that informatica server creates Reject files?
can we use self join in informaitca?
what is fact and what types of fact tables is there
can a port in expression transf be given the name DISTINCT
difference between informatica 8.6 and 9
3 Answers Atos Origin, BA Continnum Solutions, Core Logic,
How does the session recovery work.
design of staging area?
How do you implement scheduling in Informatica?
every DWH must have time dimension so now what is the use of the time dimension how we can calculate sales for one month,half-yr'ly,and year'ly?how we are doing this using time dimension.