How do we eliminate duplicate records in a flat file without using Sorter and Aggregator?
Answer Posted / priyank
There are several ways of achieving this. We can do it
through expression transformation and other is look up on
the target.
Expression transformation:
Create ports,
Var_PREV_KEY=Key
Var_CURR_KEY=Var_PREV_KEY
Var_CHK_DUPLICATE --> IIF(Var_CURR_KEY=Key,'DUP','NODUP')
OUT_DUPLICATE --> Var_CHK_DUPLICATE
Note: I have taken a scenario where the target table
contains only 1 Key. In case of multiple keys, will have to
create a few more Variable ports for both CURR and PREV and
in the Var_CHK_DUPLICATE port, we need to add those checks
with an 'AND' operator.E.g. For 2 keys,
Var_PREV_KEY1=Key1
Var_CURR_KEY1=Var_PREV_KEY1
Var_PREV_KEY2=Key2
Var_CURR_KEY2=Var_PREV_KEY2
Var_CHK_DUPLICATE --> IIF(Var_CURR_KEY1=Key1 AND
Var_CURR_KEY2=Key2,'DUP','NODUP')
OUT_DUPLICATE --> Var_CHK_DUPLICATE
If the Informatica version is Unix installation, then in
the pre session command you can give an unix command to
remove the duplicates from the file like
sort <file_name> | uniq > <file_name>.new
Hope it helps.
| Is This Answer Correct ? | 4 Yes | 12 No |
Post New Answer View All Answers
How you prepared reports for OLAP?
What are session parameters ? How do you set them?
How identifying bottlenecks in various components of informatica and resolving them?
How to update a particular record in target with out running whole workflow?
What is olap (on-line analytical processing?
How many number of sessions can one group in batches?
Can we override a native sql query within informatica?
How can one identify whether mapping is correct or not without connecting session?
Hello, Can you please send the admin and advanced mapping design sample questions to ns_sharmin@yahoo.com?
Make a note of the quantity vaults made in informatica?
What are the tuning techniques you applied during your last project
Can I use same Persistent cache(X.Dat) for 2 sessions running parallely? If it is not possible why?If yes How?
Why is sorter an active transformation?
Which are the different editions of informatica powercenter that are available?
What is Cognos script editor?