How to delete duplicate records if we have huge volume of
records in a table ?
(rowid is not the correct approach)
Answers were Sorted based on User's Feedback
Answer / keyrun
Hi, There are many ways to delete the duplicates!
By Using the following Transformations you can delete the Duplicates:
Source Qualifier: Use SQ and check 'Distinct' property (If the Source is Relational)
Sorter: Use sorter and in Properties tab Check 'Distinct'(If the Source is Flat File)
Aggregator: Use Aggregator & Group by on the key port.
| Is This Answer Correct ? | 7 Yes | 0 No |
Answer / chandu
select * from from <table_name> where rowid not in(select
min(rowid) from <table_name> group by column(primary or
unique key column))
| Is This Answer Correct ? | 5 Yes | 0 No |
First we can count the no of records using aggregator
Transformation with group by (port) for ex, in emp table
take empno. Second we can take the Filter condition like
count=1. Other records are rejected not loaded into the target.
Hope it will help little bit...
| Is This Answer Correct ? | 1 Yes | 1 No |
Answer / cmanojkumar
Hi,
Thanks for your answer.
I am sorry this question i need to post in oracle forum not
informatica.
Any way your answer is usefull for me.
Could you please tell me how can we delete with our rowid in
oracle ?
| Is This Answer Correct ? | 0 Yes | 0 No |
Answer / ravikumar2614
DELETE FROM EMP E1 WHERE ROWID<
(SELECT MAX(ROWID) FROM EMP E2 WHERE E1.ROWID=E2.ROWID)
IF I AM WRONG PLEASE CORRECT ME ON ravi.info2614@gmail.com
| Is This Answer Correct ? | 1 Yes | 1 No |
Answer / dilip ingole
Delete from emp where rowid not in(select min(rowid) from emp group by eid,ename)
in group by column you need mention all column in table
| Is This Answer Correct ? | 0 Yes | 0 No |
What is predefined event?
delete data from staging table as it loads to target table.here is the case we are getting data from 3 different server.a b and c.the data from server A loaded into staging table and we ran the task and data loaded to target table.now today data from server B and C also got loaded to the staging table.now what techniques and what transformations should be used to delete only the data which has been loaded only to the target.we need to delete only that data from staging which has been loaded into the target.looking for your responses
What is a predefined event?
What are the different transaction levels available in transaction control transformation?
How the Informatica Server reads perameter file?
Explain the etl program with few examples.
how many ways can we implement SCD2?
What is the difference between a router and a filter transformation?
what is mapping parameter?
How to create or import flat file definition in to the warehouse designer?
i have a source which relational.I am trying to populate to target flat file with one column for daily date which is sysdate. I want to populate the sysdate coulmn with DD/MM/YYYY format. kindly provide a solution for this..my clear that my target is flat file.
In EMP table, for those emp whose Hiredate is same, update their sal by "sal+500" or else for others keep the sal as it is, how to do it by SQL query