Write a single SQL to delete duplicate records from the a
single table based on a column value. I need only Unique
records at the end of the Query.
Answers were Sorted based on User's Feedback
Answer / nunna
Query to find duplicates in a table:(Custname, Prod,
Order_amt)
select custname,count(*) from sales1 a where a.rowid > ANY
(select b.rowid from sales1 b where a.custname=b.custname
and a.prod=b.prod and a.order_amt=b.order_amt) group by
custname;
Query to delete duplicates:
delete from sales1 a where a.rowid > ANY (select b.rowid
from sales1 b where a.custname=b.custname and a.prod=b.prod
and a.order_amt=b.order_amt);
| Is This Answer Correct ? | 7 Yes | 15 No |
Answer / manny
One need have atleast a unique column such as timestamp col
(and assumption is to keep lowest tmpstmp) OR some key col
say IPID (again keep lowest value)..
One determined - Have a nested Select on all rows (except
that key col) with group by rest of the columns + having
count(*) > 0 + aggreate MIN(key_col).
Now said that, have another outer SEL on all columsn &
do a inner join with above nested Sel .. WHERE outer
key_col <> MIN value of nested SEL..
See if it works..
| Is This Answer Correct ? | 5 Yes | 16 No |
Answer / milind
Nested query method might be required in other databases
how ever in TD we don’t need to follow such a difficult way
to just find out the unique rows.
In TD we have functions like Rank () and Rownum() in the
combination of Qualify, helps you to select out the rows
which you wants to delete.
you can add a condition like ‘Where Rank() > 1’
| Is This Answer Correct ? | 3 Yes | 16 No |
tomorrow i have interview in infosys.can someone pour any suggestions or any interview questions. thank you
What is the difference between correlated sub queries and inner queries?
Is it necessary to add? Quit statement after a bteq query when I am calling it in a unix environment?
Explain the term 'columns' related to relational database management system?
What are the uses of bynets in multi-node systems?
What is the purpose of upsert command?
How to copy 1 million records from one DB to another, Which is effective interms of performance?
Hello all, There is a table with 4 columns in that 3 columns has been already loaded with 5 million records.4th column is empty,Now I have got 5 million records data which has to be loaded into 4th column.How can I load this data fastly in to the 4th column with out using update
What is Difference b/w PI & PPI?how to implement PPI?
Why MultiLoad Utility supports only Non Unique Secondary Index(NUSI) in the Target Table ?
What are the functions involved in shared information architecture?
How teradata makes sure that there are no duplicate rows being inserted when its a set table?