how many types of remove the duplicate records?

Answers were Sorted based on User's Feedback



how many types of remove the duplicate records?..

Answer / prabhu rathnam

1. remove duplicate stage
2. sort stage
3. copy stage

Is This Answer Correct ?    8 Yes 0 No

how many types of remove the duplicate records?..

Answer / rohit babar

There are 3 ways by which we can remove the duplicate records
1. By using remove duplicate stage. We have control here on which record we want to remove it's first or last. There is option record to retain in remove duplicate stage where we select first or last option.

2. By using sort stage. there is option allow duplicate in sort stage. We set this option true when we want duplicate records & If we want unique records we set it to the false

3. By in-line sorting. In Partition tab of any stage when we select key base partition tech. then perform sort option will enable if we check this option next 2 option will enable which is stable & unique if we select stable that means we allow duplicate records & if we select unique that means we remove duplicate records and getting unique records

Is This Answer Correct ?    5 Yes 0 No

how many types of remove the duplicate records?..

Answer / soumya

1. Sort the Data using a key column then Use 3 transformer stage Variable
SV1 = Col1
SV2 = If SV1 = SV3 then 'DUP' else 'UNQ'
SV3 (Initial value = 0)
In the constraint SV2 = 'DUP' or SV2 = 'UNQ'

2. Aggregater stage group by key col (col1) count the column
then a filter stage use count col > 1.  

Is This Answer Correct ?    1 Yes 2 No

Post New Answer

More Data Stage Interview Questions

What is the project in datastage?

0 Answers  


What is the Difference Between DataStage 7.5 version and 8.1 Version?

10 Answers   IBM,


source file having the data like aabbccc, i want target file result like a1a2b1b2c1c2c3.

2 Answers  


what r the sources u need to create the datastage?

1 Answers  


How did you reconcile source with target?

3 Answers   HP,






what is the main difference between sorragate key n primary key in one word

2 Answers  


HOW CAN YOU DO SCD2 IN DATA STAGE USING SEQUENTIAL FILES?

1 Answers   HCL, IBM, Infosys,


Star Schema vs. Snow Flake Schema. What is the best model for DW? Give example for both the models performance wise?

3 Answers   IBM, TCS,


eno ename 1 qaz 1 wsx 1 edc 2 zxc 2 asd 3 qwe 3 wer 3 tru 4 rgj Output: eno ename count 1 qaz,wsx,edc 3 2 zxc,asd 2 3 qwe,wer,tru 3 4 rgj 1 I want the above output to be solved by DataStage as well and I have to write SQL query for the same output.

4 Answers   Wipro,


Scenario : I have 2 jobs say job A and Job B with parameters x and y respectively. I need to create a sequence job. If we pass parameter x then Job A should run, If we pass parameter y then Job B should run, if we dont pass any parameter then Both Job A & B should run.

4 Answers   iGate, Thirdware,


What a datastage macro?

0 Answers  


A table containg 100 records B table containg 20 records we have to join two tables in left outer it containg target 100 records but target containg 101 record at that time what is the issue arise

3 Answers   Polaris,


Categories