how many types of remove the duplicate records?

Answers were Sorted based on User's Feedback



how many types of remove the duplicate records?..

Answer / prabhu rathnam

1. remove duplicate stage
2. sort stage
3. copy stage

Is This Answer Correct ?    8 Yes 0 No

how many types of remove the duplicate records?..

Answer / rohit babar

There are 3 ways by which we can remove the duplicate records
1. By using remove duplicate stage. We have control here on which record we want to remove it's first or last. There is option record to retain in remove duplicate stage where we select first or last option.

2. By using sort stage. there is option allow duplicate in sort stage. We set this option true when we want duplicate records & If we want unique records we set it to the false

3. By in-line sorting. In Partition tab of any stage when we select key base partition tech. then perform sort option will enable if we check this option next 2 option will enable which is stable & unique if we select stable that means we allow duplicate records & if we select unique that means we remove duplicate records and getting unique records

Is This Answer Correct ?    5 Yes 0 No

how many types of remove the duplicate records?..

Answer / soumya

1. Sort the Data using a key column then Use 3 transformer stage Variable
SV1 = Col1
SV2 = If SV1 = SV3 then 'DUP' else 'UNQ'
SV3 (Initial value = 0)
In the constraint SV2 = 'DUP' or SV2 = 'UNQ'

2. Aggregater stage group by key col (col1) count the column
then a filter stage use count col > 1.  

Is This Answer Correct ?    1 Yes 2 No

Post New Answer

More Data Stage Interview Questions

what is A Datastage?

2 Answers  


What is the command line function to import and export the ds jobs?

0 Answers  


What is the difference between the local container and shared container?

0 Answers  


What is the Difference Between DataStage 7.5 version and 8.1 Version?

10 Answers   IBM,


How can remove duplicates in a file using UNIX?

3 Answers   Cap Gemini,


What are the benefits of datastage?

0 Answers  


tab1 tab2 1,a 1,d 2,b 3,c perfoms outerjoin what is the o/p? write sql query for outerjoin?

3 Answers   Infotech,


Describe stream connector?

0 Answers  


This is UNIX question asked in DataStage Interview. Say I have n numbers of records in a text file. I want first 3 records in 1st file, last three records in 3rd file and remaining n-6 records in 2nd file. (Note: we don't know how many records are there in the File. I am getting one file on daily basis and I want three target files as asked above)

2 Answers   CTS,


how to define satge variables in transformer stage

2 Answers  


What can we do with datastage director?

0 Answers  


convert yyyy mm dd to dd mm yyyy?

4 Answers  


Categories