one file contains
col1
100
200
300
400
500
100
300
600
300
from this i want to retrive the only duplicate like this
tr1
100
100
300
300
300 how it's possible in datastage?can any one plz explain
clearley..........?
Answer Posted / reddymkl.dwh
Job Design:
Agg--->Filter1---------->|
| | Unique
file-->cp-------------------->Join---->Filter2---->target1
|
|-->Duplicate
Target2
Agg: use aggregator and select Agg_type=count rows and then give the Count O/P column=Cnt (User defined).
Filter1: give the condition Where=Cnt=1
U will get unique values like 200,400,500,600
Use Join (Or) Lookup stage: select left outer join
Filter2:
Where=Column_name='' (Duplicate values like 100,100,300,300,300)
Where=Column_name<>'' (Unique Values like 200,400,500,600)
u will get the right output....what ever the duplicate records.
Plz correct me if am wrong.....
| Is This Answer Correct ? | 0 Yes | 0 No |
Post New Answer View All Answers
what is repositery?
what is ds administrator used for?
how many rows sorted in sort stage by default in server jobs
What is the purpose of interprocessor stage in server jobs?
Can you implement SCD2 using join, transformer and funnel stage?
What is the difference between operational data stage (ods) and data warehouse?
What are the different types of lookups in datastage?
What are datastage sequences?
What is the difference between account and directory options ?
what are .ctl(control files) files ? how the dataset stage have better performance by this files?
What is the difference between hashfile and sequential file?
How many types of stage?
How to convert RGB Value to Hexadecimal values in datastage?
How a server job can be converted to a parallel job?
What is the use of datastage designer?