one file contains
col1
100
200
300
400
500
100
300
600
300
from this i want to retrive the only duplicate like this
tr1
100
100
300
300
300 how it's possible in datastage?can any one plz explain
clearley..........?
Answer Posted / reddymkl.dwh
Job Design:
Agg--->Filter1---------->|
| | Unique
file-->cp-------------------->Join---->Filter2---->target1
|
|-->Duplicate
Target2
Agg: use aggregator and select Agg_type=count rows and then give the Count O/P column=Cnt (User defined).
Filter1: give the condition Where=Cnt=1
U will get unique values like 200,400,500,600
Use Join (Or) Lookup stage: select left outer join
Filter2:
Where=Column_name='' (Duplicate values like 100,100,300,300,300)
Where=Column_name<>'' (Unique Values like 200,400,500,600)
u will get the right output....what ever the duplicate records.
Plz correct me if am wrong.....
Is This Answer Correct ? | 0 Yes | 0 No |
Post New Answer View All Answers
Terminate Activity
How to manage date conversion in Datastage?
what is use of SDR function?
What are the different kinds of views available in a datastage director?
what is flow of project?
how to abort the job its matain duplicates?
How to implement complex jobs in data stage?
how to delete one row in target dataset
Explain the datastage parallel extender (px) or enterprise edition (ee)?
What is the use of datastage designer?
if we using two sources having same meta data and how to check the data in two sources is same or not? and if the data is not same i want to abort the job ?how we can do this?
What can we do with datastage director?
Explain entity, attribute and relationship in datastage?
tell me 5 situations when we r using oracle db stages like orecle connector, oracle enterprise
how to export or import the jobs in .ISX file