one file contains
col1
100
200
300
400
500
100
300
600
300
from this i want to retrive the only duplicate like this
tr1
100
100
300
300
300 how it's possible in datastage?can any one plz explain
clearley..........?
Answer Posted / reddyvaraprasad
Job Design:
|----->Agg--->Filter1-->|
| |
| |
file-->cp-------------------->Join---->Filter2---->target
Agg: use aggregator and select Agg_type=count rows and then give the Count O/P column=Count (User defined).
Filter1: give the condition Count<>1
Join: select left outer join
Filter2: give the condition Count<>0
u will get the right output....what ever the duplicate records.
and if u want unique records, give the condition Count=0
| Is This Answer Correct ? | 0 Yes | 0 No |
Post New Answer View All Answers
What is process model?
root tree will find which is server job and which is parallel job?
how to run a sequential file stage in parallel if the stage is used on the TARGET side
How you Implemented SCD Type 1 & Type 2 in your project?
What are the main features of datastage?
What is quality stage?
Demonstrate experience in maintaining quality process standards?
What are the processing stages?
Describe stream connector?
Source has 2 columns: USA,NewYork INDIA,MUMBAI INDIA,DELHI UDS,CHICAGO INDIA,PUNE i want data in target like below: INDIA,MUMBAI1 INDIA,DELHI2 INDIA,PUNE3 USA,NEWYORK1 USA,CHICAGO2
Describe the main features of datastage?
What are data elements?
What is orabulk stage?
What is a quality stage in datastage tool?
what should be ensure to run the sequence job so that if its get aborted in 10th job before 9job should get succeeded?