I have 2 files 1st contains duplicate records only, 2nd file contains Unique records.EX:
File1:
1 subhash 10000
1 subhash 10000
2 raju 20000
2 raju 20000
3 chandra 30000
3 chandra 30000
File2:
1 subhash 10000
5 pawan 15000
7 reddy 25000
3 chandra 30000
Output file:-- capture all the duplicates in both file with count.
1 subhash 10000 3
1 subhash 10000 3
1 subhash 10000 3
2 raju 20000 2
2 raju 20000 2
3 chandra 30000 3
3 chandra 30000 3
3 chandra 30000 3
Answer Posted / subbuchamala
File1,File2====Funnel-----Copy=======1st link AGG, 2nd link JOIN----Filter----OutputFile
1. pass the 2 files to funnel stage and then copy stage.
2. from copy stage 1st link to AGG stage, 2nd link to JOIN stage
3. In AGG stage, Group by Key column say ID, NAME take the count and JOIN based on KEY column
4. Filter on COUNT>1 send the output OutputFile
we get desired output
| Is This Answer Correct ? | 14 Yes | 0 No |
Post New Answer View All Answers
What can we do with datastage director?
root tree will find which is server job and which is parallel job?
What is meta stage?
What are some prerequisites for datastage?
What is the difference between datastage and datastage tx?
What are the processing stages?
Demonstrate experience in maintaining quality process standards?
How do y read Sequential file from job control?
Can you highlight the main features of ibm infosphere information server?
What is size of a transaction and an array means in a datastage?
Can we use target hash file as a lookup ?
Is it possible to implement parallelism in Mainframe Jobs ? If Yes how ? If no why ?
Is the value of staging variable stored temporarily or permanently?
What are the some differences between 7.x and 8.x version of datastage?
How to write a expression to display the first letter in Caps in each word using transformer stage ? Please let me know ASAP Thanks in advance...