Golgappa.net | Golgappa.org | BagIndia.net | BodyIndia.Com | CabIndia.net | CarsBikes.net | CarsBikes.org | CashIndia.net | ConsumerIndia.net | CookingIndia.net | DataIndia.net | DealIndia.net | EmailIndia.net | FirstTablet.com | FirstTourist.com | ForsaleIndia.net | IndiaBody.Com | IndiaCab.net | IndiaCash.net | IndiaModel.net | KidForum.net | OfficeIndia.net | PaysIndia.com | RestaurantIndia.net | RestaurantsIndia.net | SaleForum.net | SellForum.net | SoldIndia.com | StarIndia.net | TomatoCab.com | TomatoCabs.com | TownIndia.com
Interested to Buy Any Domain ? << Click Here >> for more details...


source
1123445
I WANT OUTPUT AS
DUPLICATES TO TARGET1 LIKE
TARGET1
1144
NON-DUPLICATES TO TARGET2
TARGET2
235

Answers were Sorted based on User's Feedback



source 1123445 I WANT OUTPUT AS DUPLICATES TO TARGET1 LIKE TARGET1 1144 NON-DUPLICATES TO TAR..

Answer / siva

seq.file-->copy-->agg
! !secondary link
!.....>join-------->filter-->t1
primarylink !---->t2
agg:countrows
give condition in filter
count=1-->t1
count<>1-->t2

Is This Answer Correct ?    10 Yes 0 No

source 1123445 I WANT OUTPUT AS DUPLICATES TO TARGET1 LIKE TARGET1 1144 NON-DUPLICATES TO TAR..

Answer / subhash

Source
..|
..|
copy--->agg
..|.....|
..|.....|
join stage
....|
....|
Filter stage -----> target1
..|
..|
target2

the main data is:
1
1
2
3
4
4
5
from aggregator stage, the output is:
1,2
2,1
3,1
4,2
5,1

If you join these two links then the output will be:
1,2
1,2
2,1
3,1
4,2
4,2
5,1

Then specify the count<>1 in the Filter for target1 then you get the duplicate records. means YOU get:
1
1
4
4

in another link for target2, give count=1. means YOU get:
2
3
5

Is This Answer Correct ?    8 Yes 0 No

source 1123445 I WANT OUTPUT AS DUPLICATES TO TARGET1 LIKE TARGET1 1144 NON-DUPLICATES TO TAR..

Answer / s

agg-->filter-->trg2
^ |
| v
seq-->copy-->join-->trg1

agg:countrows
join:innerjoin
filter:count=1
:count>1

Is This Answer Correct ?    0 Yes 0 No

source 1123445 I WANT OUTPUT AS DUPLICATES TO TARGET1 LIKE TARGET1 1144 NON-DUPLICATES TO TAR..

Answer / chaint

source1(112345) ----- lookup stage- reference lookup
on(source2 output)--reject link(1144) --output(235)
source2(112345) -- sort( get count) -> filter(only unique)

we would require two source..
one original and other only (non repeated records)

we will have a lookup stage with source1 as input and
source2 as reference lookup..
in lookup stage we will have a reject link(1144) non matched
records.. and output will be(235).

Kindly correct me if i m wrong

Is This Answer Correct ?    0 Yes 2 No

source 1123445 I WANT OUTPUT AS DUPLICATES TO TARGET1 LIKE TARGET1 1144 NON-DUPLICATES TO TAR..

Answer / nagam

seq.file----->sort------>filter----2datasets

in sort stge create key change column and then filter stage
write the condition on based on keychange column keychange
column =1 uniq data keychange<>0 duplicate data we can get

If wrong please tell me

Is This Answer Correct ?    0 Yes 5 No

source 1123445 I WANT OUTPUT AS DUPLICATES TO TARGET1 LIKE TARGET1 1144 NON-DUPLICATES TO TAR..

Answer / shreeram

Seq(source)------> Transform stage-------> Tgt1
!.........>Tgt2


use substring in transform stage.

Is This Answer Correct ?    0 Yes 11 No

Post New Answer

More Data Stage Interview Questions

How do you reject records in a transformer?

0 Answers  


What is the difference between validate and compile?

1 Answers   CTS,


Can you implement SCD2 using join, transformer and funnel stage?

0 Answers   Cognizant,


What are stage variables, derivations and constants?

0 Answers  


how to clear the source file? 2) diff b/t odbc & Oracle?what is node? 3) explain one complex job u face in ur project

2 Answers   Wipro,


Triggers,VIEW,Procedures

0 Answers   CTS,


I have 100 records how can I load at a time from the single time

1 Answers  


How do you schedule or monitoring the job?

0 Answers  


4) source target c1 c1 c2 c3 c2 c4 c4 c5 c3 c6 c7 c4 c5 c6 c7 please send me answer this question my mail

1 Answers  


where the log files or tables can store in DS?

2 Answers   IBM,


in one scenario source flat file like Fileld1 00122001550056200568 00256002360014500896 00123004560078900258 00147004560025800256 divide each 5 numbers as one column i.e here i need field1 field2 field3 field4 00122 00155 00562 00568 00256 00236 00145 00896 00123 00456 00789 00258 00147 00456 00258 00256 plz help me....

4 Answers  


what is time dimension? and how to populate time demension

2 Answers   TCS,


Categories