when U have a remove dublicate option in sort stage, why we
have a remove dublicate stage in PX, thought it is
recamended to sort data before using a remove dublicate
stage. I hae been thinking this from days....

Answers were Sorted based on User's Feedback



when U have a remove dublicate option in sort stage, why we have a remove dublicate stage in PX, t..

Answer / prasu

In Duplicate Stages we have more number of optionscompare
to sort while removing duplicates.If you have less number
if data you can go with Sort stage to remove duolicats.If
you have large number of data go for Remove Duplicates
Stage.

Is This Answer Correct ?    8 Yes 0 No

when U have a remove dublicate option in sort stage, why we have a remove dublicate stage in PX, t..

Answer / phani kumar

Sort stage is used to sort the data and having option of
identifying the duplicate records with the value of Key
change column. But, to perform sort and remove duplicates is
leads to decrease the performance. So, it is preferable for
less amount of data.

Remove duplicates stage is used to get only unique records
either first occurrence or last occurrences. For large
amount of data, sorted data is required for better performance.

Correct me if iam wrong..........

Thanks and regards....
Phani kumar

Is This Answer Correct ?    8 Yes 0 No

when U have a remove dublicate option in sort stage, why we have a remove dublicate stage in PX, t..

Answer / data master

Sort Stage do Sorting of data and performing Remove
Duplicate records, which will slow the performance of job
(Hence it is better to sort data at database level).

If the data is already sorted than use the Remove Duplicate
Stage to remove duplicate records, Which will give better
performance of job than above situation.

Is This Answer Correct ?    3 Yes 2 No

when U have a remove dublicate option in sort stage, why we have a remove dublicate stage in PX, t..

Answer / swati

In Remove Duplicate stage you will get only unique records.

In sort Stage you will get both unique and duplicate records based on key change column.

Is This Answer Correct ?    1 Yes 0 No

Post New Answer

More Data Stage Interview Questions

In the source seq file have 2 columns, i.e input: col1,col2 1,1 2,rajesh 3,15000 4,2 5,suresh 6,16000 7,3 8,veeru 9,17000 I want in the output like: eno,ename,sal 1,rajesh,15000 2,suresh,16000 3,veeru,17000 any one answer this scenario

3 Answers  


On which interface you will be working as a developer?

0 Answers  


4) source target c1 c1 c2 c3 c2 c4 c4 c5 c3 c6 c7 c4 c5 c6 c7 please send me answer this question my mail

1 Answers  


what is the main differences between Lookup, Join and Merge stages?

9 Answers   HCL,


in aggregator , how can i get the sum in readable format

4 Answers   IBM,






What is aggtorec restructure operator?

0 Answers  


guys pls tell me where we use sequence jobs exactly in realtime proj explain pls with example.

2 Answers   TCS,


With out using Funnel Stage, how to populate the data from different sources to single target

12 Answers   Wipro,


what is diff b/w datastage 8.1,8.5,8.7?

1 Answers   IBM,


Can you filter data in hashed file?

0 Answers  


Which is the best institute in Hyderabad to learn OBIEE and DataStage?Please tell me faculty name also

0 Answers  


How many input links can you give to a Transformer stage?

6 Answers  


Categories