when U have a remove dublicate option in sort stage, why we
have a remove dublicate stage in PX, thought it is
recamended to sort data before using a remove dublicate
stage. I hae been thinking this from days....

Answers were Sorted based on User's Feedback



when U have a remove dublicate option in sort stage, why we have a remove dublicate stage in PX, t..

Answer / prasu

In Duplicate Stages we have more number of optionscompare
to sort while removing duplicates.If you have less number
if data you can go with Sort stage to remove duolicats.If
you have large number of data go for Remove Duplicates
Stage.

Is This Answer Correct ?    8 Yes 0 No

when U have a remove dublicate option in sort stage, why we have a remove dublicate stage in PX, t..

Answer / phani kumar

Sort stage is used to sort the data and having option of
identifying the duplicate records with the value of Key
change column. But, to perform sort and remove duplicates is
leads to decrease the performance. So, it is preferable for
less amount of data.

Remove duplicates stage is used to get only unique records
either first occurrence or last occurrences. For large
amount of data, sorted data is required for better performance.

Correct me if iam wrong..........

Thanks and regards....
Phani kumar

Is This Answer Correct ?    8 Yes 0 No

when U have a remove dublicate option in sort stage, why we have a remove dublicate stage in PX, t..

Answer / data master

Sort Stage do Sorting of data and performing Remove
Duplicate records, which will slow the performance of job
(Hence it is better to sort data at database level).

If the data is already sorted than use the Remove Duplicate
Stage to remove duplicate records, Which will give better
performance of job than above situation.

Is This Answer Correct ?    3 Yes 2 No

when U have a remove dublicate option in sort stage, why we have a remove dublicate stage in PX, t..

Answer / swati

In Remove Duplicate stage you will get only unique records.

In sort Stage you will get both unique and duplicate records based on key change column.

Is This Answer Correct ?    1 Yes 0 No

Post New Answer

More Data Stage Interview Questions

source file contains 100 records, i want 10 records in target file how it possible in datastage

6 Answers   IBM,


my source is sequencial file and my target is dataset. i am running the job in two node configuration file. my source having 10 records how the data move to target?

3 Answers   TCS,


How we can convert rows to columns in datastage?

4 Answers   IBM,


Can aggregator and transformer stages use to sort the data? How ?

2 Answers  


What is the roundrobin collector?

0 Answers  






how will u design file watch jobs?

2 Answers  


Hi frnds, my scenario is like i'm having a record 1234"1323£3434%343434^23232!1212$23232 in the above record all the special characters must be removed.how can we do it in datastage 8.0.1.can any one please ans this? thanx in advance

2 Answers   IBM,


In my project source data comes from MAINFRAME in files.so,This time data is coming as a binary file...I know for binary data we use Complex flat file stage..I have used it also..but on 'view data' data is not coming correctly..as it in MAINFRAME.give me some ideas..

2 Answers  


I have file with empid,empname and I want to load these two fields along with sal in my target 1)salary must be same for all the records 2)I want pass the salary at run time

7 Answers   TCS,


Hi guys, please design a job with derivation(solution). write exact conditions. My requirement Source table emp_no qualification 1 a 1 c 2 a 3 c 3 b Target table emp_no qualification 1 b 2 b 2 c 3 a Here every employer have three qualifications i.e a,b and c. what ever source table dont have some qualification, that will be move to target table. Like above. Hope u get the point. Thanks.

4 Answers   UHG,


how to handle null values using transformer stage?

1 Answers  


To see hidden files in LINIX?

0 Answers   CTS,


Categories