if we take 2 tables(like emp and dept), we use join stage and
how to improve the performance?

Answers were Sorted based on User's Feedback



if we take 2 tables(like emp and dept), we use join stage and how to improve the performance? ..

Answer / kiran

when ever join 2 tables based on key columns if the key
column is numeric ,set modulus,if the key column is non
numeric set hash partition technique.and compare to look up
join give better performance coz join has sort operation
by default.

Is This Answer Correct ?    11 Yes 2 No

if we take 2 tables(like emp and dept), we use join stage and how to improve the performance? ..

Answer / ashok

above answer has one mistake
i.e join doesn't has sort operation bydefault we explicitly
specify

Is This Answer Correct ?    9 Yes 3 No

if we take 2 tables(like emp and dept), we use join stage and how to improve the performance? ..

Answer / poorna chandra

Hi this is Poorna ,

We can Improve the performance on join stage by doing
pre sorting for both left and right data based on
key .Then we can Improve the performance in join stage .

Plz correct me if any mistake in thinking .

Is This Answer Correct ?    6 Yes 1 No

if we take 2 tables(like emp and dept), we use join stage and how to improve the performance? ..

Answer / rajeshchunduri

in emp and dept tables key column is deptno so it is key
based and datatype for key column is int . At this time we
change partion tech from hash to modulus.



chunduri

Is This Answer Correct ?    1 Yes 1 No

if we take 2 tables(like emp and dept), we use join stage and how to improve the performance? ..

Answer / professional

Hi,
For the above query to improve the performance based on key columns in emp and dept joins by default sort in datastage for better performance if you have already a sorted data just go for environmental variables and do the operation #APT_Not_SORTDATA option then performance increase automatically...

Is This Answer Correct ?    0 Yes 0 No

Post New Answer

More Data Stage Interview Questions

How to RD using transformer?

0 Answers   CTS,


what is normalization and denormalization

2 Answers   CTS,


In work load management there are three options of Low priority, Medium priority and High Priority Jobs which can be used for resource management. why this feature is developed when there is already jobs prescheduled by scheduler or autosys. what will be the use of workload management then?

0 Answers  


What is the difference between account and directory options ?

0 Answers  


what is set mapping in datastage??

2 Answers   ADP,






how many types of remove the duplicate records?

3 Answers   Caterpillar,


Differentiate between operational datastage (ods) and data warehouse?

0 Answers  


input Name Salay Sam 10 Sam 30 Sam 20 Ram 40 Ram 50 Output should be Name Salary Count Sam 10 3 Ram 40 2 get min salary and count of name . OutPut 2 Name Seq Sam 1 Sam 2 Sam 3 Ram 1 Ram 2

1 Answers   HCL,


Terminate Activity

0 Answers   CTS,


How do you remove duplicate values in datastage?

0 Answers  


How many types of hash files are there?

0 Answers  


i have data like sam ram sam raj ram I want two targets trgt1 ram sam trgt2 raj how can i do this in datastage?

4 Answers   IBM,


Categories