how to do pergformence tuning in datastage?

Answers were Sorted based on User's Feedback



how to do pergformence tuning in datastage?..

Answer / prams

1. Staged the data coming from ODBC/OCI/DB2UDB stages
or any database on the server using Hash/Sequential files
for optimum performance also for data recovery in case job
aborts.
2. Tuned the OCI stage for 'Array Size' and 'Rows per
Transaction' numerical values for faster inserts, updates
and selects.
3. Tuned the 'Project Tunables' in Administrator for
better performance.
4. Used sorted data for Aggregator.
5. Sorted the data as much as possible in DB and
reduced the use of DS-Sort for better performance of jobs
6. Removed the data not used from the source as early
as possible in the job.
7. Worked with DB-admin to create appropriate Indexes
on tables for better performance of DS queries
8. Converted some of the complex joins/business in DS
to Stored Procedures on DS for faster execution of the
jobs.
9. If an input file has an excessive number of rows
and can be split-up then use standard logic to run jobs in
parallel.
10. Before writing a routine or a transform, make sure
that there is not the functionality required in one of the
standard routines supplied in the sdk or ds utilities
categories.
Constraints are generally CPU intensive and take a
significant amount of time to process. This may be the case
if the constraint calls routines or external macros but if
it is inline code then the overhead will be minimal.
11. Try to have the constraints in the 'Selection'
criteria of the jobs itself. This will eliminate the
unnecessary records even getting in before joins are made.
12. Tuning should occur on a job-by-job basis.
13. Use the power of DBMS.
14. Try not to use a sort stage when you can use an
ORDER BY clause in the database.
15. Using a constraint to filter a record set is much
slower than performing a SELECT … WHERE….
16. Make every attempt to use the bulk loader for your
particular database. Bulk loaders are generally faster than
using ODBC or OLE.

Is This Answer Correct ?    28 Yes 3 No

how to do pergformence tuning in datastage?..

Answer / raji

1.Avoid using transformer stage for renaming some
columns.Because it will slow down the performance of the
jobs.. Try to use Copy stage for renaming the columns.
2.Take only the required columns during the table level
lookups. Remove all the unnessary columns
3.By using partioning technique. It depends on our
requirement. It will increase the performance as well

Is This Answer Correct ?    29 Yes 4 No

how to do pergformence tuning in datastage?..

Answer / venugopal [patni]

1. By using hashfile stage we can improve the performance.
In case of hashfile stage we can define the read cache size
& write cache size but the default size is 128M.B.
2. By using active-to-active link performance also we can
improve the performance.
Here we can improve the performance by enabling the row
buffer, the default row buffer size is 128K.B.
3. By removing unwanted columns.
4. By selecting appropriate update actions.
5. In parallel by replacing transformer with copy or filter
stage we can improve the performance.Because if you are
using more than 5 transformers in a stage the performance
will degrade,so to avoid transformer you can use copy or
filter.
6. In server by using linkpartitioner,linkcollectoe & IPC
stages also we can improve the performance.

Is This Answer Correct ?    20 Yes 3 No

how to do pergformence tuning in datastage?..

Answer / veera

HI

1. sorted the data as much as possible in sourc database

2. Remove the unwanted columns from soure DB.

3. Drop the indexes before loading the data and Recreate
after loading the data

4. not use more than 20 stages in a job

5. Reduce the Tx stage

6. Use sort stage before an Aggregator stage (in sort mode)

7. Tuned the Project tunnables in administator for better
performance

Is This Answer Correct ?    12 Yes 2 No

Post New Answer

More Data Stage Interview Questions

how does work server jobs?

1 Answers  


What can we do with datastage director?

0 Answers  


What are transforms and what is the differenece between routines and transforms?

0 Answers  


I am having the 2 source files A and B and I want to get the output as, the data which is in file A and which doesn't in file B to a target 1 and which is in file B and which doesn't in file A to a target 2?

3 Answers  


where we use column generator stage in real time scenario?

5 Answers   TCS,






How many input links can you give to a Transformer stage?

6 Answers  


How can we read latest records in a text file named file1.txt using seq file stage only? file1 having 100 records in that 5 record sare latest records.How can we read that latest records?

3 Answers   Caterpillar,


How to remove blank spaces from data?

3 Answers  


How do you get log info into a file?

1 Answers  


Hai..,in datastage how to explain project in interview?please explain any domain please.

1 Answers   Wipro,


hi All, i have one scenario like if source--->transformer-->2 target sequential files the 1 st target sequential file is loads the data from source and 2nd target sequntial file contain the 1st target total record count,and file name of 1 st target seq file and timestamp seperated by delimeter for example if source have 10 record the 1st target seq file hav 10 records and 2nd target seq file example 10|xyz.txt|20101110 00:00:00 could you please help me out how can i implement in datastage job.

4 Answers   IBM,


what is 'reconsideration error' and how can i respond to this error and how to debug this

0 Answers   Wipro,


Categories