Main Function of the Staging area in DWH ?
Answers were Sorted based on User's Feedback
Answer / guru avula
1.In a project source systems are differnet and also all
source systems are not availble in same time.for eg 1
source is availble at 1AM and 2n one at 2 AM etc
But our schedule jobs run at different time ,so we have to
pick the data from source system based on source availble
time and put into staging area.
2.All source systems table formats and column formats are
diffrent ,so we have to sync all .
For above two reasons we go for staging area
| Is This Answer Correct ? | 5 Yes | 1 No |
Answer / srinivas
I will add some points to first answer
Source system send the raw data what ever data they have simply they will send to us.
In staging we do the cleansing, remove-duplicate and null handling process and load the data into staging tables.
Then we applying business logic's and loading into dim and fact tables.
| Is This Answer Correct ? | 2 Yes | 1 No |
HOW WILL YOU IMPLEMENT SURROGATE KEY IN SCD BY USING SURR_KEY GENERATOR,THE VALUE OF S_KEY SHOULD NOT REPEAT EVEN IF THE JOB IS COMPILED REPEATEDELY?
Define Data Stage?
i 10 jobs first two jobs are runing in 2nodes,next 2 jobs are running in 4 nodes, next 4 jobs are running in 6 nodes and the remaining jobs are running on 10 nodes. how to change the node configuration?
How one source columns or rows to be loaded in to two different tables?
Anyone has Datastage certification free dumps for 000-418 , 000-421 codes, mail me @ manik.dwh@gmail.com 000-418 : InfoSphere DataStage v8.0 000-421 : InfoSphere DataStage v8.5
How do you load 10 different sources with 10 different layouts to 10 different tables?
How do you schedule or monitoring the job?
How can we improve the performance in datastage?
EXPLAIN SCD
eno ename 1 qaz 1 wsx 1 edc 2 zxc 2 asd 3 qwe 3 wer 3 tru 4 rgj Output: eno ename count 1 qaz,wsx,edc 3 2 zxc,asd 2 3 qwe,wer,tru 3 4 rgj 1 I want the above output to be solved by DataStage as well and I have to write SQL query for the same output.
what is time dimension? and how to populate time demension
If seg file having 10 records ex:eid 1 2 " " 10 if oracle database having 100 records ex:eid 1 2 " " 100 how to delete matched records permenently from oracle database using datastage ?