Main Function of the Staging area in DWH ?
Answers were Sorted based on User's Feedback
Answer / guru avula
1.In a project source systems are differnet and also all
source systems are not availble in same time.for eg 1
source is availble at 1AM and 2n one at 2 AM etc
But our schedule jobs run at different time ,so we have to
pick the data from source system based on source availble
time and put into staging area.
2.All source systems table formats and column formats are
diffrent ,so we have to sync all .
For above two reasons we go for staging area
| Is This Answer Correct ? | 5 Yes | 1 No |
Answer / srinivas
I will add some points to first answer
Source system send the raw data what ever data they have simply they will send to us.
In staging we do the cleansing, remove-duplicate and null handling process and load the data into staging tables.
Then we applying business logic's and loading into dim and fact tables.
| Is This Answer Correct ? | 2 Yes | 1 No |
What are the some differences between 7.x and 8.x version of datastage?
if i have two tables table1 table2 1a 1a,b,c,d 1b 2a,b,c,d,e 1c 1d 2a 2b 2c 2d 2e how can i get data as same as in tables?chandu how can i implement scd typ1 and type2 in both server and in parallel?chandu field1 field2 field3 suresh , 10,324 , 355 , 1234 ram , 23,456 , 450 , 456 balu ,40,346,23 , 275, 5678 how to remove the ,inthe fields?
How can we do null handling in sequential files?
i/p 1,a,1000 2,b,2000 3,c,3000 i want to get o/p as o/p 1,a,3000 2,b,4000 3,c,5000 how it can be done on using datastage?
How can you join flat file, oracle as a sources?
is it possible to access the same job by two users at a time in DataStage?
what is the use of invokation id
What is the Difference between Change capture stage and Difference Stage ? What are its significance individually ?
I have a source table with column name CITY having 100 records, I want target table with column name start with 'A' and 'B',remaining columns as reject outputs. how can achieve this by data stage?please help me?????
Hi All , in PX Job I have passed 4 Parameters and when i run the same job in sequence i dont want to use those parameters , is this possible if yes then how
i have the source from Uk,north america how can i pass the data two tables based on the locations
HOW CAN WE ABORT THE JOB IF THE RECORDS OF SOURCE FILE CONTAINS VALUE AS 'VIJAY'?