Main Function of the Staging area in DWH ?
Answers were Sorted based on User's Feedback
Answer / guru avula
1.In a project source systems are differnet and also all
source systems are not availble in same time.for eg 1
source is availble at 1AM and 2n one at 2 AM etc
But our schedule jobs run at different time ,so we have to
pick the data from source system based on source availble
time and put into staging area.
2.All source systems table formats and column formats are
diffrent ,so we have to sync all .
For above two reasons we go for staging area
| Is This Answer Correct ? | 5 Yes | 1 No |
Answer / srinivas
I will add some points to first answer
Source system send the raw data what ever data they have simply they will send to us.
In staging we do the cleansing, remove-duplicate and null handling process and load the data into staging tables.
Then we applying business logic's and loading into dim and fact tables.
| Is This Answer Correct ? | 2 Yes | 1 No |
What is the roundrobin collector?
if a column contains data like ram,rakesh,madhan,suraj,pradeep,bhaskar then I want to place names separated by commas in another columns how can we do?
How rejected rows are managed in datastage?
disign the complex job in u r project?(they are aksing only complex job design and then data flow...)
if we using two sources having same meta data and how to check the data in two sources is same or not? and if the data is not same i want to abort the job ?how we can do this?
Is possible to create skid in dim,fact tables?
Source flat file contains src --- 1 2 ' ' ' 18 we had 3 targets T1 T2 T3 -- -- -- 1 4 7 2 5 8 3 6 9 10 13 16 11 14 17 12 15 18 How can i get? **Using only datastage, but not unix or any other. I am expecting the answer soon.. Thanks in advance.
I HAVE EMP TABLE, 4 COLS R THERE COL1,COL2,COL3,COL4 ID-- 101,102,103,104 SAL-- 1000,4000,2000,5000 DATE-- COLUMN. I WANT TO DISPLAY THE DATA PREVIOUS MONTH HIGEST SAL ?
EXPLAIN SCD
What is exact difference between Parallel Jobs and server Jobs..
What is the process of killing a job in datastage?
What are the different types of lookups in datastage?