how to design the change capture stage in(data stage
parallel jobs) type 2
Answers were Sorted based on User's Feedback
Answer / pooja
Let me just elaborate the earlier answer clearly.
1. Two input datasets are required for change data caputure
stage.
One is Old dataset
Second is New or updated dataset
2. Give in the 2 inputs to the change capture stage and the
target as a dataset.
3. Let the incoming data be sorted based on a key column(s)
for performance purpose in the change Caputure stage.
4. Upon executing the job, the data when viewed from the
dataset shows a new column added apart from the output
data. A change code column would be generated in the change
capture stage having values as 0, 1, 2, 3 which depicts the
changes on comparing the 2 input datasets such as copy(0),
Insert(1), Delete(2), Edit(3).
5. See what kind of data you need in the output target like
copy, insert, delete, edit.
6. To apply SCD Type 2 we require Start date and End date
columns.
7. The Change Capture Stage output is given to a
Transformer Stage, where 2 new columns are generated with
Effective Start Date and End Date.
8. If you need all Inserted or new data to be passed in to
a particular dataset then you need to specify an
appropriate condition in the Transformer Stage to the
outgoing link. Ex. Drop Output For insert=true
9. In the similar way other data can also be captured or a
Filter can also be used after the Transformer Stage to
filter the data into the targets based on the requirement.
Is This Answer Correct ? | 36 Yes | 4 No |
Answer / nidhi
1.for change capture stafe u need two input dataset one is
old one & second is the new or updated one.
2. incoming data should be sorted.
3. allow what kind of u need to check like new , delete,
change,copy.
4. if u wanna all new data should be passed to the outgoing
link then u need to specify Drop Output For insert=true.
5.there should be a change key or change key value, on the
basis of key data will be chaecked.
thanks
Is This Answer Correct ? | 16 Yes | 5 No |
How to remove duplicates in transformer stage? in parallel mode
options available in sequence job to run,validate?
which memory is used by lookup and join
what will happen if we allow duplicates in datastage lookup abort drop record 1st value of duplicate record none
What is configuration your file structure 2)I have two databases both are Oracle while loading data from source to target the job takes 30 min but I want to load less time how?
what is parameterset?
How to LOG 'unmatched Master' records and 'Reject Updates' in log files using MERGE stage?
What are the environmental settings for data stage,while working on parellel jobs?
I have a few records just I want to store each records tow times in target how?
I have a source like file it have Number of records and i want to load without first and last records in target?Datastage?
What is merge stage?
I have a file it contain 2 records like empname,company as Ram, Tcs and Ram, IBM. But i want empname, company1,company2 as Ram, TCS,IBM in the target. How?