How to remove duplicates in transformer stage? in parallel
mode
Answer Posted / kiran
partition the data by key and sort the data and click on
unique value. This will automatically delete duplicate
data.
| Is This Answer Correct ? | 20 Yes | 3 No |
Post New Answer View All Answers
How complex jobs are implemented in datstage to improve performance?
1)what is the size of Fact table and dimension table? 2)how to find the size of Fact table and dimension table? 3)how to implement the surrogate key in transform stage? 4)write the configuration file path? 5)how many types of datasets explain? 6)diff b/w developed projects and migration projects? 7)how to delete the header and footer file of the sequencer file? 8)how can u call the parameters in DS in unix environment? 9) how much data ur getting daily ? 10)
What is usage analysis in datastage?
Can you explain kafka connector?
What is the difference between Datastage 7.5 and 7.0?
How do you reject records in a transformer?
Can anyone tell me a difficult situation who have handled while creating Datastage jobs?
CHANGE CAPTURE
Is the value of staging variable stored temporarily or permanently?
How can one find bugs in job sequence?
what are .ctl(control files) files ? how the dataset stage have better performance by this files?
How can we improve performance of data stage jobs?
What a datastage macro?
What is data partitioning?
What are the important features of datastage?