Out of 4 mill records only 3 mill records are loaded to
target
and then job aborted. How to load only those 1 mill(not
loaded records) for next run.
This job is not sequential job, it is stand alone parallel
job.What are the possibilities available in datastage8.1?
Answers were Sorted based on User's Feedback
Answer / prasanna
Use already loaded records TGT as reference lookup,and use
lookup stage,with DROP option..and design the job.u ll get
req answer. refer
.
.
sour.......>lookup......>tgt
| Is This Answer Correct ? | 7 Yes | 4 No |
Answer / nish
there are plenty of options available.
just carefully study the scenario.
Source: 4 Mil records (doesn't change)
Target: 3 Million already loaded
Option1:
you just need to identify those 1Million pool them and then load to target.
This is clearly a case for Change Data Capture (CDC) stage.
use the Source As Before Table and Target as After.
Write those 1 Mil records based on change_code() (Deleted) to a file.
Move the contents of this file to target.
Option 2: This scenario also hints at updating the target with merge stage.use the DROP option to gather into a file and then update the target.
Option 3: Look Up being a performance concern should be your alternative.
| Is This Answer Correct ? | 2 Yes | 0 No |
Answer / venkatesh
first load the data using with surrogate stage
after that use filter stage and take that key of
surrogateand in lookup maintain the Drop option
| Is This Answer Correct ? | 2 Yes | 8 No |
Can we use target hash file as a lookup ?
source file is having 5 records while moving into target it want to be 10 records
when we will use connected Lookup & Unconnected Lookup
I have a few records just I want to store data in to targets cycling way how?
How to find value from a column in a dataset?
What r the existing server jobs in parallalism?
can we see the data in fixed width file? how can u change the datatype of fixed width files?
If you want to use a same piece of code in different jobs, how will you achieve this?
How many jobs in ur project? Explain any complex job u have done in ur project?
Can anyone tell me a difficult situation who have handled while creating Datastage jobs?
What is orabulk stage?
How a server job can be converted to a parallel job?