1.How to read multiple file from sequential stage.
2.If a file doesn't arrive or doesn't exists in sequential
stage how do u handle this.
3.What do you do before taking data from source to staging
area.
4. I have a remove duplicate stage and transformer
stage.what will u do for optimizeing the performance of the
job.
Answers were Sorted based on User's Feedback
Answer / varma
Ans1 : We can use Read Method = “Specific Files” and give the full path of all the file one by one. Alternatively, you can use Read Method = “File Pattern” and specify the wild card.
Ans 2: We can control this using “Missing File Mode” Option. Values are “Ok” to skip the file and continue and “Error” to abort the job.
Ans3: It is important to make sure the meta data matches the record. It would be better to reject bad reject and collect them in “Reject Link”. This can be controlled by “Reject Mode” option. Values are “Continue”, “Fail” and “Output”. “Output” will collect the reject records in a reject link.
| Is This Answer Correct ? | 3 Yes | 0 No |
Answer / jayanthi
1)read method should be set as file pattern(for reading
single file it should be specificfile)
| Is This Answer Correct ? | 2 Yes | 0 No |
Answer / ashok
4.in case of only for duplicates removing i will use remove
duplicate stage.but in case of more logics applied i will
go to transformer.bcoz remove duplicate stage give better
performance compare to transformer
| Is This Answer Correct ? | 1 Yes | 2 No |
Answer / nagaraz
1.READ METHODE SHOULD BE TAKE SPECIFIC FILE U CAN TAKE
MULTIPLE FILE
| Is This Answer Correct ? | 0 Yes | 1 No |
How do y read Sequential file from job control?
What is PX?
What is the importance of the exception activity in datastage?
Differentiate between hash file and sequential file?
What is difference between server jobs & parallel jobs?
In Informatica,for the table I can find coreesponding dependent mappings.Likewise can I find the dependent jobs with all the information by using the table name
What is the use of Row generator stage?
how to abort the job its matain duplicates?
source has 2 fields like COMPANY LOCATION IBM HYD TCS BAN IBM CHE HCL HYD TCS CHE IBM BAN HCL BAN HCL CHE LIKE THIS....... AND I WILL GET THE OUTPUT LIKE THIS.... Company loc count TCS HYD 3 BAN CHE IBM HYD 3 BAN CHE HCL HYD 3 BAN CHE PLZ SEND ME ANSWER FOR THIS QUESTION..........
What is the command line function to import and export the ds jobs?
A flatfile contains 200 records.I want to load first 50 records at first time running the job,second 50 records at second time running and so on,how u can develop the job?pls give the steps?
A job is having only 2 stages I/p dataset and target table.Job is taking very long time to load 50 million records.How to improve performance of this job.