What are the prerequisites for join stage
Answers were Sorted based on User's Feedback
1.Inputs as left right and intermediate links(files).
2.can do left outer,right outer,inner join and full outer join
3.N inputs for left,right and inner but 2 inputs for full outer.
4.Inputs should be sorted and partitioned for better
performance.
5.Removing duplicates is optional as join operation is
highly optimized which sequential and less i/o and less page
faults.
6.Memory requirements are comparatively less than Lookup stage.
7.In join key columns names should be same in primary and
secondary datasets.
Is This Answer Correct ? | 0 Yes | 0 No |
what is the diff between sequential file and fileset stages?
What is a folder? Difference types of stages?
what is the main difference between sorragate key n primary key in one word
What is data partitioning?
Explain Quality stage?
what is the definitions for Datawarehose and Datamart?
1. How many People are part of your Team? 2. Explain how you create jobs or flow of project? 3. Join Stage vs Lookup vs Merge Stage 4. Summation scenario based question - How you find sum of salary for a specific employee (Explain stages and flow of job)? 5. Explain Remove duplicates stage ? Can you do sort in this stage? 6. SQL Questions - Joins - Types, Difference between Join and Union 7. Unix Questions - How you run the Job, How you list all jobs in project 8. Explain Environmental Variables? 9. SQL Scenario - If you have 3 Identical record in a Table, Ex: 1, Ram, Xyz; 1, Ram, Xyz; 1, Ram, Xyz; Delete only 2 of the records and keep only 1 using the Delete query. How you will you do this?
How to delete the data in dataset?types of deleting the data in dataset?
there are two schemas x and y are there. some data is in x schema. i want to use that in y schema..how can i use? please give some possibilities
in source is like seq file in date column have dd-mm-yy dddd-mmmm-yyyy mm-dd-yy yy-dd-mm yy-mm-dd i want to display only yy-dd-mm date formats only in tgt?
explain unit testing,systemtesting,integrated testing
how to run a sequential file stage in parallel if the stage is used on the TARGET side