how can find maximum salary by using Remove duplicate stage?
Answers were Sorted based on User's Feedback
For RMD Stage we need Grouping Keys, Add one extra hard coded Column to each Input records in previous stage.
DUMMY=1
In RMD Stage,
HASH partition on DUMMY, Sort DESC on Salary
and select DUMMY as Key in properties tab.
"Duplicate to Retain"=First.
then you will get only Highest Salart only from RMD Stage
Is This Answer Correct ? | 5 Yes | 0 No |
Answer / narayana.3ds
R.D Stage remove the duplicates every one knows but u can
find max salary......
Give the key of salary order by descending other
option u can put=first
Is This Answer Correct ? | 5 Yes | 2 No |
Answer / phani kumar
By using Sort stage, sort the data in Descending order, then
use remove duplicates stage, set option as first occurrence
of records. Then we will get max sal record first.
Is This Answer Correct ? | 2 Yes | 1 No |
Answer / akila
In case an explicit sort stage is used like inteh reply
above, the sort will be done on emp_id(key field) and the
salary. But remove duplicate will be done only on emp_id. I
guess that this might result in a warning like 'sort and
remove duplicates keys do not match'.
Is This Answer Correct ? | 0 Yes | 0 No |
Parallel job contains more than 20 stages. I want to find out which stage is more performance incentive.
count number of deptno in a emp table?
my source is sequencial file and my target is dataset. i am running the job in two node configuration file. my source having 10 records how the data move to target?
Describe routines in datastage? Enlist various types of routines.
I have the following columns in the EMP table Empid,Empname,Sal,month(Sal),year(Sal) and DOB(let us say the dob is 15th-Jan-1981) Desing a job such that the output contains the following empname,year(sal),tot(sal) and current age i.e. whether 18yrs or so on
How to generate surrogate key without using surrogate key stage?
Sequential file i have one record,i want 100 records in target?How can we do that?Pls explain me and what stages are there?What logic?
what is materialized view used datastage?
How can we improve the performance in datastage?
How do you design PX Routines ?
wt is the diff b/w odbc and oracle stage
Source has 2 columns: USA,NewYork INDIA,MUMBAI INDIA,DELHI UDS,CHICAGO INDIA,PUNE i want data in target like below: INDIA,MUMBAI1 INDIA,DELHI2 INDIA,PUNE3 USA,NEWYORK1 USA,CHICAGO2