how can find maximum salary by using Remove duplicate stage?
Answers were Sorted based on User's Feedback
For RMD Stage we need Grouping Keys, Add one extra hard coded Column to each Input records in previous stage.
DUMMY=1
In RMD Stage,
HASH partition on DUMMY, Sort DESC on Salary
and select DUMMY as Key in properties tab.
"Duplicate to Retain"=First.
then you will get only Highest Salart only from RMD Stage
| Is This Answer Correct ? | 5 Yes | 0 No |
Answer / narayana.3ds
R.D Stage remove the duplicates every one knows but u can
find max salary......
Give the key of salary order by descending other
option u can put=first
| Is This Answer Correct ? | 5 Yes | 2 No |
Answer / phani kumar
By using Sort stage, sort the data in Descending order, then
use remove duplicates stage, set option as first occurrence
of records. Then we will get max sal record first.
| Is This Answer Correct ? | 2 Yes | 1 No |
Answer / akila
In case an explicit sort stage is used like inteh reply
above, the sort will be done on emp_id(key field) and the
salary. But remove duplicate will be done only on emp_id. I
guess that this might result in a warning like 'sort and
remove duplicates keys do not match'.
| Is This Answer Correct ? | 0 Yes | 0 No |
Please tell me What is difference between 8.0 and 8.1
Differentiate between Join, Merge and Lookup stage?
which r the connectors used in san?
Source have 1000 records and it can have three nodes ok but i want how many records are executed on each node?
how can or from where we can get reference data in scd type2 implementation?
how to call sequential generator in datastage?
Hi all, explain one complex business rule that you had in your project and how did you accomplish it using DS?
Can you explain engine tier in information server?
WAT TYPE OF PL/SQL QUERRYS USED IN DATASTAGE
HOW CAN WE ABORT THE JOB IF THE RECORDS OF SOURCE FILE CONTAINS VALUE AS 'VIJAY'?
if we using two sources having same meta data and how to check the data in two sources is same or not? and if the data is not same i want to abort the job ?how we can do this?
A job is having only 2 stages I/p dataset and target table.Job is taking very long time to load 50 million records.How to improve performance of this job.