i have a scenario like two columns(Empno, Ename) in that duplicate records are there, so my question is how to get second duplicate record in datastage.
Answers were Sorted based on User's Feedback
Answer / reddy
Source File/Table--->Transformer-->Target
Transformer stage:
---------------------------
Take two stage variables
sv-->Empno or Ename (drag and drop column(s))
sv1-->if Empno=sv or Ename=sv Then sv1+1 Else 1
in constraint --> sv1=2
u will get second duplicate record
Plz correct me if i am wrong...
| Is This Answer Correct ? | 2 Yes | 0 No |
Answer / pavani
if we have n duplicates
source---->transformer-------->target.
in transformer we define 2 stage variables
if empno=stg_var2 then stg_var1+1 else 1------->stg_var1;
emp_no-------->stg_var2;
now the constraint for target is stg_var1=2
| Is This Answer Correct ? | 2 Yes | 0 No |
Answer / narasimha rao k
By using Removeduplcate stage in datstage
This stage having avalabel Removeduplicate 1st and last you select last automatically apply to second duplicate data
| Is This Answer Correct ? | 6 Yes | 6 No |
Answer / nish
remove duplicate-> retail last
Alternate MEthod: if there are only two occurences then sort stage-> descending-> remove duplicates
| Is This Answer Correct ? | 0 Yes | 0 No |
i/p 1,a,1000 2,b,2000 3,c,3000 i want to get o/p as o/p 1,a,3000 2,b,4000 3,c,5000 how it can be done on using datastage?
What is the difference between validated and compiled in the datastage?
How many jobs in ur project? Explain any complex job u have done in ur project?
How do u view log file for a particular job?
1)what is the size of Fact table and dimension table? 2)how to find the size of Fact table and dimension table? 3)how to implement the surrogate key in transform stage? 4)write the configuration file path? 5)how many types of datasets explain? 6)diff b/w developed projects and migration projects? 7)how to delete the header and footer file of the sequencer file? 8)how can u call the parameters in DS in unix environment? 9) how much data ur getting daily ? 10)
Hi guys, please design a job with derivation(solution). write exact conditions. My requirement Source table emp_no qualification 1 a 1 c 2 a 3 c 3 b Target table emp_no qualification 1 b 2 b 2 c 3 a Here every employer have three qualifications i.e a,b and c. what ever source table dont have some qualification, that will be move to target table. Like above. Hope u get the point. Thanks.
by using dsjob..we can run only one job at a time?how can u run multiple jobs at a time in unix?
Hi, i did what you mentioned in the answer, i.e. source- >Transformer -> 3 datasets. Iam able to see the data in datasets but its not sort order... Can you tell how sort the data?? i also checked Hash partition with performsort.
What is the process of killing a job in datastage?
what is advantages of snowflake when it is used?
What is the difference b/w Hash file and Lookup Fileset stages?
What steps should be taken to improve Datastage jobs?