souce file having the columns like
name company
krish IBM
pooja TCS
nandini WIPRO
krish IBM
pooja TCS
if first row will be repeat i want the result like this
name company count
krish IBM 1
pooja TCS 1
nandini WIPRO 1
krish IBM 2
pooja TCS 2
Answers were Sorted based on User's Feedback
Answer / disney
first sort the both name and compamy and then,
using stage variable in transformer:
curr= name:company
val=if curr <> prev then 1 else val+1
prev=curr
o.p = val
name,company,val
| Is This Answer Correct ? | 6 Yes | 2 No |
Answer / ankit gosain
Hi ALL,
Job Design:
SourceSeqFile--->SortStage--->Transformer--->TgtSeqFile
1. In Sort Stage, take two key, name & company and then go
to options and create a keyChange column.
2. In transformer stage, create a stage variable of integer
type (say Var1) and write in it's derivation:
if keyChange=1 then 1 else Var1+1
3. Now create a new column in tgt (say count) and in
transformer, assign that Var1 to the derivation of count.
4. Goto o/p tab of transformer and there sort the data on
count column.
You'll get the desired output.
If you have more queries, you can mail me on
ankitgosain@gmail.com
Cheers,
Ankit :)
| Is This Answer Correct ? | 3 Yes | 0 No |
Answer / shasank, pr3systems
use transforemr stage and give three stage variable with intial value as 0,
then give (if company=ibm then count=count+1 else stgvar1)
like give three conditions for remaing two companies in two stage variables)
and in column derivation give if company=ibm then stgvar1 else
if company=wipro then stgvar2 else if company=tcs then stgvar3 else 0(company not in list)
| Is This Answer Correct ? | 7 Yes | 5 No |
Answer / pushparaj
Just modified the last post with minor changes
First sort the both name and compamy and then,
using stage variable in transformer:
sv1= name:company
sv3=if sv1=sv2 then sv3+1 else sv3(initial value of sv3=1)
sv2=sv1(initial value of sv2 =XXXXXXX)
OP
name company count=sv3
| Is This Answer Correct ? | 3 Yes | 2 No |
Answer / srinivas
Use below stage variables
create 4 stage variables
STG1 =NAME:COMPANY
STG2=1
STG3= IF STG1:STG4 THEN STG3+1 ELSE STG2
STG4= STG1
create extra column for store this ranks.
After this stage use the sort stage to sort the data based on ranking column.
Then will get the exact output what we are looking.
krish IBM 1
pooja TCS 1
nandini WIPRO 1
krish IBM 2
pooja TCS 2
| Is This Answer Correct ? | 1 Yes | 0 No |
Answer / sankar
use aggregator stage take count(*) group by name
| Is This Answer Correct ? | 1 Yes | 0 No |
Answer / abc
we can use aggregator stage.. nad group on the basis of name and company...
| Is This Answer Correct ? | 0 Yes | 0 No |
Answer / anand
we can use aggregator stage to count the repeated records and WHERE AGGREGATION TYPE:COUNT ROWS
| Is This Answer Correct ? | 0 Yes | 0 No |
SeqFile---->Sort---->Trms---->Dataset
Sort_Stage: Key=Company (Or) Name
Trms
----------
SV=Company
SV1= If dslink.Company=SV Then SV+1 Else 1
Add (count) column in Transformer stage
Map SV1 stage variable to Count column (SV1=Count)
Will get desired output..
Plz correct me if am wrong....
| Is This Answer Correct ? | 0 Yes | 0 No |
Does datastage support slowly changing dimensions ?
Explain the functionality of your current project?
1 Answers IBM, Miles Software,
How to manage date conversion in Datastage?
How to display errors in datastage?
Hi guys, please design job for this, MY INPUT IS COMPANY,LOCATION IBM,CHENNAI IBM,HYDRABAD IBM,PUNE IBM,BANGLOORE TCS,CHENNAI TCS,MUMBAI TCS,BANGLOORE WIPRO,HYDRABAD WIPRO,CHENNAI HSBC,PUNE MY OUTPUT IS COMPANY,LOCATION,COUNT IBM,chennai,hydrabad,pune,banglore,4 TCS,chennai,mumbai,bangloore,3 WIPRO,hydrabad,chennai,2 HSBC,pune,1 Thanks
CAN ANY ONE TELL ME THE ARCHITECTURE OF DATASTAGE CLEARLY....
What is aggtorec restructure operator?
i have source like deptno,sal 1,2000 2,3000 3,4000 1,2300 4,5000 5,1100 i want target like target1 deptno,sal 1,2000 3,4000 4,5000 target2 2,3000 1,2300 5,1100 with out using transformerstage
What is a quality stage?
detail about sdlc
how to create document in datastage?
i have source data like empno,enmae 11 ,aa 12 ,bb i want output like empno,ename 11 ,aa 12 ,bb 11 ,aa 12 ,bb