souce file having the columns like
name company
krish IBM
pooja TCS
nandini WIPRO
krish IBM
pooja TCS
if first row will be repeat i want the result like this
name company count
krish IBM 1
pooja TCS 1
nandini WIPRO 1
krish IBM 2
pooja TCS 2


Answers were Sorted based on User's Feedback



souce file having the columns like name company krish IBM pooja TCS nandini WIPRO krish IBM p..

Answer / disney

first sort the both name and compamy and then,
using stage variable in transformer:
curr= name:company
val=if curr <> prev then 1 else val+1
prev=curr

o.p = val
name,company,val

Is This Answer Correct ?    6 Yes 2 No

souce file having the columns like name company krish IBM pooja TCS nandini WIPRO krish IBM p..

Answer / ankit gosain

Hi ALL,

Job Design:

SourceSeqFile--->SortStage--->Transformer--->TgtSeqFile

1. In Sort Stage, take two key, name & company and then go
to options and create a keyChange column.
2. In transformer stage, create a stage variable of integer
type (say Var1) and write in it's derivation:
if keyChange=1 then 1 else Var1+1
3. Now create a new column in tgt (say count) and in
transformer, assign that Var1 to the derivation of count.
4. Goto o/p tab of transformer and there sort the data on
count column.
You'll get the desired output.

If you have more queries, you can mail me on
ankitgosain@gmail.com

Cheers,
Ankit :)

Is This Answer Correct ?    3 Yes 0 No

souce file having the columns like name company krish IBM pooja TCS nandini WIPRO krish IBM p..

Answer / shasank, pr3systems

use transforemr stage and give three stage variable with intial value as 0,
then give (if company=ibm then count=count+1 else stgvar1)
like give three conditions for remaing two companies in two stage variables)
and in column derivation give if company=ibm then stgvar1 else
if company=wipro then stgvar2 else if company=tcs then stgvar3 else 0(company not in list)

Is This Answer Correct ?    7 Yes 5 No

souce file having the columns like name company krish IBM pooja TCS nandini WIPRO krish IBM p..

Answer / pushparaj

Just modified the last post with minor changes

First sort the both name and compamy and then,
using stage variable in transformer:
sv1= name:company
sv3=if sv1=sv2 then sv3+1 else sv3(initial value of sv3=1)
sv2=sv1(initial value of sv2 =XXXXXXX)

OP

name company count=sv3

Is This Answer Correct ?    3 Yes 2 No

souce file having the columns like name company krish IBM pooja TCS nandini WIPRO krish IBM p..

Answer / srinivas

Use below stage variables

create 4 stage variables

STG1 =NAME:COMPANY

STG2=1

STG3= IF STG1:STG4 THEN STG3+1 ELSE STG2

STG4= STG1

create extra column for store this ranks.


After this stage use the sort stage to sort the data based on ranking column.

Then will get the exact output what we are looking.

krish IBM 1
pooja TCS 1
nandini WIPRO 1
krish IBM 2
pooja TCS 2

Is This Answer Correct ?    1 Yes 0 No

souce file having the columns like name company krish IBM pooja TCS nandini WIPRO krish IBM p..

Answer / sankar

use aggregator stage take count(*) group by name

Is This Answer Correct ?    1 Yes 0 No

souce file having the columns like name company krish IBM pooja TCS nandini WIPRO krish IBM p..

Answer / abc

we can use aggregator stage.. nad group on the basis of name and company...

Is This Answer Correct ?    0 Yes 0 No

souce file having the columns like name company krish IBM pooja TCS nandini WIPRO krish IBM p..

Answer / anand

we can use aggregator stage to count the repeated records and WHERE AGGREGATION TYPE:COUNT ROWS

Is This Answer Correct ?    0 Yes 0 No

souce file having the columns like name company krish IBM pooja TCS nandini WIPRO krish IBM p..

Answer / reddymkl.dwh

SeqFile---->Sort---->Trms---->Dataset

Sort_Stage: Key=Company (Or) Name

Trms
----------

SV=Company
SV1= If dslink.Company=SV Then SV+1 Else 1

Add (count) column in Transformer stage
Map SV1 stage variable to Count column (SV1=Count)

Will get desired output..

Plz correct me if am wrong....

Is This Answer Correct ?    0 Yes 0 No

Post New Answer

More Data Stage Interview Questions

can we use sequential file as a lookup

7 Answers   Cognizant, Wipro,


how can we validate the flat files using the date in the header and number of records in the flat file? Using both conditions at a time.

0 Answers   JPMorgan Chase,


what is Audit table?Have u use audit table in ur project?

3 Answers   Accenture, Wipro,


WHAT ARE PERFORMANACE SETTINGS YOU HAVE IMPLEMENTES IN YOUR PROJECT?

2 Answers   GE,


I have load a Dataset in UAT with 2 Node configuration, imported the job into PROD environment which is 4 node configuration and using this DataSet as SRC to other job. will the job run fine or give any errors? If job runs fine, on how many nodes? 2 nodes or 4 nodes?

2 Answers   TCS,






How do you get log info into a file?

1 Answers  


Why do you need stage variables?

0 Answers  


how to remove duplicates in transformer stage by using stage variables?one example?

6 Answers   Syntel, TCS,


i 10 jobs first two jobs are runing in 2nodes,next 2 jobs are running in 4 nodes, next 4 jobs are running in 6 nodes and the remaining jobs are running on 10 nodes. how to change the node configuration?

2 Answers  


Where the datastage stored his repository?

0 Answers  


how to export or import the jobs in .ISX file

0 Answers  


Hi friends,Two input files, wants to validate only if the reference data has '0' otherwise no validation should be done..how to do this??

1 Answers   IBM,


Categories