I have source file which contains duplicate data,my requirement is unique data

Categories | Companies | Placement Papers | Code Snippets | Certifications | Visa Questions

Post Questions | Post Answers | My Panel | Search | Topics | Errors

Categories >> Software >> Data Warehouse >> Data Stage
Suggest New Category

I have source file which contains duplicate data,my
requirement is unique data should pass to one file and
duplicate data should pass another file how?

Question Posted / ravik.datastage

7 Answers
23030 Views
CTS, I also Faced
E-Mail Answers

Answers were Sorted based on User's Feedback

I have source file which contains duplicate data,my requirement is unique data should pass to one f..

Answer / dilip anand k

Its Simple!!

All you have to do is link your source to a Sort Stage.
Sort the data and generate a Key Change column.
Key Change column = ‘1’ represents that the record is
unique while Key Change Column = ‘0’ represents the
duplicates.

Put a Filter stage and filter out the data into two
different outputs based on the generated Key Change Column.

Is This Answer Correct ?

21 Yes

5 No

I have source file which contains duplicate data,my requirement is unique data should pass to one f..

Answer / farzana kalluri

input output
1 T1 T2
2 4 1
2 6 2
1 7 3
3 4
4 5
3
5
5
6
7
for this

seq file---->Aggregate(key=id)---->filter---->2 targets

In aggregate use count rows...
in filter count=1 it goes to target1
if count=2 it goes to target2..

Is This Answer Correct ?

9 Yes

3 No

I have source file which contains duplicate data,my requirement is unique data should pass to one f..

Answer / ramachandra rao

After source use aggregator stage and use option aggregator
type is count and count the records after that use filter in
where clause count>1 ie duplicate records go to one target
and another where clause count=1 ie unique records go to
another target.

Is This Answer Correct ?

3 Yes

0 No

I have source file which contains duplicate data,my requirement is unique data should pass to one f..

Answer / sonali s

The above solution doesnt give required output. The requirement is as below:
Input:
A
B
B
C
D
D
D

Output should have 2 files as below.

File 1
A
C

File 2
B
B
D
D
D

Please provide solution for this

Is This Answer Correct ?

0 Yes

0 No

I have source file which contains duplicate data,my requirement is unique data should pass to one f..

Answer / purba

Input:
A
B
B
C
D
D
D

Required output:
A
B
C
D

Solution:
Seq file----->sort stage(create key change column for the I/p key row)
O/p:
A 1
B 1
B 0
C 1
D 1
D 0
D 0

Now take filter stage to filter for key column=0 & keycol=1
We get 2 outputs:
A. B
B. D
C. D
D

Is This Answer Correct ?

0 Yes

0 No

I have source file which contains duplicate data,my requirement is unique data should pass to one f..

Answer / riyazahamedmohamed

take two links using copystage, of your input file,one is your input file output, another one is for keychange column(using sort stage set the key change column to true) with filter "0" out of transformer, to the look up stage.set the lookup option to continue-reject.you will get the desired output.reject will capture unique records.output file will capture duplicate records.

Is This Answer Correct ?

0 Yes

0 No

I have source file which contains duplicate data,my requirement is unique data should pass to one f..

Answer / krishna

As per my knowledge
initially soure is in sequential stage anc take aggrigator
stage and select the grouping option and select which column
you want to group then go to option command and select
column for calculation and select the which column you want
to do the operation .in column for calculation w have seen
many options and select missing count column name and give
the column name for output.and add transformer stage with in
the transformer stage add constraints .and give the two outputs
if column name=1 then 1 else 0
if column name>=2 then 1 else 0
it will work

Is This Answer Correct ?

0 Yes

6 No

Post New Answer

More Data Stage Interview Questions

Hi guys, Please design a job for dis requirement with derivation(solution). my source table like dis. emp_no qualification 1 a 1 c 2 a 3 c 3 b To loaded to target like dis emp_no qualification 1 b 2 b 2 c 3 a my requirement is every employer have three qualifications i.e a,b and c. what qualification missed in source table that will be move to target systems. Hope u got it the requirement. Right Thanks.

Main Function of the Staging area in DWH ?

it is possible to load two tables data into one sequential file?if possible how?plz share with me?

How do you load 10 different sources with 10 different layouts to 10 different tables?

1 Answers TIAA CREF,

in source is like seq file in date column have dd-mm-yy dddd-mmmm-yyyy mm-dd-yy yy-dd-mm yy-mm-dd i want to display only yy-dd-mm date formats only in tgt?

2 Answers Wipro,

i have a small question for datastage, After the desinging (i.e., transformations and loading)part, what we can do?

How can we do null handling in sequential files

3 Answers Reliance,

Differentiate between Symmetric Multiprocessing and Massive Parallel Processing?

I have 2 jobs.I want to ru job B if job A has run 3 times.How can I achieve this through datastage

How to remove duplicates in transformer stage? in parallel mode

6 Answers Syntel, TCS,

in job of 30 one job is very slow due to this entire job is very slow how can u know which job is slow?

what is materialized view used datastage?

1 Answers HSBC,

For more Data Stage Interview Questions Click Here

Categories

Teradata (490)
Business Objects (875)
Cognos (1143)
Informatica (2428)
Crystal Enterprise Suite (30)
Actuate (46)
Ab Initio (442)
Data Stage (917)
SAS (1049)
Micro Strategy (282)
ETL (315)
OBIEE (92)
IBM Cognos TM1 (55)
Amazon Redshift (62)
Data Warehouse General (749)

Software Interview Questions :: Artificial Intelligence, Big Data, Python, PHP, DotNet, Java, Databases, Mobile Apps,...

Business Management Interview Questions:: Banking Finance, Business Administration, Funding, Hotel Management, Human Resources, IT Management, Industrial Management, Infrastructure Management, Marketing Sales, Operations Management, Personnel Management, Supply Chain Management,...

Engineering Interview Questions :: Aeronautical, Automobile, Bio, Chemical, Civil, Electrical, Electronics Communications, Industrial, Instrumentation, Marine, Mechanical, Mechatronics, Metallurgy, Power Plant,...

Visa Interview Questions :: USA Visa, UK Visa, Australia Visa, Canada Visa, Germany Visa, New Zealand Visa,...

Accounting Interview Questions | HR Interview Questions | Fashion & Modelling Interview Questions....

Copyright Policy | Terms of Service | Site Map | Contact Us

Copyright © 2005-2025 ALLInterview.com. All Rights Reserved.