1)How to Duplicate Records Delete in Sequential file?

Answers were Sorted based on User's Feedback



1)How to Duplicate Records Delete in Sequential file?..

Answer / naga

connect sequential file to dataset and use partition
technique and perform sort and select unique

Is This Answer Correct ?    19 Yes 6 No

1)How to Duplicate Records Delete in Sequential file?..

Answer / vinod upputuri

In the sequential File stage there is a option called filter.
there we can use UNIX command.

syntax for Remove duplicate: uniq or sort -u

Is This Answer Correct ?    11 Yes 1 No

1)How to Duplicate Records Delete in Sequential file?..

Answer / subhash

small correction to Vinod answer,
If we use UNIQ, it will remove duplicate if they are
consecutive only..
so, better to use
sort -u
or
sort file_name | uniq

Is This Answer Correct ?    9 Yes 0 No

1)How to Duplicate Records Delete in Sequential file?..

Answer / topper

Select the data set and press delete..

Is This Answer Correct ?    0 Yes 15 No

Post New Answer

More Data Stage Interview Questions

What modeling tool do you use?

6 Answers   HP,


Parallel job contains more than 20 stages. I want to find out which stage is more performance incentive.

1 Answers   IBM,


Differentiate between Symmetric Multiprocessing and Massive Parallel Processing?

0 Answers  


which r the connectors used in san?

0 Answers  


how to call sequential generator in datastage?

1 Answers   IBM,






how to run a sequential file stage in parallel if the stage is used on the TARGET side

0 Answers   Virtusa,


Define oconv () and iconv () functions in datastage?

0 Answers  


What is configuration your file structure 2)I have two databases both are Oracle while loading data from source to target the job takes 30 min but I want to load less time how?

1 Answers   Hexaware,


how does work server jobs?

1 Answers  


What are the unit test cases you used in your project?

1 Answers   CSC, HY,


how to design the change capture stage in(data stage parallel jobs) type 2

2 Answers   IBM,


Emp login_timestamp Logout_timestamp A,2019-02-01 02:24:15,2019-02-01 04:59:42 B,2019-03-29 14:43:30,2019-03-29 20:22:00 ABC,2019-03-29 12:43:00,2019-03-29 23:22:59 In the above calculate the duration of hours spent in office for each emp in datastage.

1 Answers  


Categories