Why we need datasets ratherthan sequential files?

Answers were Sorted based on User's Feedback



Why we need datasets ratherthan sequential files?..

Answer / sagar

When you use sequential file as Source,at the time of
compilation it will convert to native format from
ASCII.where as,when you go for using datasets... conversion
is not required.Also, by default sequenctila files we be
processed in sequenc only.sequential files can accommodate
up to 2GB only.sequentila files does not support NULL
values....all the above can me overcome using dataset
stage....but selection is depeneds on the
requirement....suppose if you want to capture rejected
data....in that case you need to use sequential file or
fileset stage....

Is This Answer Correct ?    22 Yes 4 No

Why we need datasets ratherthan sequential files?..

Answer / kiran

First we Know about Seq file and Dataset.

Seg file is used Extract the from flatfiles and load into
flatfiles and limit is 2GB

Dataset is a intermidiate stage and it has paralism when
load data into dataset and it inprove the performanece.

Is This Answer Correct ?    22 Yes 11 No

Why we need datasets ratherthan sequential files?..

Answer / chiranjeevi.a

To over come the limitation of sequential file,espesially
sequential file: By default reads and writes sequentially
>memory limit is up to 2GB
>conversion is needed
>sourse stores the data out side of datastage
Data file:
>parallel process
>memory is unlimited
>No conversion is needed while reading the data(its
basically in native format).
>stores data inside the repositary

Is This Answer Correct ?    11 Yes 2 No

Why we need datasets ratherthan sequential files?..

Answer / narayana

file set is nothing but collection of sequential files. if sourse database is greater than 2 GB than prefer to use file set.
Data set is internal stage in datastage.the extension of data set is .ds it never used to extract data from client location .it is used as intermediate stage between two tables.
Sequential : 1)it supports all formats like txt,csv,xls,etc.
2)it is used to extract data from flat files from client location.
Data set:1)it supports only .ds format
2)it will never extract data from flat files from client location.

Is This Answer Correct ?    4 Yes 2 No

Why we need datasets ratherthan sequential files?..

Answer / pavan

datasets get processed in our local system rather than at
the server so performance gets enhanced in this case as
server is not loaded.

Is This Answer Correct ?    0 Yes 13 No

Post New Answer

More Data Stage Interview Questions

i hav source like this . deptno,sal 1,2000 2,3000 3,4000 1,2300 4,5000 5,1100 i want target like this target1 1,2000 3,4000 4,5000 target2 2,3000 1,2300 5,1100 with out using transformer

2 Answers  


Why do we use exception activity in Datastage?

0 Answers  


i have a table col1 10 20 30 40 10 20 50 my requirement is how to retrive only duplicates like 10 10 20 20 like this how it's possible in SQL?

11 Answers   Cap Gemini,


Name the different sorting methods in datastage.

0 Answers  


Out of 4 mill records only 3 mill records are loaded to target and then job aborted. How to load only those 1 mill(not loaded records) for next run. This job is not sequential job, it is stand alone parallel job.What are the possibilities available in datastage8.1?

7 Answers   IBM,






Hi this madan, in data stage one file in Empno 12345678910 in a table, i want target is  Empno 1                                2 3 4 5 6 7 8 9 10                                                                                             

5 Answers   Tech Mahindra,


Describe stream connector?

0 Answers  


IS IT POSSIBLE TO USE DATASET AS A SOURCE FILE?

3 Answers   Infosys,


HOw Hash Partion Works Thank you in Advance i have doubts on Hash Partion TEch Could please give me the clear understandable notation example e_id,dept_no 1,10 2,10 3,20 4,20 5,30 6,40 i have TWo Nodes/Three Nodes My questions are: 1).if i select hash key as e_id how Hash partion will distribute the data in to two NOdes/three NOdes 2).if i select hash key as dept_no how Hash partion will distribute the data in to two NOdes/three NOdes sivakumar.katta7@gmail.com

5 Answers   FIS,


why dataset ?

2 Answers  


In which situations we can use normal and sparse lookup stages

4 Answers   IBM,


if we using two sources having same meta data and how to check the data in two sources is same or not? and if the data is not same i want to abort the job ?how we can do this?

1 Answers   IBM,


Categories