Why we need datasets ratherthan sequential files?
Answers were Sorted based on User's Feedback
Answer / sagar
When you use sequential file as Source,at the time of
compilation it will convert to native format from
ASCII.where as,when you go for using datasets... conversion
is not required.Also, by default sequenctila files we be
processed in sequenc only.sequential files can accommodate
up to 2GB only.sequentila files does not support NULL
values....all the above can me overcome using dataset
stage....but selection is depeneds on the
requirement....suppose if you want to capture rejected
data....in that case you need to use sequential file or
fileset stage....
Is This Answer Correct ? | 22 Yes | 4 No |
Answer / kiran
First we Know about Seq file and Dataset.
Seg file is used Extract the from flatfiles and load into
flatfiles and limit is 2GB
Dataset is a intermidiate stage and it has paralism when
load data into dataset and it inprove the performanece.
Is This Answer Correct ? | 22 Yes | 11 No |
Answer / chiranjeevi.a
To over come the limitation of sequential file,espesially
sequential file: By default reads and writes sequentially
>memory limit is up to 2GB
>conversion is needed
>sourse stores the data out side of datastage
Data file:
>parallel process
>memory is unlimited
>No conversion is needed while reading the data(its
basically in native format).
>stores data inside the repositary
Is This Answer Correct ? | 11 Yes | 2 No |
Answer / narayana
file set is nothing but collection of sequential files. if sourse database is greater than 2 GB than prefer to use file set.
Data set is internal stage in datastage.the extension of data set is .ds it never used to extract data from client location .it is used as intermediate stage between two tables.
Sequential : 1)it supports all formats like txt,csv,xls,etc.
2)it is used to extract data from flat files from client location.
Data set:1)it supports only .ds format
2)it will never extract data from flat files from client location.
Is This Answer Correct ? | 4 Yes | 2 No |
Answer / pavan
datasets get processed in our local system rather than at
the server so performance gets enhanced in this case as
server is not loaded.
Is This Answer Correct ? | 0 Yes | 13 No |
How to Convert a string function to date function by using only sequential file stage with out using other stages ?
how to abort the job its matain duplicates?
Can you explain tagbatch restructure operator?
Hi guys, Design job sequence, we have 3 sources, in that 1st source in abort then only run the remaining sources.. How please design the job. Thanks.
What is the surrogate key? what is the use of surrogate key? how to Create surrogate key Generator in scd2 in 8.5?
CAN WE DECLARE DATASET SET WITH OTHER THAN .DS EXTENTION OR NOT IF YES WHAT IS THE DIFFERENCE U FIND?IF NOT WHY?
what is advantages of snowflake when it is used?
how to use self join using datastage ? can u tell me using stage how can we implemnet the self join
Describe link sort?
IN SEQUENTIAL FILE,I HAVE DATA LIKE THIS EID,ENAME 11,AA 11,CC 22,DD 33,EE 22,AA 22,BB 22,CC 11,BB THEN i SELECT perform sort ON eid,uncheck both unique,stable AND I CHOOSE hash SORTING.WHAT IS THE OUTPUT I CAN GET.what happend if i select UNIQUE,STABLE. PLEASE EXPLAIN HOW THE RECORDS DISPLAY AT TARGET.
What is apt_config in datastage?
col1 123 abc 234 def jkl 768 opq 567 789 but i want two targetss target1 contains only numeric values and target2 contains only alphabet values like trg1 123 234 768 567 789 trg2 abc def jkl opq