what is the diff between sequential file and fileset stages?
Answers were Sorted based on User's Feedback
Answer / aparna kanduri
File set can be stored on multiple Unix files as flat files.
file set contains discriptor file and individual raw data
files . No. of raw data files depends on configuration file.
There will be some limit like 2G limit on some file
systems . we can distribute data over nodes to prevent
overrun.
In such cases file set will be useful than sequential file.
| Is This Answer Correct ? | 11 Yes | 6 No |
Answer / san
Seq. stage:
--------------
Seq file stage by default run in sqe. mode and you define a
seq file stage to run in parallel mode.. by setting option
"number of readers per node" > 1.
Seq. FS is used to read or write data from .txt .dat format
files. But the limit of seq stage is 2 GB.
Fileset stage:
---------------
by default Fileset stage run in parallel mode. more than 2GB
data can be stored. But the fileset hold to typse of information
i) files descriptor -> points to metadata, data location
ii) contains the data, contains multiple files if your using
more than one nodes config file
| Is This Answer Correct ? | 5 Yes | 1 No |
Answer / venugopal [patni]
Sequential file is used to read the data sequentially.
It can be configured to execute both in parallel and
sequential mode.We cann't perform lookups using sequential
file.
Fileset stage is used to import the exported files list.
It executes only in parallel mode.
the most importance of it is 2G.B limit on the size of a
file and we need to distribute files among the nodes to
prevent overruns.
| Is This Answer Correct ? | 7 Yes | 8 No |
What are the features of datastage flow designer?
In work load management there are three options of Low priority, Medium priority and High Priority Jobs which can be used for resource management. why this feature is developed when there is already jobs prescheduled by scheduler or autosys. what will be the use of workload management then?
Star Schema vs. Snow Flake Schema. What is the best model for DW? Give example for both the models performance wise?
difference between server shared container and parallel shared container
How to reverse the string using SQL?
what is snow flack schema?
on how many columns we can perform aggregation in the aggregator stage?
what is the Difference Between Datastage Server Edition and Parallel Edition?
What can we do with datastage director?
my soure table is emp having columns sal,deptno in the deptno 10,20,30deptno row are there expected out put is min(sal) of 10th deptno,max(sal) of 20th deptno,mean(sal) of 30th deptno using aggregation stage
if a column contains data like abc,aaa,xyz,pwe,xok,abc,xyz,abc,pwe,abc,pwe,xok,xyz,xxx,abc, roy,pwe,aaa,xxx,xyz,roy,xok.... how to send the unique data to one source and remaining data to another source????
How you Remove the Dataset in Unix?