What is the difference between lookup and sparse lookup?



What is the difference between lookup and sparse lookup?..

Answer / subhash

Normal lookup will provide data for an in-memory look up
whereas Sparse lookup will access the database directly.
Normal might provide poor performance when the reference
data is huge as it has to load large data into memory.
In such scenarios a join stage may work better(It will put
the data onto an internal dataset file for each link,
and then do the join based on the data provided). If your
stream/source data is small then a sparse lookup is
suggested
as a row is retrieved from the database for each input row
instead of the full reference data being brought into
memory.
Sparse lookup sends individual SQL statement for every
incoming row (If stream/source data is huge you can imagine
the number
of times it has to hit DB and hence the down side on
performance). It can be used when you want to get the next
sequence
number from your database (Again expensive overhead on your
job as noted before). Also note that sparse lookup is only
available for DB2 and Oracle. Normal Lookup stage can have
multiple reference link but Sparse can only have one
reference
link. When normal is used it is a good practice to choose
less volume data as reference data.

Is This Answer Correct ?    15 Yes 1 No

Post New Answer

More Data Stage Interview Questions

in aggregator , how can i get the sum in readable format

4 Answers   IBM,


in sequtial file 2 columns avaliable, i want only one column load the target. for this we can do by modify and copy stage. But here when using modify stage (in property drop column1) until it is ok. if target is data set How to view the data. with out using data management. what is the reason for this. if any body know this answer plz tel me. thanks.

1 Answers   IBM,


1.what is materialized data? 2.how to view the materialized data?

0 Answers   HCL, IBM,


what is the difference between lookup stage reject link and merge stage reject link in datastage Parallel jobs?

9 Answers   HCL,


What is aggtorec restructure operator?

0 Answers  






Hi guys, Design job sequence, we have 3 sources, in that 1st source in abort then only run the remaining sources.. How please design the job. Thanks.

2 Answers   IBM,


what are fact tables and dimension tables? give example assuming one table.

3 Answers   IBM,


how to load meta data

2 Answers   IBM,


what is sparse lookup?

7 Answers   IBM,


What are the features of datastage flow designer?

0 Answers  


what is the differeces between hash and modulus partition methods

4 Answers   TCS,


How can you write parallel routines in datastage PX?

0 Answers  


Categories