Explain about Skew Factor?

Answer Posted / sat!sh

The data distribution of table among AMPs is called Skew Factor

Generally For Non-Unique PI we get duplicate values so the
more duplicate vales we get more the data have same rowhash
so all the same data will come to same amp, it makes data
distribution inequality,

One amp will store more data and other amp stores less
amount of data, when we are accessing full table, The amp
which is having more data will take longer time and makes
other amps waiting which leads processing wastage

In this situation (unequal distribution of data)we get Skew
Factor High

For this type of tables we should avoid full table scans

ex:
AMP0 AMP1
10000(10%) 9000000(90%)

in this situation skew factor is very high 90%

Is This Answer Correct ?    79 Yes 3 No



Post New Answer       View All Answers


Please Help Members By Posting Answers For Below Questions

What happens in a conflict? How do you handle that?

625


What is a node in teradata? Explain

605


What is the difference between fastload and multiload?

801


How would you load a very large file in teradata in general?

568


what are the uses of fact table and dimension table in banking project?

4106






What do high confidence, low confidence and no confidence mean in explain plan?

592


Why is the case expression used in teradata?

594


What is bteq script in teradata?

635


Which is more efficient group by or distinct to find duplicates?

648


What type of indexing mechanism do we need to use for a typical data warehouse?

620


What are the various etl tools in the market?

587


How will you solve the problem that occurs during update?

589


What are the functions of a teradata dba?

600


Explain some differences between mpp and smp?

592


What are the scenarios in which full table scans occurs?

540