Answer Posted / sat!sh
The data distribution of table among AMPs is called Skew Factor
Generally For Non-Unique PI we get duplicate values so the
more duplicate vales we get more the data have same rowhash
so all the same data will come to same amp, it makes data
distribution inequality,
One amp will store more data and other amp stores less
amount of data, when we are accessing full table, The amp
which is having more data will take longer time and makes
other amps waiting which leads processing wastage
In this situation (unequal distribution of data)we get Skew
Factor High
For this type of tables we should avoid full table scans
ex:
AMP0 AMP1
10000(10%) 9000000(90%)
in this situation skew factor is very high 90%
| Is This Answer Correct ? | 79 Yes | 3 No |
Post New Answer View All Answers
Difference between star and snowflake schemas?
What are the string manipulation operators and functions associated with teradata?
How many codd's rules are satisfied by teradata database?
Which is faster fastload or multiload?
If I wanted to run a TPump job only once per day - basically working on a file that is produced once per day - how would you set up the parameters for that sort of job ?
Explain the term 'columns' related to relational database management system?
Explain the term 'database' related to relational database management system?
What are the available join types in teradata?
What are the uses of bynets in multi-node systems?
What happens in a conflict? How do you handle that?
Explain fallback in teradata?
What are default access rights in teradata? What explicit right can be given to a user?
What are the scenarios in which full table scans occurs?
How to view every column and the columns contained in indexes in teradata?
What are the multiload utility limitations?