Golgappa.net | Golgappa.org | BagIndia.net | BodyIndia.Com | CabIndia.net | CarsBikes.net | CarsBikes.org | CashIndia.net | ConsumerIndia.net | CookingIndia.net | DataIndia.net | DealIndia.net | EmailIndia.net | FirstTablet.com | FirstTourist.com | ForsaleIndia.net | IndiaBody.Com | IndiaCab.net | IndiaCash.net | IndiaModel.net | KidForum.net | OfficeIndia.net | PaysIndia.com | RestaurantIndia.net | RestaurantsIndia.net | SaleForum.net | SellForum.net | SoldIndia.com | StarIndia.net | TomatoCab.com | TomatoCabs.com | TownIndia.com
Interested to Buy Any Domain ? << Click Here >> for more details...


Explain about Skew Factor?

Answers were Sorted based on User's Feedback



Explain about Skew Factor?..

Answer / sat!sh

The data distribution of table among AMPs is called Skew Factor

Generally For Non-Unique PI we get duplicate values so the
more duplicate vales we get more the data have same rowhash
so all the same data will come to same amp, it makes data
distribution inequality,

One amp will store more data and other amp stores less
amount of data, when we are accessing full table, The amp
which is having more data will take longer time and makes
other amps waiting which leads processing wastage

In this situation (unequal distribution of data)we get Skew
Factor High

For this type of tables we should avoid full table scans

ex:
AMP0 AMP1
10000(10%) 9000000(90%)

in this situation skew factor is very high 90%

Is This Answer Correct ?    79 Yes 3 No

Explain about Skew Factor?..

Answer / sricharan

It is a number.It tells you how the data is distributed
among Processors or Amps.Skew factor varies about 0-100.
If the data is distributed evenly among the processor the
skew factor is ZERO whether it is smaller or large table.
Skew factor doesn't depends on size of table but it only
depends on distribution of data.

If the skewfactor is more we have to access the table on
only primary index columns but not whole table.

Is This Answer Correct ?    26 Yes 0 No

Explain about Skew Factor?..

Answer / yuvaevergreen

Skew Factor is the indication of how evenly the data is
spread across the AMPS.

A skew factor of 0 indicates that the data is perfectly
distributed across all the AMPS.

Is This Answer Correct ?    17 Yes 1 No

Explain about Skew Factor?..

Answer / navaneeth reddy

Skew factor is distribution of rows of a table among the
available no.of AMP's.
If your table has a chance of using unique primary index,it
is always better to use UPI which ensures the skew factor
around 0%.
If there is no chance of having unique values column in a
table choose a column as PI(primary index) which has less
duplicate values which inturn results in less skew factor.
That is the data will be distributed almost(not exactly
equal percentage) equally to all AMP's.

Is This Answer Correct ?    12 Yes 0 No

Explain about Skew Factor?..

Answer / yuvaevergreen

Below query can be used to find the distribution by amps.
SELECT HASHAMP(HASHBUCKET(HASHROW(index or column)))
,COUNT(*)
FROM TABLENAME GROUP BY 1 ORDER BY 2 DESC;

Is This Answer Correct ?    3 Yes 0 No

Explain about Skew Factor?..

Answer / srinu

The Table is having too many Duplicate rows, The Skew table
will hadle the duplicate rows up to limitations,Once its
cross the no of Rows it will effect on Performence.The Dba
people will the SKEW tables.

I think this Answer helps u

Is This Answer Correct ?    10 Yes 29 No

Post New Answer

More Teradata Interview Questions

If a Node is busy what are the steps you can take to avoid ?

0 Answers   Teradata,


How a Referential integrity is handled in Teradata?

5 Answers  


What are the 5 phases in a multiload utility?

0 Answers  


can i call router is a passive transformation

2 Answers  


I WANT TO LEARN TERA-DATA ,SO CAN ANY BODY PLZ REFER WHAT ARE THE TOPICS I HAVE TO GO THROUGH, TO GET ASAP JOB ,SO PLZ REFER ME WHERE I CAN GET NICE COACHING ON TERADATA.

13 Answers  


Where we use PPI in real time??? What is the disadvantages of PPI?

3 Answers   IBM, Mphasis,


Let us say there is a file that consists of 100 records out of which we need to skip the first and the last 20 records. What will the code snippet?

0 Answers  


What is called partitioned primary index (ppi)?

0 Answers  


Can we load a Multi set table using MLOAD?

2 Answers  


What are the components provided on node?

0 Answers  


Highlight the differences between Primary Key and Primary Index.

0 Answers  


Which is more efficient group by or distinct to find duplicates?

0 Answers  


Categories