Answer Posted / manoj
When developing a dimensional model, we often encounter miscellaneous flags and indicators. These flags do not logically belong to the core dimension tables.
A junk dimension is grouping of low cardinality flags and indicators. This junk dimension helps in avoiding cluttered design of data warehouse. Provides an easy way to access the dimensions from a single point of entry and improves the performance of sql queries.
Example: For example, assume that there are two dimension tables (gender and marital status). The data of these two tables are shown below:
Code:
Table: Gender
Id Gender_status
----------------
1 Male
2 Female
Table: Marital Status
Id Marital_Status
----------------
1 Single
2 Married
Here both the dimensions have low cardinality flags. This will cause maintenance of two tables and decrease performance of sql queries.
We can combine these two dimensions into a single table by cross joining and can maintain a single dimension table. The result of cross join is shown below:
Code:
id gender mrg_status
--------------------
1 Male Single
2 Male Married
3 Female Single
4 Female Married
This new dimension table is called a junk dimension. This will improve the manageability and improves the sql queries performance.
| Is This Answer Correct ? | 12 Yes | 0 No |
Post New Answer View All Answers
What is the difference between star and snowflake schemas?
What is Analysis Services?
What is factless fact tables?
What are the steps involved in designing a fact table?
Suppose you are filtering the rows using a filter transformation only the rows meet the condition pass to the target. Tell me where the rows will go that does not meetthe condition.
How many clustered indexes can you create for a table in dwh? In case of truncate and delete command what happens to table, which has unique id.
What is data warehouse architecture?
Explain what are fact, dimension, and measure?
How to allow a dynamic selection of a column for a measure in a chart,without using variable?
What is data analysis?
Why facts table is useful in representing the data?
Explain why are oltp database designs not generally a good idea for a data warehouse?
What is the very basic difference between data warehouse and operational databases?
What are the different types of scd's used in data warehousing?
How to provide security in frame work manager for a query subject?