what is a junk dimension ?
Answers were Sorted based on User's Feedback
Answer / vijay
A "junk" dimension is a collection of random transactional
codes, flags and/or text attributes that are unrelated to
any particular dimension.
The junk dimension is simply a structure that provides a
convenient place to store the junk attributes. A good
example would be a trade fact in a company that brokers
equity trades.
The fact would contain several metrics (principal amount,
net amount, price per share, commission, margin amount,
etc.) and would be related to several dimensions such as
account, date, rep, office, exchange, etc. This fact would
also contain several codes and flags that were related to
the transaction rather than any of the dimensions ... such
as origin code (that indicates whether the trade was
initiated with a phone call or via the Web), a reinvest
flag (that indicates whether or not this trade as was the
result of the reinvestment of a dividend payout) and a
comment field for storing special instructions from the
customer.
These three attributes would normally be
removed from the fact table and stored in a junk
dimension ... perhaps called the trade dimension. In this
way, the number of indexes on the fact table would be
reduced, and performance (not to mention ease of use) would
be enhanced. Hope this helps.
Is This Answer Correct ? | 57 Yes | 0 No |
Answer / aparna
A junk dimension is a convenient grouping of flags and
indicators. It's helpful, but not absolutely
required, if there's a positive correlation among the
values. The benefits of a junk dimension
include: ? Provide a recognizable, user-intuitive location
for related codes, indicators and their
descriptors in a dimensional framework.
? Clean up a cluttered design that already has too many
dimensions. There might be five
or more indicators that could be collapsed into a single 4-
byte integer surrogate key in
the fact table.
? Provide a smaller, quicker point of entry for queries
compared to performance from
constraining directly on these attributes in the fact
table. If your database supports bitmapped
indices, this potential benefit may be irrelevant, although
the others are still
valid.
Is This Answer Correct ? | 20 Yes | 3 No |
Answer / srinu
junk dim is convient for group of flags and attributes to
get them out of a fact table inti useful dimension framework
Is This Answer Correct ? | 15 Yes | 5 No |
Answer / pullareddy palle
junk dimensions are the dimensions which can not participate
in mapping data flow(not useful). like boolean, flags
Is This Answer Correct ? | 7 Yes | 0 No |
Answer / arpita
A dimension which cannot be used to indicate the key
performance indicators is known as junk dimension.....
eg: Address,Phone number,Faxno,Description......etc...
Is This Answer Correct ? | 7 Yes | 3 No |
Answer / mohan
junk dimention is a dimention table which holds only flags
or status where dimension field values are like True/False,
open/Close,complete/incomplete/inprogress, etc.
Is This Answer Correct ? | 4 Yes | 0 No |
Answer / ramesh
what is junk dimension? Give an example
When developing a dimensional model, we often encounter miscellaneous flags and indicators. These flags do not logically belong to the core dimension tables.
A junk dimension is grouping of low cardinality flags and indicators. This junk dimension helps in avoiding cluttered design of data warehouse. Provides an easy way to access the dimensions from a single point of entry and improves the performance of sql queries.
Example: For example, assume that there are two dimension tables (gender and marital status). The data of these two tables are shown below:
Code:
Table: Gender
Id Gender_status
----------------
1 Male
2 Female
Table: Marital Status
Id Marital_Status
----------------
1 Single
2 Married
Here both the dimensions have low cardinality flags. This will cause maintenance of two tables and decrease performance of sql queries.
We can combine these two dimensions into a single table by cross joining and can maintain a single dimension table. The result of cross join is shown below:
Code:
id gender mrg_status
--------------------
1 Male Single
2 Male Married
3 Female Single
4 Female Married
This new dimension table is called a junk dimension. This will improve the manageability and improves the sql queries performance.
Is This Answer Correct ? | 4 Yes | 0 No |
Answer / vamshi
In data warehouse design, frequently we run into a situation where there are yes/no indicator fields in the source system. However, if u keep all those indicator fields in the fact table, not only do we need to build many small dimension tables, but the amount of information stored in the fact table also increases tremendously,
Junk dimension is the way to solve this problem. In a junk dimension, we combine these indicator fields into a single dimension. This way, we'll only need to build a single dimension table, and the number of fields in the fact table, as well as the size of the fact table, can be decreased
The content in the junk dimension table is the combination of all possible values of the individual indicator fields
Is This Answer Correct ? | 1 Yes | 0 No |
Answer / prakash m
A convenient grouping of random flags and attributes to get
them out of the fact table
Retain only useful fields
Remove fields that make no sense at all
Remove fields that are inconsistently filled
Remove fields that are of operational interest only
Design similar to demographics; maximum unique
combinations, assign integer key, plug into fact
Create new combination (insert new dimension record) at ETL
run-time
E.g. Yes/No Flags in old retail transaction data
Is This Answer Correct ? | 2 Yes | 2 No |
What is an active transformation?
what is the main advantage of unconnected lookup
without using emailtask how will send a mail from informatica?
i have a table like empid ename year month sal 1 x 98 jan 500 1 x 98 feb 500 1 x 98 mar 500 1 x 99 jan 600 1 x 99 feb 600 2 y 98 jan 600 2 y 98 feb 600 2 y 98 mar 600 2 y 99 jan 700 2 y 99 jan 700 and so on i want to find out totsal for every emp on year wise plz help me
How can you recover the session in sequential batches?
i have a data in my source as a flat files what test i have to perform the next step can any body help to me
How many transformations can be used in mapplets.
Explain pushdown optimization $pushdownconfig parameter - informatica
if i hv 6 table as source table but can i make 12 as a dim table and 6 as fact table belongs to that src table.
Enlist the tasks for which source qualifier transformation is used.
How to join three sources using joiner?
Without source how to insert record to target?