HOw Hash Partion Works
Thank you in Advance
i have doubts on Hash Partion TEch Could please give me the
clear understandable notation
example
e_id,dept_no
1,10
2,10
3,20
4,20
5,30
6,40
i have TWo Nodes/Three Nodes
My questions are:
1).if i select hash key as e_id how Hash partion will
distribute the data in to two NOdes/three NOdes
2).if i select hash key as dept_no how Hash partion will
distribute the data in to two NOdes/three NOdes
sivakumar.katta7@gmail.com
Answers were Sorted based on User's Feedback
Well The basic idea is:
Same key column values are given to the same node.
Hence
1.(e_id)
node1 node2 node1 node2 node3
1,10 2,10 1,10 2,10 3,20
3,20 4,20 4,20 5,30 6,40
5,30 6,40
2. (dept_no)
node1 node2 node1 node2 node3
1,10 3,20 1,10 3,20 5,30
2,10 4,20 2,10 4,20 6,40
5,30 6,40
(This is where pranay went wrong 6,40 will go to node3
instead of node1.)
Is This Answer Correct ? | 1 Yes | 0 No |
Consider you r having two nodes node 1 and node and u
selected e_id as hash key then
for two nodes for three nodes
node 1 node 2 node 1 node 2 node 3
1,10 2,10 1,10 2,10 3,20
3,20 4,20 4,20 5,30 6,40
5,30 6,40
if u selected dept_no as hash id then
for two node for three node
node 1 node 2 node1 node2 node3
1,10 3,20 1,10 3,20 5,30
2,10 4,20 2,10 4,20
5,30 6,40 6,40
Is This Answer Correct ? | 6 Yes | 6 No |
i don't know y some one ticked my ans as wrong, please give me explanation, n correct ans if i'm not correct. don't tick blindly as no.
Harikrishna, ur ans is correct if it is 4 node configuration and dept_no is key column, read the question properly he asked (1) 2 or 3 node and key column as e_id
(2) 2 or 3 node and key column is dept_id
Is This Answer Correct ? | 0 Yes | 0 No |
Hi Pranay,
Sorry I got confused with your answer thats Y i messed it up
Is This Answer Correct ? | 0 Yes | 0 No |
If U make Dept_no as Key Then data will be as below:
node 1 node2 node3 node4
1,10 3,20 5,30 6,40
2,10 4,20
Is This Answer Correct ? | 0 Yes | 1 No |
1.How do u handle NULL in sequential stage. 2.Difference between switch stage and filter stage.
can we half project in parallel jobs and half project in server jobs?
i have the source from Uk,north america how can i pass the data two tables based on the locations
hi.... am facing typical problem in every interview " I need some critical scenarios faced in real time" plz help me guys
What is use Array size in datastage
what should be ensure to run the sequence job so that if its get aborted in 10th job before 9job should get succeeded?
i have 4 jobs i want run 1job should run on 1node and 2job runon 2node and.... how to make it possible?
i have input like this Column 1 ,column 2 3,a 4,b 5.c i want output aaa bbbb ccccc Ple help any one?
what is the unix script to run the job? Please mention commands which we use often?
What is meta stage?
How can you join flat file, oracle as a sources?
Explain usage analysis in datastage?