how to remove duplicates in transformer stage by using
stage variables?one example?
Answers were Sorted based on User's Feedback
Answer / ds
In Stage variable:
stage_variable3 <map> stage_variable1
if column=stage_variable1 than 0 else 1 <map>
stage_variable2
column <map> stage_variable3
Put stage_variable2 as constrain to target stage.
Is This Answer Correct ? | 12 Yes | 2 No |
Answer / venu
if you want to remove duplicates in transformer stage
use one of the partition technic hash partition you can
easily remove duplicatess
Is This Answer Correct ? | 8 Yes | 2 No |
Answer / peeyush sehgal
sv1=inputlink
sv2=if inputlink=sv3 then 1 else 0
sv3=sv1
Is This Answer Correct ? | 0 Yes | 2 No |
Answer / prasad
take two stage variables
sV1: Input_column
sV2: if Input_column = sV1 then 0 else 1
and put 'sV1=1' as constraint
Plz correct me, If am wrong.....
Is This Answer Correct ? | 4 Yes | 7 No |
Answer / amit
using hash partition technique, we can bring duplicate data(based on key columns) in one partition. Then in stage constraints filter out data with setting @inrownum = 1.
This will remove duplicate in transformer stage.
Is This Answer Correct ? | 0 Yes | 7 No |
Answer / subodh
duplication of transformer stage is removed b7y using a
call by referance and call by value , using we create one
object and no other duplication is done
Is This Answer Correct ? | 1 Yes | 14 No |
i hav source like this . deptno,sal 1,2000 2,3000 3,4000 1,2300 4,5000 5,1100 i want target like this target1 1,2000 3,4000 4,5000 target2 2,3000 1,2300 5,1100 with out using transformer
source 1123445 I WANT OUTPUT AS DUPLICATES TO TARGET1 LIKE TARGET1 1144 NON-DUPLICATES TO TARGET2 TARGET2 235
what is usage of datastage with materialized views
On which Dimension Table you implemented SCD Type in your Project
how to clear the source file? 2) diff b/t odbc & Oracle?what is node? 3) explain one complex job u face in ur project
What r the existing server jobs in parallalism?
Hi every one, I am Suneel. I/p o/p --- ----- suneel suneel suneel suneel suneel suneel how it will get. Please design job with explain. Thanks.
how can we perform the 2nd time extraction of client database without accepting the data which is already loaded in first time extraction
what are .ctl(control files) files ? how the dataset stage have better performance by this files?
souce file having the columns like name company krish IBM pooja TCS nandini WIPRO krish IBM pooja TCS if first row will be repeat i want the result like this name company count krish IBM 1 pooja TCS 1 nandini WIPRO 1 krish IBM 2 pooja TCS 2
1)How will u implement SCD2 by using surrogate key. 2)What are the disadvantages with surrogate key. 3)How will you handle nulls in your project for the varchar, integer data types. 4)Can I use two fact tables in star schema. 5)3 jobs are running on the 2 nodes after I added one more node so can I compile those jobs to run on three nodes.
DB2 connector> transformer > sequential file Data will be exported into a csv format in a sequential file. This file will be send in a email using a sequence job. Problem here is, how to avoid sending a blank csv file? When I ran the job there are chances that it might return zero records but in the sequence job csv file is going blank. how can I avoid this? thanks