how will you remove the duplicate records from flat file
without using sorter?
Answers were Sorted based on User's Feedback
Answer / rkumar.etl
use aggregator transformation and group by all ports and
create one port for checking count...and pass the results
accordingly to target tables.
| Is This Answer Correct ? | 24 Yes | 3 No |
Answer / sathish
use aggregater T/R after source qualifier select all ports in group by clause it will removes the duplicates
| Is This Answer Correct ? | 5 Yes | 2 No |
Answer / ajit kumar pradhan
if it is a relational source then click on source Qualifier
properties then check Distinct option.
if it is a flat file then it can't work .......
click on target defination select target menu and click
generate/execute then it come out one dialog box come out
then check primary key check box..it avoid duplicale row...
| Is This Answer Correct ? | 0 Yes | 0 No |
Answer / sathish
put primary key to target it will reject duplicate rows
| Is This Answer Correct ? | 1 Yes | 4 No |
Answer / amedela chandra sekhar
SD---S.Q----EXP t/r------TGT.
add two port in exp t/r.
one is variable port v_seq expression:seq+1
one output port o_seq expression:v_seq
required ports connect to target.
| Is This Answer Correct ? | 1 Yes | 4 No |
We can use the option 'SELECT DISTINCT' in the source
qualifier.
Anybody please tell me whether this works or not.
| Is This Answer Correct ? | 5 Yes | 23 No |
in staging we are merging the data and remove the inconsistants data that type of situation what u will done and type of functions u can use
 Code page compatibly –Where all changes are made to handle the scenario.
without update strategy how to insert &update?
differences between service based and product based?
How to do Half of the table ex: if 1000 records are there means 500 in one table and 500 in second table like this if N number of tables means how to do Half of the table ??????????
what is mean by grouping of condition column in lookup transformation?
Q. We are the loading the table on daily basis it is incremental loading. and A person rahul slary was 10000, so if i check before run my salalr is 10000. but toay there is update that my sal is 15k but that will come to know after the load. braod crtiteria is we donot want to show downstream teams partial updated data. need aproad as etl developer
what is the diff b/w cached and unchaed look ups
Can anyone please distinguish what is Inner join and Outer join
What are the designer tools for creating tranformations?
How can we create index after completion of load process?
hw cn v elimate dublicate values from lookup without overriding sql?