How to eliminate duplicates in FF and oracle both
Answers were Sorted based on User's Feedback
Answer / ravindra.15aug
In RDBMS we can write sql queary in Source qualifer by
using rowid, in sorter by using select distinct option, In
aggregator by using groupby, in Rank by using groupby.
In FF level also we can do except sql queary in source
qualifier.
| Is This Answer Correct ? | 9 Yes | 0 No |
Answer / akash
In addition to the above answers I will add one more option.
We can use expression to check whether a record is
repeating or not and set an output port as flag. For first
occurence of any record this flag is 1. Use router after
this with condition "i_FLAG = 1". This output will give
only unique records. Default output will give only
duplicate records.
Also we can use a filter after the above expression with
the condition "i_FLAG = 1"
| Is This Answer Correct ? | 3 Yes | 0 No |
Answer / priyank
For FF, place an Aggregator transformation, do a group by
on all the columns, it will remove the duplicates from the
flat file.
In Oracle, use distinct in SQ sql override to remove the
duplicates.
| Is This Answer Correct ? | 1 Yes | 0 No |
enterprise datawarehouse your project phase by phase explain?
HOW TO YOU LOAD TIME DIMENSION
performance wise which one is better in joiner and lookup transformation?why?explain clearly?
while for 100 records in source table loaded sucessfully in trgt table . assume ,session will take 10min or 5 min to successfully succeeded. then 100 million records r there in source how much time will take by session to succeeded. there no fail ok.trgt table will load 100 million records with out any errors . don't tell perfect time . assume your self how much time to succeeded?
Can you access a repository created in previous version of Informatica?
In a mapping i have three dimensions. If i want to pass a same surrogate key value to all the three dimensions by using one sequence generator is possible?If the mapping is containing single flow? And in the same case if the mapping is contaning 3 flows for the three dimensions then by using one sequence generator can we populate surrogate key (same value) to all the three dimensions?
every DWH must have time dimension so now what is the use of the time dimension how we can calculate sales for one month,half-yr'ly,and year'ly?how we are doing this using time dimension.
How to do unit testing in informatica?
How can we get multiple output values using an unconnected lookup transformation?
have you developed documents in your project? and what documents we develop in realtime?
what is confirmed fact?
what are surrogate keys?