How to eliminate duplicates in FF and oracle both
Answers were Sorted based on User's Feedback
Answer / ravindra.15aug
In RDBMS we can write sql queary in Source qualifer by
using rowid, in sorter by using select distinct option, In
aggregator by using groupby, in Rank by using groupby.
In FF level also we can do except sql queary in source
qualifier.
| Is This Answer Correct ? | 9 Yes | 0 No |
Answer / akash
In addition to the above answers I will add one more option.
We can use expression to check whether a record is
repeating or not and set an output port as flag. For first
occurence of any record this flag is 1. Use router after
this with condition "i_FLAG = 1". This output will give
only unique records. Default output will give only
duplicate records.
Also we can use a filter after the above expression with
the condition "i_FLAG = 1"
| Is This Answer Correct ? | 3 Yes | 0 No |
Answer / priyank
For FF, place an Aggregator transformation, do a group by
on all the columns, it will remove the duplicates from the
flat file.
In Oracle, use distinct in SQ sql override to remove the
duplicates.
| Is This Answer Correct ? | 1 Yes | 0 No |
what is dynamic cashe
What is the difference between a table and the materialized view?
how can we find the bottle neck in SQL Query in SOURCE QUALIFIER, and how can we tune it..?
what is target load plan
i have two source data there is no pk,fk how to join the two sources
There are 10 flat files; out of 10 files 1 file is empty. How do you identify the empty file? How can you load all files into targets?
how can we run workflow with pmcmd?
List the transformation used for the following: (1) Heterogeneous Sources (2) Homogeneous Sources (3) Find the 5 highest paid employees within a dept. (4) Create a Summary table (5) Generate surrogate keys
Hi Experts, Performance Wise Which transformation is better in LOOKUP AND JOINER TRansformations? Why? Can anybody please explain it ? Please help me out from this Question. Thanks In Advance.
I am having a table with columns ID NAME 1 x and the requirement is to get the o/p like this 1 y ID Count(*) 1 z 1 3 2 a 2 2 2 b 3 c so write a sql query to get the id n how many times its count of repetition n there u shouldn't get the distinct(i.e id-3) Reply as early as possible
What is the difference between Oracle performance and Informatica Percfomance? Which performance is better?
If informatica has its scheduler why using third party scheduler?