How to eliminate duplicates in FF and oracle both
Answers were Sorted based on User's Feedback
Answer / ravindra.15aug
In RDBMS we can write sql queary in Source qualifer by
using rowid, in sorter by using select distinct option, In
aggregator by using groupby, in Rank by using groupby.
In FF level also we can do except sql queary in source
qualifier.
| Is This Answer Correct ? | 9 Yes | 0 No |
Answer / akash
In addition to the above answers I will add one more option.
We can use expression to check whether a record is
repeating or not and set an output port as flag. For first
occurence of any record this flag is 1. Use router after
this with condition "i_FLAG = 1". This output will give
only unique records. Default output will give only
duplicate records.
Also we can use a filter after the above expression with
the condition "i_FLAG = 1"
| Is This Answer Correct ? | 3 Yes | 0 No |
Answer / priyank
For FF, place an Aggregator transformation, do a group by
on all the columns, it will remove the duplicates from the
flat file.
In Oracle, use distinct in SQ sql override to remove the
duplicates.
| Is This Answer Correct ? | 1 Yes | 0 No |
if we have a delimiters at unwanted places in a flat file how can we over come those.
What is informatica etl tool?
get me output as if input is like 1 x o/p-->1 x,y,z 1 y 2 a,b 1 z 3 c 2 a 2 b 3 c
wt is informatica file watch timers in aflat i want to get the first record and last record how could i.
which one is costliest transformation? costliest means occupying more memory. which one Aggregator,sorter,filter,etc..
What is the meaning of decode in informatica?
In aggregator if u enable sorted input what will be the output and disable sorted wt is output
Informatica and datawarehousing courses in Pune?
What are the new features of informatica 9.x in developer level?
Find a way to implement the following logic : You have column like : Column 1 AA|BB|CC DD|EE|FF WW|AA I need to get the length of the column without the ‘|’ .
What are the challenges of Dataware housing in the future?
Differentiate between a repository server and a powerhouse?