Eliminating of duplicate records without using dynamic lookups
Answers were Sorted based on User's Feedback
Answer / pooja ahmed
1. you can you use sorter transformation .... in that you
select distinct option
2. use agg trans .... in that go for group by
usi8ng above metyhod you can remove duplicate rec...
Is This Answer Correct ? | 6 Yes | 0 No |
Answer / bsgsr
duplicate records can be eliminated the following ways:
1) by using select distinct option
2) by over riding sql with group by port on the column
which gets duplicate values ( P_key column)
3) by connecting the source to an expression transformation
and flagging the duplicate records to another target by
writing an expression in the exp editor
Is This Answer Correct ? | 3 Yes | 0 No |
Answer / raghav
with out distinct we can do it in 3 possible ways.
1. Using Group by function which will act like a distinct in
TD13.0 database
2. using SET table concept, like first we create the multi
set table and insert the duplicate rows into them and then
create the set table and use the insertselcet concept, in
this case only distinct rows will be inserted.
3. use rowid, derived table or sub query concept then we can
do this
Is This Answer Correct ? | 1 Yes | 0 No |
Do we need an ETL tool? When do we go for the tools in the market?
Assume u have a 24CPU machine with 24GB RAM, suggest how u would like to configure Informatica ,like number of concurrent sessions, RAM requirements etc,max partitions that u would permit per mapping.
can Informatica be used as a cleansing tool? If yes, give examples of transformations that can implement a data cleansing routine.
can u please tell me about ........ how can explain my project before interviewer my projects are insurence domain & sales domain?
What is factless fact schema?
what are the concerns of OLTP and DSS systems?
Identifying the bottlenecks in various components of Informatica and resolving them.
how do u set partition points in the mapping?
what is session partitioning?
how do u handle performance issues in Informatica . where can u monitor the performance ?
WHY CANT WE SEE THE DATA IN DATASET
Hi can anyone write query for the below input and output Table 1 col 1 100 Null 200 300 Table 2 col1 Null 400 700 800 o/p 200 700 800 2)sql col 1 contains 23x-5-45 26x-12-45 28x-10-50 o/p middlevalue should be greater than 10