What is data merging, data cleansing and sampling?
Answers were Sorted based on User's Feedback
Answer / rekha
DATAMERGING : IT IS THE PROCESS OF INTEGRATING THA DATA
WITH SIMLIAR SOURCE ,STRUCTURE AND TYPE
DATA CLEANSING : IT IS THE PROCESS OF IDENTIFING AND
CHANGING THE INCONSISTANCIES AND INACCURACIES
DATA SAMPLING : IT IS A PROCESS , ORBITARILY READING THE
DATA FROM GROP OF RECORDS
Is This Answer Correct ? | 10 Yes | 0 No |
Answer / dev
Data Cleansing: A two step process of detection and
correction of errors in a data set.
Is This Answer Correct ? | 11 Yes | 4 No |
Answer / amsarasu
data merging :multiple detailes values are summarised into
single summaeised value.
data cleansing:to eliminate the inconsistant data
sampling:it is the process ,orbitarly reading the data from
group of records.
Is This Answer Correct ? | 6 Yes | 2 No |
Answer / sudheer
The main thing Merging of data is nothing but integrating from multiple source systems. It is in 2 types
1.Horizontal merging(Join)
2.Vertical Merging(Union)
Is This Answer Correct ? | 3 Yes | 0 No |
Answer / sridhar
Data Merging:The process of standing the Structure Of The table(table name,column name,column type.
Is This Answer Correct ? | 0 Yes | 0 No |
Answer / sarat
datacleaging:it is the process of identifying and changing
inconsistency and inacquries
datamerging:it is process of integreated multiple
inputsource into singleoutput with similar srtucture and
datatype
Is This Answer Correct ? | 1 Yes | 2 No |
What is target load order?
What is up date strategy and what are the options for update strategy?
what is diffrence b/w joner and union transfermation
Workflow is long running due to long running sql query so when we refer the query plan it tells the issue is due to partition of the db table. How to handle this?
How to load the source table into flat file target(with columns) in informatica?
Can anyone give some input on "Additional Concurrent Pipelines for Lookup Cache Creation" ? I know that this property is used to build caches in a mapping concurently. But which values should I set into this ( i.e. 1 or 2 or 3 or something else ) for concurrent cache building ?
How to do the error handling of if ur source is flatfiles?
which one is better performance wise joiner or look up
What is active and passive transformation?
following scenario empsal table i want who exist one lakshs sal above monthwise? ` empsal empid monthyear sal 1 jan2008 1000 2 march2009 50000 3 april2009 4000 4 feb2009 100000 5 jul2009 600000 6 dec 2008 90000
3. Suppose Seq Gen is supplying a increamental value to a column of a table, suppose, table's column value reaches to maximum value, then what will happen, will the session fail? If it is the situation, then what should be done so that we can stop this kind of situation in advance?
What are the different options used to configure the sequential batches?