how to cleansing data
Answers were Sorted based on User's Feedback
Answer / navin
Data cleansing means converting non unique data format into unique format .This is performed in Transformer stage.
| Is This Answer Correct ? | 6 Yes | 1 No |
Answer / satyanarayana
In this removes the unwanted data(Bad records OR NULL
Values) and find the inconsistent data and make it
consistent data.
Example:
LOc
---
Hyd
Hyderabad
hyde
After Cleansing
Loc
---
Hyderabad
Hyderabad
Hyderabad
| Is This Answer Correct ? | 4 Yes | 1 No |
Answer / usha
Data cleansing means removing unwanted spaces.
By using LTRim,Rtrim functions we can remove unwanted space
| Is This Answer Correct ? | 3 Yes | 1 No |
Answer / krish
it is process of correcting the inconsitency data and make consitent format
| Is This Answer Correct ? | 1 Yes | 0 No |
Answer / venkatesh k
Data cleansing means performing all the de-dupe rules according to the requirements and make your data unique.For cleansing operations mainly we will use transformer,sort stage,aggregator and look up.
| Is This Answer Correct ? | 0 Yes | 0 No |
Answer / b.rambabu
data cleansing is a process of identifing the the data
inconsistency and inaccuracies
ex:
data inaccuracy:
hyd
Hydrabad
after
hydrabad
hydrabad
data inconsistency
10.78
10.23465
after
10.27
10.23
| Is This Answer Correct ? | 1 Yes | 2 No |
i have a project manager round on sat this week can you post what are the main question i have to check.if you have any idea regular question on project pls send me. thanks in advance
how can we join one oracle & flat files ?
What are routines in datastage?
How can we improve the performance in datastage?
How to remove duplicates in transformer stage? in parallel mode
If the job aborted in a sequencer, how can we start that from the previews successful job.
What are the primary usages of datastage tool?
How u implement the slowly changing dimensions if my source table is consisting of cid,cname,add,phno,email but i need to capture the changes for first three columns how u implement?
if i have two tables table1 table2 1a 1a,b,c,d 1b 2a,b,c,d,e 1c 1d 2a 2b 2c 2d 2e how can i get data as same as in tables? how can i implement scd typ1 and type2 in both server and in parallel? field1 field2 field3 suresh , 10,324 , 355 , 1234 ram , 23,456 , 450 , 456 balu ,40,346,23 , 275, 5678 how to remove the duplicate rows,inthe fields?
disign the complex job in u r project?(they are aksing only complex job design and then data flow...)
Hi guys, Please design a job for dis requirement with derivation(solution). my source table like dis. emp_no qualification 1 a 1 c 2 a 3 c 3 b To loaded to target like dis emp_no qualification 1 b 2 b 2 c 3 a my requirement is every employer have three qualifications i.e a,b and c. what qualification missed in source table that will be move to target systems. Hope u got it the requirement. Right Thanks.
Which algorithm you used for your hashfile?