how to cleansing data
Answers were Sorted based on User's Feedback
Answer / navin
Data cleansing means converting non unique data format into unique format .This is performed in Transformer stage.
| Is This Answer Correct ? | 6 Yes | 1 No |
Answer / satyanarayana
In this removes the unwanted data(Bad records OR NULL
Values) and find the inconsistent data and make it
consistent data.
Example:
LOc
---
Hyd
Hyderabad
hyde
After Cleansing
Loc
---
Hyderabad
Hyderabad
Hyderabad
| Is This Answer Correct ? | 4 Yes | 1 No |
Answer / usha
Data cleansing means removing unwanted spaces.
By using LTRim,Rtrim functions we can remove unwanted space
| Is This Answer Correct ? | 3 Yes | 1 No |
Answer / krish
it is process of correcting the inconsitency data and make consitent format
| Is This Answer Correct ? | 1 Yes | 0 No |
Answer / venkatesh k
Data cleansing means performing all the de-dupe rules according to the requirements and make your data unique.For cleansing operations mainly we will use transformer,sort stage,aggregator and look up.
| Is This Answer Correct ? | 0 Yes | 0 No |
Answer / b.rambabu
data cleansing is a process of identifing the the data
inconsistency and inaccuracies
ex:
data inaccuracy:
hyd
Hydrabad
after
hydrabad
hydrabad
data inconsistency
10.78
10.23465
after
10.27
10.23
| Is This Answer Correct ? | 1 Yes | 2 No |
Why we need datasets ratherthan sequential files?
What is a quality stage in datastage tool?
How can we run same job in 1 day 2 times
I have 2 Files like fileA fileB Output1 Output2 Output3 1 6 1 6 11 2 7 2 7 12 3 8 3 8 13 4 9 4 9 14 5 10 5 10 15 6 11 7 12 8 13 9 14 10 15 please let know
What are the main differences you have observed between 7.x and 8.x version of datastage?
How to create a file using vi editor? 2)how to delete a file in vi editor? 3)How to connect the server datastage to unix? what r the command lines we r using? 4)30 jobs r runnig in unix i want to find out my job. how to do this? give me command?
Emp login_timestamp Logout_timestamp A,2019-02-01 02:24:15,2019-02-01 04:59:42 B,2019-03-29 14:43:30,2019-03-29 20:22:00 ABC,2019-03-29 12:43:00,2019-03-29 23:22:59 In the above calculate the duration of hours spent in office for each emp in datastage.
by using dsjob..we can run only one job at a time?how can u run multiple jobs at a time in unix?
What is the diff between sort performed at sort stage and the stream sort performed at the input of few stages in DS Enterprise edition?
What is a datastage job?
1.what is materialized data? 2.how to view the materialized data?
Give an idea of system variables.