How would you delete duplicate observations?
Answers were Sorted based on User's Feedback
Answer / poornima
we can delete duplicate observations by using nodup or
nodupkey option in the proc sort
Example :-
Proc sort data=datasetname nodup;
run;
| Is This Answer Correct ? | 33 Yes | 12 No |
Answer / padmasri
In 3 ways we can delete the duplicate records.
1.procedure proc sort
in proc sort there are two ways to delete duplicate
observations:
* nodupkey
*noduprec
2.first. and last.
3.proc sql
these 3 ways we can delet the duplicate records in sas.
| Is This Answer Correct ? | 16 Yes | 0 No |
Answer / srinivas
there 3 options to delete duplicate obs
1. nodup
2. nodupkey
3.noduprec
if a entire record is duplicated in sense we use nodup of
nodup rec in proc sort procedure.
proc sort data=dsn nodup/noduprec;
by var;
run;
if a variable is repeated not a entire record . this time
we use nodupkey
proc sort data=dsn nodupkey;
by var;
run;
ex; in a dataset empid is repeated then use this syn. and
the empid is used in by var statement.
| Is This Answer Correct ? | 19 Yes | 5 No |
Answer / vijay
There are several ways to do this. However the easliest
code-wise is to use PROC SORT. For example:
PROC SORT DATA=mydata NODUPKEY;
BY variable;
RUN;
| Is This Answer Correct ? | 17 Yes | 7 No |
There are two ways of deleting the records from the dataset
with the help of PROC SORT.
1. Using NODUP/NODUPRECS
2. Using NODUPKEY
The first option deletes the records only if all the
variables values are repeated in the subsequent records.
The second options deletes the records only if the value of
the BY variables given in the BY clause are repeated in the
subsequent records.
| Is This Answer Correct ? | 4 Yes | 1 No |
I have 2 data sets A & B. Both have a variable called Age in them, each of them specifying a different functionality. In my program I use bot these data sets. How do I specify which Age variable I want to use?
What is the differnce between SDTM 3.1.2 to 3.1.1 version
In the following DATA step, what is needed for ‘fraction’ to print to the log? data _null_; x=1/3; if x=.3333 then put ‘fraction’; run;
What is the one statement to set the criteria of data that can be coded in any step?
What is the command used to find missing values?
what is the diff. b/w proc means and proc summary?
how can u join the two tables without using proc sql Joins and nested queries ?
In proc transpose and data step with arrays which one you pick?
0 Answers Accenture, Quintiles,
I need help in merging two different datasets. I am merging by date and I want to propagate observations from one dataset to the corresponding dates. One dataset has a unique date for each day of the month, while the other dataset has same date for different patient visits. For example I want to spread an observation on the 31DEC2008 from one dataset to several observations with the same date on a second dataset for all the patients who visited on that date. I have tried to merge the two and the result is not what I wanted. Instead I get a dataset whereby all the dates have missing values where observations from the first datset should have spread.
If you use a symput in a data step, when and where can you use the macro variable? : sas-macro
how to debug and test the sas program? : Sas-administrator
what is chi-square test? have you used that?
1 Answers Accenture, Quintiles,