how do we get duplicate observations in a separate dataset?
Answers were Sorted based on User's Feedback
Answer / bitla
ex:
data dup1 dup2;
set original;
by dup_var;
if first.dup_var then output dup1 else output dup2;
run;
| Is This Answer Correct ? | 11 Yes | 0 No |
Answer / mallikarjuna reddy.vanna
use DUPOUT option in proc sort statement.
| Is This Answer Correct ? | 12 Yes | 1 No |
Answer / natrajboga
use the dupout= and nodupkey options in proc sort and
followed by BY statemet with list of vars
proc sort data=xxx dupout=dup_xxx nodupkey;
by var1;
run;
/* see the log window */
proc print data=dup_xxx;
run;
| Is This Answer Correct ? | 8 Yes | 0 No |
Answer / vipin choudhary
Proc sort data = indata;
by name;
run;
Data outdata;
set indata;
by name;
if first.name and last.name then delete;
run;
proc print data = outdata;
run;
or else you can use the dupout option in proc sort
| Is This Answer Correct ? | 4 Yes | 2 No |
First sort them in descending order and then using first.var
separate all the first observations into a new dataset and
the remaining into another. So unique observations from each
group
will come into one dataset and the other duplicate
observations will enter into another dataset.
| Is This Answer Correct ? | 2 Yes | 1 No |
proc sql;
create table dup_obs as (
select * from <lib>.<dsn>
group by <dup var>,<list of other vars>
having count(*)>1 )
quit;
| Is This Answer Correct ? | 4 Yes | 3 No |
what are methods to identify duplicate observations?
Where do the database management systems store data and how do u import them.
how we can create a FLAG datasets? Ex:-ID age_group no_persons 1 to 10 10 to 20 3 11 to 20 21 to 30 7 21 to 3o 31 to 40 5
What are the data types that sas contain?
how to debug and test the sas program? : Sas-administrator
Name statements that function at both compile and execution time.
Mention few capabilities of sas framework.
I Here clinical SAS training by Ramesh Azmeera is good. Is it confirm ?
is there any differnce between proc means and proc summary?
To what type of programms have you used scratch macros?
What is the command used to find missing values?
What is by-group processing?