The first task is to identify missing data and recode it. Here are the steps:
In this step, you will use the proc means procedure to check for missing, minimum and maximum values of continuous variables, and the proc freq procedure to look at the frequency distribution of categorical variables in your master analytic dataset. The output from these procedures provides the number and frequency of missing values for each variable listed in the procedure statement.
Typically, proc means is used for continuous variables, and proc freq is used for categorical variables. In the following example, we provide proc means and proc freq procedures on the same set of variables without distinguishing continuous and categorical variables. If you perform a proc freq on a continuous variable with many values, the output could be extensive.
Statements | Explanation |
---|---|
data =demo1_nh1 N nmiss min max ; | Use the proc means procedure to determine the number of missing observations (nmiss), minimum values (min), and maximum values (max) for the selected variables. |
where N1BM0101 >= Span class="teal">20 |