How to Define Formats and Label Variables

Name, Define, and Apply Custom Formats

To create custom formats for your dataset, you will need to use the PROC FORMAT procedure.  Using the VALUE statement, you first assign a name to a format. Then, you use descriptive text to define the values of the format. Note that all assigned text names for the values must be surrounded by single quotation marks in order to be applied properly.

The sample code, which comes from the “CVXMSTR” program, below shows how to name and define a custom format. This example uses the format CVDFITLVF. (Note that you can assign any name you choose, so long as it meets the SAS specifications for a valid format name.  See a SAS manual for more information.) This format defines values 1 through 3, with each value representing the level of cardiovascular fitness.

Sample Code

proc format ;
 value CVDFITLVF
        1 = "Low"
        2 = "Moderate" ;
        3 = "High" ;
run ;

After you have named and defined a format, apply it to selected variables using the FORMAT statement in the data step of your code. Applying a format to a variable allows you to determine how the values will look in the output (e.g., Cardiovascular Fitness Level group 1 will be represented by the text “Low”). When assigning formats to variables, note that format names always come directly after variable names and MUST end with a period.

 

Apply Labels to Variables

Variables are given a text description using a LABEL statement. One way to do this is by using a SAS data step, as shown below in the sample code from the “CVXMSTR” program. User-defined labels should always be surrounded by single- or double-quotation marks.

Sample Code

data cvx;
 set cvx;
 label CVDFITLV = "Cardiovascular Fitness Level" ;
run ;