There are three parts of the data documentation for each data file that you will consult to finish gathering background information on your variables, the
The codebook portion lists all the variables in the data file. Use it to determine what the values associated with a variable mean. It includes the
The Codebook and Frequencies are contained in the Data Documentation file.
Use the data file documentation to determine if the collection or measurement is appropriate for your analysis. The data file documentation outlines
For example, in the previous task you noted that there were two data files with cholesterol variables — Total Cholesterol and HDL Cholesterol (LBXTC) and Standard Biochemistry Profile (LBXSCH). Reading the Analytic Notes in the data file documentation, you will find that the LBXTC variable in Total Cholesterol and HDL Cholesterol is the appropriate variable to use.
The frequency files for each data file contains the frequency count for each item in the data file and can be used to verify the sample size for an particular data item. The frequency counts are found with the codebook in the documentation file.