Dataset. The term dataset or "data set" is used in specific ways in different
contexts.
ICPSR defines a dataset as "a collection of data records" and
uses this term to encompass a file or group of
files associated with one part of a study. Files
associated with a dataset might include a data file, a machine- readable
codebook , SPSS control cards,
and other files related to the data file. Examples: The files associated
with California might be considered one dataset in the 1990 Census of
Population and Housing STF 1A study; the files associated with the First
Congress, House of Representatives, in the study "Congressional Roll
Call Voting Records."
SAS. In the SAS statistical software, a SAS "data set" is the
internal representation of data. Raw data when read by SAS command
statements is converted into a SAS Data Set before SAS can use the data.
SAS Data Sets have specific filename extensions for different operating
systems; e.g., a SAS 6.12 Data Set created in Unix has the filename
extension ".ssd01" and in Windows, ".sd2".
Consistency Check
|