UK DATA ARCHIVE: IMPORTANT STUDY INFORMATION

Study Number 5760 - Growing Up in Scotland: Sweeps 1 to 5, 2005-2010


NEW EDITION INFORMATION

The first edition of the study (December 2007) contained data and documentation for Sweep 1, 2005. The second edition (January 2008) contained revised data for Sweep 1 with some syntax corrections and amended documentation. The third edition (April 2008) contained new versions of the data files with two additional variables (DaPSU, DaStrat) which can be used for taking account of design effects when producing standard errors. For the fourth edition (October 2008) data and documentation for Sweep 2, 2006-2007 were added to the study. For the fifth edition (December 2008), data from Sweep 1 data were revised. Variable DaHGnp01 in the child cohort data was updated, and some new derived variables (highest qualification for respondent and partner) were added to both Sweep 1 files. The documentation remained unchanged. For the sixth edition (August 2009) data and documentation for Sweep 3, 2007-2008 were added to the study. For the seventh edition (January 2010), Body Mass Index (BMI) variables were added to both the birth and cohort Sweep 2 datasets. For the eighth edition (July 2010), data and documentation for Sweep 4, conducted in 2008-2009, have been added to the study. For the ninth edition (November 2011), data and documentation from Sweep 5, conducted in 2009-2010, were added to the study. For the 10th edition (January 2012), updated data and documentation for sweeps 1-4 were added to the study. The various updates and amendments made are described in the documentation.

DATA PROCESSING NOTES


Data Archive Processing Standards

The data were processed to the UK Data Archive's A standard. A rigorous and comprehensive series of checks was carried out to ensure the quality of the data and documentation.�Firstly, checks were made that the number of cases and variables matched the depositor's records. Secondly, checks were made that all variables had variable labels and all nominal (categorical) variables had value labels. Where possible, either with reference to the documentation and/or in communication with the depositor, absent labels were created. Thirdly, logical checks were performed to ensure that nominal (categorical) variables had values within the range defined (either by value labels or in the depositor's documentation). Lastly, any data or documentation that breached confidentiality rules were altered or suppressed to preserve anonymity.

All notable and/or outstanding problems discovered are detailed under the 'Data and documentation problems' heading below.

Data and documentation problems

Variable 'DaCman01' on the Sweep 1 birth cohort data file has no label for value 20.

Data conversion information

From January 2003 onwards, almost all data conversions have been performed using software developed by the UK Data Archive. This enables standardisation of the conversion methods and ensures optimal data quality. In addition to its own data processing/conversion code, this software uses the SPSS and StatTransfer command processors to perform certain format translations. Although data conversion is automated, all data files are also subject to visual inspection by a member of the Archive�s Data Services team.

With some format conversions, data, and more especially internal metadata (i.e. variable labels, value labels, missing value definitions, data type information), will inevitably be lost or truncated owing to the differential limits of the proprietary formats. A UK Data Archive Data Dictionary file (generally in Rich Text Format (RTF)) is usually provided for each data file, enabling viewing and searching of the internal metadata as it existed in the originating format. These files are called: [data file name]_UKDA_Data_Dictionary.rtf

Important information about the data format supplied

The links below provide important information about the Archive's data supply formats. Some of this information is specific to the ingest format of the data, i.e. the format in which the Archive received the data from the depositor. The ingest format for this study was SPSS

Please follow the appropriate link below to see information on your chosen supply (download) format.

SPSS (*.sav)

STATA (*.dta)
Tab-delimited text (*.tab)
MS Excel (*.xls/*.xslx)
SAS (*.sas7bdat and *.sas)
MS Access (*.mdb/*.mdbx)

Conversion of documentation formats

The documentation supplied with Archive studies is usually converted to Adobe Portable Document Format (PDF), with documents bookmarked to aid navigation. The vast majority of PDF files are generated from MS Word, RTF, Excel or plain text (.txt) source files, though PDF documentation for older studies in the collection may have been created from scanned paper documents. Occasionally, some documentation cannot be usefully converted to PDF (e.g. MS Excel files with wide worksheets) and this is usually supplied in the original or a more appropriate format.