ESDS Government FAQ
Questions:
1. What are the large-scale
government surveys?
2. Who can obtain survey
data?
3. How much will it
cost?
4. How do I gain access
to the survey data?
5. What formats are the
data available in?
6. Are there any restrictions
on the use of the survey data?
7. What information
is available in the surveys?
8. Can I order publications
and obtain statistics?
9. What can I expect
of the ESDS Government Data Service?
10. Who can use ESDS Government
services?
11. How do I get help
if I am stuck?
12. Where can I get
the large-scale government survey questionnaires from?
13. What are missing
values?
14. Should I include
missing values in my analysis?
15. Why am I not getting
the same numbers in my analysis as the published results?
16. What is the ILO
definition of unemployment?
17. Which years
were missed from the GHS?
18. Why do we weight
data?
19. Why are response
rates decreasing on surveys?
20. Who are the non-responders
in surveys?
21. How do I use
SPSS and STATA?
22. Are there are
training courses on how to use SPSS and STATA?
23. What is the
Continuous Population Survey?
24. Can I use the
data for teaching?
25. I want to carry
out a time-series analysis but some of the variables have changed over time.
Are there any variables that are consistent over time?
26. What is the
Integrated Household Survey? (formerly Continuous Population Survey)
27. How can
I find out who has been researching using a specific survey?
28. What are
the ESDS Government user meetings and are they for me?
29. What is a special licence and special licence data?
30. What is the new GHS(L)?
31. When did the QLFS shift from seasonal quarters to calendar quarters?
32. How do I use the weights in the 2004 ethnic boost dataset if combining the general population and ethnic boost files?
33. Can I use multiple imputation to deal with missing data in the Health Survey for England?
34. How do I use the Health Survey for England individual weights if I’m pooling the data from the 1999 and 2004 datasets (ethnic boost years)?
35. Where do I get statistics on...?
FAQs from individual surveys
Labour
Force Survey FAQs
Northern
Ireland Labour Force Survey FAQs
General
Household Survey FAQs
Continuous
Household Survey FAQs
Family
Expenditure Survey FAQs
Northern
Ireland Family Expenditure Survey FAQs
National
Food Survey/Expenditure and Food Survey FAQs
Family
Resources Survey FAQs
Omnibus
Survey FAQs
Survey
of English Housing FAQs
Health
Survey for England FAQs
Welsh
Health Survey FAQs
Scottish
Health Survey FAQs
British
Crime Survey FAQs
Scottish
Crime Survey FAQs
British
Social Attitudes FAQs
Scottish
Social Attitudes FAQs
Northern
Ireland Life and Times Survey FAQS
Young
People’s Social Attitudes Survey FAQs
National
Travel Survey FAQs
Time
Use Survey FAQs
1. What
are the large-scale government surveys?
The large-scale government surveys
are large datasets collected either by the Office
for National Statistics or other agencies,
such as the National Centre for Social Research.
The surveys are principally designed for government information purposes but
are available to researchers for secondary analysis. They are key data sources
for social scientists and include surveys such as the Labour Force Survey, General
Household Survey, Family Resources Survey and Expenditure and Food Survey. A
full list of the surveys covered by the ESDS Government Data Service is available
from the survey pages
of the ESDS Government web site. You can find full descriptions of the surveys,
together with documentation, on the web site - you will need to search for the
survey on the homepage or follow the link from Major Studies to the survey you
are interested in. There are other large-scale government surveys which are
not covered by ESDS Government but are available from the general ESDS
service.
2. Who can
obtain survey data?
Researchers, students and teachers
from all fields and types of organisation may register with the Economic and
Social Data Service (ESDS) and obtain survey data. However, some datasets may
have restrictions on access. For example, commercial usage may be restricted,
permission may be required from the depositor or publications may need to be
vetted by the sponsoring organisation.
3. How
much will it cost?
All users requiring data for non-commercial purposes
can download data free of charge. Where data is required for commercial purposes
there is a fee - see Charges
information
4. How do
I gain access to the survey data or explore it online?
To access survey data, all users must register with the Economic and
Social Data Service (ESDS). Information about obtaining data from the ESDS can
be found on Accessing
data page.
Registered users can download/order
a large and increasing number of datasets direct from the web site (usually
in SPSS, STATA or tab-delimited formats) via its online catalogue record and
via the download/order section of the Major
Studies web pages.
An increasing number of datasets are
also available to most registered users in the Nesstar
system, which will allow you to explore the data online and to do basic exploratory
analysis before choosing to download all, or a subset, of the data.
5. What
formats are the data available in?
Most datasets are available in SPSS,
STATA and tab delimited formats (suitable for use in Excel) and can also be
converted to other formats such as SAS. Other systems, including Nesstar,
may also provide additional formats, such as Statistica, DIF (suitable for use
in Excel), NSDStat and Dbase.
6. Are there
restrictions on the use of the survey data?
Restrictions on the use of the surveys
are outlined in the End
User Licence that all users sign
when registering for an ESDS account. In particular there is a fundamental restriction
on the confidentiality of data. Users should not attempt to use the data to
deliberately compromise the confidentiality of individuals or households and
are required to abide by the current Data Protection Act. This agreement also
covers requirements for citation of publications and safeguarding of data. Sharing
Data with colleagues, other researchers
or students must always be authorised by ESDS. Certain datasets may also require
depositor permission. Some datasets which pose a higher risk of disclosure require
a Special Licence (SL). This licence has additional conditions. For example,
the SL requires the signature(s) of the researcher(s) and the institution with
responsibility for the researcher. It also needs the explicit permission of
the data owner to release the data to the researcher(s). See FAQ 29 for more
detail on special licences.
7. What
information is available in the surveys?
A wide variety of information is available
from the surveys. Most surveys cover a specific topic, for example the Labour
Force Survey (LFS) collects detailed data on economic activity and the Health
Survey for England (HSE) collects information about the nation’s health
and the number of people with specified health conditions among other things.
In addition, each survey asks a number of core questions such as sex, age, ethnic
group, housing tenure. The ESDS
Government and Survey Question Bank
web sites give full descriptions of the surveys. There is also work on the harmonisation
of concepts and questions by ONS
and on derived variables over
time by ESDS Government.
8. Can
I order publications and obtain statistics?
The Economic and Social Data Service
(ESDS) is not able to supply copies of publications (other than User Guides
accompanying the data). However, the ESDS Government web pages contain a searchable
publications database
which cite the ESDS Government surveys. You can search on keywords such as 'employment'
and can specify whether you want to search on all the supported surveys or specific
surveys. Results can be ordered by date or author. In addition, within each
specific survey page the green box titled ‘links’ generally provides
links to recent reports for that survey.
ESDS does not typically supply tailor-made
statistics, although it will undertake certain consultancies at cost. A number
of datasets are also available to most registered users to analyse, subset and
download online via the Nesstar
service, where basic frequency counts are freely available to all users.
9. What
can I expect of the ESDS Government Data Service?
ESDS Government, led by CCSR,
aims to exchange information between users and producers, enlarge the user base
and provide user support. This includes:
- a JISCmail list for parties interested in the large government surveys,
for discussion and to receive updates. To join the list, you simply need to
either go to the ESDS-govsurveys
JISCmail web site or alternatively
you can send an email to listserv@jiscmail.ac.uk
with the following text: subscribe esds-govsurveys firstname secondname
- web-based Frequently Asked Questions
- a twice-yearly newsletter
- products that provide easy routes through the complex documentation of the
government surveys such as introductory guides and indexes of common variables
- workshops, conferences and courses on: the government datasets and their
potential; key topics of interest; specific statistical packages, particularly
SPSS and STATA; methods of statistical analysis; some of the unique aspects
of the datasets; topic-related online course materials
- a range of teaching datasets
- a searchable database of publications which cite the government surveys
Further information, including the
list of datasets supported by the service, can be found on the ESDS
web site.
10. Who
can use ESDS Government services?
ESDS Government is funded by the ESRC
and the Joint Information Systems Committee to support UK academic users of
the large-scale government surveys. However, non-academic users and overseas
academics have access to all our materials, such as the SPSS guide, the newsletter,
all other online documents and are very welcome to attend the workshops and
events we organise. ESDS Government will also provide quick answers to helpdesk
queries from non-academics and overseas academics. However, lengthy helpdesk
queries requiring one-to-one attention can only be provided to UK academics.
11. How
do I get help if I am stuck?
You can email us at govsurveys@esds.ac.uk
or telephone us on +44 (0)161-275-1980.
12. Where
can I get the large-scale government survey questionnaires from?
Questionnaires and other survey documentation
can be found on the web site (you do not have to be registered to use this facility).
The Survey Question Bank
based at the University of Surrey also provides free online questionnaire and
related material, e.g. diaries, for most of the large-scale government surveys
from 1991 onwards. Often questionnaires can be found in the back of published
reports. If you cannot find the questionnaire you require email us at govsurveys@esds.ac.uk
or telephone us on +44 (0)161-275-1980.
13.
What are missing values?
Missing values are categories in variables
for which there are not valid responses. Missing values are specified for a
number of reasons and usually have specific codes within the large-scale government
surveys. Data can be specified as missing for the following reasons:
- the question does not apply to the respondent (DNA). For example someone
who said that they were a non-smoker would not be asked the question ‘How
many cigarettes do you smoke on average per week?’. DNA is often coded –9
or 99
- the respondent has simply not given an answer (NA). This is often coded
–8 or 98
- other reasons such as proxy response, refused entire section and so on.
These are often coded as other values such as –6 and –7
- you should check the survey documentation rather than assuming that the
codes given above are correct.
14.
Should I include missing values in my analysis?
You should never include missing values
coded as DNA (see FAQ 12 above) because the question is not relevant to these
cases. You may or may not want to include other missing values such as NA (see
FAQ 12 above). Care should be taken when excluding missing values if they account
for a large proportion of the answers because it could lead to the misinterpretation
of results. The fact that a large proportion of respondents did not answer the
question may be an interesting finding in itself. For example:
Have
you ever used an illicit drug? |
Response
|
N |
Missing
answers included (%) |
Missing
answers excluded (%) |
Yes |
50 |
10%
|
25%
|
No |
150 |
30%
|
75%
|
NA |
300 |
60%
|
- |
If missing answers are included then
you report that 60% of respondents chose not to answer this question. However,
if missing answers are excluded you report that 75% of respondents have never
used an illicit drug. These two findings have very different implications.
15.
Why am I not getting the same numbers in my analysis as the published results?
You may not always get exactly the
same numbers as the published results because of small things you may have done
differently in your data preparation. However, one would expect your results
to be very similar to the published results. If your findings are significantly
different there could be a number of explanations:
• you may not be comparing the same variables
• you may not have dealt with missing values in the same way as the
published results
• there may be a filter variable to be applied (a filter variable is
used where analysis is performed on a subset of the data, for example, the
analysis may have been run selected ages, such as the working aged population
only).
• there may be a weighting variable to be applied.
16.
What is the ILO definition of unemployment?
Most of the large-scale government
surveys use definitions of employment, unemployment and economic activity and
inactivity that have been internationally agreed by the ILO (International Labour
Organisation). To be unemployed, according to the ILO definition you must be:
not in employment but available to start within two weeks, and have either looked
for work in the last 4 weeks or waiting to start a new job. ILO employment includes
those who are in paid work either as an employee or self-employed and those
on a government-supported training scheme. For more information see ONS
harmonisation web site .
17.
Which years were missed from the GHS?
The General Household Survey has been
carried out annually since 1971, with the exception of two years: 1997 (when
it was being reviewed) and 1999 (when it was being re-developed). In 2005 the
GHS(L) was introduced – see FAQ 30 for more information.
18.
Why do we weight data?
It is common for social surveys to
use some form of weighting; this is for a number of reasons:
to correct for bias caused by
particular groups in the population not responding to the survey. This is
called non-response weighting and often uses figures from
the census to provide a correct population distribution, for example in the
2001 General Household Survey
to remove bias caused by the
use of different selection probabilities. For example, in surveys where only
one adult per household is interviewed, those living in households with more
than one adult will have a lower probability (or less of a chance) of being
selected than those adults living on their own. The 2000 British Crime Survey
and the 2001 British Social Attitudes Survey both use this type of weighting
to make sure that the proportions
of people in each age group, sex and region, for example, are the same in
the sample as they are in the whole population. This is called calibration
weighting (or population based weighting)
to gross the survey to population
totals, so you can say how many people in the UK watch TV every day for example,
rather than just what proportion (or %). This is called grossing
and is used in the Labour Force Survey.
In all such cases, the samples do not
accurately reflect the general population. The purpose of weighting is to adjust
the sample so that the sample profile on key variables reflects that of the
population. Weighting involves statistically increasing or decreasing the numbers
of cases with particular characteristics so that the proportion of cases in
the sample is adjusted to the population proportion. For more detailed information
on weighting please refer to the Weighting
the Social Surveys guide on the ESDS Government web site .
19.
Why are response rates decreasing on surveys?
Non-contact with the sampled respondent
and refusal to participate are the two reasons why response rates have fallen
across surveys in recent years. Tactics used by survey organisations, such as
calling more frequently and at times of the day when people are more likely
to be at home, help to reduce non-contact rates. However, the increase in refusal
to participate is more difficult. Much of the increase in refusals seems to
be attributable to increased reluctance among members of the public to take
part in surveys. These general attitudes are hard to change. Instead survey
organisations concentrate on the aspects they have some control over, namely
how they approach individuals, how interviewers introduce themselves on the
doorstep, and how individuals might be persuaded to participate.
20. Who
are the non-responders in surveys?
In practice, relatively little is
known about the characteristics of non-responders. On most surveys the characteristics
of non-respondents are ‘inferred’ by comparing the survey profile with the population
profile derived from official statistics such as the Census. But, in other than
census years, this tends to restrict us to comparisons of the age and sex profile.
These comparisons typically demonstrate that young men are disproportionately
missing from surveys; what they don’t show is whether the missing young men
have different characteristics to the responding young men. Other studies have
shown that surveys typically under-represent those with lower educational qualifications,
adults without dependent children, and those living in London. Knowing about
the characteristics of non-respondents allows survey researchers to reduce bias
by applying non-response weights to the data. See FAQ 17 for more details on
non-response weights.
21.
How do I use SPSS and STATA?
ESDS Government has written introductory
guides to using SPSS and STATA. These can be found on the resources
page of the ESDS web site.
22.
Are there are training courses on how to use SPSS and STATA?
The Cathie Marsh Centre for Census
and Survey Research (CCSR) at the University of Manchester regularly hold short
course on SPSS and STATA at varying levels (introduction, advanced etc). The
CCSR web site
holds the information on the short courses, including how to apply for a bursary.
23.
What is the Continuous Population Survey?
The Continuous Population Survey (CPS)
is now known as the Integrated Household Survey (IHS). See FAQ 26 for details.
24.
Can I use the data for teaching?
The datasets can be useful resources
for teaching. You can use the datasets for hands-on exercises in methods courses
such as questionnaire design or statistics ESDS Government have created a number
of teaching datasets for this
purpose using the Labour Force Survey, British Crime Survey, Health Survey for
England and the General Household Survey (social capital teaching set). If you
wish to use one of the teaching datasets or any of the ESDS Government datasets
with your students as part of a taught course you should read the information
on Sharing Data on the ESDS web
site.
You could also use the datasets as research design examples in methods lectures
or alternatively you may wish to use the data for substantive courses, for example
if you wanted to show that certain ethnic minorities have high unemployment
rates you could use the data yourself to illustrate this or use existing exemplars
from published research.
25.
I want to carry out a time-series analysis but some of the variables have changed
over time. Are there any variables that are consistent over time?
ESDS Government has carried out some
work to document information about the consistency of key variables over time
on the General Household Survey and Labour Force Survey, and to provide syntax
to derive consistent variables. The initial work is available from the ESDS
Government web site and other variables
will be added from time to time. Additionally, a policy of harmonisation since
the early 1990s has greatly increased the consistency of definitions and question
wording across key variables (but not all variables) on the main government
surveys - for more information see the ONS
harmonisation web site.
26.
What is the Integrated Household Survey? (formerly Continuous Population Survey)
Fieldwork for the Integrated Household Survey (IHS) (formerly the Continuous
Population Survey (CPS)) is proposed to begin in January 2008. The design involves
the integration of the household surveys on which the Office for National Statistics
(ONS) leads into one IHS (CPS). The surveys included for integration are:
• Labour Force Survey (LFS) and associated boosts (APS)
• General Household Survey (GHS)
• Expenditure and Food Survey (EFS)
• Omnibus Survey (OMN)
Two consultation meetings on the IHS (CPS) took place in January 2006 and March
2007 at the Royal Statistical Society, London. These meetings provided opportunities
to find out the latest developments from ONS. They also provided opportunities
to ensure that users needs were heard and considered. The content of the core
outputs and each interview combination were discussed among other things. Slides
from the meetings are available on the Past
Events web pages. More information about the IHS is available from the IHS
page on the ESDS Government web site.
27.
How can I find out who has been researching using a specific survey?
To find out who has recently registered to use a survey, the first place to
check is the ‘Registered Uses’ box within each specific survey page
on the ESDS Government web site. When registering with ESDS for
the use of data there is an option to make registration of that usage, including
details of any project surrounding it, widely available.
It may also be useful to do a search on the specific survey name in the ESDS
Government publications database.
Alternatively, if you have a particular
research area/question and are interested in making contact with other researchers
it may be useful to join a JISC mailing list, such as the ESDS
Government JISCmail list. Other than the ESDS Government list, there are
also the Labour
Market Statistics User Group JISCmail list, Health
Statistics User Group JISCmail List and British Crime Survey JISCmail list.
User meetings on a specific survey or research topic can also be a helpful way
to find out current research examples, see ESDS Government events
for forthcoming user survey/topic meetings and past
events for presentation sides from previous events. For more information
on user meetings see FAQ 28.
28. What are the ESDS Government user meetings
and are they for me?
ESDS Government organises a number of user meetings each year, these are:
• General
Household Survey User Meeting
• Labour
Force Survey User Meeting
• Health Surveys User Meeting
• Crime
Surveys User Meeting
• Family Resources Survey User Meeting
These are days which allow users to meet data producers. The days take a fairly
standard format in which the morning consists of presentations by the data producers
and provides information on news and developments on the surveys, while the
afternoon is led by users in researchers describe new or recent work being done
with the data. The meetings are friendly, informal and open to all users or
potential users of the surveys. Previous programmes and slides are available
from the past events
section on the ESDS Government web site.
These meetings are part of a wider range of events organised by ESDS Government,
see the events web page for
more information. If you would like to be kept informed of all the latest events,
please subscribe to the ESDS Government
JISCmail list.
29.
What is a special licence and special licence data?
When you require access to social survey microdata, for example, the data held
at the UK Data Archive (UKDA), this requires registration, agreement to the
End User Licence (EUL) and the registration of usage details.
A Special Licence (SL) is used for special licence data; this is data which
pose a higher risk of disclosure meaning the licence has additional conditions.
For example, the SL requires the signature(s) of the researcher(s) and the institution
with responsibility for the researcher. It also needs the explicit permission
of the data owner to release the data to the researcher(s).
SL applicants are screened by UKDA and the data depositors. Data are only provided
to those researchers who can justify their need for the data for statistical
research purposes and agree to meet the terms of the SL.
The first ONS social surveys to trial this new access initiative are the Annual
Population Survey (APS) and the QLFS (from March-May 2005), but it is planned
to extend the special licences to other surveys in due course.
The QLFS special licence version contains the same content as the QLFS end user
file but has additional geographical information at local authority level. A
list of the variables available in each file is available in an Excel file on
the ESDS Government web site.
The source for this FAQ originates from Special
Licence: new arrangements for access to social survey microdata on the ESDS
web site.
30. What is the new GHS(L)?
The 2005-2006 GHS fieldwork is the first to be undertaken under a new longitudinal survey design. The new design is also accompanied by a slight change in substantive emphasis, which leans towards a greater range of questions on social exclusion. The changes to the GHS aim to satisfy new EU requirements to produce comparable data from a Survey of Income and Living Conditions (EU-SILC). For more information go to the slides from the 2006 GHS User Meeting.
31. When did the QLFS shift from seasonal quarters to calendar quarters?
In accordance with EU regulations, the LFS moved from seasonal (spring, summer, autumn, winter) quarters to calendar quarters (January-March, April-June, July-September, October-December) in 2006. The last seasonal quarter dataset issued was the Quarterly Labour Force Survey, December 2005 - February 2006 (SN 5356) and the first calendar quarter dataset was the Quarterly Labour Force Survey, January - March, 2006 (SN 5369). Users should note that there is some overlap between these two datasets. ONS have produced a limited series of historical LFS datasets on a calendar-quarterly basis. This will allow users to make meaningful comparisons of labour market statistics from the LFS microdata over time. Further information on the seasonal to calendar quarter change and its impact on LFS data may be found in the following online article:
Madouros, V. (2006) Impact of the LFS switch from seasonal to calendar quarters: an overview of the switch of the LFS to calendar quarters and the potential effects of this change on users, London: ONS.
32. How do I use the weights in the 2004 ethnic boost dataset if combining the general population and ethnic boost files?
ESDS Government have written a detailed briefing note on how to use the 2004 Health Survey for England (HSE) weights.
33. Can I use multiple imputation to deal with missing data in the Health Survey for England?
Multiple Imputation (MI) for missing data in the HSE can be carried out using the ice and mim commands in STATA. The results from the MI can then be compared with the results before MI is applied.
However, the ice and mim commands in STATA cannot be carried out in conjunction with the STATA svyset commands (the svyset commands adjust the analyses to take account of the complex survey design). As the HSE has a complex survey design it is important that you use the svyset commands for analyses. Using the svyset commands is likely to have a larger effect upon the results of analyses than using MI. For more information go to the Missing Data web site.
34. How do I use the Health Survey for England individual weights if I’m pooling the data from the 1999 and 2004 datasets (ethnic boost years)?
The 1999 and 2004 Health Surveys for England (HSE) both contained ethnic boost samples in order to increase the number of ethnic minority respondents in the survey.
It is possible to pool the 1999 and 2004 datasets to further increase the number of ethnic minority respondents for your analyses. When pooling the 2 datasets you need to ensure that you use the weights from 1999 and 2004 in the correct way.
First you need to prepare the 2004 weights, see the detailed briefing note on how to use the 2004 Health Survey for England (HSE) weights.
Next, you need to prepare the 1999 weights which are less complex than the 2004 weights.
For the 1999 dataset you need to follow the same procedure as 2004 in terms of dropping the ethnic minority cases from the general population dataset, merging the 2 files together and adjusting the weight so that the weighted total equals the actual total. All of this is illustrated in the detailed briefing note on how to use the 2004 Health Survey for England (HSE) weights. You can check the population is in the correct proportion for 1999 (e.g. the white group reflects the correct proportion of the population) by comparing with the 2001 Census estimates.
You should then scale the weights for both years (i.e. divide each year by the mean weight). Scaling the weights for both years means you're not giving prominence to any one year - both years have a mean weight of one. You then simply give the new scaled weight in each year the same name (e.g. wt_intnew) and then combine the datasets and use the new scaled weight (e.g. wt_intnew) as described in the detailed briefing note on how to use the 2004 Health Survey for England (HSE) weights.
ESDS Government has also written some general guidance notes for users who want to pool the 1999 and 2004 HSE datasets.
35. Where do I get statistics on...?
Unfortunately ESDS Government do not have the resources to produce bespoke analyses. Our role is to support users who are undertaking analysis of major government survey series held at the UK Data Archive.
No software?
The Nesstar system contains many datasets in a user friendly online format. Before you are registered you can browse documentation including frequencies of variables. Once registered, you can create tables and graphs for all or a subset of cases and weight these.
Do you want published statistics?
There are an increasing number of web sites providing access to statistics on the sort of social topics covered by ESDS Government. These include:
Additionally, many surveys result in published reports. These results are typically available in the ESDS Government resources section for the survey series in question. So, for example, you can find links to the published reports of the General Household Survey on the GHS resources web page.
Links to the published reports of the Health Survey for England can be found on the HSE resources web page.
Updates
The FAQs are regularly updated; the
latest version is always available in HTML from the ESDS
Government web pages.
This version was last modified on
5 March 2010
ESDS
Government home