| Home | ESDS | A-Z index | Site map | Contact | Login | Search: 
ESDS Government logo - link to ESDS Government home page

ESDS Government


Labour Force Survey Frequently Asked Questions

Questions:

What population is covered by the LFS?
Where can I obtain statistics and tables from the LFS?
Before I order, how do I find out what questions/variables are included?
When did the LFS move from seasonal to calendar quarters?
Does the LFS include information at the person level?
What format is the data available in?
Can I use the LFS for teaching purposes?
What are the differences between the various Labour Force Survey datasets?
What is the relationship between the APS, the IHS and the LFS?
Do I need to use weights to analyse the LFS?
What were the 2007 and 2009 reweighting projects for and which datasets do they affect?
I want to produce yearly (not quarterly) representative results. How can I do this?
Can I use the LFS to analyse change over time?
Do the longitudinal LFS datasets contain all the variables in the quarterly LFS datasets?
How can I link the longitudinal, quarterly and household editions of the LFS?
What is the lowest geographical area for which LFS data is available?
Why is there a combined missing value category ‘-10’ in some of the household datasets?
What are the access restrictions to LFS data under the End User License due to privacy protection reasons?
Are age (continuous) and SOC occupation (4 digits) variables available in the QLFS dataset?
When will the transition to the new SOC 2010 classification of occupations happen for the LFS?
I cannot find what I am looking for in the documentation for pre 1990 editions of the LFS. What can I do about it?
What should I do about missing variables in Labour Force Survey Calendar Datasets?
What are the ID variables on the Quarterly Labour Force Survey datasets
Is there a more general FAQ?

What population is covered by the LFS?

The LFS is intended to be representative of the total UK population. The survey covers all adults in private households. It also includes those living in NHS accommodation and young people (aged 16-24) in student halls of residence or similar institutions. Analyses requiring representative data should be used with weighted data.

Where can I obtain statistics and tables from the LFS?

Latest key trends for the previous three months are published monthly here by the ONS. Detailed results are published quarterly in the Labour Market Statistical Bulletin (available here). More in-depth analyses on topics covered by the LFS are published monthly in the Economic & Labour Market Review (formerly Labour Market Trends). Hard copies are available from good academic libraries or from The Stationery Office. The current Economic & Labour Market Review is also available in PDF format from the National Statistics web site. The archive contains all issues since 2007.

Before I order, how do I find out what questions/variables are included?

Variable lists and PDF user guides (including questionnaires) are freely available via the Doc column on the LFS catalogue page (you will need to select a flavour of the LFS first). A list of the variables in the QLFS Special Licence Dataset is available in this Excel file. A variable search tool is also available which allows users to find out whether a specific topic or question is covered in each of the LFS surveys (or any of the other ESDS Government surveys). The variable search tool is currently under development. Please report any error to govsurveys@esds.ac.uk.

The Social Surveys Data Service at ONS can also provide customized LFS data analysis and/or tabulations (including dialup access) according to customer requirements. Data runs are costed according to the specifications of individual requests, however a minimum fee of of £135 incl. VAT applies. Requests or enquiries should be addressed to Lfs.dataservice@ons.gsi.gov.uk (Tel: +44 (0)1633 655 732).

NOMIS is a database of labour statistics run on behalf of ONS by the University of Durham. Tel: +44(0)191 334 2680, email: support@nomisweb.co.uk.

When did the LFS move from seasonal to calendar quarters?

In accordance with EU regulations, the LFS moved from seasonal (spring, summer, autumn, winter) quarters to calendar quarters (January-March, April-June, July-September, October-December) in 2006. The last seasonal quarter dataset issued was the Quarterly Labour Force Survey, December 2005 - February 2006 (SN 5356) and the first calendar quarter dataset was the Quarterly Labour Force Survey, January - March, 2006 (SN 5369). Users should note that there is some overlap between these two datasets. ONS have produced a limited series of historical LFS datasets on a calendar-quarterly basis. This will allow users to make meaningful comparisons of labour market statistics from the LFS microdata over time. Further information on the seasonal to calendar quarter change and its impact on LFS data may be found in the following online article:

Madouros, V. (2006) Impact of the LFS switch from seasonal to calendar quarters: an overview of the switch of the LFS to calendar quarters and the potential effects of this change on users, London: ONS.

Does the LFS include information at the person level?

Yes. The Quarterly Labour Force Survey and the 2 quarter and 5 quarter longitudinal datasets only contain information at the person level. Users should download the Quarterly Labour Force Survey Household dataset for analysis at the household level.

What format is the data available in?

Most years of the LFS are available in SPSS, STATA, SAS and Tab-Delimited format. Most of these datasets can also be used with R. Although one can also use Excel to analyse the data, we do not recommend it: users might find it difficult to have to constantly refer to the codebook while handling the data, or to control for missing data.

Can I use the LFS for teaching purposes?

Yes. The LFS teaching dataset (2002) gives a subset of data drawn from the UK Labour Force Survey, containing data from all four quarters of the 2002/3 LFS, for respondents aged 16-65 and resident in the UK (n=63,559). For ease of use within a teaching context, the dataset is restricted to a subset of 58 key (mainly individual level) variables. A Quarterly Labour Force Survey, June - August, 2005: Ethnicity Teaching Dataset is also available.

What are the differences between the various Labour Force Survey datasets?

(Quarterly) Labour Force Survey
The Labour Force Survey is a unique source of information using international definitions of employment and unemployment and economic inactivity, together with a wide range of related topics such as occupation, training, hours of work and personal characteristics of household members aged 16 and over. (Q)LFS is the main dataset including results from all questionnaire items and is available at the level of the individual with government office region being the lowest geographical variable for analysis. It is available as an annual dataset from 1975 (excluding 1976, 1978, 1980 and 1982) until 1991. From June-August 1992 it is available as a quarterly dataset and can be accessed here.

QLFS Special Licence Datasets
From the March-May 2005 quarter, a Special Licence version of the QLFS data is also available in addition to the version of the QLFS data made available under the standard End User Licence. This contains extra variables, including more detailed geographical information, and therefore is subject to more restrictive access conditions. Prospective users of the Special Licence edition will need to complete the Approved Researcher form and demonstrate to the data owners exactly why they need access to the extra variables, in order to get permission to use that version. Most users should order the standard version of the data. In order to help users choose the correct dataset, a Special Licence Access section has been added to the dataset title page.

QLFS Local Area Data
Available only between 1992 and 2006, the data provided aggregate estimates of the following subjects for Local Authority Districts and Training and Enterprise Council areas (not at the individual level):

• employment (by age groups)
• employees
• self-employed
• ILO unemployed (by age groups)
• economically active (by age groups)
• economically inactive (by age groups)
• industry Sectors (by Standard Industrial Classification codes)
• occupations (by Standard Occupational Classification codes)
• ethnic minorities
• full-time education
• qualifications
• job related training

Due to perceived confidentiality issues, this series was discontinued in 2006. Users interested in producing analysis for Local Authority Districts should consider using the Special Licence version of the quarterly datasets and derive their own estimates.

QLFS Household Datasets
The LFS household datasets are produced twice a year (April-June and October-December) from the corresponding quarter's individual-level LFS data. They include a number of new derived variables at household and family unit level only, to facilitate the analysis of the economic activity patterns of whole households. For 1992 to 1995, the household datasets include adjusted variables for household and family type, which are adjusted for several inconsistencies and discontinuities during this period.
It is recommended that the existing individual-level (Q)LFS datasets continue to be used for any analysis at individual level and that these household datasets be used for any analysis involving household or family level data. The lowest geographical variables for analysis are government office region and a county level indicator.

Two- and Five-quarters LFS Longitudinal Datasets
Any respondent to the LFS is interviewed at five consecutive quarters. Data on each individual has now been linked together to provide longitudinal information. This longitudinal data consists of two types of linked datasets, created using the weighting method to adjust for non-response bias. The two-quarter datasets link data from two consecutive waves, while the five-quarter datasets link across a whole year (for example summer 1999 to summer 2000 inclusive) and contain data from all five waves. A full series of the data has been produced going back to winter 1992. The longitudinal datasets include a subset of the most commonly used variables from the QLFS, covering the main areas of the survey.

Eurostat Datasets
From its origin in 1973, the UK Labour Force Surveys were conducted under an initiative of the European Union and were further expanded in order to provide comparable labour market data between member states. They remain regulated by European law; the latest EU LFS regulation was passed in 1998. EU regulations require only a core set of the variables available in the LFS to be present, as well as nomenclatures used (ISCO occupations / NACE industry codes). As a result, LFS Eurostat datasets only contain a subset of the variables accessible in the quarterly LFS. The list is available here (ADD HYPERLINK).
In addition, occasional ad-hoc modules are also included in the LFS as a result of Eurostat requirements, such as the 2003 Regular National Education System, the 2006 Transition from Work into Retirement, or the 2007 Accidents at Work and Work-Related Health Problems modules. More information on the background of the EU LFS and the variable list may be found here (ADD HYPERLINK).

What is the relationship between the APS, the IHS and the LFS?

The Annual Population Survey is made of several components, one of which is the waves 1 and 5 of the Labour Force Survey, together with the Welsh, Scottish Labour Force Surveys, and the English Local Labour Force Survey. Sample size is much bigger than the LFS, the APS uses only a subset of the LFS variables.
Together with several other governments surveys the Annual Population Survey is part of the Integrated Household Survey. As a result, the IHS includes a much broader set of variables than either the LFS or the APS.

Do I need to use weights to analyse the LFS?

Yes. The Quarterly LFS (QLFS) datasets have two weights (Pwt07 and Piwt07). Pwt07 is the weight for individual data - this compensates for non-response and grosses to population estimates. Piwt07 is the weight for income data - this weights so that that the weight of a sub-group corresponds to that sub-group's size in the population and also weights to give estimates of the number of people in certain groups. This is restricted to employees' earnings: other income data are not (yet) weighted.

The QLFS household datasets contain individual level data for households, but have been designed for household analyses. They have one weight to gross to population estimates. The weight is the same for all household members. The 2009 weighting variable is called phhwt09.

The QLFS longitudinal datasets (2-quarter and 5-quarter) contain one weight to compensate for attrition and to produce population estimates. The weighting variable is called LGWT

What were the 2007 and 2009 reweighting projects for and which datasets do they affect?

Reweighted Labour Force Survey (LFS) microdata were published by the Office for National Statistics (ONS) in 2008 and 2009. All LFS datasets prior to July-September 2006 have been reweighted to the 2007/08 population estimates whereas previously they were based on population totals published in 2003. LFS data from from July-September 2006 were reweighted using population estimates for 2009. Adjustments to weights are carried out regularly as more recent population estimates become available. It is likely that there will be further changes to LFS weights in the future. Further information about the 2009 and 2007 reweighting projects is included in the documentation for the updated datasets, and can also be found on the ESDS web site.

I want to produce yearly (not quarterly) representative results. How can I do this?

The easiest way to produce yearly analyses with the LFS is to append the four calendar quarterly datasets for a given year into one, then only retain respondents who were contacted for the first time at each quarter (THISW=1). For most application keeping weights as they are should be OK. However, for analyses that do not involve estimating population counts users can also center the weights around the middles of each quarter by dividing each individual respondent' weight by the mean weight for that quarter.

Can I use the LFS to analyse change over time?

Yes. The LFS contains both panel and (short term) repeated cross sections elements. For more information see the ESDS Government Analysing change over time guide.

Do the longitudinal LFS datasets contain all the variables in the quarterly LFS datasets?

No. Because of the resources involved in production and the size of the resultant datasets, the longitudinal datasets include only a subset of the full LFS variable set. This subset has been agreed in consultation with users and represents the most important and commonly used variables covering the main areas of the survey.

How can I link the longitudinal, quarterly and household editions of the LFS?

Users who have access to the Special License quarterly datasets and who want to add in extra variables to the longitudinal data can do so by creating the PERSID variable that uniquely identifies each individual and simply merging the variables on. Instructions on how to create the persid variable using SPSS can be found in the User Guide for Two- and Five-Quarter LFS Longitudinal Datasets. If using STATA then it is important to ensure that PERSID is in double format when creating this variable (see syntax below):

gen double persid= (quota*10000000000)+(week*100000000)+ ///(w1yr*10000000)
+(qrtr*1000000)+(add*10000)+(hhld*100)+recno

What is the lowest geographical area for which LFS data are available?

The quarterly LFS (QLFS) datasets are available for Government Office regions (North East, North West, Merseyside, Yorkshire and Humberside, East Midlands, West Midlands, Eastern, London, South East, South West, Wales, Scotland and Northern Ireland), with additional identifiers for Inner and Outer London, the West Midlands Metropolitan County, Greater Manchester and Merseyside, South, West Yorkshire, and Strathclyde in Scotland.

Datasets containing lower geographical level are available under a Special License which requires accreditation by the UK Statistics Authority as an Approved Researcher. There are three such datasets based on the LFS that allow users to obtain finer geographical detail: The QLFS Special Licence access data distinguishes Unitary / Local Authorities in England and Wales. It also includes an urban/rural indicator. Alternatively, the Annual Population Survey (APS) combines results from a number of sources including the Labour Force Survey (LFS) (waves 1 and 5) contains data for unitary/local authorities in the UK. Its large sample size make it ideal for detailed geographical analysis

Finally between 1992 and 2006, the QLFS Local Area Data provided data aggregated by area for Counties/Scottish Councils, Local Authority Districts (LADs), Training and Enterprise Councils (TECs), Local Enterprise Companies (LECs), Government Offices (GOs) and Training, Enterprise and Education Directorate (TEED) regions.

Why is there a combined missing value category ‘-10’ in some of the household datasets?

All missing values in the data have been set to one '-10' “No answer/Does not apply” category instead of the previous separate '-8' “No answer” and '-9' “Does not apply” categories. The ONS has introduced a new imputation process for the LFS household datasets and it has thus been necessary to code the missing values into one new combined category ('-10'), to avoid over-complication. The '-10' categorisation is in line with the Annual Population Survey household series. The change also applies to LFS household data back to April-June 1997 (currently being updated at the UK Data Archive), to ensure continuity for analytical purposes. The change only applies to the household data, and there are no current plans to extend it to other LFS series.

What are the access restrictions to LFS data under the End User License due to privacy protection reasons?

Due to the perceived risk of identifying respondents to the LFS, codes for geography levels smaller than Government Office Regions are only available in the Special License version of the LFS. Additional restrictions have recently been put in place by the ONS on the End-User License datasets: it is no longer possible to link quarterly and longitudinal editions, by computing unique identifiers. Users interested in doing so need to apply for a Special Licence.

Are age (continuous) and SOC occupation (4 digits) available in the QLFS dataset?

Yes. The ONS has reversed its previous decision to remove AGE and SC2KMMJ from the End User License LFS datasets.

When will the transition to the new SOC 2010 classification of occupations take place in the LFS?

Fieldwork for the LFS with the new ONS' 2010 Standard Occupational Classification has begun in January 2011. The first dataset that will include the SOC 2010 variables is the January-March 2011 edition. It is due to be released in May 2011.

I cannot find what I am looking for in the documentation for pre 1990 editions of the LFS. What can I do about it?

Early editions of the LFS documentation did not offer the same level of detail as it does now. Variable names and definitions changed several times, and the description of pre 1985 classifications of occupations, education subjects, or industries is not always well documented. If you don't find the information you are looking for please write to ESDS government Help Desk.

What should I do about missing variables in Labour Force Survey Calendar Datasets?

The LFS moved from seasonal to calendar quarters in 2006 and the seasonal datasets have been removed from ESDS and replaced by the calendar datasets. As a result a number of variables have been dropped from the datasets. A page which lists each calendar dataset with the corresponding seasonal datasets is now available, together with more information about the changes.

Unfortunately the LFS User Guides so far do not reflect the changes. As a result you will find that certain variables noted in the User Guides as being present in the datasets are not actually in the datasets.

What are the ID variables on the Quarterly Labour Force Survey datasets

a. End-User Licence (EUL) files

QLFS January - March 2011 (Study Number 6782)
The JM11 QLFS data files contain new ID variables/numbers which are designed to improve the confidentiality of respondents, which has some impact on the way in which the data can be used. The new identifiers are:

CASENOP - ‘Case Identifier - pseudoanonymised’
HSERIALP - ‘Number uniquely identifies a household - pseudoanonymised’

These ID numbers are designed to permit users to link household members together, but not to link across waves to create their own longitudinal datasets (users who wish to do this should use the LFS longitudinal files). The household identifier (hserialp) is an identifier which starts at 1 and simply counts up. Common values of this variable across waves no longer indicate that the case is from the same household. The numbers are no longer based on administrative data, and many of these administrative variables have been removed for reasons of confidentiality. Accordingly, previous guidance on how to generate person and household identifiers from these administrative variables has been superceded. It should still be possible to combine datasets in order to increase sample size as variables indicating which wave of the survey the respondent is in and when the respondent entered are still present. However, it is no longer possible to check that individual cases have not been duplicated as a result of failures in the merge process. It is still possible to use the data to identify household membership using the hserialp variable. However, users are reminded that when undertaking household analysis the most suitable data for household level analysis or analyses which involve household context are the household LFS files. All future QLFS datasets will contain these two new identifier variables. A variable called QUOTAP is also available which is the ‘stint number where interview took place - pseudoanonymised’

QLFS April-June 2009-October-December 2010 - these data files contain a new ID number (IDREF) which preceded the new ID variables for the January-March 2011 dataset (see above). IDREF was designed to improve the confidentiality of respondents, which has some impact on the way in which the data can be used. IDREF does not allow linkage between quarters (nor do Casenop and HserialP – see notes above). Please note that there is no household identifier on these datasets.

QLFS 2001-2009 - there are no changes to the ID variables in these QLFS datafiles – they should contain the original QLFS identifier variables (caseno, add, quota etc).

b. Special Licence (SL)
All Special Licence QLFS datasets should still contain the original ID variables (caseno, add, quota etc) so they are not affected by the above changes.

Is there a more general FAQ?

There is a generic ESDS Government FAQ for all surveys available.

 



ESDS Home Page > Government > Lfs > Faq
_
  Page last updated 7 December 2011
© Copyright 2003 - 2012 Universities of Essex and Manchester. All rights reserved.
Contact   |    Copyright and Disclaimer    |    Accessibility |   Send us comments on this page
Link to University of Essex Link to University of Manchester Link to JISC Link to ESRC