During this project we have generated many resources which we feel are of wider benefit to
occupational researchers. These can be downloaded from this page. At present these resources
are in Stata format. We are currently working on producing the resources in other formats.
Most of these resources can be found Geode - Grid Enabled Occupational
Data Environment. GEODE serves to collate and distribute occupational resources for the benefit of social
science research. For a richer source of occupational research resources, visit
Dave Griffiths or
Paul Lambert if you have any questions or comments regarding these files.
This .dta file adds HIS-CAM scores to
Microclass conversion files
These resources enable occupational data to be converted into the
Microclass scheme, as documented in:
Jonsson, Jan O., David B. Grusky, Matthew Di Carlo, Reinhard Pollak, and Mary C. Brinton. 2009.
"Microclass Mobility: Social Reproduction in Four
Countries", American Journal of Sociology, Volume 114, Number 4.
This .dta file translates from ISCO-08 to microclasses
This .dta file translates from ISCO-88 to microclasses
This .dta file translates from ISCO-68 to microclasses
This .dta file translates from the 3 digit HISCO scheme to microclasses
This .dta file translates from the 2 digit HISCO scheme to microclasses
This file adds labels to the microclass variable generated through other resources from this project.
Stata .do files for analysing occupational structure
This file predicts an expected number of marriage pairings between members of occupational
unit groups, based on the numbers of male and females in each job. The actual number of
pairings, and the value of over/under-representation is also calculated. If this data is
used to drop marriage pairings under certain thresholds of frequency or value of
over/under-representation, a text outsheet of the male and female occupation can be
saved as a matrix for use in Pajek and other social network analysis software.
This file splits occupational unit groups into graduate members and non-graduate members
by adding 10,000 to the code for graduates.
This file obtains the percentage of graduates within an occupational unit group amongst
different age groups from Ipums_international data.
This enables researchers to construct
their own versions of the SOC(HE) scheme, created by Peter Eilas and Kate Purcell in
SOC (HE): A classification of occupations for studying the graduate labour market,
Researching Graduate Careers Seven Years On research paper no.6, Peter Elias and Kate Purcell,
March 2004. (Available at
This file groups individuals from
into the following age-groups.
- Under 25
- 25 to 34
- 45 to 55
- 55 to 64
- 65 and over
IPUMS International occupational labels
These files enable occupation labels to be automatically generated for surveys found within
The command files for IPUMS can become confused if you are downloading multiple samples. For instance, if you download the 1991 and 2001 UK data in the same download, the value "111", in the variable "occ", refers to "authors, writer, journalists", for the 1991 data and "corporate managers and senior officals", in the 2001 data. These files enable the correct labels to be easily attached.
These files can be run from their online address, or downloaded to a computer. To run them automatically online, the command syntax is :
The country is always the two digit internet domain version. The year is always four digit. We hope this standard will enable researchers to easily obtain labels if working on multiple datasets.
Costa Rica 1984
Costa Rica 2000
Kyrgyz Republic 1999
Puerto Rica 1970
Puerto Rica 1980
Puerto Rica 1990
Puerto Rica 2000
Puerto Rica 2005
St Lucia 1991
United Kingdom 1991
United Kingdom 2001
South Africa 1996
South Africa 2001
South Africa 2007
- Chile 1992 - These labels are our own construction to adjust between the 3-digit occ variable in the data and the 4-digit labels on the IPUMS website. There are a few missing codes, namely 331, 335, 799 and 899.
- Kenya 1989 – Codes 7818 –7832 are labelled as "undocumented",.
- Switzerland 1970 – 2000 – No label is available for "29", in all datasets, and "798", in 1970.
The following countries and time-points have occ data available on IPUMS but we are yet to generate the occ_labels .do file. We intend to make them available online soon to enable this resource to be comprehensive.
Dave Griffiths if you have any questions or comments regarding this files.
- Canada 1971; 1981; 1991; 2001
- Israel 1972; 1983; 1995
- Italy 2001
- Nepal 2001
- Netherlands 1960; 1971; 2001
- Palestine 1997
- Slovenia 2002
- Tanzania 1988; 2002