2016 submissions

Linkage of records from the 2011 survey for the Programme for the International Assessment of Adult Competencies (PIAAC), the 2011 Census of Population, the 2011 National Household Survey (NHS) and the Immigrant Landing file. (001-2016)

Linkage of records from the 2011 survey for the Programme for the International Assessment of Adult Competencies (PIAAC), the 2011 Census of Population, the 2011 National Household Survey (NHS) and the Immigrant Landing file. (001-2016)

Purpose: Given that the linguistic practices of official language minorities in the labour market and their communities affect their cultural and economic vitality, linking PIAAC data with 2011 NHS and 2011 Census data will provide information on each member of a PIAAC respondent's household, which cannot be done at this time. This will help us gain a better understanding of the relationship between the characteristics of household members and the level of skills measured in the PIAAC. As well, the census and NHS include questions that were not asked in the PIAAC survey (e.g., questions to derive the first official language spoken and a question on languages used regularly at work), hence the importance of linking data from different sources.

Description: The PIAAC survey is part of a series of international surveys that have been conducted since the mid-1980s to measure the various dimensions of adult literacy, numeracy and problem-solving skills. The 2011 Census and 2011 NHS contain information on respondents' first official language spoken and on the members of the respondents' household. The Immigrant Landing file also includes data on the knowledge of official languages among immigrants upon arrival in Canada.

Records from the PIAAC, 2011 Census, 2011 NHS and the Immigrant Landing file are linked using a (deterministic hierarchical) record linkage program.

Only PIAAC survey respondents and members of their households will be retained for this record linkage.

Output: Only estimates that present aggregate data in accordance with the confidentiality provisions of the Statistics Act will be published outside Statistics Canada. Products derived from the linkage between PIAAC, the census, the NHS and the Immigrant Landing file will be in the form of cross-tabulations, charts, geographical maps and results of multivariate logistic and linear regressions.

The results of the linkage, including the variables used to link the records, such as personal identifiers and information for measuring linkage quality, will be destroyed by March 31, 2016 at the latest, or as soon as they are no longer required. All files will be stored on a server in a secure location. Access to the linkage results is restricted to Statistics Canada employees and deemed employees of Statistics Canada whose work assignment requires this access.

Evaluation of the quality of addresses on CCTB and IDENT files and cleaning of the T1FF to exclude duplicates and optimize the linkage. (007-2016)

Evaluation of the quality of addresses on CCTB and IDENT files and cleaning of the T1FF to exclude duplicates and optimize the linkage. (007-2016)

Purpose: The Demographic Estimates Program (DEP) is responsible for the production of international and internal migration estimates. The Household Survey Methods Division (HSMD) offers methodological services to the Demography Division (DEM) in order to improve the population estimates.

Preliminary interprovincial migration is estimated using monthly data from the Canadian Childcare Tax Benefit (CCTB). Migrants are identified by comparing the addresses (postal codes) of CCTB recipients for two consecutive time periods. For this reason, the quality of CCTB addresses are of the upmost importance. Also, because we are mainly interested in migration, it is important that the addresses are updated as soon as possible. The IDENT file is now available quarterly at Statistics Canada. This source offers the possibility of evaluating the quality of CCTB addresses for quarterly time periods by looking specifically at individuals who changed their address on one file or the other in order to verify the consistency of the change.

Interprovincial and subprovincial final estimates are calculated using tax data from the T1FF. As for preliminary estimates, interprovincial and subprovincial migrants are identified by comparing the addresses of tax filers and their dependents for two consecutive years. Filers eligible for that comparison are those who appear on the T1FF for the two years that are being compared. Some individuals change their social insurance number (SIN) or change from a dependent identification number (DIN) to a SIN over time. As a result, these changes create duplicates on the T1FF and make the linkage between the two compared files more difficult. The linkage of the T1FF to auxiliary sources allows us to track individuals whose identification numbers changed over time, and to clean the files to improve their representiveness of the population.

Description: Because of these issues, evaluations were done to 1) assess the quality of CCTB addresses by comparing them with those of the IDENT files and 2) understand the impact of the cleaning of the T1FF identification numbers, which removes duplicates and ensures a common number is kept for the two studied years.

Output: The linkage files between CCTB and IDENT will be kept until April 1st, 2018 (at the latest). After that date, they will be destroyed.

The linkage files between T1FF and the auxiliary sources used for the cleaning would be used in production, which means they could be required for any production-related revisions. For that reason, it is recommended to keep these files.

For these two record linkage projects, aggregated and non-confidential results will be presented internally and externally. They could also be released in reports or scientific papers (conferences and scientific journals).

Agriculture-Population Linkage – A secondary use of file 011-1987. (012-2016)

Agriculture-Population Linkage – A secondary use of file 011-1987. (012-2016)

Purpose: To add a secondary use of the administrative data by another program within the division.

Description: The Census of Agriculture is linked to the Census of Population to create a comprehensive database that includes socio-economic characteristics of farm operators. This activity allows for the production of cross-classified tables displaying farm characteristics and farm operator's socio-economic data. This activity has been ongoing since the 1971 censuses.

Output: The Canadian Agricultural Financial Statistics Section of the Agriculture Division will use the aggregated data from the Ag-Pop linkage to calculate the income-in-kind estimate for the Net Farm Income as part of the requirements for the System of National Accounts.

Linking command organization from the Department of National Defence (DND) Human Resources (HR) management file. (016-2016)

Linking command organization from the Department of National Defence (DND) Human Resources (HR) management file. (016-2016)

Purpose: To link respondent's survey data to their "command organization" from the DND HR management file to ensure that this information will be as accurate as possible.

Description: Each respondent's survey data will be linked to their "command organization" from the DND HR management file (the frame file) using survey access codes (SAC). The command organization will then be kept on the final micro file; the SACs will be removed.

Output: The linked variable (command organization) will be used in the data analysis and included on disseminated aggregate data files. It will be included on the final micro data file which will be delivered to the Research Data Centres. This variable on the final micro data file will be retained indeterminately.

Long-term health-related benefits of greenness: results from a Canadian national cohort study. Secondary use of 049-2012 and 045-2015. (020-2016)

Long-term health-related benefits of greenness: results from a Canadian national cohort study. Secondary use of 049-2012 and 045-2015. (020-2016)

Purpose: The purpose of this project is to examine the benefit of long-term exposure to greenness and green spaces to the health of Canadians in a large, national cohort study.

Description: The 1991 Canadian Census Cohort: mortality & cancer follow-up is a probabilistic linked database. Approximately 2.7 million individuals aged 25 or older, who were enumerated by the 1991 long-form census, were followed for mortality, cancer, and annual place of residence. Also included in the analysis is the Canadian Census Health and Environment Cohort (CanCHEC) 2001.

The study addresses several research questions:

  1. Is long-term exposure to greenness associated with reduced risk of dying from cause-specific mortality (e.g., non-accidental causes, diseases of the respiratory system, cardiovascular causes) among Canadian adults?
  2. Is long-term exposure to greenness associated with the incidence of selected cancers (e.g., lung, breast, prostate) among Canadian adults?
  3. Do associations between exposure to greenness and health outcomes vary by geographic region or by demographic/socioeconomic characteristics?
  4. Do exposures to local (e.g., neighbourhood-level) or regional (e.g., citywide) estimates of greenness have different associations with health outcomes.

Output: Only aggregate data and analyses conforming to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada Research Data Centres, in the form of peer reviewed journal articles and presentations.

2015 General Social Survey on Time Use: linking tax data from the T1 Personal master File, T1 Family File and T4 Summary and Supplementary file. (021-2016)

2015 General Social Survey on Time Use: linking tax data from the T1 Personal master File, T1 Family File and T4 Summary and Supplementary file. (021-2016)

Purpose: The General Social Survey (GSS) program, established in 1985, conducts telephone surveys from a sample selected across the 10 provinces (excluding the Territories). The GSS is recognized for its regular collection of cross-sectional data that allows for trend analysis, and its capacity to test and develop new concepts that address emerging issues. Each year the GSS focuses on a different topic, such as family, victimization, social support and aging, and time use. A specific topic is usually repeated approximately every 5 years. The 2015 GSS will focus on Time Use.

The 2015 GSS on Time Use is the fifth iteration of a series of surveys which began in 1986.

By linking the 2015 GSS on Time Use responses to personal tax files of respondents, and the tax files

of all household members, more accurate income (personal and household), will be obtained for respondents. At the same time, response burden will be minimized, and collection and data processing costs will be reduced.

Description: The 2015 GSS on Time Use is a sample based survey with a cross-sectional design. Telephone and /or Internet surveys are conducted through computer assisted telephone or Electronic Questionnaire (EQ) interviews from a sample selected across the 10 Canadian provinces.

By linking data, we are aiming to obtain better quality data for income (personal and household).

Questions relating to income show rather high non-response rates, the incomes reported by respondents are usually rough estimates and donor imputation is used for partial and item non-response.

The information collected during the 2015 GSS on Time Use will be linked to the personal tax records (T1, T1FF or T4) of respondents, and tax records of all household members.

Respondents will be notified of the planned linkage before and during the survey. Any respondents who object to the linkage of their data will have their objections recorded, and no linkage to their tax data will take place.

Output: The availability of the 2015 GSS on Time Use analytical data file will be announced in The Daily. The analysis file will be made available to Statistics Canada researchers, and to deemed employees at the Statistics Canada Research Data Centres. All data will remain confidential and protected under the Statistics Act.

Along with the availability announcement of the analytical data file (in The Daily), only non-confidential aggregate statistics will be released.

Linkage of data from the current cycle to the previous cycle for the Farm Financial Survey. (022-2016)

Linkage of data from the current cycle to the previous cycle for the Farm Financial Survey. (022-2016)

Purpose: The FFS is seeking approval to link micro-level data from the current survey cycle with corresponding data from the previous survey cycle. The objective of the proposed linkage is for AAFC internal research, to enable more in-depth analysis of the sector investments patterns.

FFS data are the only available source of farm-level data to link farm characteristic, farm financial and Business Risk Management (BRM) program data. BRM is a major policy initiative by Federal, Provincial and Territorial (FPT) governments. The record linkage will allow for more targeted analysis to support discussions by Federal-Provincial and Territorial Governments during the development of a new policy framework.

Description:Where possible, data from the Farm Financial Survey from the current reference year and the previous reference year will be linked at the record level. The records linked will only be for those respondents who have agreed to share their data with Agriculture and Agri-Food Canada.

Output: Linkage results will be used to produce analytical reports for Agriculture and Agri-Food Canada research purposes. Any use of the data will conform to the confidentiality provisions of the Statistics Act.

The linked file used to produce the analytical reports will be retained for the duration of the applicable data sharing agreement(s). The current data sharing agreement expires on March 31, 2022.

Linkage of the Annual Capital and Repair Expenditures Survey (CAPEX) 2000 to 2012 to the Linkable File Environment (LFE) (027-2016)

Linkage of the Annual Capital and Repair Expenditures Survey (CAPEX) 2000 to 2012 to the Linkable File Environment (LFE) (027-2016)

Purpose: Following the conversion of the Annual Capital and Repair Expenditures (CAPEX 2000 to 2012) from establishment level to the firm level, the linkage of CAPEX estimates with other firm level databases will be used for future research projects both by ISTD at STC and ISED. Thus, the project benefits both STC and ISED in the sense that it fills an important void with respect to studying the role of intangible assets on innovation (multifactor productivity) at the firm level.

Description: This is a request to link the Annual Capital and Repair Expenditures Survey (CAPEX) 2000 to 2012 to the Linkable File Environment (LFE)

Output: The linkable CAPEX dataset will be housed at Statistics Canada's Centre for Special Business Project (CSBP). When a research project is formally approved by Statistics Canada, Statistics Canada's Centre for Special Business Projects will extract a researcher database from the LFE which will contain data for the variables that are listed in the research proposal for the population that has been specified.

Canadian Cancer Treatment Linkage Project (028-2016)

Canadian Cancer Treatment Linkage Project (028-2016)

Purpose: The CCR is a dynamic database of all Canadian residents, alive or dead, who have been diagnosed with cancer since 1992. While the CCR contains current data on cancer incidence in Canada, it does not presently provide any surgical or other cancer treatment information through either base collection or record linkage. As a result, there is no way to assess or monitor cancer management approaches within Canada. This project seeks to take advantage of two key opportunities – the availability of Canadian health administrative data on surgical treatment and the creation of the new record linkage environment (SDLE) at Statistics Canada – to create a linked file that adds surgical treatment data to the CCR. This project will result in the creation of a national (except Quebec) linked database reporting surgical treatments for cancers in a standard and comparable way across Canada, thereby extending the relevance of the CCR.

Description: The SDLE Production team will link the CCR, DAD, NACRS and CMDB files to the Derived Record Depository (DRD) using a probabilistic linkage methodology for the CCR and CMDB and a deterministic linkage methodology for the DAD and NACRS files. Once the linked files are produced, project staff will undertake validation of the linked dataset. The first validation will be completed as part of the linkage process in the SDLE, whereby each linkage will be examined for its overall linkage rate. A second validation will focus on determining the analytical robustness of the linked file to ensure that the linked data can be used to answer analytic questions (i.e., fitness for use). Upon completion of the validation and documentation, a subset of the linked file will be placed in up to three RDCs as a pilot for use by external researchers who have an approved project proposal. The full linked file will be retained at Statistics Canada for a period of 10 years, after which its status will be revaluated.

Output: The full linkage file will be composed of:

  • all the variables available in the CCR file;
  • all the variables from linked DAD records;
  • all the variables from linked NACRS records; and
  • the date of death, cause(s) of death, and locality of death variables for linked Canadian Mortality Database (CMDB) records.

The file will not contain any direct identifiers such as name, address, telephone number, health information number or patient number. The subset of the file for use by analysts will only be accessible through the RDC system, following review of user proposals, and will be in accordance with the rules of the pilot study.

Socioeconomic variations in violence-related deaths in Canada: 1991-2011 (031-2016)

Socioeconomic variations in violence-related deaths in Canada: 1991-2011 (031-2016)

Purpose: This research will detail temporal trends, and method of death; and assess socio-economic status as a risk factor, testing the social disadvantage theory in relation to violent deaths.

Description: The 1991 Canadian Census Cohort: mortality & cancer follow-up is a probabilistic linked database. Approximately 2.7 million individuals aged 25 or older, who were enumerated by the 1991 long-form census, were followed for mortality, cancer, and annual place of residence.

This is an epidemiological study on violence-related mortality in Canada from a public health perspective. This research aims to investigate the patterns of violence-related mortality over the 1991-2011 period, considering sex, age, geography and various individual socioeconomic indicators.

A descriptive analysis will be conducted to measure the relationship between violence-related death and some socioeconomic indicators. Fitted models will be used to assess time-trend and main effects of each socioeconomic indicator on violence-related mortality risk. The study will also estimate whether the violent mortality slopes differ across socioeconomic groups of the population.

Output: Only aggregate data and analyses conforming to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada Research Data Centres, in the form of peer reviewed journal articles and presentations.

Official language acquisition among immigrants to Quebec: Its determinants, and its association with earnings (032-2016)

Official language acquisition among immigrants to Quebec: Its determinants, and its association with earnings (032-2016)

Description: This is a proposal for the secondary use of linkage file 001-2016, which includes data from the Programme for the International Assessment of Adult Competencies (PIAAC 2013), the 2011 Census of the Population, the National Household Survey, and the Immigrant Landing File.

Purpose: The proposed research will be ongoing over the course of the coming year, and will culminate in the publication of several research papers focusing on immigrants and knowledge of official languages. There are firm plans for two papers to be completed by March 31, 2017, and a third paper is tentatively planned for early next fiscal year.

These papers will benefit from the fact that this linked dataset contains information on knowledge of official languages at landing, as well as information on current knowledge of official languages. This means that we have information on knowledge of official languages at two points in time, which we can compare in order to derive a measure of official language acquisition. The dataset also has very detailed information on various skills and skill use from PIAAC.

The first paper will investigate the determinants of official language acquisition among immigrants in Quebec, with special emphasis on the Montreal CMA, and the second will see how the acquisition of an official language impacts immigrant earnings in the Montreal CMA.

Output: The linked analysis file will remain on Statistics Canada premises. Access to the analysis file will be restricted to Statistics Canada staff and deemed employees whose assigned work requires such access.

Only aggregate statistics that conform to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada. Results of this study will be published through analytical reports from the Immigration, Language and Ethnocultural Statistics Program.

The linked analysis file, stripped of identifiers, has been approved for retention until March 31, 2018, or until no longer required, at which time it will be destroyed.

The linkage key file will be retained until March 31, 2017, or until no longer required, at which time it will be destroyed.

Linking Data from the 2012 Survey of Financial Security to income tax Records file (T1) (034-2016)

Linking Data from the 2012 Survey of Financial Security to income tax Records file (T1) (034-2016)

Purpose: The purpose of this linkage is to obtain income data and reduce respondent burden, interviewer time and collection costs for the 2016 Survey of Financial Security.

Description: The 2016 Survey of Financial Security database and the 2015 T1 File will be linked using the address, city, date of birth, first name, surname, sex, province, NYSIIS and SNDX code for surname, postal code, marital status, telephone number and first initial. This information will be removed from the linked file as soon as the linkage is completed, and stored separately. Access to these files will be restricted to Statistics Canada employees whose assigned work activities require access.

Output: No information containing personal identifiers will be released outside of Statistics Canada from this linkage activity. Only aggregate statistics and analysis conforming to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada. The Survey of Financial Security will be conducted from September 1st to December 1st 2016 and therefore, the linkage would be undertaken in 2016/2017.

Air Pollution Study: Linkage of 1991 Census of Population, Canadian Mortality Database and Canadian Cancer Database Follow-Up Study (037-2016)

Air Pollution Study: Linkage of 1991 Census of Population, Canadian Mortality Database and Canadian Cancer Database Follow-Up Study (037-2016)

Purpose: To assess the impact of long-term exposure to air pollution on human health, with the objective to inform the development of Canada-wide standards for key criteria pollutants. Linkage of separate sources of information is an important way in which Statistics Canada can meet identified data gaps on environmental data related to human exposure to air pollution. For example, Canada-wide standards for annual averages of either fine particulate matter or ozone have not been developed, largely due to lack of evidence from the Canadian population and uncertainties about the applicability of risk estimates generated in other countries to Canada.

The specific objectives of this study are: to determine whether deaths from all causes, from ischaemic heart disease, from cardiopulmonary disease, from respiratory cancer, and from all cancers combined are associated with long-term exposure to ambient air pollutants; to determine the air pollution risks for cancer incidence and the risks for specific cancer types; and, to examine the relationship of cancer incidence and causes of death to socio-demographic and neighbourhood characteristics over a 22-year period.

Description: For a previous approved record linkage (reference number 012-2001), a sample of 2.7 million Canadians was selected from respondents to the 1991 Census of Population long-form questionnaires and their Census information was linked to the 1991 Health and Activity Limitations Survey, the 1990 and 1991 Tax Summary Files and the 1991 to 2001 Canadian Mortality Database, for the development of indicators on health.

The current project will extend and expand the linked information on this 1991 Census sample, as follows:

  • linkage to an additional 30 years of the T1 Personal Master files, that is, from 1981 to 2012;
  • linkage to an additional 10 years of the Canadian Mortality Database, up to 2011; and
  • a new linkage to the 1969 to 2011 Canadian Cancer Database, for the period 1969 to 2011.

The linked files will contain only those data items required to conduct the study. Personal identifiers, such as name and social insurance number, will be used only for linkage purposes, then removed from the linked microdata file. Only a sample of individuals who completed the 1991 Census of Population long-form questionnaires are included on the file.

Output: All access to the linked microdata file will be restricted to Statistics Canada staff whose work activities require access. Only aggregate data that conform to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada. Availability of the linked file will be announced in The Daily. Major findings will be used to create research papers for publication in peer-reviewed journals (including Statistics Canada's Health Reports) and presentation at workshops and conferences.

The linked file, stripped of personal identifiers, will be retained until no longer required, at which time the file will be destroyed.

On-going linkage of justice data: policing, courts, corrections (038-2016)

On-going linkage of justice data: policing, courts, corrections (038-2016)

Purpose: To link data from the Uniform Crime Reporting (UCR2) Survey, the Integrated Criminal Court Survey (ICCS) and the Integrated Correctional Services Survey (ICSS) on an on-going basis for statistical and research purposes. Crime in Canada is a dynamic and complex social phenomenon; understanding the responses to crime and the administration of justice requires on-going research in order to provide information for evidence-based policy and program development.

Description: The Canadian Centre for Justice Statistics intends to increase both the frequency and volume of linked data incorporated within its core business activities including publishing results in Juristat articles. The information garnered from linking justice data files provides a more comprehensive look at the justice system as an integrated system, which allows for increased understanding of the nature and extent of crime and criminality in Canada.

The linkage will use records from three micro data surveys including those collected under the Uniform Crime Reporting (UCR2) Survey, the Integrated Criminal Court Survey (ICCS) and the Integrated Correctional Services Survey (ICSS).

Output: Only non-confidential aggregate statistics and analyses conforming to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada. Access to linking keys and linked analysis file will be restricted to Statistics Canada employees whose assigned work activities require such access.

2016 General Social Survey on Canadians at Work and Home: Linking to tax data from the T1 Personal Master File, T1FF Family File, and T4 Summary and Supplementary File (045-2016)

2016 General Social Survey on Canadians at Work and Home: Linking to tax data from the T1 Personal Master File, T1FF Family File, and T4 Summary and Supplementary File (045-2016)

Purpose: Linking the 2016 GSS responses to the personal tax files of respondents and their household members provides more accurate income data for both persons and households. In addition, linking reduces respondent burden, and lowers collection and data processing costs.

Description: The 2016 GSS is a sample-based survey with a cross-sectional design. Collection is conducted through computer assisted telephone interviews (CATI) and/or self-completed Electronic Questionnaires (EQ) from a sample of Canadians residing in the 10 provinces.

Over the years, GSS questions relating to income have consistently shown high non-response rates. Many respondents who did provide income figures had given rough estimates rather than exact amounts. Donor imputation has been used for partial and item non-response.

The information collected from the 2016 GSS will be linked to the personal tax records (T1, T1FF or T4) of respondents and the tax records of all household members.

Linking to the tax files will ensure better quality data, lower respondent burden and decreased costs.

Respondents will be notified of the planned linkage before and during the survey. Those who object to the linkage will have their concerns recorded and no linkage to their tax data will take place.

Output: The availability of the 2016 GSS Analytical Data File will be announced in The Daily. The file will be made available to Statistics Canada researchers and to deemed employees at the Statistics Canada Research Data Centres. All data will remain confidential and protected under the Statistics Act.

Along with the availability announcement of the Analytical Data File, non-confidential aggregate statistics will also be released either in table and/or article format.

Canadian Forces Cancer and Mortality Study II, 1976 to 2015 (046-2016)

Canadian Forces Cancer and Mortality Study II, 1976 to 2015 (046-2016)

Purpose: The purpose of the Canadian Forces Cancer and Mortality Study II is to address major gaps in the health surveillance of its members. The study's general objectives are to describe the mortality and cancer experience of Canadian Forces personnel (serving and released) and to inform:

  • Health promotion and health protection policies and programs for serving personnel
  • Programs that deliver care for veterans (released) and their families after leaving military service.

The Canadian Forces (CF) are tasked with protecting Canada and its citizens from threats to security. CF members may be involved in combat, peace-keeping and observer missions, post-conflict peace building and humanitarian assistance. The very nature of these operations can pose unusual and uncommon exposures with known and unknown risks to CF personnel. Adverse outcomes, including death, may be immediate or delayed. In order to identify risks, DND and VAC must be able to conduct on-going analysis and interpretation of health information for CF personnel during and after their active military service period.

DND and VAC do not currently have access to complete information on mortality and cancer outcomes of serving and retired CF personnel.

Description: DND will provide Statistics Canada with a list of approximately 329,000 personnel who enrolled on or after January 1, 1976 and have served or are still serving with the Canadian Forces at any point in the period from January 1, 1976 to December 31, 2015.

The records of this CF cohort will be linked to the following files maintained by Statistics Canada: the 1976 to 2012 Vital Statistics Death Database (VSDD) with annual updates to done when 2013, 2014 and 2015 VSDD becomes available, and to the 1969 to 20134 Canadian Cancer Registry.

A random Statistics Canada-generated unique identifier will be attached to each record in the CF cohort, as well as to each record in the output file generated by the mortality linkage, and the output file generated from the cancer linkage. In addition, Statistics Canada will attach the unique identifier to each record in a DND cohort work history file. This will enable linkage of the output files with the DND files by Statistics Canada or DND.

Output: Only aggregate tabular statistics that conform to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada. A report on specific mortality trends and a description of the linkage process will be produced.

As deemed employees, DND and VAC will be provided access to a copy of the mortality and cancer analysis files, without names or other direct personal identifiers, within the Federal Research Data Centre. The files, which will be merged with DND's work history file, will be used for statistical and research purposes only.

The linked analysis files and linkage key files will be retained by Statistics Canada for at least 15 years, that is, until December 31, 2031, or until they are no longer required, at which point they will be destroyed. Access to the linkage key files will be restricted to Statistics Canada employees whose work activities require access.

Air Pollution Study: Linkage of 1996 Census of Population, T1 Personal Master Files, Mortality and Canadian Cancer Registry (047-2016)

Air Pollution Study: Linkage of 1996 Census of Population, T1 Personal Master Files, Mortality and Canadian Cancer Registry (047-2016)

Purpose: To assess the impact of long-term exposure to air pollution on human health, with the objective to inform the development of Canada-wide standards for key criteria pollutants. Linkage of separate sources of information is an important way in which Statistics Canada can meet identified data gaps on environmental data related to human exposure to air pollution.

The specific objectives of this study are: to determine whether non-accidental deaths and cancers are associated with long-term low exposure to ambient air pollutants.

Description: A sample of approximately 3.4 million Canadians was selected from respondents to the 1996 Census of Population long-form questionnaires and their Census information will be linked to the T1 Personal Master File (1981 to 2016), the Amalgamated Mortality Database (1996 to 2016) and the Canadian Cancer Registry (1992 to 2016). Air pollution data (e.g. fine particulate matter (PM2.5), nitrogen dioxide (NO2), ozone (O3)) will be spatially integrated to these files.

The linked files will contain only those data items required to conduct the study. Personal identifiers, such as name and social insurance number, will be used only for linkage purposes, then removed from the linked analytical microdata file. Only a sample of individuals who completed the 1996 Census of Population long-form questionnaires were included on the file.

Output: All access to the linked microdata file will be restricted to Statistics Canada personnel (including Statistics Canada deemed employees) whose work activities require access. Only aggregate data that conform to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada. Major findings will be used to create research papers for publication in peer-reviewed journals and presentation at workshops and conferences.

The linked file, stripped of personal identifiers, will be retained until no longer required, at which time the file will be destroyed.

Modelling the health status of the Quebec population and their use of health care services: A subsequent use of linkage 037-2016 (050-2016)

Modelling the health status of the Quebec population and their use of health care services: A subsequent use of linkage 037-2016 (050-2016)

Subject: The objective of this study is to project the health status of the Quebec population and their use of health care services.

Description: The 1991 Canadian Census cohort mortality and cancer follow-up database is produced using probabilistic record linkage. Approximately 2.7 million individuals aged 25 years and older, who were enumerated in the 1991 Census long-form questionnaire, were followed for mortality, cancer incidence and annual place of residence.

The purpose of this study is to project the health status of the Quebec population and their use of health care services between 2010 and 2050 using an approach similar to the Future Elderly Model (FEM) dynamic microsimulation model (RAND Corporation 2008), as well as data from the National Population Health Survey (NPHS), the Canadian Community Health Survey (CCHS) and the 1991 Canadian Census cohort mortality and cancer follow-up database. This model takes into account various dimensions of population health, making it possible to consider a range of health status and health-care-use scenarios to ultimately assess their impact on health-care spending.

Output: Only aggregate data and analysis that conform to the confidentiality provisions of the Statistics Act will be released outside Statistics Canada's research data centres, in the form of presentations or peer-reviewed journal articles.

The impact of socio-economic status on cancer risk using individual-level assessment: a subsequent use of linkage 037-2016 (051-2016)

The impact of socio-economic status on cancer risk using individual-level assessment: a subsequent use of linkage 037-2016 (051-2016)

Purpose: To evaluate the concordance of socio-economic status (SES) information collected by the long-form census and SES information obtained by residential area. To quantify the impact of low SES on the incidence of cancer, severity of cancer at diagnosis, and mortality from cancer using individual long-form census and residential area data. And, to compare the performance of SES assessment using information from the long-form census with that of residential-area data with respect to cancer incidence, morbidity and mortality rates.

Description: The 1991 Canadian Census Cohort: mortality & cancer follow-up is a probabilistic linked database. Approximately 2.7 million individuals aged 25 or older, who were enumerated by the 1991 long-form census, were followed for mortality, cancer, and annual place of residence.

Risk stratification is an extremely useful method to help identify individuals in a population who are at a higher risk of disease, whose diagnosis may be delayed, and who do not receive timely or optimal treatment, thus suffer more devastating disease outcomes. The link between low SES and higher incidence and prevalence of cancer, greater disease burden, less timely access to treatment, and higher mortality is prevalent in Canada. However, despite studies that have established an association between low SES and cancer incidence and mortality, challenges remain in assessing and quantifying this association largely because of a paucity of accurate individual-level SES data sources that can be linked to cancer registry data.

The Canadian census mortality follow-up study is a unique source of information that could help close this gap. Linking cancer registry and mortality data with census databases provides an irreplaceable opportunity to identify individuals with low SES and to better understand the association between SES and cancer incidence and mortality rates.

Output: Only aggregate data and analyses conforming to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada Research Data Centres, in the form of peer reviewed journal articles and presentations.

Environmentally Adjusted Multifactor Productivity (EAMFP) (055-2016)

Environmentally Adjusted Multifactor Productivity (EAMFP) (055-2016)

Purpose: The purpose of this project is to improve Canadians' understanding of the relationship between the economy and the environment, and thereby permit more effective economic and environmental policy-making. In particular, by estimating environmentally adjusted multifactor productivity (EAMFP), this project would extend the existing work at Statistics Canada to include natural capital in its productivity estimates. An EAMFP measure would provide a more accurate and complete assessment of Canada's economic and productivity performance that could eventually be integrated into the System of National Accounts.

Description: This project will link greenhouse gas emissions (GHG) data from Environment and Climate Change Canada's (ECCC) Greenhouse Gas Emissions Reporting Program (GHGRP) and pollutant release data from ECCC's National Pollutant Release Inventory (NPRI) to Statistics Canada's data on economic activity. These Statistics Canada databases include: the Business Register (BR); Annual Survey of Manufactures and Logging; Oil and Gas Extraction and; the Census of Mines, Quarries and Sand Pits from 2000 to the most recent period. The linkage will create a database that relates releases of environmentally harmful by-products to the economic activities that create them. It will facilitate the creation of estimates of EAFMP measures for industries that contribute a large majority of Canada's GHG emissions and pollutants, measures of productivity growth that take into account the environmental consequences of production. It will also facilitate research and analysis that informs environmental and economic policies.

Output: Outputs would include a research paper containing tabular and multivariate regressions results conforming to the confidentiality provisions of the Statistics Act will be published. The linked files and identifiers will be kept indefinitely by Statistics Canada.

Assignment of historical environmental exposures to health datasets for the purpose of environmental health research (056-2016).

Assignment of historical environmental exposures to health datasets for the purpose of environmental health research (056-2016).

Purpose: To attach historical environmental exposures to health datasets for the purpose of environmental health research.

Description: Our health is influenced by the environment we live in. Although datasets often record a respondent's place of residence at time of collection, many do not record respondent's previous residence(s). Statistic Canada data holdings, specifically place of residence reported on tax administrative files, can help fill this data gap. Through individual level tax files, it is possible to reconstruct a person's "residential" history beginning in 1981. Datasets will be linked to the Derived Record Depository in order to obtain historical place of residence beginning in 1981. Historical environmental exposures will be attached to these datasets using postal code-environmental exposure correspondence files and or postal code representative points.

Output: The creation of enhanced datasets that included historical environmental exposures. These datasets will be part of a pan-national research program focused on addressing key issues related to the impact of the urban environment on health.

Linkage of the McMaster 2014 School Mental Health Survey (SMHS) aggregate scores to the 2014 Ontario Child Health Study (OCHS) (059-2016)

Linkage of the McMaster 2014 School Mental Health Survey (SMHS) aggregate scores to the 2014 Ontario Child Health Study (OCHS) (059-2016)

Purpose: The main objective of this linkage is to combine aggregate scores of a school climate study conducted by McMaster in various Ontario schools called School Mental Health Survey (SMHS) with data collected in the 2014OCHS. The linkage will allow for a more complete portrait of children's mental health in Ontario.

Description: Aggregate scores from McMaster's School Mental Health Survey will be matched for each respondent using the school identifier (which identifies a school uniquely in the sample). The final output will be a micro data file containing both OCHS and SMHS data.

Output: The aggregate scores obtained from McMaster's School Mental Health Survey are combined with parent (guardian) reported data on the 2014 Ontario Child Study. The data are processed and prepared for dissemination. The linked data set will become the master file and will be available through the Research Data Centres without personal identifiers and only accessed by deemed employees of Statistics Canada. Any data being removed from the Research Data Centres will be vetted to ensure that individuals or schools cannot be identified.

The linkage will also be used to create a share file for the Ontario Ministry of Health and Long Term Care which will be under the provisions of a data sharing agreement. The Ministry of Health is funding the McMaster researchers for the OCHS. This information is very important to understand the health of children and youth and the purpose is to have up to date information about children's mental health in Ontario.

By linking data from this survey with data on the use of children's health services, response burden was reduced and they will be able to examine whether or not these services are being used effectively and have access to administrative information about visits to the doctor and program participation.

Characteristics of renter's households from Canada Mortgage and Housing Corporation's Rental file, 2010 to 2014 T1 Family File and 2011 National Household Survey Linkage (060-2016)

Characteristics of renter's households from Canada Mortgage and Housing Corporation's Rental file, 2010 to 2014 T1 Family File and 2011 National Household Survey Linkage (060-2016)

Purpose: The purpose of this record linkage is to generate information on the users of different types of housing accommodations in Canada.

Statistics Canada will use the linked data to create a portrait of renter's households by different type of rental initiation for the Canada Mortgage and Housing Corporation (CMHC). CMHC will use it to help understand the relationship between household's socio-economic characteristics, housing characteristics, housing conditions and access to affordable housing in Canada. This will help inform the development of the strategy to re-establish the federal government's role in supporting affordable housing mandated by the current government.

The resulting data will fill a data gap with high quality information while avoiding additional respondent burden.

Description: The data linkage will be made for all respondents to the 2011 National Household Survey and the 2010 to 2014 T1 Family File.

Selected variables from the 2011 National Household Survey database and the 2010 to 2014 T1 Family File will be linked to data from Canada Mortgage and Housing Corporation's Rental Building Survey.

Output: Only non-confidential aggregated data tables, which conform to the confidentiality provisions of the Statistics Act, will be released outside of Statistics Canada. The linked files (without any personal identifiers) will also be placed in the Federal Research Data Centre for the sole use by CMHC's deemed employees of Statistics Canada.

Update of the 2006 Census of Population to the Discharge Abstract Database and the Vital Statistics Death Database (formerly the Canadian Mortality Database) (063-2016)

Update of the 2006 Census of Population to the Discharge Abstract Database and the Vital Statistics Death Database (formerly the Canadian Mortality Database) (063-2016)

Purpose: The purpose of the proposed record linkage project is to link 2006 Census of Population to the Discharge Abstract Database (DAD) and to the Vital Statistics Death Database (VSDD). These linked data will enhance the ability to better understand patterns of health outcomes (i.e. hospital, mortality) for key sub populations identified in the Census including immigrants, ethnic groups, Aboriginal peoples and those in lower socio-economic groups.

Description: The following files will be linked: Census of the Population 2006 (2b long-form), Discharge Abstract Database (DAD) (1999/2000 to 2013/2014), Vital Statistics Death Database – (VSDD) (2006-2011); Amalgamated Mortality Database (AMDB) and T1 Historical Personal Master File (T1H).

This linkage project builds on previous linkage activities and will be conducted using the linkage keys of following approved linkages which have involved linkage via tax records: Linkage of the Census of Population 2006 to the Discharge Abstract Database, the Canadian Mortality Database and the Landing File for Purposes of the Longitudinal Health and Administrative Data (LHAD) Initiative (#088-2014) and Changes in work and earnings following health shocks (082-2015).

Output: Methodological and analytical findings resulting from these linked data will be used to prepare research papers for publication in analytical reports, peer-reviewed scientific journals, CANSIM, for presentation at conferences, workshops and meetings. The linkable data files will also be used to develop tabular data and indicators for release on Statistics Canada's website. Only non-confidential aggregate statistics that will not result in the identification of an individual person, business or organization will be released outside of Statistics Canada.

A Linkage of the Future to Discover Pilot Project and the T1 Family File (065-2016)

A Linkage of the Future to Discover Pilot Project and the T1 Family File (065-2016)

Purpose: The purpose of the Future to Discover (FTD) Pilot Project - T1 Family File (T1FF) linkage study is to analyze the long-term post-secondary employment and earnings impacts of interventions for participants in the FTD Project in order to inform future programs and policies. Studies produced through this linkage have the potential to address important information gaps that will inform policy and program development related to overcoming barriers to post-secondary education, especially for those from lower income families, namely, lack of career clarity, misinformation about post-secondary education including its costs and benefits and lack of financial resources.

Description: The Future to Discover (FTD) Pilot Project data file, 2004-2011, will be linked to the 2007-2014 T1 Family File (T1FF). The FTD Pilot Project data file consists of survey and administrative data for approximately 5,400 New Brunswick and Manitoba participants in the Future to Discover Project. The SDLE Production team will link the FTD Project's participants to the Derived Record Depository (DRD) using deterministic and probabilistic linkage methodologies. The Tl Personal Master Files of 1981 to 2014 have also been linked to the DRD and will provide the SINs of the cohort members in order to link to the T1FF and create outcome files from 2007-2014.

Output: Only non-confidential aggregate statistics and analytical output that will not result in the identification of an individual person, business or organization will be released outside of Statistics Canada. SRDC will access the data through the Federal Research Data Centre and will prepare and submit final reports to the provincial governments of New Brunswick and Manitoba and to the Higher Education Quality Council of Ontario. These final reports will also be published on SRDC's website. Finally, the results will be disseminated and discussed at academic conferences, policy conferences and workshops, as well as at meetings with provincial education departments and educators. The linked analysis file will not contain any personal identifiers.

Labour market and business-related outcomes for the participants of the Pathways to Education Canada program (067-2016)

Labour market and business-related outcomes for the participants of the Pathways to Education Canada program (067-2016)

Purpose: The results from this linkage proposal will help identify and assess the impact(s) that the Pathways to Education Canada (Pathways) program has on the labour-market and business-related outcomes of its participants. The results will help determine if Pathways is achieving the goals it sets out to accomplish and help determine if the federal government's recent investment into the program is producing the benefits expected and whether any continued investment should be considered in the future by the federal government.

Description: This project will link Pathways' participants data and Toronto District School Board (TDSB) education data to the T1 Family File (T1FF, containing information on the incomes of all Canadians), and the Canadian Employer-Employee Dynamics Database (CEEDD). CEEDD is a link between various tax files including the T1 Historical Personal Master file, the T1 Financial Declaration file, the T1 Business Declaration file, the T2 Incorporated Business files, the T4 Statement of Remuneration Paid, the Payroll Deduction Accounts (PD7), and the Longitudinal Immigration Database (IMDB). The linkage will provide the opportunity to compare the labour market and business-related outcomes of Pathways program participants with non-participants and assess the impact(s) that Pathways has on participants. The analysis will examine whether the students who went through the Pathways program have an advantage transitioning into and persisting in the labour market due to their participation. Linkages would occur for approximately 150,000 students that entered Grade 9 in a TDSB high school in the years from 2000 to 2008.

Output: Only non-confidential aggregate statistical outputs and analyses that conform to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada. The information will be presented in the form of tables of regression results and summary statistics related to the project's goal of evaluation the Pathways program. Results will also be shared with the evaluation group within ESDC to support government officials in conducting a robust as part of the departmental evaluation plan for the Pathways program.

Linkage of Natural Sciences and Engineering Research Council (NSERC) variables 2001 to 2013 to the Linkable File Environment (LFE). (068-2016)

Linkage of Natural Sciences and Engineering Research Council (NSERC) variables 2001 to 2013 to the Linkable File Environment (LFE). (068-2016)

Purpose: The linkage of NSERC variables with other variables from the Linkable File Environment (LFE) will be used to create a customize dataset for a research study entitled NSERC Grants and Firm Performance. This study is a collaborative work between NSERC and Innovation, Science and Economic Development Canada (ISEDC). The aim of this study is to support the evaluation of NSERC financing programs by producing objective statistical measures of its economic impact on the performance of Canadian enterprise.

Description: This is a request to link new variables provided by NSERC for the period 2001 to 2013 to the Linkable File Environment (LFE). For this project, NSERC-firms are the same as the previously approved NSERC project (056-2015). Additional variables provided by NSERC will be linked to each NSERC firm. In order to measure the effectiveness and the impact of NSERC financing services, a comparison group of non-NSERC client firms with similar characteristics will be selected. A series of hypotheses, formulated by the NSERC, will be tested using a model.

Output: The linked NSERC variables will be housed at Statistics Canada's Centre for Special Business Project (CSBP). It is the CSBP that will prepare a database from the LFE, which will contain data for the variables that are listed in the previously approved research proposal.

Linkable File Environment (LFE) (070-2016)

Linkable File Environment (LFE) (070-2016)

Purpose: Policy makers continue to have a keen interest in the role of SMEs in the economy. A Linkable File Environment (LFE) will be created that will serve the needs of policy researchers. Research requests will be accommodated according to demand, pending approval of the project proposals. The LFE will be used to construct and extract annually an SME Database, which will allow Statistics Canada to meet our obligations for deliverables to Industry Canada and the overall research community.

Description: The LFE will include the following databases: 1. Business Register (BR); Longitudinal Employment Analysis Program (LEAP); General Index of Financial Information (GIFI); Exporter Register; Importer Register; Research and Development in Canadian Industry (RDCI); Value of Foreign Direct Investment; Canadian Direct Investment Abroad; Trade in Services; PD7; Goods and Services Tax (GST); and T1 and T4 tax data. 2. Surveys: Survey of Innovation and Business Strategy; Surveys of Innovation; Survey of Electronic Commerce Technology; Survey of Advanced Technology; Survey of Commercialization of Innovation; Survey on Financing Small and Medium Enterprises and Survey of Employment and Payroll Hours. 3. External databases: Patents (Canadian Intellectual Property Office) and Venture Capital (Thompson Financial Venture Capital Database).

Output: The outputs released outside of Statistics Canada will be aggregate or modeled and tabular statistics and analyses that conform to the confidentiality provisions of the Statistics Act.

Longitudinal Census of Agriculture to Agriculture Taxation Data (071-2016)

Longitudinal Census of Agriculture to Agriculture Taxation Data (071-2016)

Purpose: The objective is to assess the productivity of farms producing grains and oilseeds and compare results with other OECD countries. Agricultural policy and program performance measurement requires a good understanding of the productivity of Canadian farms not only internal, but also vis-à-vis its international competitors. The Organisation for Economic Co-operation and Development (OECD) Estimation of Changes in Productivity of Field Crop Farms project will significantly increase this knowledge, as it develops a common methodology of measuring total factor productivity increases in Canada and other countries. This international benchmarking would provide the baseline for performance measurement. Results may produce insight on policies related to investment in research and innovation, regulatory burden, or other policy conditions that may contribute to increasing the competitiveness of the sector.

Description: The Agricultural Operation Entity Number (AGOPID) will be used as the key identifier to the Census of Agriculture to Census farm variables (2001, 2006 and 2011). The questionnaire number (STMPNO) will be used to link the Agricultural Operator variables to the farm variables on the longitudinal linked file. The farm id unique number (FARMID) from the final Agriculture taxation data files will be used to match the taxation data with the farm variables to create the final linked file.

Output: Non-confidential aggregates and econometric coefficients will be disseminated outside Statistics Canada to the OECD analysts under the Farm Level Analysis Network for the preparation of the international report on farm productivity which will compare results for several member countries. A shorter version is planned to be published using Statistics Canada publication Vista on the Agri-Food Industry and the Farm Community (cat. 21-004-XIE).

The linked file will be retained until no longer required, up to September 2019, at which time it will be destroyed.

Record-linkage Amendment to: Canadian Employer-Employee Dynamics Database (CEEDD) 015-2013 (072-2016)

Record-linkage Amendment to: Canadian Employer-Employee Dynamics Database (CEEDD) 015-2013 (072-2016)

Purpose: The activities and economic outcomes of workers are shaped in many ways by the firms or organizations in which they are employed, and conversely, worker characteristics have implications for firm performance. Together, individual-level and firm-level data that are integrated to facilitate a more comprehensive understanding of labour market processes and economic outcomes than is possible using either type of data in isolation.

The CEEDD will be a multi-purpose file capable of supporting research on many issues. These include projects on business start-ups and job creation, with particular emphasis on the role of immigrant entrepreneurs; the distribution of immigrants across business enterprises and how this differs from the distribution of Canadian-born workers; how workforce aging is playing out within business enterprises, including its effect on labour productivity; local labour market information, including hiring rates, separation rates, layoff rates, and aggregate turnover rates within sub-provincial regions and the impacts of organizational changes, such as mergers and acquisitions, on individual-level outcomes.

Description: Information at the level of the business-enterprise will be drawn from the National Accounts Longitudinal Micro data File (NALMF) while individual- and job-level data will be drawn from T1 files and T1 family files, the T4 Statement of Remuneration Paid file, the Record of Employment (ROE) file, the Longitudinal Immigration Data Base (IMDB), and the Temporary Foreign Work file, for the years 1997 onward. All linkages will be done on a deterministic basis using Business Numbers (BNs) and/or Social Insurance Numbers (SINs).

Business Numbers and SINs will be transformed into unique personal identifiers that will remain on the business-level and individual-level files in a scrambled form. The use of scrambled identifiers will allow users to differentiate units in the cross-sectional data, and to follow them longitudinally over time. Postal code information will be used to create aggregated geography variables and then removed from the files.

All BNs, SINS and other identifiers that may be on the files such as names and addresses (including postal codes) will be removed from the linked analytical files and stored in a separate location accessible only to indeterminate Statistics Canada employees who job duties require them to access this information.

Output: The outputs of the proposed database will include two components. One is labour market indicators (e.g. hiring, separations, job creation/destruction) at national, provincial and sub-provincial levels of geography. This information will conform to the confidentiality provisions of the Statistics Act. The other is the longitudinal files that will serve the internal and external researchers. Only aggregate statistics and analysis conforming to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada. The linked file and linking key file will be retained by Statistics Canada until no longer required, up to, March 31, 2026, at which time they will be destroyed.

The linked files will be hosted by the Center of Data Development and Economic Research (CDER) of Economic Analysis Division. External researchers will be able to access the linked data on Statistics Canada premises, under the Policy on the Use of Deemed Employees and current MOUs with the CDER. Synthetic files will be created for external researchers for direct access, while the original files will only be accessed via batch mode with no viewing function. Research studies will be published in Statistics Canada's Research Paper Series as well as in academic journals. Research studies will also be presented at professional conferences.

Employment, Health & Well-Being Database (EHWD): Capitalizing on processing linkages between the CCHS and administrative data (075-2016)

Employment, Health & Well-Being Database (EHWD): Capitalizing on processing linkages between the CCHS and administrative data (075-2016)

Purpose: The objective of this initiative is to create a linked database that will support research on health, subjective well-being, and labour market outcomes. The proposed linkage will combine data on health, health behaviours, retirement experiences, subjective well-being and socio-demographic characteristics from several cycles of the Canadian Community Health Survey (CCHS) with data from various taxation- and employment-based administrative files. The resulting analytical file will support research on inequalities in health outcomes and consequences for labour market outcomes, retirement transitions, well-being and mobility, among others.

Description: This project builds on a linkage between the CCHS and tax data developed as part of the administrative processing of record linkage #030-2012 which combined various health-related databases. This linkage key enables a link between CCHS and various taxation- and employment-based administrative files, specifically the T1 Family File, T1 Personal Master File, T4 Summary File, T4E Statement of EI Benefits Received, EI Status Vector File, Record of Employment, and Longitudinal Employment Analysis Program. These data combine unique cross-sectional information on health, health behaviours, subjective well-being and retrospective self-reported retirement behaviour with longitudinal data on income and employment.

Output: Methodological and analytical findings resulting from these linked data will be used to prepare research papers for publication. The linked file will also be used to develop tabular data and indicators for release on Statistics Canada's website. Only non-confidential aggregate statistics that will not result in the identification of an individual person, business or organization will be released outside of Statistics Canada.

Characteristics and performance of businesses partnering with the Centres of Excellence for Commercialization and Research (CECR) (077-2016)

Characteristics and performance of businesses partnering with the Centres of Excellence for Commercialization and Research (CECR) (077-2016)

Purpose: This linkage proposal will help identify the characteristics of businesses partnering with the Centres of Excellence for Commercialization and Research (CECR), and assess the performance of these firms over time. The results produced by this linkage will support the federal government in its evaluation of the CECR program, and help policy makers develop the program.

Description: This project will link the CECR performance data from fiscal years 2010/2011 to 2014/2015 with three files maintained by Statistics Canada: the Business Register (BR), the National Accounts Longitudinal Microdata File (NALMF), and the Survey of Innovation and Business Strategy (SIBS). The NALMF contains administrative data from Corporate Income Tax (T2), Statements of Remuneration Paid (T4), and Payroll Deduction Accounts (PD7). It covers the period from 2001 to 2015. The SIBS contains information on innovation activities and business strategy. The linkage will use data from the 2009 and 2012 SIBS surveys.

The linkage will provide the opportunity to compare the characteristics of firms participating in CECR centres to firms in the general population, and to more narrowly defined segments of firms operating in Canada (for example, within the same sector or province). The linkage will also allow the assessment of firm performance following a partnering with a CECR centre, using a variety of performance measures (survival, revenue growth, employment growth, R&D expenditures, etc.).

Output: Only non-confidential aggregate statistical outputs and analyses that conform to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada. The information will be presented in the form of tables of regression results and summary statistics related to the project's goal of evaluating the CECR program. Results will be shared with the evaluation group within the National Resources and Engineering Research Council of Canada (NSERC).

Ontario Child Health Study Linkage with Data extracted from the Ontario Ministry of Education. (078-2016)

Ontario Child Health Study Linkage with Data extracted from the Ontario Ministry of Education. (078-2016)

Purpose: Administrative information from the Ontario Ministry of Education will be linked to the 2014 Ontario Child Health Study (OCHS). The purpose of this linkage is to obtain accurate data on educational achievement, special education requirements, program type and school behaviour. This information will provide researchers valuable information that will be used to study the relationships between overall health, mental health and academic performance.

Description: Administrative education data extracted from the Ontario Ministry of Education will be linked for each respondent using a unique StatCan identifier found on the returned file to the OCHS survey data and added to the final micro data . The final output will be a micro data file containing both OCHS and administrative education data.

Output: The administrative education data obtained from the Ontario Ministry of Education are combined with parent (guardian) reported data on the 2014 Ontario Child Health Study. The data are processed and prepared for dissemination. The linked data set will become the master file and will be available through the Research Data Centres without personal identifiers and only accessed by deemed employees of Statistics Canada. Any data being removed from the Research Data Centres will be vetted to ensure that individuals cannot be identified.

By linking data from this Ministry with data from OCHS, response burden was reduced and will allow an examination of the relationships between overall health, mental health and academic performance.

A Linkage of the B.C. Performance Indicator Reporting System Data (PIRS) and the T1 Family File (T1FF) (079-2016)

A Linkage of the B.C. Performance Indicator Reporting System Data (PIRS) and the T1 Family File (T1FF) (079-2016)

Purpose: The main objective of this study is to examine the labour market outcomes of individual, especially youth, who are not in employment, education, or training (NEET) before and after participating in a provincially-sponsored training program and attempt to assess the effectiveness of this training program in the province of British Columbia. There are many job training programs targeted at youth who are NEET in Canada, but there has not been a systematic evaluation of these training programs or the individual outcomes. This study proposes to fill this gap by examining the labour market outcomes of NEET youth post-training in the province of British Columbia who took part in a job training program, namely the Labour Market Agreement (LMA) program. The analysis of the linked data will help identify the skills and/or education workers may be lacking that hinder their success in the labour market. This will inform how this program can be tailored to meet the training needs of students and improve their labour market outcomes.

Description: Using the protocol of the Social Data Linkage Environment (SDLE), the SDLE Production team will link the B.C. Performance Indicator Reporting System Data (PIRS) participants to the Derived Record Depository (DRD) using probabilistic linkage methodologies. The T1 Personal Master Files (T1 PMF) of 1981 to 2012 have also been linked to the DRD and will provide the SINs of the cohort members in order to link to the T1 Family File (T1FF). The following variables will be used for the linkage: first name(s), surname(s), date of birth, sex, and last known postal code. The resulting associations of the cohort record identifiers and the Tl PMF record identifiers will be stored in the SDLE Key Registry. The Key Registry does not contain personal identifiers. The SDLE Analysis and Data Development team will use the keys stored in the Key Registry to link the SINs from the Tl PMFs to the cohort record identifiers and then, using the SIN, add the cohort record identifier to the TIFF records with the corresponding SIN. The SIN will then be removed from the final tax outcome files. The final outcome files will not contain any personal identifiers.

Output: The results of the analysis of the linked data will be presented in a written research report, and possibly a presentation, delivered to the Ministry of Jobs, Tourism, Skills, and Training of British Columbia. All results will be non-confidential aggregate statistics and analytical output that will prevent any identification of an individual person, business or organization outside of Statistics Canada.

Profiles of repeated contact with the Saskatchewan criminal justice system: Linking Saskatchewan re-contact data with income, health and immigration data (081-2016)

Profiles of repeated contact with the Saskatchewan criminal justice system: Linking Saskatchewan re-contact data with income, health and immigration data (081-2016)

Purpose: To establish baseline information about those who have been identified as either one time, repeat or chronic users of the justice system. Specifically, the study will seek to identify the factors associated with repeated contact with the justice system and will generate information about their involvement with Saskatchewan's other social or human services—namely health, education and those in other social/community sectors. This study will link justice data with other administrative records including income, health and immigration data in order to identify the risk and protective factors associated with repeated contact with the justice system and to generate information about involvement with Saskatchewan's other human and social services, including health and education.

The study will provide a more complete understanding of the factors associated with repeat contact with the justice system. It will also assist policymakers and justice administrators in developing crime control strategies – those aimed at prevention as well as reduction in re-offending by addressing key questions about repeat contact with the criminal justice system and chronic offending in Canada.

Description: The linkage will bring together records from the Phase 2 re-contact Saskatchewan justice cohort, with other social domain data, including the T1 Family File, the National Ambulatory Care Reporting System, the Discharge Abstract Database, the Vital Statistics Death Database and the Immigrant Landing file. The information will be used to generate aggregate information on the factors related to repeated contact with the justice system.

Output: Only non-confidential aggregate statistics and analyses conforming to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada. The file will not contain any direct identifiers such as name, address, telephone number or social insurance number. Access to linking keys and linked analysis files will be restricted to Statistics Canada employees whose assigned work activities require such access.

High-level and non-confidential findings may be reported in the form of presentations to various National Justice Statistics Initiative partners.

Statistics Canada will retain the linked analysis file until no longer required, up to, March 31, 2027, at which time its status will be re-evaluated.

Linkage of the Canadian Community Health Survey (CCHS) annual component to the Immigration Landing File (ILF) and the Longitudinal Immigration Database (IMDB) (084-2016)

Linkage of the Canadian Community Health Survey (CCHS) annual component to the Immigration Landing File (ILF) and the Longitudinal Immigration Database (IMDB) (084-2016)

Purpose: The purpose of this project is to address important data gaps related to information on the health, social and economic outcomes of immigrants to Canada. The linked data will be used by Immigration, Refugee and Citizenship Canada (IRCC) to conduct research on these outcomes for immigrants in different classes and categories and comparing these outcomes to those of the Canada-born population.

Description: Data from the annual Canadian Community Health Survey (CCHS) from cycle 2.1 until 2014 and from the CCHS Mental Health surveys (cycles 1.2 and 5.2) will be linked to the following databases: 1980 to 2014 Immigrant Landing File (ILF) and the 1982 to 2013 Longitudinal Immigration Database (IMDB). Only those CCHS respondents who agreed to link and share their survey information with Health Canada, The Public Health Agency of Canada and the Provincial Ministries of Health will be included in the project. The CCHS annual component provides comprehensive information regarding the health status/outcomes, health care utilization, social activity, social support, and other social-demographic characteristics of the Canadian population. The CCHS mental health surveys include information related to mental disorders, use of mental health services and mental health determinants. The ILF is an administrative database, produced by IRCC and includes demographic, immigration program and personal information from the landing record of individuals. The IMDB is a database combining immigration and taxation records. The IMDB provides information on the economic behaviour of the immigrant tax filer population in Canada. The final analysis file will not contain direct personal identifiers.

Output: The linked files will at all times remain on Statistics Canada premises. Only non-confidential aggregate data that conform to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada. The linked data will be used by the Research and Evaluation Branch of IRCC as deemed employees of Statistics Canada in the Federal Research Data Centre (FRDC) for policy research and analysis purposes. In addition, the linked data files will be made available for use by deemed employees of Statistics Canada through the Research Data Centres (RDCs) as per the Subsequent Use of Linked Data provision in the Directive on Record Linkage. Research papers based on analyses of the linked data could be presented internally, at conferences and/or submitted for publication in peer-reviewed journals.

The linked analysis files, stripped of direct personal identifiers, will be retained until no longer required by Statistics Canada, up to December 31, 2021, at which time they will be destroyed. The corresponding linkage key files housed at the Statistics Canada Head Office will also be retained until no longer required by Statistics Canada, up to December 31, 2021, at which time they too will be destroyed.

Internal Linkage of New Brunswick Registry File and creation of the New Brunswick Smoking Record Linkage Environment (NBSRLE) (085-2016)

Internal Linkage of New Brunswick Registry File and creation of the New Brunswick Smoking Record Linkage Environment (NBSRLE) (085-2016)

Purpose: The province of New Brunswick has commenced litigation against several tobacco companies in the Court of Queen's Bench of New Brunswick pursuant to the Tobacco Damages and Health Care Costs Recovery Act, SNB 2006, c.T-7.5, which enables the province to seek compensation from the tobacco companies for tobacco related health care costs. To support both parties in this litigation, Statistics Canada has signed an agreement concerning access to confidential personal information of residents of New Brunswick with the province of New Brunswick and Imperial Tobacco Canada, JTI-MacDonald Corp., B.A.T. Industries p.l.c., British American Tobacco (Investments) Limited, R.J. Reynolds Tobacco Company, and R.J. Reynolds Tobacco International Inc.

Statistics Canada will create a record linkage environment for the New Brunswick Tobacco Project called the New Brunswick Smoking Record Linkage Environment (NBSRLE). This environment will be used to link Statistics Canada survey data to New Brunswick administrative health data using established record linkage techniques. The NBSRLE will enable Statistics Canada to provide access to linked and unlinked data within the Research Data Centers (RDC) to deemed employees, which will ensure the confidentiality of the personal information contained in the source data sets.

The purpose of this linkage is to build the administrative data component of the NBSRLE. The New Brunswick Registry File will be loaded to the NBSRLE as the Record Depository and the New Brunswick administrative databases will be linked to that Depository. The record linkage keys will be retained separately in the NBSRLE Key Registry.

Statistics Canada has the responsibility for securely storing and processing data files and for the production of analysis files needed to carry out approved research studies. NBSRLE research projects will involve the use of linked records, and in accordance with Statistics Canada's Directive on Record Linkage, approval by the Chief Statistician will be required for each new linkage related to this project.

Description: The NBSRLE Record Depository (RD) will be created from the New Brunswick Medicare Decision Support System Individual Registry File. The RD will be composed of the surnames, given names, date of birth, sex, postal code, health insurance number, resident id and postal code effective dates for health insurance card holders in New Brunswick. Each individual in the RD will be assigned an anonymous NBSRLE identifier, which has no value outside of the NBSRLE.

Linkage of the RD to New Brunswick health administrative databases will be performed in a dedicated NBSRLE. To ensure a high level of data security and privacy, the association of Statistics Canada-generated identification numbers from the RD and the administrative database Record Identifiers will be stored in a separate NBSRLE Key Registry, thus avoiding the need to store health administrative data with personal identifiers.

For analytical studies, the NBSRLE Identifier will replace health administrative data identifiers and access to linkable files will require prior linkage approval from Statistics Canada's Executive Management Board.

Access to the NBSRLE Key Registry will be restricted to the Statistics Canada employees responsible for its development and maintenance and those responsible for the creation of linked analysis data files. The Key Registry will contain linkage keys to permit linkage for approved studies for this project. The administrative files included as part of this linkage application are as follows:

  • New Brunswick Medicare Decision Support System Individual Registry File
  • New Brunswick Discharge Abstract Database
  • New Brunswick Prescription Drug Program Database
  • New Brunswick Physician Billing Database
  • New Brunswick Extramural Hospital Database
  • New Brunswick Nursing Homes Database
  • New Brunswick Vital Statistics Death Event Database

Output: There are no linked analysis files created as part of this record linkage. The NBSRLE Record Depository and Key Registry will be used exclusively to support the development of research files for the New Brunswick Tobacco Project and no identifiable personal information will be released from the NBSRLE. Statistics Canada will retain the RD and Key Registry files until December 31, 2031.

Research projects will be approved on a study-by-study basis. These may only be carried out as part of a research agenda initiated by the designated representatives from the province of New Brunswick or the Defendants under contract with Statistics Canada. A summary of each approved study will be posted on the Statistics Canada web site.

The Cost of a Case of Cancer in New Brunswick; Subsequent Use of 037-2016 (086-2016)

The Cost of a Case of Cancer in New Brunswick; Subsequent Use of 037-2016 (086-2016)

Purpose: The purpose of this project is to estimate the cost burden of the most common types of occupational cancers (i.e., lung, colon, rectum, prostate, breast, brain, bladder, ureter, kidney, testicular, esophageal, non-Hodgkin's, skin, multiple myeloma, or leukemia) from the point of view of the New Brunswick Workers' Compensation Board.

Description: The researchers will assemble data from multiple sources and conduct much of the research outside the RDC. The Canadian Cancer Registry and the 1991 Canadian Census Cohort: mortality & cancer follow-up will be analysed in the RDC. Aggregate estimates will be obtained from these sources.

The 1991 Canadian Census Cohort: mortality & cancer follow-up is a probabilistic linked database. Approximately 2.7 million individuals aged 25 or older, who were enumerated by the 1991 long-form census, were followed for mortality, cancer, and annual place of residence. This file has been updated and renamed 1991 CanCHEC and has been linked to the CCR from 1992 to 2010 and mortality from 1991 to 2011. Confidential microdata from the Canadian Cancer Registry is also required.

Output: Only aggregate data and analyses conforming to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada Research Data Centres, in the form of peer reviewed journal articles and presentations.

Applying Causal Inference Methods in the Evaluation of the Association between Fine Particulate Matter (PM2.5) and Cause-Specific Mortality in the Canadian Census Health and Environment Cohort (CanCHEC); Subsequent Use of Linkage 037-2016 (087-2016)

Applying Causal Inference Methods in the Evaluation of the Association between Fine Particulate Matter (PM2.5) and Cause-Specific Mortality in the Canadian Census Health and Environment Cohort (CanCHEC); Subsequent Use of Linkage 037-2016 (087-2016)

Purpose: The purpose of this project is to estimate the potential benefit of plausible interventions to reduce PM2.5 in Canada on all-cause and cause-specific mortality. The study will address two principle questions:

  1. What is the potential reduction in excess mortality due to any cause by a reduction in PM2.5 to levels which can be feasibly achieved across Canada, in provinces and territories, and in metropolitan areas? Guidance on specific policy intervention scenarios will be provided by Health Canada.
  2. What are the important geographic variations along which estimated population benefits differ?

Description: The researchers are quantifying the effects of neighborhood level air pollution on the Canadian population using the 1991 CanCHEC data. Descriptive statistics will be conducted and then estimations of the years of life saved by interventions to reduce PM2.5 levels. Fine particulate matter (PM2.5) is a general term for all small particles found in air measuring equal to or less than 2.5 μm in aerodynamic diameter. It is a complex mixture whose constituents vary in size, shape, density, surface area, and chemical composition (Health Canada and Environment Canada 1999; US EPA 2009).

The 1991 Canadian Census Cohort: mortality & cancer follow-up is a probabilistic linked database. Approximately 2.7 million individuals aged 25 or older, who were enumerated by the 1991 long-form census, were followed for mortality, cancer, and annual place of residence. This file has been updated and renamed 1991 CanCHEC and has been linked to the CCR from 1992 to 2010 and mortality from 1991 to 2011.

Output: Only aggregate data and analyses conforming to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada Research Data Centres, in the form of peer reviewed journal articles and presentations.

Education Longitudinal Linkage Platform (ELLP): Creation of a record linkage platform to allow development of key education indicators and analysis related to postsecondary education and apprenticeship programs (089-2016)

Education Longitudinal Linkage Platform (ELLP): Creation of a record linkage platform to allow development of key education indicators and analysis related to postsecondary education and apprenticeship programs (089-2016)

Purpose: Longitudinal data are needed for the development of key, Pan-Canadian, longitudinal indicators and analysis related to postsecondary education and apprenticeship programs. These outputs will lead to a better understanding of student pathways through postsecondary education and training including completion rates and outcomes. They will be useful for education and labour market policy and planning and fill gaps in current knowledge.

Administrative data files from the Postsecondary Student Information System (PSIS), the Registered Apprenticeship Information System (RAIS) and the T1 Family Files (T1FF) will be used to create a linkage platform for relating longitudinal education information and other data sources listed below.

The linkage platform will permit use of the longitudinal administrative data while protecting the privacy of individuals.

Description: The target population for the linkage platform and education indicator development comprises individuals who were enrolled in postsecondary institutions (PSIS) or registered in apprenticeship programs or as trade qualifiers (RAIS), at some time since 2008. Data for selected jurisdictions will go back as far as 2004.

Anonymized linking keys will be associated to the records of analytical variables from the data source files and all personal identifiers will be removed. A registry of these linking keys will be created. To protect the sensitivity of the information, the registry of keys and the personal identifiers required for updating the linkage platform will be stored in separate files in a separate location accessible only to the few Statistics Canada employees whose job duties require access. The registry of keys will be used to create customized, linked files that merge variables from the different data sources for creating longitudinal education indicators and for analytical purposes. These customized, linked files will not include the data source linking keys or personal identifiers.

Data sources used to construct the linkage platform or that will be linked for analytical purposes include:

  • Postsecondary Student Information System (PSIS) annual pan-Canadian records beginning with 2008-09 and ongoing, and records for selected jurisdictions for 2004-05, 2005-06, 2006-07, 2007-08;
  • Registered Apprenticeship Information System (RAIS) annual pan-Canadian records beginning with 2008 and ongoing, and records for selected jurisdictions for 2002 to 2007;
  • Selected, tax-related, administrative and concordance files needed to establish and validate record matches between the annual PSIS and RAIS data files.
  • The T1 Family Files (T1FF), beginning with 1993 and ongoing and other tax files to supplement this information including variables from the T1, the T1H, the T4, the T4E and possibly the T2202A files, beginning with 1997 or later and ongoing;
  • National Apprenticeship survey, 2015 and ongoing;
  • National Graduate Survey, beginning with 2013 (graduates of 2009/2010) and ongoing;
  • The Alberta Graduate Outcomes Survey and Employer Satisfaction Survey, beginning with 2004 or the first year available after that and ongoing;
  • Additional files from the Alberta data systems that are used to report PSIS and RAIS data to Statistics Canada and files from selected postsecondary institutions, beginning with 2004 and ongoing;
  • Administrative data from Alberta relating to the kindergarten to grade 12 history of those attending and those not attending postsecondary education, and data from the Alberta Post-Secondary Application System (APAS) both beginning with 1999 or the first year available after that and ongoing;
  • The Immigration, Refugees and Citizenship Canada Landing File; the Census; the National Household Survey (NHS); the Canadian Employer-Employee Dynamics Database (CEEDD); and the Longitudinal Apprentices and Trade Qualifiers Database; all beginning with 2004 or the first year available after that and ongoing, except for CEEDD which will begin with 2001;
  • Data on student and apprenticeship financial aid, loans, bursaries and grants from provinces, territories, postsecondary institutions and/or Employment and Social Development Canada (ESDC), beginning with 2003 and ongoing and data on education savings programs beginning with 1998 and ongoing;
  • The Longitudinal Immigration Database (IMDB), and the Immigration, Refugees and Citizenship Canada Temporary Residents (TR) file, beginning with 2004 and ongoing;
  • Youth in Transition Survey (YITS) (all cycles and cohorts); Programme for International Student Assessment (PISA) (2000, 2003, 2006, 2009, 2012);
  • Program for the International Assessment of Adult Competencies (PIAAC) (International Study of Adults) (2012 and ongoing every 10 years);
  • Labour Force Survey, beginning with January 2004 and ongoing;
  • Employment Insurance Program data from 1997 and ongoing;
  • Social Data Linkage Environment (SDLE), from 2004 and ongoing;
  • British Columbia's public postsecondary institutions will provide additional variables to Statistics Canada, through BC Stats, to supplement a current research project. Reference years begin with 2008 and are ongoing.

New years of data will be added as needed when they become available.

The eventual production of the ELLP within the Social Data Linkage Environment will be explored.

Output: Analytical data linked using this linkage platform will be used to prepare indicators, tables, analytical reports and research papers for publication, for presentation at conferences, workshops and meetings and to fill cost-recovery requests for clients. They will also be used to provide insights for improving education data collection and data quality.

Only non-confidential aggregate statistics and analysis conforming to the confidentiality provisions of the Statistics Act or as permitted by the Statistics Act will be released outside of Statistics Canada (including cost-recovery clients). A discretionary disclosure approval has been granted to allow the PSIS program to release aggregated enrolment and graduation information at the postsecondary institution level for institutions that have signed a waiver covering the specific PSIS release period.

Examination of the long-term community adjustment of offenders: Linking Federal correctional data with income, health and immigration data. (090-2016)

Examination of the long-term community adjustment of offenders: Linking Federal correctional data with income, health and immigration data. (090-2016)

Purpose: The purpose of the proposed study is to establish baseline information about the re-integration of former offenders. The study will examine 10 to 15 year life trajectories of offenders in the Correctional Service of Canada (CSC) based on experiences in prison (e.g., time served, release status) and post-release indicators of economic activity (e.g., reported income, taxes, social assistance), health (e.g., mortality), and social stability (geographic mobility, marital status). Although the outcomes for many federal offenders are expected to be worse than for the general population, it is expected that a significant proportion of ex-CSC inmates will successfully reintegrate as law abiding citizens. This study will estimate that proportion, and identify predictors of successful reintegration over time.

By generating information on the factors associated with successful community re-integration, the study will assist policymakers and justice administrators in developing crime control strategies – those aimed at prevention as well as reduction in re-offending in Canada.

Description: The linkage will bring together records from the CSC's corrections cohort, with other social domain data, including the T1 Family File, the National Ambulatory Care Reporting System, the Discharge Abstract Database, the Vital Statistics Death Database, the Canadian Cancer Registry and the Immigrant Landing file. The data will be used to generate aggregate information on offender's re-integration into society and the risk and protective factors associated with ex-offenders becoming law-abiding and contributing members of the community.

Output: Only non-confidential aggregate statistics and analyses conforming to the

confidentiality provisions of the Statistics Act will be released outside of Statistics Canada. The file will not contain any direct identifiers such as name, address, telephone number or social insurance number. Access to linking keys and linked analysis file will be restricted to Statistics Canada employees and deemed employees whose assigned work activities require such access.

Statistics Canada will retain the linked analysis file until no longer required, up to, March 31, 2027, at which time its status will be re-evaluated.

Immigration, education, ethnocultural diversity and future composition of the population and labour force of high-immigration countries. Linkage 092-2016, a subsequent use of 037-2016 (092-2016)

Immigration, education, ethnocultural diversity and future composition of the population and labour force of high-immigration countries. Linkage 092-2016, a subsequent use of 037-2016 (092-2016)

Purpose: The main objective is to analyze the demographic and socioeconomic behaviour of ethnocultural groups in six countries with low fertility and high immigration, using various data sources.

Description: At the Research Data Centres (RDCs), researchers will have access to the files of the 2011 National Household Survey, certain cycles of the General Social Survey (5, 10, 15, 20 and 25), the National Graduates Survey and the 1991 Canadian Census Health and Environment Cohort (CanCHEC).

The 1991 Canadian Census Cohort mortality and cancer follow-up database is produced using probabilistic record linkage. Approximately 2.7 million individuals aged 25 and older, who were enumerated in the 1991 Census long form, were followed for mortality, incidence of cancer and annual place of residence. This file was updated and renamed the 1991 CanCHEC and was linked to Canadian Cancer Registry (CCR) data from 1992 to 2010 and to mortality data from 1991 to 2011.

The ultimate objective of this project is to develop a microsimulation model that produces population projections beyond age and sex to guide decision-makers in their decisions with respect to immigration policy, social cohesion, labour market needs and training, including language training.

Output: Only aggregate data and analyses conforming to the confidentiality provisions of the Statistics Act will be released outside Statistics Canada RDCs, in the form of peer-reviewed journal articles and presentations.

2016 Canadian Community Health Survey Annual Component (CCHS) Linkage to Tax Data (093-2016)

2016 Canadian Community Health Survey Annual Component (CCHS) Linkage to Tax Data (093-2016)

Purpose: The purpose of this linkage is to reduce respondent burden while improving data quality. This reduces the overall survey time which will reduce collection costs.

Description: HSD is planning to link the 2016 CCHS survey data to existing tax files to collect income information.

The first step is to determine if tax data are available for the CCHS 2016 sampled households. When this information is available, respondents will be given a linkage statement which includes a specific reference to linking to tax data. They will have the opportunity to refuse the linkage. For those respondents that refuse to link, a set of income questions will then be asked. For households where there is no tax data available, the income questions will be asked followed by the linkage statement.

After collection, the second step will be to link the 2016 CCHS data to the most recent available tax files (generally a two year lag from the collection year) to collect the income information for those respondents who did not refuse to link.

Given the CCHS sample is drawn from two frames (Canadian Child Tax Benefit file for respondents aged 12-17 and the Labour Force Survey (LFS) area frame for those 18+), there will be slightly different approaches to the two step linking strategy. For those aged 18 years or older, the sample records will for the most part contain an ARUID. Prior to collection of the CCHS Annual 2016, the ARUIDS for the selected sample will be linked to the 2014 or 2015 IDENT_ARUID file using ARUID and then linked to the most recent tax data available at the time of collection to identify cases that do not have 2015 tax data. So for the 2016 CCHS master data file, this will be either 2014 or 2015 T1 Personal Master File(T1). Cases that do not have 2014 tax data will be asked income questions as a back-up measure to provide income data. All respondents will also be given the tax linkage statements. For all those who did not object to the tax linkage (regardless of whether they were also asked income questions) we will attempt data linkage in the following manner:

  1. Link the ARUID to the 2015 IDENT_ARUID then use this to link to the 2015 T1, T1FF or T4 to obtain tax data.
  2. If a link is not found for 2015 then link to the 2014 IDENT_ARUID and use that link to find the 2014 T1, T1FF, T4 tax data.

The T1 enhanced file database (for 2014 and subsequent years) which contains ARUID may be used to facilitate the ARUID linkage instead of linking through the IDENT_ARUID file. Personal information such as name, date of birth and gender, or contact information such as telephone number or postal code may be used to verify the links (through ARUID), or improve linkage rates. This will utilize the Linkage Control File (LCF), as well as the SINSIN (Greenberg) file. When linking to the T1FF we will use the XVARS file to get linking information.

For the 12-17 year old selected respondents, records can be linked through the SIN number of the parents to identify those without 2014 T1, T1FF or T4 data. Those without the 2014 T1 data will be asked the income questions as a back-up measure. All respondents will also be asked the tax data linkage statement. For those who did not object to the tax linkage (regardless of whether they are asked the income questions) we will attempt linkage as follows:

  1. If the child still lives with the recipient (parent/guardian) then link the SIN of the parent to the 2015 T1,T1FF or T4 to obtain the most recent tax data.
  2. If a link is not found for the 2015 T1 or T1FF and the child still lives with the recipient (parent/guardian) then use the SIN to link to the 2014 T1,T1FF or T4 to obtain tax data.

If the child no longer lives with the recipient (parent/guardian) then linking through contact information such as name, address or phone number may be attempted.

Personal information such as name, date of birth and gender, or contact information such as telephone number or postal code may be used to verify the links (through ARUID), or improve linkage rates. This will utilize the Linkage Control File (LCF), as well as the SINSIN (Greenberg) file. When linking to the T1FF we will use the XVARS file to get linking information.

Output: The release of data from the 2017 Canadian Community Health Survey will be announced in The Daily. The master and share files are expected to be made available June 2018 to provincial and territorial ministries of health, the "Institut de la statistique du Québec" (Quebec respondents only), and approved researchers in partner departments (Health Canada and the Public Health Agency of Canada) through share agreements, as well as to Statistics Canada researchers, and to deemed employees at the Statistics Canada Research Data Centres. Only information for which an informed consent was obtained from the respondent will be included in the share files. Rapid response files (master and share) will be released throughout the year if content is collected. All data will remain confidential and protected under the Statistics Act. There will be no personal identifiers on either data file.

The 2016 output files are similar to 2017, but the linked tax data will not be on the share file. The first release of linked data will be the rapid response master file (2016 data) in March of 2017.Income tax data will only be on the share file starting with the release of the 2017 reference year data.

Along with the availability announcement of the analytical data file (in The Daily), only non-confidential aggregate statistics will be released.

Characteristics of persons linked and not-linked to personal income tax and benefits records, using the 2011 Census and National Household Survey (NHS) linkage to income information from personal income tax and benefits records (097-2016)

Characteristics of persons linked and not-linked to personal income tax and benefits records, using the 2011 Census and National Household Survey (NHS) linkage to income information from personal income tax and benefits records (097-2016)

Purpose: The purpose of this record linkage is to create a portrait of people that are linked and not-linked to tax data.

Canada Revenue Agency (CRA) will use the non-confidential aggregate tables to better understand the characteristics of persons that may not be receiving benefits because the persons are not found in CRA's administrative databases. This information will be used for program improvements such as targeted outreach to increase the number of recipients of benefits.

Canadians will benefit from this analysis that aims to inform the development of program improvements geared towards increasing awareness and the number of recipients of benefits programs.

Description: All respondents to the 2011 Census of Population have been linked with the 2011 National Household Survey (NHS) and several income tax files and benefits records.

Respondents' information on types and amounts of benefits received, and the type of administrative databases in which they are present, have been extracted from various income tax files and benefits records and added to their responses from the 2011 Census and 2011 NHS records.

Output: Only non-confidential aggregated data tables, which conform to the confidentiality provisions of the Statistics Act, will be released outside of Statistics Canada.

Socioeconomic variations in violence-related deaths in Canada: 1991-2011, a subsequent use of 037-2016 (098-2016)

Socioeconomic variations in violence-related deaths in Canada: 1991-2011, a subsequent use of 037-2016 (098-2016)

Purpose: This research will detail temporal trends, and method of death; and assess socio-economic status as a risk factor, testing the social disadvantage theory in relation to violent deaths.

Description: The 1991 Canadian Census Cohort: mortality & cancer follow-up is a probabilistic linked database. Approximately 2.7 million individuals aged 25 or older, who were enumerated by the 1991 long-form census, were followed for mortality, cancer, and annual place of residence.

This is an epidemiological study on violence-related mortality in Canada from a public health perspective. This research aims to investigate the patterns of violence-related mortality over the 1991-2011 period, considering sex, age, geography and various individual socioeconomic indicators.

A descriptive analysis will be conducted to measure the relationship between violence-related death and some socioeconomic indicators. Fitted models will be used to assess time-trend and main effects of each socioeconomic indicator on violence-related mortality risk. The study will also estimate whether the violent mortality slopes differ across socioeconomic groups of the population.

Output: Only aggregate data and analyses conforming to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada Research Data Centres, in the form of peer reviewed journal articles and presentations.

Canadian Income Survey: Linkage to Income Data Files (099-2016)

Canadian Income Survey: Linkage to Income Data Files (099-2016)

Purpose: The purpose of this linkage is to obtain income data and reduce respondent burden, interviewer time and collection costs for the Canadian Income Survey. The linkage allows obtaining information on income variables without burdening respondents with detailed questions about their income.

Description: The Canadian Income Survey database and the T1, T1IDENT, T5007 and CCTB files will be linked using the address, city, date of birth, first name, surname, sex, province, social insurance number, codes for surname, postal code, marital status, telephone number and first initial. This information will be removed from the linked file as soon as the linkage is completed, and stored separately. Access to these files will be restricted to Statistics Canada employees whose assigned work activities require access.

Output: Information containing personal identifiers will not be released outside of Statistics Canada from this linkage activity. Only non-confidential aggregate statistics and analysis conforming to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada.

Linkage of the Discharge Abstract Database (DAD), National Ambulatory Care Reporting System (NACRS), and the Ontario Mental Health Reporting System (OMHRS) to the Canadian Vital Statistics Death Database (CVSD) (100-2016)

Linkage of the Discharge Abstract Database (DAD), National Ambulatory Care Reporting System (NACRS), and the Ontario Mental Health Reporting System (OMHRS) to the Canadian Vital Statistics Death Database (CVSD) (100-2016)

Purpose: The purpose of the linkage is to provide Canadians with a more comprehensive understanding of health care system interventions and their outcomes that are currently not possible using existing individual databases. The creation of this linked database will allow for the creation and public dissemination of:

  • Performance indicators on short/long-term mortality in and out of hospital following hospitalization and procedures, including patients with mental health conditions
  • Indicator studies on transitions of care and patient journey across the health care continuum
  • Survival and outcome analysis for new and existing measures (e.g. one-five year total mortality rate, survival analysis for specific patient populations)

Description: On behalf of the Canadian Institute for Health Information (CIHI), Statistics Canada will be linking the DAD and NACRS data from 2000/01 to 2014/15, and the OMHRS data from 2005/06 to 2014/15 to the CVSD from 2000 to 2012. The resulting analytical files will contain all the records from the DAD, NACRS and OMHRS databases as well as selected variables from the CVSD. Personal identifiers will be removed from the analytical files and replaced by a random Statistics Canada-generated unique identifier.

Output: All data and analytical products to be released outside of Statistics Canada will conform to the confidentiality provisions of the Statistics Act.

The linked analysis files and linkage key files will be retained by Statistics Canada for 5 years after they are made available to CIHI or until no longer required, at which point they will be destroyed. Access to the linkage key files will be restricted to Statistics Canada employees whose work activities require access.

2017 Aboriginal Peoples Survey Linkage to the 2016 Census of Population (102-2016)

2017 Aboriginal Peoples Survey Linkage to the 2016 Census of Population (102-2016)

Purpose: The Aboriginal Peoples Survey (APS) is a national post-censal survey of Aboriginal peoples (First Nations peoples living off reserve, Métis and Inuit) in Canada. The survey provides valuable data on the social and economic conditions of Aboriginal people 15 years of age and over. Data from the APS inform policy and programming activities aimed at improving the well-being of Aboriginal peoples in Canada.

Linking the 2017 APS and the 2016 Census of Population will allow methodologists to derive weights for the APS. As well, an APS-Census linkage will enrich the analytical potential of the 2017 APS microdata file by allowing data users to analyse APS data with reference to person-level, family-level, and household-level information collected in the Census. Census data complement the findings of the APS, providing information on topics that were either beyond the scope of the APS or which were explored in the survey in only a very limited way in order to reduce response burden.

Description: Responses to the 2017 Aboriginal Peoples Survey and 2016 Census of Population will be matched for each respondent using the variables frame_id (which identifies a household uniquely in Canada) and persnr (which identifies a person uniquely within the household). This linkage will result in the APS-Census linked file. This composite file will be used to create an analytical file which will become the base from which other products will be developed.

Output: Only aggregate statistical estimates that conform to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada. Linked information from the 2017 Aboriginal Peoples Survey and 2016 Census of Population will be used in analytical articles and other data products released from the 2017 Aboriginal Peoples Survey. All products containing linked data will be disseminated in accordance with Statistics Canada's policies, guidelines, and standards.

The APS analytical file, including linked Census records, will be created and retained indefinitely by Social and Aboriginal Statistics Division (SASD). The analytical file will not contain any personal identifiers.

2016 Census in Fort McMurray: 2015 Regional Municipality of Wood Buffalo (RMWB) Census Linkage with the RMWB Contact for Addresses of Damaged Property provided by the Government of Alberta Office of Statistics and Information (OSI) (104-2016)

2016 Census in Fort McMurray: 2015 Regional Municipality of Wood Buffalo (RMWB) Census Linkage with the RMWB Contact for Addresses of Damaged Property provided by the Government of Alberta Office of Statistics and Information (OSI) (104-2016)

Purpose: Following the outstanding collaboration of the Government of Alberta and local authorities of the Regional Municipality of Wood Buffalo, Statistics Canada has accepted a mandate from the Government of Alberta OSI, to gather information on people that were living in dwellings identified in the list of Damaged Properties.

Description: Damaged Properties data will be linked by address to the 2015 RMWB Census and a final micro data file will be created with both the Damaged Properties and the 2015 RMWB Census data. The final output will be the dissemination of statistical tables for various levels of geographies and variables.

Output: The 2015 RMWB Census data obtained with the addresses are combined with Damaged Properties data. The data are processed and prepared for dissemination. The linked data set will become the master file and will be available through the Statistical Registers and Geography Division (SRGD) with address identifiers with access restricted to Statistics Canada employees and deemed employees with a working requirement.

By linking data from Damaged Properties file and the 2015 RMWB Census, the OSI will be in a position to better asses the population that were living in the damaged properties.

Linkage of simple establishment-enterprises from the Survey of Regulatory Compliance 2011 to the Linkable File Environment 2001 to 2014. (105-2016)

Linkage of simple establishment-enterprises from the Survey of Regulatory Compliance 2011 to the Linkable File Environment 2001 to 2014. (105-2016)

Purpose: The project will support research on the impact of regulatory compliance costs on business performance which is an issue of policy interest to Innovation, Science and Economic Development Canada (ISED)

Description: The Innovation, Science and Economic Development Canada (ISED) project consists in linking simple establishment enterprises (excluding complex ones) of the Survey of Regulatory Compliance Cost 2011 to the Linkable File Environment (LFE). The LFE contains 21 Statistics Canada administrative and survey datasets that can be used to create research datasets with variables from any of the datasets that are available. Once done, a research dataset will be produced and the database will be sent to Statistics Canada's Canadian Centre for Data Development and Economic Research where access for the client will be arranged. The project will examine the impact of regulatory compliance costs on business performance. The final linked database will include all of the variables from the 2011 survey and a list selected variables from the General Index of Financial Information (GIFI) from 2011 to 2014 and the Business Register from 2011 to 2014 for simple business units only. The survey weights will be dropped from the final research database.

Output: Only non-confidential aggregate statistical outputs and analysis that conform to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada. These will be in the form of regression analysis results relating to the impact of regulatory burden at firm level by exploring firm-level variations in performance and regulatory burden using the Survey of Regulatory Compliance Costs developed by ISED in partnership with Statistics Canada. ISED will prepare a working paper using the modeled output that has been vetted for confidentiality.

Feasibility Study to Address Statistical Data Gaps in the Measurement of Social Enterprises (106-2016)

Feasibility Study to Address Statistical Data Gaps in the Measurement of Social Enterprises (106-2016)

Purpose: The purpose of the feasibility study is to develop statistics indicators to distinguish social enterprises from other types of organizations (non-social enterprises).

Description: Employment and Social Development Canada (ESDC) has requested the Centre for Special Business Projects (CSPB) to carry out work to help understand the key characteristics of social enterprises with a view to developing relevant metrics which can address existing data gaps in the measurement of social enterprises activity. ESDC will deliver lists of "known social enterprises" which will linked to the Linkable File Environment. Common potential statistical indicators of these known social enterprises will be identified and tested using Business Register and tax data variables.

Output: Only non-confidential aggregate statistical outputs that conform to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada. These will be in the form of descriptive summary tables presenting the results of testing. A methodology report will be prepared, explaining the file matching processes and the constraints and key issues related to the quality of the data. Recommendations on possible future work to address data gaps for social enterprises will also be prepared.

Linking Health Administrative Data to the Immigrant Landing File to understand the health outcomes of immigrants (107-2016)

Linking Health Administrative Data to the Immigrant Landing File to understand the health outcomes of immigrants (107-2016)

Purpose: The purpose of the proposed record linkage project is to link the Immigrant Landing File (ILF) to health administrative data including hospital, mortality and cancer records to support a program of research on immigrant health. These linked data will greatly enhance the ability to better understand patterns of health outcomes for immigrants overall, as well as by immigration categories (e.g. family class, refugees and economic class) and source country.

Description: The following files will be linked using the Social Data Linkage Environment (SDLE): Immigrant Landing File (1980-2013), Discharge Abstract Database (DAD) (1994/95 to 2014/15), National Ambulatory Care Reporting System (NACRS) (2002 to 2014) Ontario Mental Health Reporting System (OMHRS) (2005 to 2014/15), Canadian Vital Statistics: Death Database (CVSD) (1980 to most recent); Canadian Cancer Registry (CCR), T1 Personal Master File (T1PMF) (1981 to most recent), and Canadian Child Tax Benefits - ldent Files (CCTB-Ident) (2010 onwards). Tax files are used to obtain postal code information to confirm residency. No income information will be used.

Outputs: The linked file will at all times remain on Statistics Canada premises and will be made available in the Research Data Centres upon request. Access to the linked microdata will be restricted to Statistics Canada staff and deemed employees whose work activities require access. Access by deemed employees will be subject to Statistics Canada legal and policy requirements, in particular, the Directive on the Use of Deemed Employees and the Subsequent Use provision. HAD will be responsible for providing access to the linked data and fulfilling requirements for the subsequent use if necessary.

Methodological and analytical findings resulting from these linked data will be used to prepare research papers for publication in analytical reports, peer-reviewed scientific journals, CANSIM, for presentation at conferences, workshops and meetings. The linkable data files will also be used to develop tabular data and indicators for release on Statistics Canada's website. Only non-confidential aggregate statistics that will not result in the identification of an individual person, business or organization will be released outside of Statistics Canada.

Assignment of historical environmental exposures to health datasets for the purpose of environmental health research (108-2016)

Assignment of historical environmental exposures to health datasets for the purpose of environmental health research (108-2016)

Purpose: The purpose of this initiative is to attach historical environmental exposure data to external and internal health datasets in order to facilitate environment health research. Specifically, this initiative will address an important data gap in environmental health research, being the lack of data on where respondents have previously resided, as identified by CIHR. This record linkage project undertaken by Statistics Canada will attach environment exposure data to external and internal datasets by linking them to the DRD in order to attach historical places of residences (e.g. postal codes). These postal codes will then be used to attach environmental data (e.g. particulate matter, walkability, noise, ozone etc.) to external and internal datasets.

Description: Our health is influenced by the environment we live in. Although datasets often record a respondent's place of residence at time of collection, many do not record respondent's previous residence(s). Statistics Canada data holdings, specifically place of residence reported on tax administrative files, can help address this data gap. Through individual level tax files, it is possible to reconstruct a person's "residential" history beginning in 1981. Datasets will be linked to the Derived Record Depository in order to obtain historical place of residence beginning in 1981. Historical environmental exposures will be attached to these datasets using postal code-environmental exposure correspondence files and/or postal code representative points.

Output: The creation of enhanced datasets that include historical environmental exposures. These datasets will be part of a pan-national research program focused on addressing key issues related to the impact of the urban environment on health.

Use of three pre-existing linkages for analysis and publication purposes: linkages between (1) the Indian Register and the 2011 Census/National Household Survey, (2) the Indian Register, tax data and vital statistics data, and (3) tax data and the 2011 Census (111-2016)

Use of three pre-existing linkages for analysis and publication purposes: linkages between (1) the Indian Register and the 2011 Census/National Household Survey, (2) the Indian Register, tax data and vital statistics data, and (3) tax data and the 2011 Census (111-2016)

Purpose: The purpose of this request is to disseminate the aggregate results from previously approved linkages between the 2011 Census/National Household Survey (NHS) and the Indian Register, between the Indian Register, tax data and vital statistics data, and finally between tax data and the 2011 Census. These results would first be released as part of the new Indian Register population projections provided to Indigenous and Northern Affairs Canada (INAC). The request is also to obtain authorization to disseminate the results of an analysis of the registered Indian population residing in collective dwellings. Lastly, the request is to seek authorization to disseminate the results of a fertility analysis among Aboriginal peoples, funded by INAC, which would include data from the linkages.

Publishing the projection results will provide more knowledge about the population in the Indian Register and about the quality of the data available. Information from the linkages will also offer greater transparency for Statistics Canada regarding the methodology of its projections.

In the case of the collective dwelling analysis, knowledge of the representation of this population in certain types of institutions will inform decision-makers and planners and guide future research, in particular.

As for the fertility analysis, the data would provide better knowledge of a key element of the demography of the population in the Indian Register, but one that is understudied because of a lack of available data.

Description: The first file consists of a linkage between the Indian Register and the 2011 Census/National Household Survey. The second consists of linkages between tax data, vital statistics data and the Indian Register. The third file comprises linkages between tax data and the 2011 Census. These linked databases were created during previously approved projects.

Output: First, results from these linkages would be disseminated in two reports presenting new Indian Register projections produced by microsimulation using the Demosim model. The first projection report would describe the methodology and the second would present an analysis of the results. These reports would be provided to a federal client (Indigenous and Northern Affairs Canada) and could be published thereafter. The institutional population data would be published in two reports: a methodology report and an analytical report. The fertility data would be published in an analytical report which, after being provided to INAC, would be submitted for publication in a scientific journal. The data from these three projects would not be available in the research data centres.

In all cases, the data would be disseminated only in aggregate form in accordance with the confidentiality provisions of the Statistics Act.

Date modified: