2017 Canadian Survey on Disability linkage to the 2016 Census of Population (001-2017)
Purpose: The Canadian Survey on Disability (CSD) is a national survey of persons whose every day activities may be limited because of a condition or health-related problem. The 2017 CSD is a post-censal survey which uses 2016 Census of Population as a sampling frame to identify its target population.
The collection of the data on persons with disabilities is required by the Government of Canada to fulfill various policy and program commitments including the United Nations Convention on the Rights of Persons with Disabilities. Information from the survey is essential for the effective development and operation of federal, provincial, territorial and local government programs such as employment equity.
Linking the 2017 CSD and the 2016 Census of Population will allow methodologists to derive weights for the CSD. As well, a CSD-Census linkage will enrich the analytical potential of the 2017 CSD microdata file by allowing users to analyse CSD data with reference to person-level, family-level and household-level information collected in the Census. A sample of persons without a disability is also drawn from the Census file (these people were not covered by the CSD) and reweighted to be representative of the whole population without a disability. These records are added to the CSD analytical file to allow computation of disability rates and comparisons of persons with and without a disability using Census information (labour force status, income, education level, housing information, etc.)
Census data complement the findings of the CSD and a linkage between the two will help meet the requirements of external researchers who are seeking to expand the potential of the existing CSD data file.
Output: Only aggregate statistical estimates that conform to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada. Linked information from the 2017 CSD and 2016 Census of Population will be used in analytical articles and other data products released from the 2017 CSD. All products containing linked data will be disseminated in accordance with Statistics Canada's policies, guidelines, and standards.
The CSD analytical file, including linked Census records, will be created and retained indefinitely by Social and Aboriginal Statistics Division (SASD). The analytical file will not contain any personal identifiers.
Standards Council of Canada: Size & Characteristics of the Canadian Conformity Assessment Industry (005-2017)
Purpose: The proposed research will provide the first overview of Canada's conformity assessment industry, identifying the size and characteristics of the Canadian conformity assessment industry.
Description: A list of conformity assessment bodies provided by Standards Council of Canada (SCC) from the 2004 to 2014 period will be linked to data from National Accounts Longitudinal Microdata File (data from the Business Register, Corporate tax data-T2 tax database, PD7 and T4). The SCC firm's records will be linked probabilistically and manually through the Business Register the using name and address. This is a one-time linkage.
Output: Only non-confidential aggregate statistical outputs and analyses that conform to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada. The information will be presented in the form of aggregate tabulations providing information on Size & Characteristics of the Canadian Conformity Assessment Industry.
Mortality-Air Pollution associations in Low Exposure environments (MAPLE) - Identifying the shape of the association between long-term exposure to low levels of ambient air pollution and the risk of mortality: an extension of the Canadian Census Health and Environment Cohort using innovative data linkage and exposure methodology. Linkage 06-2017, a subsequent use of 037-2016
Purpose: The purpose of this project is to examine risks for mortality associated with long-term exposure to fine particulate matter (PM2.5).
Description: The researchers examine the shape of association between long-term exposure to ambient concentrations of PM2.5 and cause-specific mortality in three large, population-based Canadian cohorts that are linked to mortality and tax records until 2011. Using several exposure-time windows, they characterize the shape of the concentration-mortality association using newly developed variable coefficient hazard regression models. They examine the sensitivity of the shape of the association to age, sex, socio-economic position (income, education, occupation), behavioral (smoking, obesity, diet, alcohol), and contextual (% recent immigrants, % <high school, % low income) risk factors. Both relative and additive risk models are examined.
The 1991 Canadian Census Cohort: mortality & cancer follow-up is a probabilistic linked database. Approximately 2.7 million individuals aged 25 or older, who were enumerated by the 1991 long-form census, were followed for mortality, cancer, and annual place of residence. This file has been updated and renamed 1991 CanCHEC and has been linked to the CCR from 1992 to 2010 and mortality from 1991 to 2011. The 2001 CanCHEC is also being used for this research as is CCHS Cycle 1.1 (2000-2001), Cycle 2.1 (2003), Cycle 3.1(2005), 2007 (12 month file), 2008 (12 month file), 2007-2008 (24 month combined file).
Output: Only aggregate data and analyses conforming to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada Research Data Centres, in the form of peer reviewed journal articles
Linkage of the Longitudinal Administrative Databank (LAD) with T776 Statement of Real Estate Rentals (009-2017)
Purpose: The objective of this research project is to provide an understanding of residential rental income trends in Canada.
The linked data will be used by the Canada Mortgage and Housing Corporation (CMHC) to create a portrait of rental revenues. They will use it to help understand the relationship between the physical location of the rental properties and the tax filers' address, as well as the relationship between total income, interest expenses and rental income overtime.
This information will be used by CMHC to better understand the relationship between rental income trends and differences in rising house prices across major urban centers, and to consider policy tools to keep home ownership within reach for more Canadians.
The resulting data will fill a data gap with high quality information while avoiding additional respondent burden.
Description: Statistics Canada's Longitudinal Administrative Databank (LAD) is a longitudinal linked filed comprising of a 20% sample of the annual T1 Family File (T1FF) and the Longitudinal Immigration Data Base (IMDB).
Data from the T776 Statement of Real Estate Rentals administrative data files from Canada Revenue Agency (CRA) will be linked to the LAD for all reference years 2000 to 2014. Rental income and expense variables from the T776 will be added to the LAD. Address information will be used to create non-confidential geography variables and then removed from the files.
Output: Only aggregate data that conform to the confidentiality provisions of the Statistics Act will be released to outside analysts. All manipulation of micro data will be done internally by Statistics Canada and tabulated results will be screened prior to release outside the Agency.
Access to the linked file will be available through Federal Research Data Centre where the output will be examined to ensure that it meets the confidentiality requirements prior to releasing the output to the researchers.
Title: Foreign ownership of residential property – Sharing with other organizations (012-2017)
Purpose: In Budget 2016 the federal government announced $ 500,000 to Statistics Canada "to develop methods for gathering data on purchases of Canadian housing by foreign homebuyers". Statistics Canada received approval to conduct record linkages to assess the feasibility of carrying out this project using data already available at Statistics Canada. The objective of this application is to obtain permission to share results (aggregate and non-confidential) with Statistics Canada partners for this project, including Finance Canada, Canada Mortgage and Housing Corporation, and the Bank of Canada for their feedback and support in relation to the strategy proposed by Statistics Canada.
Description: Record linkages were done for all persons identified on the test files received by Statistics Canada. The methodology put in place made it possible to match these data with different files currently available at Statistics Canada, including census data and tax data.
Output: Record linkages provided results to assess the feasibility of developing a Housing Statistics Framework based on various data currently available at Statistics Canada. Only aggregate and non-confidential data will be shared with external partners.
Linkage of crop insurance files to G-SUF and the Census of Agriculture database (018-2017)
Purpose: This linkage project will partly use land parcel data derived from provincial crop insurance files to create the Census Geographic Component (CGC) database. In the CGC database, land areas and commodities assigned to the headquarters location of a farm that operates in more than one location are divided into all locations where that farm operates. The CGC database complements the Census of Agriculture database by providing a more accurate representation of where farming activities are taking place in Canada.
The CGC database provides valuable data for evaluating the relationship between agriculture and the environment. The database also provides fundamental inputs into various government departments programs thereby enabling them to fulfil their mandates and develop relevant policies about agriculture and environment. The database also helps Canada to fulfil its international reporting obligations on climate change.
Description: Crop insurance files from Quebec, Manitoba, Saskatchewan and Alberta will be linked to G-SUF (a monthly snapshot of Business Register) and the Census of Agriculture database to create the CGC database.
The generalised record linkage program uses the business name, telephone, postal code and business number (when available) to link the crop Insurance files to a unique key identifier from the Business Register. The linked crop insurance file will be assigned geography codes in order to identify farms that operate in multiple geographies. The proportions of land derived from the crop insurance will be used to allocate the land and commodities reported by that farm on the Census of Agriculture at the parcel level.
Output: The micro-data will remain within Statistics Canada and will not be shared with clients. Non-confidential aggregate outputs will be disseminated to external and internal clients as custom data requests, as needed, and under the provisions of Statistics Canada End User Licence.
All personal identifiers and linking keys will be removed from the linked file. The linking keys and personal identifiers will be kept in a separate location until no longer required, up to December 31, 2022, at which time they will be destroyed. All files are password protected and are accessible to only Statistics Canada employees with a work-related need for access. All files are kept on a server in a secured area.
Census of Agriculture Linkage to Ontario Goat Dairy Registry (019-2017)
Purpose: The objective is to assess the characteristics of farms engaged in commercial goat dairy activities in the Province of Ontario. Agricultural business, policy and program decisions require a good understanding of specialized agricultural sectors such as this one. Results will produce insight on the current state of this evolving sector and provide a basis upon which to structure business decisions and policies that may contribute to increasing the competitiveness of the sector.
Description: The Ontario Dairy Goat Registry – an administrative data set, will be used to identify a subpopulation of the 2016 Census of Agriculture based on a link made between contact and address information from the Registry and respondent, operator business and address information from the Business Register / 2016 Census of Agriculture. The linkage will be used to define the appropriate subset of the 2016 Census of Agriculture for analysis. Direct identifiers will be removed from the linked file.
Output: Non-confidential aggregates and analysis will be disseminated outside Statistics Canada. The report is planned to be published using Statistics Canada publication Canadian Agriculture at a Glance, and an announcement will be made in The Daily.
The linked analytical file will be retained until no longer required, up to September 30, 2019, at which time it will be destroyed.
Life After Service Income Study: Linkage of a cohort of former Canadian Armed Forces members to tax information (020-2017)
Purpose: To assess the economic outcomes of former Canadian Armed Forces (CAF) members after their release to civilian life. There are currently an estimated 600,400 CAF Veterans (regular and reserve force) and about 95,000 CAF members (68,000 regular force and 27,000 reserve force). Only about 12.1% of this population is currently receiving benefits and services from Veterans Affairs Canada (VAC). Transition outcomes are of interest to both Department of National Defence (DND) and VAC. This linkage will assist DND and VAC in identifying gaps in programs and services, evaluating existing programs and developing new re-integration and rehabilitation programs that meet the needs of military personnel as they transition to civilian life.
In addition, the New Veterans' Charter, which represents the most significant overhaul of rehabilitation and reintegration programs and services for veterans since the Second World War, was implemented by VAC in 2006 and both departments require more information on the impact of the new charter.
Description: Statistics Canada will provide indicators on the income and employment of members released from the CAF from 1998 to 2015. Economic adjustment will be measured through the production of statistical tables from personal income tax data (T1 Family File (T1FF)) linked to a cohort of approximately 110,000 former CAF regular and reserve members whose personal identifiers and other key pre- and post- release status variables will be supplied by DND/VAC.
Output: Statistics Canada will prepare a set of analytical tables on the pre- and post-release incomes and employment of veterans. Only aggregate data that conform to the confidentiality provisions of the Statistics Act will be released to VAC from the DND/VAC cohort linkage to T1FF. Information will be provided in tabular form. The client will not have direct access to the linked file. All access to this linked file will be restricted to employees of Statistics Canada whose work activities require access.
Atlantic Canada Opportunities Agency (ACOA): Economic Impact – 2004-2014. (022-2017)
Purpose: To support the evaluation of the Atlantic Canada Opportunities Agency (ACOA) financing services program, by producing objective measures of its economic impact on the performance of small and medium-sized enterprises (SMEs). Key performance indicators, and value-added measures such as sales, profits, firm survival rate, export and employment, will be calculated for ACOA client businesses and for comparable non-client businesses.
Description: A list of firms that were ACOA clients in the period 2004 to 2014 will be linked to the Business Register to obtain the Business Number and Statistical Enterprise Number, to facilitate linkage to payroll and tax data. In order to measure the effectiveness and the impact of ACOA financing services, a comparison group of non-ACOA client firms with similar characteristics will be selected.
Records of ACOA clients and the businesses in the comparison group will be linked to the Payroll Deduction Account (PD7), T2 Corporate Tax data, the General Index of Financial Information (GIFI), the Chart of Accounts database and Research and Development in Canadian Industry (RDCI) for the period 2004 to 2012. The records will be linked using the Business Number and Statistical Enterprise Number. The resulting linked analysis file will enable longitudinal analysis of each cohort. The characteristics of the matched and un-matched businesses will also be compared. This is a one-time linkage.
Output: Only non-confidential aggregate statistical outputs and analysis that conform to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada. These will be in the form of separate summary tables of regression analysis results relating to the economic impact of ACOA's financing services, in addition to profiling tables. A methodology report will be prepared, explaining the file matching processes and constraints and key issues related to the quality of the data. An analytic report will be produced by Statistics Canada.
Linkage of the 2016 Life After Service Survey with the 2016 Life After Service Survey sample file. (023-2017)
Purpose: The main objective of this data linkage is to add the Strata and Environment variables from the 2016 LASS sample file to the 2016 LASS survey data to allow analysis of survey indicators such as physical and mental health by military rank and environment (Air, Land, and Sea).
Description: The Strata and Environment variables from the 2016 LASS sample file will be added to the 2016 LASS survey data by linking the two files using the sample id. The addition of the Strata and Environment variables will help identify groups of veterans by rank and type of military personnel who are most vulnerable and have the most difficulty transitioning to civilian life.
Output: The Strata and Environment variables from the 2016 LASS sample file will be added to the 2016 LASS survey data by linking the two files using the sample id. The addition of the Strata and Environment variables will help identify groups of veterans by rank and type of military personnel who are most vulnerable and have the most difficulty transitioning to civilian life.
Linkage of the British Columbia Medical Services Client Registry File and creation of the British Columbia Smoking Record Linkage Environment (BCSRLE) (024-2017)
Purpose: The province of British Columbia has commenced litigation against several tobacco companies in the Supreme Court of British Columbia pursuant to the Tobacco Damages and Health Care Costs Recovery Act, SBC 2000, c. 30. To provide some litigants with access to certain data for use in the litigation, Statistics Canada has signed an agreement concerning access to confidential personal information of residents of British Columbia with the province of British Columbia and Imperial Tobacco Canada, JTI-MacDonald Corp., B.A.T. Industries p.l.c., British American Tobacco (Investments) Limited, R.J. Reynolds Tobacco Company, and R.J. Reynolds Tobacco International Inc.
Statistics Canada will create a record linkage environment for the British Columbia Tobacco Project called the British Columbia Smoking Record Linkage Environment (BCSRLE). This environment will be used to link Statistics Canada survey data to British Columbia administrative health data using established record linkage techniques. The BCSRLE will enable Statistics Canada to provide access to linked and unlinked data within the Research Data Centers (RDC) to deemed employees of Statistics Canada, which will ensure the confidentiality of the personal information contained in the source data sets.
The purpose of this linkage is to build the BCSRLE. The British Columbia Medical Services Client Registry File will be loaded to the BCSRLE as the Record Depository (RD) and the Canadian Community Health Survey (CCHS) share-link files and six British Columbia administrative databases will be linked to that Depository. The record linkage keys will be retained separately in the BCSRLE Key Registry.
Statistics Canada has the responsibility for securely storing and processing data files and for the production of analysis files needed to carry out approved research studies. BCSRLE research projects will involve the use of linked records, and in accordance with Statistics Canada's Directive on Record Linkage, approval by the Chief Statistician will be required for each new linkage related to this project.
Description: The BCSRLE Record Depository will be created from the British Columbia Medical Services Client Registry File. The RD will be composed of the surnames, given names, date of birth, sex, address, personal health number, and resident id for health insurance card holders in British Columbia. Each individual in the RD will be assigned an anonymous BCSRLE identifier, which has no value outside of the BCSRLE.
Linkage of the RD to British Columbia health administrative databases and the CCHS share-link files will be performed in the BCSRLE. The CCHS respondents who consented to link their survey results and who consented to share their survey response information with the provincial health ministry while residing in the province of New Brunswick will be included in the linkage to the RD. An anonymous BCSRLE identifier will be assigned to each respondent, and the resulting linkage keys will be stored in the BCSRLE Key Registry.
To ensure a high level of data security and privacy, the association of Statistics Canada-generated identification numbers from the RD, CCHS Record Identifiers and the administrative database Record Identifiers will be stored in a separate BCSRLE Key Registry, thus avoiding the need to store health administrative data with personal identifiers.
For analytical studies, the BCSRLE Identifier will replace health administrative data identifiers and access to linkable files will require prior linkage approval from Statistics Canada's Executive Management Board.
Access to the BCSRLE Key Registry will be restricted to the Statistics Canada employees responsible for its development and maintenance and those responsible for the creation of linked analysis files. The Key Registry will contain linkage keys to permit linkage for approved studies for this project. The administrative files included as part of this linkage application are as follows:
- British Columbia Medical Services Client Registry File
- British Columbia Discharge Abstract Database
- British Columbia Home and Community Care Minimum Reporting Requirements, including the British Columbia Continuing Care data
- British Columbia Medical Services Plan Payment Information
- British Columbia Pharmacare (Claims)
- British Columbia Pharmacare (Dispensing)
- British Columbia Registration and Premium Billing
Output: There are no linked analysis files created as part of this record linkage. The BCSRLE Record Depository and Key Registry will be used exclusively to support the development of research files for the British Columbia Tobacco Project and no identifiable personal information will be released from the BCSRLE. Statistics Canada will retain the RD and Key Registry files until no longer required, up to, December 31, 2032, at which time they will be destroyed.
Output files that contain the anonymous BCSRLE identifier and analysis variables for the CCHS share-link files and British Columbia administrative datasets linked to the BCSRLE will be placed in the RDC identified by the litigants.
Research projects to access and link the BCSRLE the output files will be approved on a study-by-study basis. These may only be carried out as part of a research agenda initiated by the designated representatives from the province of British Columbia or the Signing Defendants under contract with Statistics Canada. A summary of each approved study will be posted on the Statistics Canada web site.
Alberta Social Assistance Demonstration Project (026-2017)
Purpose: As part of ongoing consultation between Statistics Canada and the provinces and territories, a linked data file comprised of Alberta Social Assistance data and federal taxation- and employment-based administrative data will be created. The file will be used to examine the employment histories and tax filing behaviours of social assistance recipients in Alberta, thereby demonstrating how the analytical value of provincial data can be leverage through record linkage.
Description: The linked data file is comprised of individuals who were social assistance clients in Alberta in 2012, 2013 and/or 2014. Information on their socio-demographic characteristics and types and values of income supports received will be drawn from provincial data; information on their incidence of tax filing and use of tax credits will be drawn from the T1 Family File, T1 Personal Master File, T1 Historical File and Immigrant Landing File; and information on their employment histories in the years prior to receiving social assistance will be drawn from the T4 file, the Record of Employment file, and the Longitudinal Immigration Data file.
SINs will be transformed into unique identifiers that will remain on the linkable files to facilitate longitudinal analyses. All Business Numbers (BNs), Social Insurance Numbers (SINs) and personal identifiers will be removed from the analytical files and stored in a separate location accessible only to Statistics Canada employees whose job duties require them to access this information.
Output: Methodological and analytical findings resulting from these linked data will be used to prepare a PowerPoint deck for presentation and circulation at workshops and meetings.
Only aggregate statistics and analysis conforming to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada. The linked data file will not be made available in the RDCs as it is only intended for demonstration purposes rather than broader analytical use.
Title: Linkage of Agriculture and Agri-Food Canada businesses from 2008 to 2015 to the Linkable File Environment of Statistics Canada (029-2017)
Purpose: To support the evaluation of AAFC financing services programs by producing objective measures of its economic impact on the performance of enterprises. Key performance indicators and value-added measures, such as sales and employment, will be calculated for AAFC client businesses and for comparable non-client businesses.
Description: A list of firms of businesses provided by AAFC for the period 2008 to 2015 will be linked to the Linkable File Environment (LFE) to obtain the Statistical Enterprise Number, to facilitate linkage to payroll and tax data. In order to measure the effectiveness and the impact of AAFC financing services, a comparison group of non-AAFC client firms in the same sector of activity will be selected.
Records of AAFC clients and the businesses in the comparison group will be linked to the Payroll Deduction Account (PD7), T2 Corporate Tax data, the General Index of Financial Information (GIFI) for the period 2008 to 2015. The records will be linked using the Business Number and Statistical Enterprise Number. The characteristics of the matched and un-matched businesses will also be compared. This is a one-time linkage.
Output: This will be in the form of profiling tables comparing the economic performance of businesses that received AAFC financing support as compared to the non-supported businesses. The linked AAFC list of businesses will be housed at Statistics Canada's Centre for Special Business Project (CSBP).
Only non-confidential aggregate statistical outputs and analysis that conform to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada.
Disparities in the distribution of deaths by age according to level of education in Canada. Subsequent use of 037-2016 (030-2017)
Purpose: The purpose of this project is to look specifically at the role of level of education in the new trends in old-age mortality and longevity differences by sex. The study will focus on the following research questions: (1) Can certain socioeconomic characteristics, particularly level of education, explain the differences in modal age at death (most frequent age) and its dispersion at old ages for mortality in general? (2) Will modal age at death be significantly different depending on education level? and (3) What will the scope be for dispersion of life expectancies beyond modal age at death by level of education?
Description: For each level of education and sex, smoothed density functions are estimated using P-splines. The Canadian Cancer Registry (CCR) and the 1991 Canadian Census cohort mortality and cancer follow-up database will be analyzed at the research data centres (RDC). The aggregate estimates will come from these sources.
The 1991 Canadian Census cohort mortality and cancer follow-up database is a probabilistic linked database. Approximately 2.7 million individuals aged 25 or older, who were enumerated with the long-form questionnaire during the 1991 Census, were followed for mortality, cancer and place of residence. This file was updated and renamed CanCHEC, and was then linked to the Canadian Cancer Registry (CCR) from 1992 to 2010 and to the mortality data from 1991 to 2011. Confidential microdata from the CCR are also required.
Output: Only aggregate data and analyses conforming to the confidentiality provisions of the Statistics Act will be released outside Statistics Canada's RDCs, in the form of peer-reviewed journal articles and presentations.
Partnership for Health System Improvement for Cancer Control: Feasibility 17-PRGSSH-MTL-4766-S002; Subsequent Use of 037-2016 (033-2017)
Purpose: This project aims to conduct a feasibility assessment for the development of future projects related to applied health services and policy research to improve cancer control and cancer-related interventions in Quebec.
Description: The 1991 Canadian Census Cohort: mortality & cancer follow-up is a probabilistic linked database. Approximately 2.7 million individuals aged 25 or older, who were enumerated by the 1991 long-form census, were followed for mortality, cancer, and annual place of residence. This file has been updated and renamed 1991 CanCHEC and has been linked to the CCR from 1992 to 2010 and mortality from 1991 to 2011. Confidential microdata from the Canadian Cancer Registry is also required.
The analyses also utilizes data from the 1992-2013 Canadian Cancer Registry (CCR), 2001 census, 2006 census, and the 2011 National Household Survey (NHS).
Prevalence estimates of the overall incidence of various cancers (e.g. lung, breast, colorectal, cervical, etc.) across different socio-demographic characteristics (e.g. age at diagnosis, sex, income, education, etc.) and across geographic regions pertinent to policy evaluation (e.g. the assessment of organized colorectal cancer screening pilot programs in several regions in Quebec, etc.) will be obtained, and multivariate semi-parametric Cox proportional hazard modeling will be used to estimate the overall 5-year survival rates for the various cancers (adjusting for factors such as stage, age at diagnosis, sex, income, education, etc.).
Output: Only aggregate data and analyses conforming to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada Research Data Centres, in the form of peer reviewed journal articles and presentations.
Understanding the Effects of a Time-Limited Earnings Supplement: A Long-Term Analysis of the Self-Sufficiency Project (034-2017)
Purpose: The objective of this initiative is to provide new evidence on the long-term impacts of experimental interventions designed to help social assistance recipients achieve a permanent break from welfare dependency. The initiative will link Statistics Canada survey data from the Self-Sufficiency Project (SSP) with administrative data files. The linked files will be used to study long-term outcomes like income and employment of SSP participants and those of individuals in a control group.
Description: This linkage will combine survey data from the SSP with information on demographic, income and earnings characteristics drawn from the T1 Family File, T1 Personal Master File, and T1 Historical Personal Master File; information on employment and EI benefits received drawn from the T4 Summary File, T4E Statement of EI Benefits Received, the EI Status Vector File, the Record of Employment file, and the Longitudinal Employment Analysis Program file; and information on deaths among the individuals in the study sample from the Canadian Mortality Database.
Social Insurance Numbers (SINs) will be transformed into unique identifiers that will remain on the linkable files to facilitate longitudinal analyses. All Business Numbers (BNs), SINs and personal identifiers will be removed from the analytical files and stored in a separate location accessible only to Statistics Canada employees whose job duties require them to access this information.
Output: Methodological and analytical findings resulting from these linked data will be used to prepare research papers for publication in analytical reports, peer-reviewed scientific journals, CANSIM, for presentation at conferences, workshops and meetings. Only aggregate statistics and analysis conforming to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada.
Estimating Rates of Hip and Knee Replacement: A Comparison between Aboriginal and non-Aboriginal Populations - Subsequent Use of 043-2012 (036-2017)
Purpose: This project has two principal questions:
1. To compare knee and hip replacement rates between the Aboriginal and non-Aboriginal communities.
2. To compare knee and hip replacement rates across provinces/regions for Aboriginal and non-Aboriginal populations
Description: The proposed analysis plan will include basic descriptive statistics with comparisons between Aboriginals and non-Aboriginals. Multivariate analysis will be carried out to compare joint replacement rates while controlling for covariates which are known to influence the likelihood of having joint replacement surgery. The probability of undergoing joint replacement surgery will be modeled using mixed logistic regression with both fixed effect (patient characteristics) and random effect (provincial indicators) variables included to account for the non-independence of observations within each health care system. This method will also allow for the inclusion of aggregate provincial characteristics which may affect the probability of joint replacement and vary by province.
The 2006 Census (long-form) was linked to the Discharge Abstract Database (DAD) for 2006/07-2008/09 to support analysis of hospital use by sub-populations and socio-economic status of Canadians. Approximately 4.652 million long-form respondents were eligible for linkage to the DAD. Approximately 5.3% of Census respondents were linked to at least one DAD record between 2006/07 and 2008/09. The dataset for the proposed work will be derived from the 2006 Census long-form data and will consist of all individuals who were 18 years or older in 2006.
Output: Only aggregate data and analyses conforming to the confidentiality provisions of the Statistics Actwill be released outside of Statistics Canada Research Data Centres, in the form of peer reviewed journal articles and presentations.
Income inequalities and potentially avoidable hospitalizations for chronic diseases in a context of universal health coverage; Subsequent Use of 043-2012 (037-2017)
Purpose: The objective of this analysis is to strengthen the evidence base on the social determinants of potentially avoidable hospitalizations for high-burden chronic diseases to help inform health and social policy and planning decisions.
Description: The study will focus on hospital stays for selected chronic ambulatory care sensitive conditions (ACSC) – that is, conditions for which the need for hospital admission can be largely prevented or reduced through appropriate ambulatory care – including diabetes, chronic obstructive pulmonary disease, asthma, heart failure, hypertension and angina. The analysis will take a two-pronged approach, looking at individual and community measures of income inequality and risk of potentially avoidable hospitalization. The concentration of neighbourhood-level income inequalities will be calculated, and comparisons made between individual and neighbourhood measures. The outcome of interest is hospitalization for chronic ACSC, as an indicator of the quality of ambulatory care services, effectiveness of health system investments and better population health outcomes.
The 2006 Census (long-form) was linked to the Discharge Abstract Database (DAD) for 2006/07-2008/09 to support analysis of hospital use by sub-populations and socio-economic status of Canadians. More than 4.6 million long-form respondents were eligible for linkage to the DAD. Approximately 5.3% of Census respondents were linked to at least one DAD record between 2006/07 and 2008/09.
Output: Only aggregate data and analyses conforming to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada Research Data Centres, in the form of peer reviewed journal articles and presentations.
Comparing healthcare utilization and associated expenditures by clinical service type for indigenous and non-indigenous peoples living in Canada from 2006 to 2009 - Subsequent Use of 043-2012 (038-2017)
Purpose: The purpose of this project is to examine the patterns of hospital service utilization of indigenous peoples living in Canada and compare it to utilization by non-indigenous people.
Description: This project will explore these patterns over time (years 2006 to 2009), and across service types. Analyses will involve descriptive analysis examining variations in utilization patterns across clinical service type and by geographic location, conditional on indigenous versus non-indigenous identity, and logistic regressions to estimate the likelihood of high-frequency hospital utilization for indigenous and non-indigenous peoples. Multi-level and fixed effects modeling will be used to further explore the relationships among independent individual and community-level variables and the outcome variables, and across time.
The 2006 Census (long-form) was linked to the Discharge Abstract Database (DAD) for 2006/07-2008/09 to support analysis of hospital use by sub-populations and socio-economic status of Canadians. Approximately 4.652 million long-form respondents were eligible for linkage to the DAD. Approximately 5.3% of Census respondents were linked to at least one DAD record between 2006/07 and 2008/09.
Output: Only aggregate data and analyses conforming to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada Research Data Centres, in the form of peer reviewed journal articles and presentations.
Social Class Indicators and Hospitalization in Canada; Subsequent Use of 043-2012 (039-2017)
Purpose: This study will attempt to examine the links between indicators of social class and health care utilization, specifically hospitalization.
Description: The researchers will first examine descriptive statistics looking at the rates of hospitalization overall and by cause-specific hospitalizations (e.g., seizures, chronic obstructive pulmonary disease, asthma, heart failure, hypertension, angina, and diabetes) by education, income and occupation (standardizing for gender and age), and then examine the correlation between different indicators of social class and hospitalization overall. Logistic regression will be used to examine the joint effects of the social class indicators on hospitalization overall and by main cause, controlling for gender and age.
The 2006 Census (long-form) was linked to the Discharge Abstract Database (DAD) for 2006/07-2008/09 to support analysis of hospital use by sub-populations and socio-economic status of Canadians. Approximately 4.652 million long-form respondents were eligible for linkage to the DAD. Approximately 5.3% of Census respondents were linked to at least one DAD record between 2006/07 and 2008/09.
Output: Only aggregate data and analyses conforming to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada Research Data Centres, in the form of peer reviewed journal articles and presentations.
Investigating the contribution of socio-economic factors to hospitalization for chronic disease using linked survey and hospital administrative data: A focus on immigrant and Indigenous populations; Subsequent Use of 043-2012 (040-2017)
Purpose: One purpose of this project is to examine hospitalization for five conditions -- CVD, COPD , diabetes, cancer, and mental disorders -- and the contribution of socio-economic characteristics (e.g., educational attainment, income) to risk of hospitalization, stratified by age and sex. A second purpose is to identify two populations of particular interest: (1) Indigenous peoples, who are subject to higher rates of diabetes and other chronic conditions and (2) recent immigrants. The rates of hospitalization for both of these populations have already been assessed using the Census- Discharge Abstract Database (DAD) linked file, however the role of socio-economic disadvantage as a potential factor explaining these differential rates of hospitalization within these populations has not been examined.
Description: This project will employ various descriptive and regression-based statistical techniques, and will compute crude and age-standardized hospitalization rates per 100,000 population related to the five chronic conditions for the two focal populations and, when a sufficient number of events is observed, for subgroups within these populations. To test the statistical models derived from the proposed research questions, a standard multivariate logistic regression technique as well as structural equation modelling (SEM) techniques for more complex relationships will be used.
The 2006 Census (long-form) was linked to the DAD for 2006/07-2008/09 to support analysis of hospital use by sub-populations and socio-economic status of Canadians. Approximately 4.652 million long-form respondents were eligible for linkage to the DAD. Approximately 5.3% of Census respondents were linked to at least one DAD record between 2006/07 and 2008/09.
Output: Only aggregate data and analyses conforming to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada Research Data Centres, in the form of peer reviewed journal articles and presentations
Hospitalisations and Readmissions for Ambulatory Care Sensitive Conditions in Immigrants and non-Immigrants; Subsequent Use of 043-2012 (041-2017)
Purpose: The purpose of this project is to examine hospitalisation and readmissions for ambulatory care sensitive conditions (ACSC) by immigration status in Canada. The study will address two principal research questions: (1) How does the immigration status affect the risk of a hospitalisation for an ACSC, and (2) For people who had a hospitalisation for an ACSC, how does the risk of a readmission vary by immigration status?
Description: Descriptive statistics will be computed and then the risk of an ACSC hospitalization will be examined using a logistic regression model (adjusting for characteristics of the individual).
The 2006 Census (long-form) was linked to the Discharge Abstract Database (DAD) for 2006/07-2008/09 to support analysis of hospital use by sub-populations and socio-economic status of Canadians. Approximately 4.652 million long-form respondents were eligible for linkage to the DAD. Approximately 5.3% of Census respondents were linked to at least one DAD record between 2006/07 and 2008/09
Output: Only aggregate data and analyses conforming to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada Research Data Centres, in the form of peer reviewed journal articles and presentations.
Characteristics and Performance of Firms that Propose or File for Bankruptcy in Canada (042-2017)
Purpose: To study the effect of firm financing on the economic performance of firms.
Description: Bank of Canada would like to link data of all firms that proposed or filed for bankruptcy over the 2000 to 2014 period to Statistics Canada data sources. The data on insolvency will be obtained from the Office of the Superintendent of Bankruptcy Canada (OSB). It will include the firms that initiate an insolvency process, the reason for these, and the outcomes.
Output: Security Measures for the linking keys, identifiers and linked analysis file will conform to statistics Canada's policies and standards. The linked file will be retained in accordance with the Directive on the Management of Statistical Microdata Files, until no longer required, at which time they will be destroyed. . All direct business identifiers will be removed from the analysis file once linkage is complete
Characteristics and Performance of Firms that Propose or File for Bankruptcy in Canada (043-2017)
Purpose: To study the determination of exchange rates and their effects on trade and inflation.
Description: The Bank of Canada would like to link data on imports, retail, wholesale trade and firm characteristics to determine how the enterprise-level performance (sales, employment, productivity, profitability) responded to variations in the Canada-US real exchange rate during the period 2002-2014. The project will compare the performance of importing enterprises versus non-importers, as well as the performance of enterprises in the retail and wholesale sectors.
Output: Security Measures for the linking keys, identifiers and linked analysis file will conform to statistics Canada's policies and standards. The linked file will be retained in accordance with the Directive on the Management of Statistical Microdata Files, until no longer required, at which time it will be destroyed.. All direct business identifiers will be removed from the analysis file once linkage is complete.
Performance of Ontario Ministry of International Trade (ON-MIT) Assisted Enterprises (045-2017)
Purpose: This linkage will allow the ON-MIT to assess the performance of firms that receive assistance from ON-MIT relative to firms that do not receive any assistance in the same region, industry, and firm size category.
Description: The Ontario Ministry of International Trade (ON-MIT) provides assistance to exporters through the Ontario's International Trade Action Plan. In order to evaluate the impact of the assistance, the ON-MIT would like to link data of firms assisted by ON-MIT to Statistics Canada data, to compare the performance of the firms assisted by ON-MIT to the performance of similar Ontario companies that do not receive assistance. Statistics Canada files to be linked are the National Accounts Longitudinal Microdata file (NALMF), the Trade by Exporter Characteristics (TEC) data and the Business Registry (BR), from 2010 to 2015. The data on the 2013 assisted firms will be obtained from the Ontario Ministry of International Trade.
Output: Only non-confidential aggregate statistical outputs and analyses that conform to the confidentiality provisions of the Statistics Actwill be released outside of Statistics Canada. The information will be presented in the form of aggregate tabulations providing information on ON-MIT assisted firm.
Health Care Cost Mobility in Canada, Demographics, Mobility and other Predictors; Subsequent Use of 043-2012 (049-2017)
Purpose: Previous research has shown that a careful examination of health care cost across socioeconomic and demographic groups and over time is needed. This project will examine healthcare cost mobility at the national level (except for Quebec), as well as examining how healthcare spending changes across the distribution of spending.
This study will build on existing literature by adding estimates of costs and applying new econometric techniques that examine costs across the conditional cost distribution rather than just at the conditional mean. The detailed Canadian data and its large sample will also permit the identification of which procedures result in persistent costs.
Description: The researchers will begin by examining differences in healthcare spending by demographic groups (e.g., age, gender, ethnicity, income, family structure, education level, immigration status, and disability) to determine if there are some provinces spending more on certain health care procedures, as well as examining the effect of distance to a hospital on health care use and costs. The researchers will also condition on and examine variation in items and events such as weather, stock markets, timing of income receipt, secular and non-secular holidays and sporting events. Regression methods will be used to quantify the changes in healthcare spending across the distribution of spending.
The 2006 Census (long-form) was linked to the Discharge Abstract Database (DAD) for 2006/07-2008/09 to support analysis of hospital use by sub-populations and socio-economic status of Canadians. Approximately 4.652 million long-form respondents were eligible for linkage to the DAD. Approximately 5.3% of Census respondents were linked to at least one DAD record between 2006/07 and 2008/09. Local daily weather information from the Government of Canada Historical Climate data as well as daily local air quality information will be merged to the file as will daily stock market indicators.
Output: Only aggregate data and analyses conforming to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada Research Data Centres, in the form of peer reviewed journal articles and presentations.
Historical Data Linkage of the 2011 General Social Survey (GSS) and Tax Records. (052-2017)
Purpose: Data from the linkage of previous tax records to the 2011 GSS will allow analysis projects to examine the relationship between family transitions and socioeconomic well-being.
Product: Details on products planned to be released.
There are currently two research projects planned on the topic of socioeconomic well-being of women: Women's Family Trajectories and Economic Well-being a Gender Analysis of the Work-Family Interface in Canada. The results are to be released in a publication. Other projects could be examined from the linked database once it becomes available to Statistics Canada Research Data Centre network.
Creation of the Research Analysis Files Using the New Brunswick Smoking Record Linkage Environment (053-2017)
Purpose: Statistics Canada has created a record linkage environment for the New Brunswick Tobacco Litigation Project called the New Brunswick Smoking Record Linkage Environment (NBSRLE). This environment will be used to link Statistics Canada survey data to New Brunswick administrative health data using a common unique, anonymized identifier, created as part of Record Linkage 085-2016 and Record Linkage 016-2017.
The purpose of this linkage is to build analytical research files from the linkable survey and administrative files placed in the RDC for the New Brunswick Tobacco Litigation Project.
Description: Personal identifiers were removed and a unique anonymous identifier was assigned to the administrative records for each person during the development of the linkable files (RL 085-2016) for this project. Common linking keys based on the identifiers have been created for the share-link CCHS and NPHS and de-identified linkable versions of the administrative files and the survey files were placed in the RDC environment (RL 016-2017).
This project is to create analysis files from the de-identified files to create the necessary statistical models and tables to be presented during the litigation processes. There are three areas of study proposed by this record linkage project. First, researchers will address the question of the comparability of smoking attributable fraction (SAF) estimates derived from the Public Use Microdata File of the CCHS/NPHS to estimates derived from the share-link file of the CCHS/NPHS. Second, this project seeks to extend the estimation model to additional aspects of health care utilization (i.e. beyond hospital and physician care) and to examine the comparability of SAFs derived from self-report versus 'administrative' measures of health care utilization. Third, this file may be used to substantiate or refute evidence presented during the litigation process.
Output: Linked analysis files, statistical models, tabulations, and other analytical products will be created as part of this record linkage. Output files will contain the anonymous NBSRLE identifier and analysis variables for the share-link CCHS, share-link NPHS and New Brunswick administrative datasets previously linked to the NBSRLE (record linkage number RL-085-2016) will be used in the RDCs identified by the Province of New Brunswick.
No identifiable personal information will be released from the NBSRLE.
Research projects to access and link the NBSRLE output files in the RDC will be approved on a study-by-study basis. These may only be carried out as part of a research agenda initiated by the designated representatives (litigant experts) from the province of New Brunswick or the Defendants under contract with Statistics Canada. A summary of each approved study will be posted on the Statistics Canada web site.
End-of-life health care use for children with life-threatening conditions: A national population based study (054-2017)
Purpose: This linkage of the Canadian Vital Statistics Deaths database to the Discharge Abstract Database (DAD) and National Ambulatory Care Reporting System (NACRS) will be used to examine patterns of health care use over the last year of life in a national cohort of children aged 0 to 19 years old who died in Canada since 2008. The focus of the study is to examine hospital visits, both inpatient and outpatient, over the period up until death to qualify differences in care based on the characteristics of the patient. This would not be possible using existing individual databases. This study will provide a better understanding of the intensity and types of care delivered and the location of death in respect to children. This information is valuable for provincial, territorial and federal stakeholders for informing policy and decision-making related to the provision of health care services for children.
Output: The linked datasets will be available to deemed employees to use in a Research Data Center (RDC). All data and analytical products to be released outside of Statistics Canada and the RDC will conform to the confidentiality provisions of the Statistics Act. These file will be retained in accordance with the Directive on the Management of Statistical Microdata Files.
Canadian Census Health and Environment Cohort (CanCHEC): linkage of the 2006 Census of Population to Mortality, Canadian Cancer Registry, Discharge Abstract Database, and annual place of residence. (055-2017)
Purpose: To create a large population-based dataset that will allow for the examination of mortality, cancer, hosptilizations, and environmental exposures by taking into account ethnocultural and socioeconomic factors (as measured on the census) and historical exposure (using annual place of residence to attach environmental exposures). In conjunction with previous census linked cohorts, these data files will provide clear and authoritative evidence on whether health inequalities are changing (increasing/decreasing) or remaining constant over time.
Output: All access to the linked microdata file will be restricted to Statistics Canada personnel (including Statistics Canada deemed employees) whose work activities require access. Only aggregate data that conform to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada. Major findings will be used to create research papers for publication in peer-reviewed journals and presentations at workshops and conferences. An analytical dataset will be placed in the Research Data Centres (RDCs). Access will be granted following standard RDC approval processes and procedures.
Horizontal Innovation and Clean Technology Review (057-2017)
Purpose: The federal government Budget 2017 made the commitment to "initiate a horizontal review of all federal government programs across all departments with the objective of simplifying programming and better aligning resources to improve the effectiveness of innovation programs." The Treasury Board Secretariat (TBS) was tasked with carrying the horizontal review of all federal innovation and clean technology programs across all departments with the purpose of simplifying programming and better aligning resources to improve the effectiveness of innovation programs. Statistics Canada will contribute to the review by collecting administrative data from federal departments on the recipients of federal government support and linking to this data to Statistics Canada's administrative data in the Linkable File Environment.
Output: Statistics Canada will contribute to the review through the provision of descriptive statistics tables and econometric analysis to the Treasury Board.
Labour Market Implications of Mergers and Acquisitions (M&As). (058-2017)
Purpose: The study will provide the first evaluation of the post-merger impact on workforce and overall welfare effect of M&As. This will inform policy making related to desirability of employment protection and the design of its framework.
The files will link an external M&A dataset from Thomas Reuters SDC database to Statistics Canada's CEEDD. The study is expected to start on September 30, 2017 and to be completed on December 31, 2019.
Output: Security measures for the linking keys, identifiers and linked analysis file will conform to Statistics Canada's policies and standards. All direct business and employee identifiers will be removed from the analysis file once linkage is complete. External researchers can only access the linked data in the Canadian Centre for Date Development and Economic Research, on Statistics Canada premises, and under the Directive on the Use of Deemed Employees. The linked file will be retained in accordance with the Directive on the Management of Statistical Microdata Files, until no longer required, at which time they will be destroyed.
Longitudinal and International Study of Adults: Linkage to Personal Tax, Business Register, Pension Plans in Canada, T1 Family Files and Longitudinal Immigration Database for past and current Census family members of survey respondents. (059-2017)
Purpose: The Longitudinal and International Study of Adults (LISA) is a voluntary, multi-topic, longitudinal, socioeconomic survey of households collected every other year. The survey was designed to meet the key policy data needs of Employment and Social Development Canada (ESDC) in the domains of education and training, family health, income and employment. For most people these outcomes are determined in the broader family context, through intra-household decision-making, relationship formation and dissolution, and inter-generational transmission mechanisms. These channels will be better understood by linking LISA respondents with their past and current family members' income, employment and pension trajectories. Combining variables collected by the LISA with administrative family information will fill important gaps in existing Canadian data.
Canadians will benefit from research that can support them in making good choices and inform improvements in the government services provided to them. It will supply information on subjects of relevance to Canadians, such as the importance of different personal and financial insurance mechanisms in the face of partnership dissolution; the effect of parental outcomes on own income trajectory, in childhood but also possibly in adulthood; and the intergenerational transmission of income and wealth inequality. Finally, the linkage will complement the data collected by the survey, improving data quality and providing a long panel suitable for complex analyses, while minimizing response burden and respondent fatigue.
Output: The output of the linkage will be a linked analysis file containing the LISA-linked family members, as well as files containing past and current tax, employer and pension plan data. Only aggregate statistics and analyses that conform to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada.
Access to the linking keys and files containing personal identifiers will be restricted to Statistics Canada employees whose assigned work activities require such access. These files will be retained until they are no longer required for the processing of the LISA survey, at which time they will be destroyed. The linked analysis file will be retained until no longer required in accordance with the Directive on the Management of Statistical Microdata Files.
The availability of the linked analysis file will be announced in The Daily. The analysis file will be made available to Statistics Canada researchers, and to deemed employees at the Statistics Canada Research Data Centres. Research reports will be generated on topics such as family welfare, the financial security of seniors, and the economic outcomes of workers with varying educational and skill levels.
Linkage of the Canadian Cancer Registry (CCR) to obtain death information. (061-2017)
Purpose: The purpose of the linkage is to add death information to the Canadian Cancer Registry (CCR) with the aim to broaden the analytical use of the file by facilitating epidemiological research on cause of death and to permit the calculation of survival and prevalence rates. The linked file will also improve the completeness of the registry since not all deaths with an underlying cause of cancer (e.g., those found at the time of death) may be included on the CCR but could be found on death records.
Output: Statistics Canada will create a de-identified linked analytical file that will include the CCR variables and selected analytical variables from the CVSD and date of death from the T1PMF. It will not include any personal identifiers. The analytical file will be available to deemed employees to use in a Research Data Center (RDC) and to share partners that have the necessary agreements in place. In addition, a separate file that includes select identifiers will be created and provided to the Provincial and Territorial Cancer Registrars to update their patient registries. This file with identifiers will only contain data for which written approval to disclose has been obtained. Statistics Canada will also use the analytical file to create CANSIM tables and cancer indicators, including cancer survival and prevalence rates, as well as be used for other research and analytical products.
Linkage of the Natural Resources Canada (NRCan) mill list with Statistics Canada's establishment identification number (062-2017)
Purpose: The purpose of this linkage is to improve the use of microdata from the Annual Survey of Manufacturing and Logging Industries (ASML) already obtained from Statistics Canada under a section 12 data sharing agreement. The linkage of an existing NRCan list of companies to Statistics Canada identifiers will allow for more reliable results in research being conducted by the client. The enhanced NRCan database will be used to improve existing NRCan models on supply chains and industry structure.
Output: NRCan analysts use the Forestry database to respond to information requests from departments and industry. The deliverables generally include modelling and economic analysis related to the forestry sector including regional trends, industry structure and supply chains.
Bail and Remand in Ontario: Linking Ontario bail and remand data with Census (2006) data. (063-2017)
Purpose: The purpose of this project is to build the capacity for evidence-based structured data surrounding remand and bail in the province of Ontario. This work is being conducted for the Ministry of Community Safety and Correctional Services to assist them in identifying current and critical issues, assess program integrity and derive well-informed and measureable solutions to remand/bail in the province, while at the same time, maintaining their overarching mandate to both protect and enhance public safety, and to strike an appropriate balance between the rights of individuals and the rights of the community at large.
The data generated by the project will assist the Ministry of Community Safety and Correctional Services in identifying a set of concrete recommendations for province-wide efficient improvements to the criminal justice system. These improvements are aimed at reducing systemic issues and unnecessary time involving remand, police detention, and court processes; developing effective community alternatives to police detention, court reviews, and remand; and, reducing the rate of new charges and breaches while accused are awaiting trial in the community, while at the safe time ensuring the public safety of Ontario's communities.
Output: An analytical report, to be prepared by the Ministry of Community Safety and Correctional Services (MCSCS) in collaboration with Statistics Canada, will be disseminated by the MCSCS.
Pesticide Sector in Canada: An examination of Pest Control Product Registrants (064-2017)
Purpose: By linking The Pest Management Regulatory Agency's administrative data to the National Accounts Longitudinal Microdata File (NALMF) 2005-2015 and Business Register 2005-2015, this study will examine:
- The economic and business characteristics for pesticide registrants, in terms of company size, revenue, expenditure, organizational type, number of employees, registered Pest Control Products (PCPs) by sector and type, as well as number and sales of PCPs.
- Explore the relationship between sales of PCP and registrants in terms of economic and business characteristics and the potential economic impact of changes in regulation.
Output: This record linkage will conform to Statistics Canada's policies and standards and business identifiers will be removed from the analysis file once linkage is complete. Under the Directive on the Use of Deemed Employees, external researchers can access the analysis file in the Canadian Centre for Date Development and Economic Research, at Statistics Canada's headquarter. Only non-confidential analysis that will not result in the identification of an individual, business or organization will be released outside of Statistics Canada. In accordance with the Directive on the Management of Statistical Microdata Files, the analysis file will be retained until no longer required, at which time it will be destroyed.
Tax filing patterns of Canada's overall population and subpopulations over the 2010-2015 period, using the 2011 National Household Survey (NHS) and personal income tax records. (065-2017)
Purpose: The purpose of this record linkage is to create a portrait of tax filing patterns, using linkage rates as a proxy, over 2010-2015 for Canada's populations and subpopulations, including the on- and off-reserve populations and the populations identifying as Aboriginal and non-Aboriginal.
Canada Revenue Agency (CRA) and Employment and Social Development Canada (ESDC) will use the non-confidential aggregate estimates to better understand the characteristics of persons with low filing incidence. This information will be used for program improvements such as targeted outreach to increase the uptake of benefits like the Canada Child Benefit.
This analysis will benefit Canadians by informing the development of program improvements aimed at increasing the number of recipients of benefits programs.
Output: Only non-confidential aggregated data estimates, which conform to the confidentiality provisions of the Statistics Act, will be released outside of Statistics Canada. Confidentiality rules for NHS and Census would also be applied to all products before release.
Multimorbidity and material and social deprivation among high users of in-hospital health care services. (066-2017)
Purpose: The purpose of this project is to create a linked dataset that will allow researchers to analyse the socio-economic and health characteristics of high users of in-hospital health care services as compared to a control cohort drawn from the general population of in-hospital health care service users. To accomplish this the Discharge Abstract Database (DAD) will be linked, within the social data linkage environment (SDLE), to various administrative and survey databases, including the 2006 Census long form, the 2011 Census, the 2011 National Household Survey, theT1 Family File (T1FF), the Ontario Mental Health Reporting System (OMHRS), and the National Ambulatory Care Reporting System (NACRS).
This linkage will provide health care providers and stakeholders with information that will help inform policy and decision-making regarding the provision of services to high users of in-hospital services. It will also provide the public and health care providers with information that will foster a better understanding of the socio-economic and health characteristics affecting this group of high users. This information can be used to support the development and implementation of evidence based interventions for this population.
Output: The linked data file will be available to deemed employees of Statistics Canada within the Research Data Centres (RDC). All data and analytical products to be released outside of Statistics Canada and the RDC will conform to the confidentiality provisions of the Statistics Act. The linked file will be retained in accordance with the Directive on the Management of Statistical Microdata Files.
Administrative Personal Income Masterfile (APIM)/Fichier administratif principal du revenu personnel (FAPRP). (067-2017)
Purpose: The purpose of this project is to create the Administrative Personal Income Masterfile (APIM), a comprehensive individual annual income database that is created using administrative files obtained from Canada Revenue Agency (CRA). The file will be created annually, in a centralized fashion, for use by Statistics Canada programs that produce income estimates. The use of this file will ensure coherent income concepts across Statistics Canada and reduce response burden for Statistics Canada respondents.
Description: The Administrative Personal Income Masterfile (APIM) combines Canada Revenue Agency's (CRA) information available through different data sources using auxiliary files produced within Statistics Canada.
The APIM contains general income information such as total income as well as detailed income components that were traditionally collected through questionnaires. Even though most of the components are directly sourced from CRA's files, some non-taxable income components such as provincial senior supplements and provincial tax credits are not available through CRA's files and are imputed using information for the respective rules and regulations.
The APIM is an ongoing project and will first be released for 2014 income data. Income data for prior and following years will be available subsequently.
Output: An annual cross-sectional database containing income information for all persons that can be found within the administrative files obtained from Canada Revenue Agency (CRA).
Only non-confidential aggregated data and analytical products, which conform to the confidentiality provisions of the Statistics Act and Privacy Act, will be released outside of Statistics Canada.
Linkage of socioeconomic and treatment data to the Canadian Cancer Registry. (068-2017)
Purpose: The Canadian Cancer Registry (CCR) will be linked to the Discharge Abstract Database (DAD), the National Ambulatory Care Reporting System (NACRS), the Canadian Vital Statistics Death database (CVSD), the T1 Family File (T1FF), the Longitudinal Immigration Database (IMDB) and the 2016 Census long from. This linkage will support research on the relationship between cancer diagnoses, surgical treatment(s), and socioeconomic factors which is not currently possible using a single dataset. The information will help support policy and decision-making for the various stakeholders involved in the delivery of cancer care in Canada.
Output: The linked data will be available to deemed employees to use in a Research Data Center (RDC). All data and analytical products to be released outside of Statistics Canada will conform to the confidentiality provisions of the Statistics Act. The linked data and linkage keys will be retained by Statistics Canada in accordance with the Directive on the Management of Statistical Microdata Files.
Development of the Employment Insurance Status Vector (EISV) microdata for longitudinal analysis (069-2017)
Purpose: The purpose of this microdata record linkage request is to obtain authorization (a) to include an anonymous person identifier variable in the Employment Insurance Status Vector administrative data, and (b) for researchers to use the anonymous person identifier variable to construct longitudinal histories for EI claimants/recipients.
Statistical analyses of longitudinal histories of EI claimants/recipients will fill an important knowledge gap regarding the frequency and duration of use of regular, maternity, parental and other special EI benefits over the last 20 years. An improved understanding of the subpopulations at risk of EI benefit recidivism could assist in the development of programs and policies to improve labour market attachment for these workers.
Output: Authorized Statistics Canada employees may use the EISV microdata for research purposes, and the results may be published by Statistics Canada or other approved organizations.
Within the RDCs, each research team authorized to use the EISV microdata will be required to produce specific statistical output, under the terms of the microdata research contract. Results from projects conducted in the RDCs are usually presented at academic conferences and/or published in academic journals, theses and dissertations.
No portion of the EISV microdata will be disseminated outside of Statistics Canada premises, including the RDCs. Results from statistical analyses of the EISV microdata will be vetted prior to dissemination to ensure that they meet minimum requirements for preserving EI claimant/recipient confidentiality.
A comparative and longitudinal study of socioeconomic and health outcomes for Canadian Veterans and non-Veterans (071-2017)
Purpose: The purpose of this linkage is to conduct a longitudinal, comparative analysis of Canadian Armed Forces Veterans and civilians to determine how military service influences socioeconomic and health outcomes for these populations.
Output: An analytical report, intended for publication in one of Statistics Canada's publications, will be generated.
Canadian Census Health and Environment Cohorts (CanCHEC): linkages of the 2011 and 2016 Censuses of Population and 2011 National Household Survey to Mortality, Cancer, Hospital Administrative Files, and T1 Family File (072-2017)
Purpose: To create two large population-based datasets that will allow for the examination of mortality, cancer, and hospitalizations, and environmental exposures by taking into account ethnocultural and socioeconomic factors (as measured on the census and T1 files) and historical exposure (using annual place of residence to attach environmental exposures). In conjunction with previous census linked cohorts, these data files will provide clear and authoritative evidence on whether health inequalities are changing (increasing/decreasing) or remaining constant over time.
Output: All access to the linked microdata file will be restricted to Statistics Canada personnel (including Statistics Canada deemed employees) whose work activities require access. Only aggregate data that conform to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada. Specifically, output will follow previously approved census linked confidentiality rules and guidelines and other vetting rules specific to the 2011 and 2016 Censuses and 2011 National Household Survey and other datasets used in this linkage. Access will be granted following standard RDC approval processes and procedures. Major findings will be used to create research papers for publication in peer-reviewed journals and presentations at workshops and conferences. Analytical datasets will be placed in the Research Data Centres (RDCs) and respect data access guidelines (e.g. exclusion of variables) currently in effect for the source datasets (e.g. census, vital statistics, tax files etc).
The impact of proposed government taxation on income sprinkling on corporation owners and their family members (075-2017)
Purpose: By linking the Longitudinal Administrative Data (LAD), T1 family files (T1FF), T2 corporation tax files as well as T4 employment remuneration files and T5 partnership tax files, between 2012 and 2014, this study will examine:
- The current states of income sprinkling between CCPC owners and their family members.
- The impact of proposed CCPC taxation on government tax revenue.
- The distribution of characteristics for CCPC owners and their family members who are likely affected by the proposed taxation
Output: This record linkage will conform to Statistics Canada's policies and standards and business identifiers will be removed from the analysis file once linkage is complete. Under the Directive on the Use of Deemed Employees, external researchers can access the analysis file in the Canadian Centre for Data Development and Economic Research, at Statistics Canada's headquarter. Only non-confidential analysis that will not result in the identification of an individual, business or organization will be released outside of Statistics Canada. In accordance with the Directive on the Management of Statistical Microdata Files, the analysis file will be retained until no longer required, at which time it will be destroyed.
Summary for the Annual Report to Parliament on Privacy and the Statistics Canada Website: Linkage of the Employment Insurance Status Vector (EISV) microdata with the Labour Force Survey (LFS). (081-2017)
The purpose of the project is to link the Employment Insurance Status Vector data with the Labour Force Survey within the Social Data Linkage environment to: 1) assess whether the two datasets have a sufficient population that overlaps, knowing that both datasets have relatively large populations and that they both have data on the unemployed and their characteristics; 2) give insight into the usefulness of linking the two datasets to examine the impact of the EI program and its changes on labour supply behaviours as a mean to evaluate the program, provide evidence for policy development, and assess the feasibility of replacing part of the content of the EI Coverage Survey with administrative data.
The assessment results of the linkage will be shared with Employment and Social Development Canada (ESDC). There will be no dissemination of research findings during the first phase of the project. However, should the linkage results be significant, the linked files will be made available in all RDCs for other research projects.
Canadian Housing Statistics Program (CHSP) (082-2017)
Purpose: In 2017, Statistics Canada was mandated to develop a comprehensive housing database that integrates social, economic and financial information to produce official housing statistics. The Property Register (PR) will provide up-to-date data on the stock (number of properties) and the flows (purchases and sales) of housing, and deliver comprehensive data and statistics to provide policy makers with valuable information to monitor and analyze the Canadian housing market.
Output: The first release is scheduled for December 19th, 2017 for Toronto Census Metropolitan Area (CMA) and Vancouver CMA. The current plan is to have quarterly releases. Other CMAs and provincial level will be released beginning in 2018.
The first release will include statistical information on the following variables/concepts:
- Residential property type
- Residential assessment value
- Residency status
- Living area
- Period of construction
- Condominium status
- Number of owners
Additional variables will be added beginning in 2018.
In cases where the data custodian outlines specific terms and conditions for access use, dissemination and/or retention/deletion of the data files that differ from those stated above (source files, analytical files or Research Data Centre files), the data custodian's terms and retention periods will be applied.
Social Insurance Number (SIN) linkage and tax-filing rates for families with children using the 2016 Census and linkage to the SIN registry (083-2017)
Purpose: The purpose is to obtain information for Employment and Social Development Canada (ESDC) on possible barriers to families receiving the Canada Learning Bond (CLB) and the Canada Education Savings Grant (CESG). In order to receive these grants, parents must have a SIN for their child and they must file taxes. This project will provide non-confidential aggregate statistics on the percentage of children who have a SIN, the percentage of parents who file taxes, and the characteristics of the families who do not have these and therefore do not receive the grants. ESDC will use this information for program improvements such as targeted outreach to encourage families to file taxes and get SINs for their children, and to increase the uptake of the CLB and CESG.
Output: Only non-confidential aggregated tables, conforming to the confidentiality provisions of the Statistics Act, will be released outside of Statistics Canada.
Use of pre-existing linkages and linkage between these pre-existing linkages for analysis, data quality evaluation, projections and publication (085-2017)
Purpose: The purpose of this request is, first, to seek authorization to use pre-existing linkages (i.e., linkages between the following files: (1) the 2011 Census/National Household Survey, the 2016 Census, T1 Family Files, the Immigrant Landing File, the 2011 Indian Register, etc. (see the files included in linkage request #085-2015); (2) the 2006 Census and the 2011 Census/National Household Survey; (3) the 2012 Aboriginal Peoples Survey and the 2011 Census/National Household Survey; (4) the 2017 Aboriginal Peoples Survey and the 2016 Census; and, (5) the 2011 Indian Register and the 2011 Census/National Household Survey) as well as the linkage of these linkages for analysis purposes (1) as part of the Demosim project to produce parameters in order to develop plausible population projection scenarios for various groups of the Canadian population, (2) to assess the quality of the linkages used to prepare the Canadian Statistical Demographic Database (CSDD) as well as its content, (3) as part of projects aimed at gaining a better understanding of Aboriginal and immigrant populations, and (4) for all projects that require use of these data sources, whether or not they are defined in the appendix to this document.
Second, this request seeks authorization to document the analyses and to publish this documentation either as a technical document or as scientific papers (publications to be determined). This authorization request covers the projects listed in the appendix to this document as well as other projects (undefined at this time) that might require these data sources.
Output: Under the Demosim project, the data produced would be used to prepare projection assumptions and parameters. The data used in preparing these assumptions and parameters, produced at an aggregate level, would be released, along with the methods used to produce them, in the form of technical documentation and/or scientific articles.
The results of the analyses that could be conducted (whether defined at this time or not) may also be published in the form of reports, analytical articles or presentations.
In every case, the data would only be disseminated in their aggregate form, in accordance with the confidentiality provisions of the Statistics Act. The linked files will be stored in a secure directory on one of the Demography Division servers. Only employees whose work requires it will have access to this directory for the entire life of the files.