2014 General Social Survey on Victimization: linking tax data from the T1 Personal File and T4 Summary and Supplementary file (075-2013, 040-2015)
Purpose: The General Social Survey (GSS) program, established in 1985, conducts telephone surveys from a sample selected across the 10 provinces. Population in Yukon, Northwest Territories and Nunavut are not usually part of the targeted GSS population with the exception of cycles on victimization. The GSS is recognized for its regular collection of cross-sectional data that allows for trend analysis, and its capacity to test and develop new concepts that address emerging issues. Each year the GSS focuses on a different topic, such as family, victimization, social support and aging, and time use. A specific topic is usually repeated approximately every 5 years. The 2014 GSS which will focus on Victimization is the sixth iteration.
This survey is an important source of information to better understand how safe people feel, what they think of the justice system and their experiences of crime.
By linking the 2014 GSS on victimization responses to personal tax files of respondents, and the tax files of all household members, more accurate income (personal and household) information will be obtained for respondents. At the same time, response burden will be minimized, and collection, data processing, and testing costs will be reduced.
Description: The 2014 GSS on Victimization is a sample based survey with a cross-sectional design. Telephone surveys are conducted through computer assisted telephone interviews from a sample selected across the 10 Canadian provinces and interviews are conducted through a mix of computer assisted telephone interviews and computer assisted personal interviews in the territories. By linking data, we are aiming to obtain better quality data for income (personal and household).
Questions relating to income show rather high non-response rates, the incomes reported by respondents are usually rough estimates. Linking will allow getting such information without having to ask questions.
The information collected during the 2014 GSS on Victimization will be linked to the personal tax records (T1 or T4) of respondents, and tax records of all household members. Household information (address, postal code, and telephone number), respondent's information (social insurance number, surname, name, date of birth/age, sex) and information on other members of the household (surname, name, age, sex and relationship to respondent) will be key variables for the linkage.
Respondents will be notified of the planned linkage before and during the survey. Any respondents who object to the linkage of their data will have their objections recorded, and no linkage to their tax data will take place.
Output: The availability of the 2014 GSS on Victimization analytical aggregated data file will be announced in The Daily. The analysis file containing only aggregated data created using confidentiality procedures as required by Statistics Canada's directives will be made available to Statistics Canada researchers, and to deemed employees at the Statistics Canada Research Data Centres. All data will remain confidential and protected under the Statistics Act.
Along with the availability announcement of the analytical data file (in The Daily), only non-confidential aggregate statistics will be released.
Evaluation of the Canada Child Tax Benefit (CCTB) file for the production of interprovincial migration estimates. (027-2013)
Purpose: Preliminary interprovincial migration is estimated using a mathematical model based on Canada Child Tax Benefit (CCTB) data. This fiscal program, administered by the Canada Revenue Agency (CRA), helps families with the cost of raising children. Interprovincial migrants are defined as children whose parents' address changes (change of province) between two points in time. This is why it is critical that the CCTB addresses be of good quality, in other words, that they actually represent the child's place of residence and that they are quickly updated after a move.
The objective of linkage is therefore to assess the quality of the CCTB addresses by matching the data in that file with the 2011 Census data and the National Household Survey data. It ties in with both the evaluation of the quality of population estimates and the development of new methods.
Population estimates are the cornerstone of statistical measurement of the population. They are also used to calculate income transfers and cost-sharing programs between the different levels of government and to weight a number of Statistics Canada surveys. Furthermore, interprovincial migration is one of the components that best explains inaccuracies in the population estimates. Because linkage can evaluate the quality of the CCTB addresses, it is part of the population estimate quality evaluation process.
Description: The linked files are the 2011 CCTB monthly data, the 2011 Census data and the 2011 NHS data. Linkage was done deterministically by successive waves based on the following variables: last name, given name, date of birth, sex. The address, postal code and telephone number were used to evaluate the linkage quality. The linking key between the census and the NHS is the key created by the census team using the variables frame_id and persnr.
Output: The linked files, identifiers and linking keys will be kept until no longer required, up to, April 1, 2017, at which time they will be destroyed. Non-confidential aggregated results will be used in a presentation to the 2013 Methodology Symposium and in the symposium's written proceedings.
Analysis of the change in Aboriginal identity reporting (026-2013)
(Note that this will entail a secondary use of an approved linkage (026-2013). - R. Cunningham)
Purpose: The increased number of people reporting Aboriginal identity is a known phenomenon not only in Canada (Guimond, Robitaille and Sénécal 2003; Guimond 1999, 2003; Guimond, Kerr and Beaujot 2004; Siggner 2003; Caron-Malenfant et al. 2014) but also in the United States (Passel 1996, 1997; Eschbach 1993). This phenomenon necessarily leads to the conclusion that people change their reported status from one census to another. By combining census data with data from the Aboriginal Peoples Survey (APS) for the same individuals, it is possible to better understand this phenomenon by analyzing the factors associated with the fact that people change their self-reported identification over a very short period, specifically the time between the Census and the APS. It is even more important to understand it because studies on Aboriginal people depend on these responses and consider them to be "error-free".
We analyzed the change in Aboriginal identity reporting—inconsistencies in identifications by survey—by linking data from the 2006 Census with data from the 2006 APS, which enabled us to determine the scope of the phenomenon. In Canada, a significant proportion of the people living off reserve, who identified as First Nations or as Métis on the APS, did not report the same identity on the Census. In Quebec and Eastern Canada, the proportion is even greater.
Therefore, the purpose of the second phase of this study is to further our understanding of Aboriginal identification and, more specifically, the individual characteristics and contexts (province, region, municipality, among others) that are linked to the change in responses. In this phase, the 2011 data will be analyzed and the new results will be compared with the previous ones. The new results will also be used to review the results of analyses carried out using the identification criteria from only the Census or the APS to examine the situation of Aboriginal people.
Description: We would like to have access to a file containing the identity variables from the 2011National Household Survey (NHS), in addition to the 2012 APS data, for the same individuals. These variables (Aboriginal ancestry (Indian, Métis or Inuit), Aboriginal identity, registered Indian status, band membership, band identification) will enable us to validate the analyses of the 2006 data to determine whether the phenomenon is still as strong and has the same determining factors. Furthermore, we require access to the variable indicating the APS respondent's rank in the household. We need to know whether the APS respondent is Person 1 on the NHS—and therefore whether this person likely reported the first identification—or whether it was reported by a third party. We will do profile and classification analyses to validate the eight classes found in the 2006 analysis, then we will replicate the ordinary/logistic and multi‑level regression analyses (taking into account community of residence).
Output: Only aggregate or modelled statistics and analyses conforming to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada.
Statistics Canada will retain the linked files for five years, until August 30, 2019, or until they are no longer required, at which time the said files will be destroyed.
Life After Service Income Study: Linkage of a cohort of former Canadian Armed Forces members to tax information
Purpose: To assess the economic outcomes of former Canadian Armed Forces (CAF) members after their release to civilian life. There are currently an estimated 594,500 CAF Veterans (regular and reserve force) and about 95,000 CAF members (68,000 regular force and 27,000 reserve force). Only about 11% of this population is currently receiving benefits and services from Veterans Affairs Canada (VAC). Transition outcomes are of interest to both Department of National Defence (DND) and VAC. This linkage will assist DND and VAC in identifying gaps in programs and services, evaluating existing programs and developing new re-integration and rehabilitation programs that meet the needs of military personnel as they transition to civilian life.
In addition, the New Veterans' Charter, which represents the most significant overhaul of rehabilitation and reintegration programs and services for veterans since the Second World War, was implemented by VAC in 2006 and both departments require more information on the impact of the new charter.
Description: Statistics Canada will provide indicators on the income of members released from the CAF from 1998 to 2010. Economic adjustment will be measured through the production of statistical tables from personal income tax data (T1 Family File (T1FF)) linked to a cohort of approximately 85,500 former CAF regular and reserve members whose personal identifiers and other key pre- and post- release status variables will be supplied by DND/VAC.
Output: Statistics Canada will prepare a set of analytical tables on the pre- and post-release incomes of veterans. Only aggregate data that conform to the confidentiality provisions of the Statistics Act will be released to VAC from the DND/VAC cohort linkage to T1FF. Information will be provided in tabular form. The client will not have direct access to the linked file. All access to this linked file will be restricted to employees of Statistics Canada whose work activities require access.
Linkage of the 2008 General Social Survey (GSS) Cycle 22, Social Networks, and the Longitudinal Immigrant Database
Purpose: A record linkage between the 2008 General Social Survey (GSS) on Social Networks and the Citizenship and Immigration Canada (CIC) Longitudinal Immigrant Database (IMD) would permit analysis of social outcomes (such as civic participation) of immigrants to Canada by entrance characteristics such as admission category (e.g. refugee, family class, etc.). The results from this record linkage would be used by Citizenship and Immigration Canada to support and evaluate immigrant policies and programs.
Description: The General Social Survey (GSS) on Social Networks provides detailed information on the social and civic integration of immigrants and ethno-cultural minorities into Canadian society. The Longitudinal Immigrant Database (IMDB) provides information on immigrants to Canada from 1980–2011 such as admission category and Low Income Measures.
The record linkage between the Landing File and the 2008 General Social Survey employed a hierarchical deterministic record linkage program developed by HSMD for the IMDB.
Only GSS respondents will be maintained for this record linkage.
Output: Only aggregate statistical estimates that conform to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada. Outputs for the Longitudinal Immigrant Database (IMDB) and the General Social Survey (GSS) on Social Networks linkage will include a collection of cross-tabulations between these two sources.
The linkage results including variables used to perform the record linkage such as personal identifiers and information used to measure the linkage quality will be destroyed by March 31, 2014, or sooner if no longer required. All files will be kept on a server in a secure area. Access to these files is restricted to Statistics Canada employees whose assigned work activities require such access.
Re-contact with the Justice system
Purpose: To determine the types of unique information required to create and support high quality indicators of re-contact within the policing sector of justice. Whereas contact is defined as a documented official intervention (e.g. charge) against a person by a criminal justice agency/organization, a re-contact is defined as a subsequent contact signifying a new, official intervention by the agency/organization during a specified follow-up period.
The proposed project will attempt to establish baseline metrics on re-contact with policing sector and can serve as a comparison group for the previous policing re-contact evaluation undertaken in record linkage 065-2012. It will also provide the potential to be able to track emerging patterns of re-contact which may appear to be unique within a jurisdiction at a local level yet are more systematic in nature when evaluated at a higher level (e.g. national) of analysis.
Description: The proposed project consists of one record linkage activity which will be used to support the development of re-contact indicators within the policing sector of justice.
The linkage will use records collected under the Uniform Crime Reporting Survey (UCR2), in addition to supplemental personal identifiers provided by the Waterloo Regional Police Service for the years January 1, 2005 to December 31, 2011.
Output: Only aggregate statistics and analyses conforming to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada. Access to linking keys and linked analysis file will be restricted to Statistics Canada employees whose assigned work activities require such access.
High-level findings may be reported in the form of presentations to various National Justice Statistics Initiative partners.
Statistics Canada will retain the linked analysis files until March 31, 2017 or sooner if no longer required, at which time the linked analysis files will be destroyed.
Study of the Gross Flows Into and Out of Particular Industries in New Brunswick, 2000 to 2011
Purpose: To estimate the supply of workers in the New Brunswick labour market, particularly the replacement demand, that is, the workers needed to replace those who are no longer employed in the province. The focus will be on three groups: stayers (workers employed in the same industry for one or more years), leavers (workers who leave the labour market or industry) and entrants (workers entering the labour market, including those returning to the workforce).
Description: The annual T1 Family File (T1FF) records of tax-filers who lived in New Brunswick for at least one year from 2000 to 2011 will be selected for this research study. The T1FF records will be linked over this period using Social Insurance Numbers (SIN) to produce a longitudinal analysis file. All direct identifiers, including the SINs, will be removed from the analysis file following completion of the linkage.
The linkage will be produced by Statistics Canada staff on the agency's premises.
Output: Only aggregate statistics and analysis conforming to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada. Research findings will be used to enhance the labour market policies in New Brunswick. To support on-going analysis, the linked analysis file will be retained at Statistics Canada until April 30, 2016, or sooner if no longer required, at which time it will be destroyed. Access to the linked analysis file will be by Statistics Canada employees or deemed employees whose assigned work activities require such access.
Amendment: Data linkage to examine pathways of Information and Communication Technology (ICT) students through post-secondary education (PSE) and into the labour market, 1998-2011 (057-2013)
Purpose: The main objective of this project is to link the University of Ottawa students administrative data from 1998 to 2011, to the tax data (using the T1 Family File) of the corresponding years. This amendment is to add some aspects of job quality, namely union coverage and pension plan coverage.
The linkage will enable the tracking of post-schooling earning trajectories and acknowledge the presence of some job quality indicators of the jobs of university students in the ICT field of study and also of those with some ICT courses but not in an ICT field of study and compare these with the trajectories of non-ICT students. It will also allow, more generally, to look at student trajectories by various fields of study and to be able to make comparisons between various subgroups.
Description: Records of students from the University of Ottawa will be linked to the T1 Family File (T1FF) over a 14 year period (from 1998 to 2011). The data linkage will be done in two stages:
In the first stage, University of Ottawa will send a file containing the student identification variables as well as a pseudo-identification variable for each student. The linkage will be done with the T1 Family File containing an identification number and a selection of variables to conduct the research. Once the linkage is finalized, the student identification variables will be destroyed except the pseudo-identification variable from the University of Ottawa.
In Stage 2, the University will provide a file with the pseudo-identification variable and the student information. This second file will be linked to the reduced T1FF file from stage 1. The pseudo-identification variable from the University will be destroyed once the linkage is finalized and will not be part of the linked file.
A synthetic file will be created containing the same variables as the linked file but a noise will be introduced into the data. This file will reside at Statistics Canada and will be used by the researchers to plan their tables and models. A Statistics Canada employee will run their programs on the real linked file, apply the vetting rules and transmit the results to the researchers.
The record linkage will be done by Statistics Canada personnel. Individuals' tax data for all available years will be included in the linked file.
Output: Aggregate statistics and analysis conforming to the confidentiality provisions of the Statistics Act will be released to users. The linked file will be retained at Statistics Canada until no longer required, up to, December 31, 2018, after which time it will be destroyed.
The Effectiveness of the Department of Foreign Affairs and International Trade and Development's Trade Commissioner Services (072-2013)
Purpose: This project assesses the effectiveness of the services offered by Department of Foreign Affairs and International Trade and Development's (DFAITD) trade commissioners. The analysis will provide DFAITD with improved information on the economic impact of these services on the export performance, performance and survival of Canadian firms. It will allow DFAITD to determine whether the impacts of the services are consistent with the objectives set out for it by the Government of Canada.
Description: A list of firms in the DFAITD's International Business Development dashboard from 1999 to 2012 will be linked to data from National Accounts Longitudinal Microdata File (data from the Business Register, T2 Corporate tax, PD7, T4, Export Register, Import Register). The firms in the IBD dashboard will be linked probabilistically using name and address and then subjected to a manual review. This is a one-time linkage.
Output: Only non-confidential aggregate statistical outputs and analyses that conform to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada.
The Importance of Governance Structures on Firm Performance in the Portfolio of Firms Receiving Support from the Business Development Bank of Canada (071-2013)
Purpose: This project examines the impact of having a board of directors, an advisory board or both on firm performance. The analysis produced in this project will improve the understanding of whether, when and how certain governance structures improve firm performance, and enable the Business Development Bank of Canada (BDC) to better fulfill its mandate of providing services complementary to those available at other financial institutions with the goal of encouraging the development of small and medium-sized businesses in Canada.
Description: A list of firms in the BDC portfolio in the 2000 to 2010 period will be linked to data from National Accounts Longitudinal Microdata File (data from the Business Register, Corporate tax data-T2 tax database, PD7 and T4). The BDC firms records will be linked probabilistically using name and address. This is a one-time linkage.
Output: Only non-confidential aggregate statistical outputs and analyses that conform to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada. The information will be presented in the form of tables of regression results and summary statistics related to the project's goal of ascertaining the impact of having an advisory board or board of directors.
The linked file will be retained until December 2018. All direct business identifiers will be removed from the analysis file once linkage is complete, and placed in a separate linkage key file. The linked file and the linkage key file will be retained until December 2018, or sooner if no longer required, at which time it will be destroyed.
Linking Data from the 2012 Survey of Financial Security to income tax Records file (T1)
Purpose: The purpose of this linkage is to obtain income data and reduce respondent burden, interviewer time and collection costs for the 2012 Survey of Financial Security. The linkage allows obtaining information on income variables without burdening respondents with detailed questions about their income. The income data is important for the Survey of Financial Security and allows analysis of the relationship between income and wealth. This survey is the only Statistics Canada survey that releases information on these topics.
According to the directive on Informing Survey Respondents, the 2012 Survey of Financial Security informs all respondents of this linkage through the inclusion of the statement within the survey collection vehicle and on the STC website. The statement reads:
" In order to reduce the length of the interview, and enhance the information provided in this survey, Statistics Canada will combine this information with information from personal tax data. The information we obtain will be used for statistical purposes only, and will be kept confidential."
Description: The 2012 Survey of Financial Security databaseand the 2011 T1 File will be linked using the address, city, date of birth, first name, surname, sex, province, NYSIIS and SNDX code for surname, postal code, marital status, telephone number and first initial. This information will be removed from the linked file as soon as the linkage is completed, and stored separately. Access to these files will be restricted to Statistics Canada employees whose assigned work activities require access.
Output: No information containing personal identifiers would be released outside of Statistics Canada from this linkage activity. Only aggregate statistics and analysis conforming to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada. The Survey of Financial Security is an occasional survey to be conducted from September to November 2012 and therefore, the linkage would be undertaken in 2012/2013.
2011 National Household Survey and Immigrant Landing File linkage
Purpose: A record linkage between the 2011 NHS and the Citizenship and Immigration Canada (CIC) Landing File would permit detailed analysis of socio-economic outcomes of immigrants to Canada by entrance characteristics such as admission category (e.g. refugee, family class, etc.). The results from this record linkage would be used to support and evaluate immigration policies and programs. In addition, the file will be used to strengthen inputs into Statistics Canada's DEMOSIM micro-simulation model.
Description: The National Household Survey (NHS) provides detailed information on the demographic, social and economic characteristics of people in Canada, as well as providing information about the housing units in which they live. The Citizenship and Immigration Canada (CIC) Landing File provides information on immigrants to Canada from 1980–2011 such as admission category and knowledge of official languages at time of landing.
The record linkage between the Landing File and the 2011 National Household Survey was a two-stage process that involved first linking the Landing File to the 2011 Census and then making use of the already existing linkage between the 2011 Census and the 2011 NHS. The record linkage between the Landing File and the 2011 Census employed a hierarchical deterministic record linkage program developed by HSMD for the IMDB. Only NHS respondents will be maintained for this record linkage.
Output: Only aggregate statistical estimates that conform to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada. Outputs for the Landing File and the NHS linkage will include a wide range of analysis and standard data tables, as well as custom tabulations.
A linkage key will be retained indefinitely as part of this record linkage. The linkage results including variables used to perform the record linkage such as personal identifiers and information used to measure the linkage quality will be destroyed by August 31, 2014. All files will be kept on a server in a secure area. Access to these files is restricted to Statistics Canada employees whose assigned work activities require such access.
2012 Aboriginal Peoples Survey and 2011 National Household Survey Linkage
Purpose: The Aboriginal Peoples Survey (APS) is a national post-censal survey of Aboriginal peoples (First Nations peoples living off reserve, Métis and Inuit) in Canada. The survey provides valuable data on the social and economic conditions of Aboriginal people 6 years of age and over. Data from the APS inform policy and programming activities aimed at improving the well-being of Aboriginal peoples in Canada.
Linking the 2012 APS and the 2011 National Household Survey (NHS), will allow methodologists to derive weights for the APS. As well, an APS-NHS linkage will enrich the analytical potential of the 2012 APS microdata file by allowing data users to analyse APS data with reference to person-level, family-level and household-level information collected in the NHS. NHS data complement the findings of the APS, providing information on topics that were either beyond the scope of the APS or which were explored in the survey in only a very limited way in order to reduce response burden.
Description: Responses to the 2012 Aboriginal Peoples Survey and 2011 National Household Survey will be matched for each respondent using the variables frame_id (which identifies a household uniquely in Canada) and persnr (which identifies a person uniquely within the household). This linkage will result in the APS-NHS linked file. This composite file will be used to create an analytical file which will become the base from which other products will be developed.
Output: Only aggregate statistical estimates that conform to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada. Linked information from the 2012 Aboriginal Peoples Survey and 2011 NHS will be used in analytical articles and other data products released from the 2012 Aboriginal Peoples Survey. All products containing linked data will be disseminated in accordance with Statistics Canada's policies, guidelines and standards.
The APS analytical file, including linked NHS records, will be created and retained indefinitely by Social and Aboriginal Statistics Division (SASD). The analytical file will not contain any personal identifiers.
2012 Canadian Survey on Disability and 2011 National Household Survey Linkage
Purpose: The Canadian Survey on Disability (CSD) is a post-censual survey which provides information on Canadians whose everyday activities may be limited because of a condition or health-related problem. Information from this survey is essential for the effective development and operation of national programs such as employment equity and is required by the Government of Canada to fulfill various international commitments, including the United Nations Convention on the Rights of Persons with Disabilities.
Linking the 2012 CSD and the 2011 National Household Survey (NHS) is necessary to derive accurate weights for the CSD. As well, a CSD-NHS linkage will enrich the analytical potential of the CSD microdata file by allowing for the analysis of CSD data with reference to person-level, family-level and household-level information collected in the NHS Furthermore, it will allow for the calculation of disability rates and for the comparison of the characteristics of activity-limited CSD respondents with those of non-activity-limited NHS respondents. NHS data complement the findings of the CSD, providing information on topics that were either beyond the scope of the CSD or which were explored in the survey in only a very limited way in order to reduce response burden.
Description: Responses to the 2012 CSD and 2011 NHS will be matched for each respondent using the variables frame_id (which identifies a household uniquely in Canada) and persnr (which identifies a person uniquely within the household). This linkage will result in the CSD-NHS linked analytical microdata file.
Output: Linked data from the 2012 CSD and 2011 NHS will be disseminated on the analytical microdata file produced for the 2012 CSD. All products containing linked data are disseminated in accordance with Statistics Canada's policies, guidelines and standards. Only aggregate statistical estimates that conform to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada.
The linked 2012 CSD-2011 NHS analytical file will be created and retained indefinitely by Social and Aboriginal Statistics Division (SASD). The analytical file will not contain any personal identifiers.
Update: Gross Flows of Workers Into and Out of Industries in Newfoundland and Labrador, 2000 to 2011
Purpose: To estimate the supply of workers in the Newfoundland and Labrador labour market, particularly the replacement demand, that is, the workers needed to replace those who are no longer employed in the province. The focus will be on three groups: stayers (workers employed in the same industry for one or more years), leavers (workers who leave the labour market or industry) and entrants (workers entering the labour market, including those returning to the workforce).
Description: The annual T1 Family File (T1FF) records of tax-filers who lived in Newfoundland and Labrador for at least one year from 2000 to 2011 will be selected for this research study. The T1FF records will be linked over this period using Social Insurance Numbers (SIN) to produce a longitudinal analysis file. All direct identifiers, including the SINs, will be removed from the analysis file following completion of the linkage.
The linkage and specific data requirements will be produced by Statistics Canada staff on the agency's premises.
Output: Only aggregate statistics and analysis conforming to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada. Research findings will be used by the Government of Newfoundland and Labrador to enhance their labour market policies. To support on-going analysis, the linked analysis file will be retained at Statistics Canada until April 30, 2016, or sooner if no longer required, at which time it will be destroyed. Access to the linked analysis file will be restricted to Statistics Canada employees whose assigned work activities require such access.
Examination of Sentencing Trends for Drug Offences and Motor Vehicle Theft to Revise the Crime Severity Index Weights
Purpose: In order to revise the weights for the Crime Severity Index (CSI), this project links data from the Incident-based Uniform Crime Reporting Survey (UCR2) and the Integrated Criminal Court Survey (ICCS) to examine the hypothesis that court sentencing patterns for drug offences differ according to type of drug involved and that sentencing patterns for motor vehicle theft differs from sentencing patterns for the broader categories of theft. The results of these examinations will be used to inform decisions on appropriate revised weights for these offences in the Index. Conforming to the recommendation to revise the weights every five years, this current project marks the first revision to the Crime Severity Index.
The Crime Severity Index (CSI) is an important addition to existing measures of crime because it measures change in the volume of crime while also taking into account the relative seriousness of crime. The Index relies on courts sentencing data to measure the relative seriousness of crime. Consultations with the justice community have indicated a need to better understand the effects of sentencing practices for offence types.
This record linkage project will contribute to the public good by increasing the confidence of the general public, as well as the police, the academic and justice communities in the measures of crime in Canada. Further, it is in the public interest to better understand if and how the consequences of being convicted of drug offences differs according to the type of drug involved and how the consequences of being convicted of motor vehicle theft differs from the consequences of being convicted of a more general theft.
Description: In order that the weights for the Index can be updated, this linkage will bring together police-reported records from the UCR2 and courts-reported records from the ICCS to allow an analysis of sentencing patterns for drug offences and motor vehicle thefts. ICCS charge records will be linked to UCR2 records using a direct matching methodology of identifiers and linking keys.
Drug offences by specific drug types can only be identified in the police-reported records, so the linkage will allow their identification in courts data. Motor vehicle thefts can only be identified by using incident characteristics data from police-reported records. Prior to November 5, 2010 there was not yet a specific Criminal Code offence for motor vehicle thefts so the linkage will allow their identification in courts data.
Output: A data file will be produced containing court-based charge information with the imputed UCR2 violation code. Results of analysis of this file will be published in a report entitled "Updating the Crime Severity Index Weights: Refinements to the Methodology" (scheduled for release in May 2013). This report is intended to provide information on the methodology behind the revisions made to the weights to be use for the Crime Severity Index for 2012 (including revised 2011 UCR data) to 2016. A discussion of the results of this record linkage will be included. Composite or linked files will be retained until no longer required, up to, April 30, 2018, at which point they will be destroyed, in order for the file to be available for reference next time the CSI weights require updating.
Analysis of the legal components involved in self-identification among First Nations peoples
Purpose: The starting point for this analysis is the mismatch between estimates of the number of Aboriginal people in the Census and in the 2006 Aboriginal Peoples Survey (APS). For Quebec, for example, the APS shows 40% more registered Indians living off reserve than in the Census. This raises the question of how consistently Aboriginal people self-identify. Since most studies on Aboriginal people use either Aboriginal identity or registered Indian status as the variable determining the category, it is clear that if responses to questions are not reliable, the analyses based on them must be verified in light of this information. Guimond (2007) studied this issue by looking at intergenerational and intragenerational mobility, or the tendency for self-reporting to increase over time and in the generations resulting from couples of mixed ancestry. For our part, we are interested in how the reporting for the same individual fluctuates in two separate surveys scarcely a few months apart.
This project is structured around the concept of "fluidity of ethnic identity." It will focus on the components relating to "legal status," i.e., registered Indian status. In the case of Quebec, the project seeks to shed light on the differences between the new identification categories—consistent or inconsistent—and sociodemographic characteristics including education and status. This study will ultimately serve to validate or invalidate various previous studies that did not take fluidity of identity into account.
Description: To perform the planned analyses, access to the 2006 Census-APS linked file, in which the census variables reporting origin, identity and registered Indian status were added to the APS file, is requested. A first series of multivariate descriptive analyses (analysis of matches) will provide an overall picture of the situation. Subsequently, multi-level analyses that take into account the municipality of residence will determine how the consistency of reporting affects variables of interest such as education and income.
Output: Only weighted (rounded) statistics, aggregate analyses and models complying with the confidentiality provisions of the Statistics Act will be disseminated outside Statistics Canada.
Statistics Canada will retain the linked files for five years, until May 15, 2018, or no longer needed, at which time, they will be destroyed.
Atlantic Canada Opportunities Agency (ACOA) - Update of Business Performance Evaluation Report (2010)
Purpose: To assess the effectiveness of ACOA's programs and activities and the usefulness of the Agency's efforts to assist small businesses, and to determine more effective means of providing assistance to the small business community. ACOA assists businesses by providing loans, as well as a broad range of programs and services, for purposes of establishing, expanding, or modernizing businesses, and for the development of human resources. Information resulting from the linkage will be used by ACOA to measure the performance of businesses which received financial assistance under the Agency's programs, and compare it to the performance of other firms in the Atlantic region. Employment dynamics, businesses entering and exiting, selected financial statistics, as well as measures of labour productivity will be analyzed. Findings from this assessment may be used by ACOA to improve assistance to businesses.
Description: A list of ACOA-assisted businesses will be linked to the following files: 2004 to 2010 Business Register, 2010 Longitudinal Employment Analysis Program (LEAP) file, 2004 to 2010 Corporate Tax-General Index of Financial Information (GIFI), 2004 to 2010 Exporter Registry and 2004 to 2010 Research and Development in Canadian Industry database. The files will be linked using the Business Number (BN), Statistical Enterprise Number (SNUM) and the legal/operating name.
Output: Only aggregate statistical outputs and analyses that conform to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada. These will be in the form of statistical tables at the business sector and business size level for Atlantic Canada; as well, research and development estimates will be produced at the Canada level. ACOA will publish these results in their annual performance report to Parliament, which will be available on the ACOA website, and in research studies on topics such as entrepreneurial start-ups, employment patterns and growth in Atlantic Canada.
Analyzing the differences in self-identification on the Census and Aboriginal Peoples Survey
Purpose: When we compare the estimates of the number of Aboriginal people living off reserve from the Census and the Aboriginal Peoples Survey (APS), we note a significant difference between the two. This difference is not due to methodology-related issues such as weighting or margin of error. For example, for Quebec, the APS estimates 40 per cent more Aboriginal people living off reserve than the Census. There is only one way to uncover the source of this difference and understand why this is happening: we need to link the two files. We believe that the same individuals provide different information to the two surveys. As a result, an analysis of the factors behind this "inconsistent" reporting is necessary. It is all the more important to understand it because studies on Aboriginal people depend on these responses and consider them "error-free."
Firstly, this study will seek to better understand Aboriginal identification, more specifically the individual characteristics and contexts—province, region, municipality, etc.—that explain the difference in reporting. The findings of this first phase will be used to review the results of analyses that relied on identification criteria from only the Census or the APS to study the situation of Aboriginal people.
Description: For the time being, we would like to have access to a file that provides the identity variables from the census for the same individuals, in addition to APS data. These variables will help us to create new identification variables that take into account the consistency of reporting and to analyze the factors related to this consistency. Initially, an analysis of the matches and classification could provide an overall picture of the groups concerned. Next, standard regression and logistical analyses will be used to identify the variables at the individual level associated with inconsistent reporting. Lastly, multilevel regression methods will help to determine the contexts likely to be linked to inconsistent reporting.
Output: Only aggregate statistics and analyses conforming to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada.
Statistics Canada will retain the linked analysis files for five years, until April 30, 2018 or sooner if no longer required, at which time the said files will be destroyed.
Canadian Employer-Employee Dynamics Database
Purpose: The activities and economic outcomes of workers are shaped in many ways by the firms or organizations in which they are employed, and conversely, worker characteristics have implications for firm performance. Together, individual-level and firm-level data that are integrated to facilitate a more comprehensive understanding of labour market processes and economic outcomes than is possible using either type of data in isolation.
The CEEDD will be a multi-purpose file capable of supporting research on many issues. In addition, a number of priority projects are proposed for 2013-14 and 2014-2015. These include projects on business start-ups and job creation, with particular emphasis on the role of immigrant entrepreneurs; the distribution of immigrants across business enterprises and how this differs from the distribution of Canadian-born workers; how workforce aging is playing out within business enterprises, including its effect on labour productivity; local labour market information, including hiring rates, separation rates, layoff rates, and aggregate turnover rates within sub-provincial regions and the impacts of organizational changes, such as mergers and acquisitions, on individual-level outcomes.
Description: Information at the level of the business-enterprise will be drawn from the National Accounts Longitudinal Micro data File (NALMF) while individual- and job-level data will be drawn from T1 files, the T4 Statement of Remuneration Paid file, the Record of Employment (ROE) file, the Longitudinal Immigration Data Base (IMDB), and the Temporary Foreign Work file, for the years 1999 onward. All linkages will be done on a deterministic basis using Business Numbers (BNs) and/or Social Insurance Numbers (SINs).
Business Numbers and SINs will be transformed into unique personal identifiers that will remain on the business-level and individual-level files in a scrambled form. The use of scrambled identifiers will allow users to differentiate units in the cross-sectional data, and to follow them longitudinally over time. Postal code information will be used to create aggregated geography variables and then removed from the files.
All BNs, SINs and postal codes will be removed from the analytical files and stored in a separate location accessible only to indeterminate Statistics Canada employees who job duties require them to access this information.
Output: The outputs of the proposed database will include two components. One is labour market indicators (e.g. hiring, separations, job creation/destruction) at national, provincial and sub-provincial levels of geography. This information will conform to the confidentiality provisions of the Statistics Act. The other is the longitudinal files that will serve the internal and external researchers. Only aggregate statistics and analysis conforming to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada. The linked file and linking key file will be retained by Statistics Canada until no longer required, up to, March 31, 2023, at which time they will be destroyed.
The linked files will be hosted by the Center of Data Development and Economic Research (CDER) of Economic Analysis Division. External researchers will be able to access the linked data on Statistics Canada premises, under the Policy on the Use of Deemed Employees and current MOUs with the CDER. Synthetic files will be created for external researchers for direct access, while the original files will only be accessed via batch mode with no viewing function. Research studies will be published in Statistics Canada's Research Paper Series as well as in academic journals. Research studies will also be presented at professional conferences.
Long Term Income and Employment Among Breast Cancer Survivors: Linkage of British Columbia Breast Cancer Data and the Longitudinal Administrative Databank
Purpose: The goal of this study is to provide information on income and employment of breast cancer survivors that will inform the development of strategies and supports to facilitate return to work, reduce financial hardship, and improve the long-term financial circumstances of breast cancer survivors, from diagnosis to end of life.
Description: The client proposes to use a linked database consisting of cancer registry clinical databases from the British Columbia Cancer Agency, and Statistics Canada's Longitudinal Administrative Databank (LAD) 1982 to 2010, to directly measure long-term burden and change in income and employment, and determinants of income, employment, and change over time and stage in the cancer trajectory, among those with breast cancer in British Columbia. Only indeterminate authorized Statistics Canada personnel will perform the linkage. Following completion of the linkage, all personal identifiers will then be removed from the linked file. No names or addresses will be on the final linked analysis data file. Detailed data analysis of this final linked file will be undertaken by Statistics Canada personnel and/or deemed Statistics Canada employees under the strict provisions of the Statistics Act.
Output: Only aggregate data and analysis conforming to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada. Findings will be disseminated in a variety of professional and academic venues including research papers as well as presentations to Canadian academic conferences. Depending on requests, there may be wider dissemination of the research results. The linked file and identifiers will be retained separately at Statistics Canada only until 31 March, 2018, or sooner, after which time they will be destroyed.
Amendment to Canada Student Loans Program Linkage to the Longitudinal Administrative Databank (CSLP-LAD)
Purpose: On August 1, 2000, a direct loan regime was adopted for Canada Student Loans in which the Government of Canada issues loans directly to students and receives the repayments of those loans. The purpose of the original linkage (011-05) was to assess the effectiveness of this new regime in reducing loan repayment difficulties. This request is for an extension to the date of retention of the linkage, between Canada Student Loan Program (CSLP) data and the Longitudinal Administrative Databank (LAD), from February 2013 to March 31, 2016.
Description: LAD data files for the years 1985 to 2003 are linked to administrative records from the Canada Student Loan Program (CSLP) for the years 1991 to 2000. The LAD data prior to 1991 (the beginning of the CSLP data) are to be included because it provides information about the socio-economic and demographic situation of students prior to, and during their PSE years, through variables such as parental income, parental urban/rural residency, parental provincial residency and student's employment during PSE study. All direct identifiers, including the SINs, will be removed from the analysis file following completion of the linkage.
The linkage and specific data requirements will be produced by Statistics Canada staff on the agency's premises.
Output: Only aggregate statistics and analysis conforming to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada. Research findings will be used by Student Loans Program, Human Resources and Skills Development to enhance student loan policies. To support on-going analysis, the linked analysis file will be retained at Statistics Canada until March 31, 2016, or sooner if no longer required, at which time it will be destroyed. Access to the linked analysis file will be restricted to Statistics Canada employees whose assigned work activities require such access.
Expanding the Longitudinal Analytical Databank (LAD) to include Tax Free Savings Account (TFSA) information
Purpose: The debut of the Tax Free Savings Account (TFSA) in 2009, introduced a new program that is separate from and is not captured by the T1 tax file, upon which the Longitudinal Analytical Databank (LAD) is based. The TFSA program may have an effect on the measurement of income, particularly when focusing on the retired population, as well as more generally on the tax and transfer system. The proposed addition of TFSA information to the LAD will permit Statistics Canada to improve its data reporting on income and wealth, and thereby assist our clients with their research in this area.
Description: The project involves linking on a continuing basis Statistics Canada's LAD file to the TFSA file from Canada Revenue Agency (CRA). The LAD and TFSA will be linked deterministically using the Social Insurance Number (SIN) of individuals. Following completion of the linkage, the SIN and other identifiers will then be removed and kept securely separate from the final linked file. No names or addresses will be used at any point in the linkage process or be on any of the files used. The linkage will be performed by Statistics Canada personnel.
Output: The output will be a linked database file combining LAD and TFSA information. Only aggregate data and analysis conforming to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada. The linked file will be retained at Statistics Canada.
Update: Gross Flows of Workers Into and Out of Industries in Newfoundland and Labrador, 2000 to 2010
Purpose: To estimate the supply of workers in the Newfoundland and Labrador labour market, particularly the replacement demand, that is, the workers needed to replace those who are no longer employed in the province. The focus will be on three groups: stayers (workers employed in the same industry for one or more years), leavers (workers who leave the labour market or industry) and entrants (workers entering the labour market, including those returning to the workforce).
Description: The annual T1 Family File (T1FF) records of tax-filers who lived in Newfoundland and Labrador for at least one year from 2000 to 2010 will be selected for this research study. The T1FF records will be linked over this period using Social Insurance Numbers (SIN) to produce a longitudinal analysis file. All direct identifiers, including the SINs, will be removed from the analysis file following completion of the linkage.
The linkage and specific data requirements will be produced by Statistics Canada staff on the agency's premises.
Output: Only aggregate statistics and analysis conforming to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada. Research findings will be used by the Government of Newfoundland and Labrador to enhance their labour market policies. To support on-going analysis, the linked analysis file will be retained at Statistics Canada until April 30, 2015, or sooner if no longer required, at which time it will be destroyed. Access to the linked analysis file will be restricted to Statistics Canada employees whose assigned work activities require such access.
Economic Development Agency of Canada for the Regions of Quebec (EDAC): Economic Impact – 2001 to 2010
Purpose: To support the evaluation of the Economic Development Agency of Canada for the Regions of Quebec's (EDAC) financing services program, by producing objective measures of its economic impact on the performance of small and medium-sized enterprises (SMEs). Key performance indicators, and value-added measures such as sales, profits, firm survival rate, and employment, will be calculated for EDAC client businesses and for comparable non-client businesses.
Description: A list of firms that were EDAC clients in the period 2001 to 2010 will be linked to the Business Register to obtain the Business Number and Statistical Enterprise Number, to facilitate linkage to payroll and tax data. In order to measure the effectiveness and the impact of EDAC financing services, a comparison group of non-EDAC client firms with similar characteristics will be selected.
Records of EDAC clients and the businesses in the comparison group will be linked to the Payroll Deduction Account (PD7), the T1 Unincorporated Business Tax Data, the T2 Corporate Tax data, the General Index of Financial Information (GIFI), Exporter Registry and Research and Development in Canadian Industry (RDCI) for the period 2001 to 2010. The records will be linked using the Business Number and Statistical Enterprise Number. The resulting linked analysis file will enable longitudinal analysis of each cohort. This is a one-time linkage.
Output: Only non-confidential aggregate statistical outputs and analysis that conform to the confidentiality provisions of the Statistics Act will be released outside of Statistics Canada. These will be in the form of separate summary tables of regression analysis results relating to the study hypotheses of the economic impact of EDAC's financing services, in addition to profiling tables. A methodology report will be prepared, explaining the file matching processes and constraints and key issues related to the quality of the data. An analytic report will be produced by Statistics Canada.
Linkage of Survey of Intellectual Property Management (SIPM) 2010 to the Linkable File Environment (LFE)
Purpose: The purpose of this record linkage is provide Industry Canada researchers and other researchers the opportunity to carry out policy relevant research using the data in the SIPM 2010 and the data available in other databases that are in the LFE.
Description: This is a request to link the Survey of Intellectual Property Management (SIPM) 2010 to the Linkable File Environment (LFE).
Output: The linkable SIPM dataset will be housed at Statistics Canada's Centre for Special Business Project (CSBP). When a research project is formally approved by Statistics Canada, Statistics Canada's Centre for Special Business Projects will extract a researcher database from the LFE which will contain data for the variables that are listed in the research proposal for the population that has been specified. Access to the researcher database will be facilitated and managed by Statistics Canada's Centre for Data Development and Economic Research (CDER).