New data tables that provide the revision history of 28 economic and social time series are now available. Statistics Canada has always provided its users with the most recent data available, but after consulting some of its expert data users, the agency identified a need for real-time data—or vintage data—to make certain types of analysis easier. These new tables were created to fill this data gap.
Initially, the tables will contain vintages of data as of January 2015. However, some may be expanded to provide users with a longer time series. The real-time table will be released approximately one week after the standard data table.
Background
Statistical revisions are carried out regularly in the compilation of economic and social statistics. These revisions incorporate the most complete and current information from many sources (including surveys, administrative data and public accounts) and use improved estimation methods. While the majority of revisions are done within the months or quarters of a given reference year or on an annual basis, going back two to three years to incorporate benchmark information, some revisions are carried back further to incorporate major changes to concepts or classifications.
Statistics Canada's economic and social statistics programs have well-established policies that govern revisions. Every time Statistics Canada revises data for a given time period, it replaces the existing data table information with the revised data. This ensures that users always have the most up-to-date statistics.
This up-to-date (or revised) information meets the data requirements of most users. However, some users have said that they would like Statistics Canada to provide access to the different vintages of a given time series of economic or social data within a single table or database. A table or database that contains vintages of data is referred to in the international community as a real-time database.
Real-time databases allow users to examine a given time series of economic or social data as it appeared (and was used) at a given point in time before it was revised. This is helpful to users who may want to examine a policy decision—such as a change in interest rates or tax policy—based on the information that was available to policy makers at the time of the decision. These real-time tables help economic and social statistics users to better analyze the impact and development of policy, to prepare forecasts, and to test econometric models.
The revisions in the real-time data tables are not corrections to errors. They represent a normal step in the statistical process, in which statistical agencies produce new vintages of higher quality data as new information becomes available.
Publishing real-time data tables reflects Statistics Canada's values of transparency, accessibility, interpretability, and increased data relevance for users.
Real-time data tables
Statistics Canada will release real-time data for 21 economic and social time series (Table 1).
The real-time data tables will not replace the current data tables for these time series; they are a new product for data users.
The real-time data tables for these economic and social time series will be released approximately one week after the corresponding standard tables have been released and will have their own reference number. At this point, the tables will contain vintages of data starting with the January 2015 reference period. At a later date, some programs may include earlier reference periods to provide users with a longer time series.
The real-time data tables show all revisions of a specific data point over time. Typically, Statistics Canada releases initial estimates for a given period (month or quarter), revises them in subsequent periods based on new information, then revises them again in an annual or historical revision process. Statistics Canada has determined that the most transparent way to present the vintages is to record the date that the data were released in The Daily, the agency's official release vehicle. The real-time data tables are as easy to use as standard tables, but with one important exception: they contain a vintage dimension that records the date of official release.
For example, let us suppose that on November 29, 2019, gross domestic product data for the third quarter of 2019 were released for the first time. The vintage (release) dimension would record the date as November 29, 2019. Suppose that on May 29, 2020, gross domestic product data for the first quarter of 2020 were released for the first time, along with a revised estimate for the third and the last quarter of 2019. A second entry would be made in the table for the third and the fourth quarter of 2019, and the vintage (release) dimension would have the value of May 29, 2020. As new vintages are added, the real-time tables will display the revised data for selected reference periods in columns.
Likewise, the initial estimates for each reference period appear as the last (non-missing) figure in each column. The comparison between the initial and the most recent estimate therefore represents the difference between the first and last rows in the table for a given reference period. For the most recent reference period, the initial estimate and the most recent estimate are the same.
Twenty-one real-time tables have been released to the public, and represent the majority of the key economic and social indicators produced by Statistics Canada. In most cases, revisions occur because the first vintages of estimates are based on incomplete information. As more up to date information becomes available, the data are revised. In some cases, revisions occur due to changes in concepts or methods. These types of revisions need to be analyzed differently than revisions made using updated information.
To help users in their analysis, the real-time tables will include detailed footnotes to provide context for the revisions. These footnotes should be used along with the data to understand how to interpret the various vintages. Specifically, users should exercise caution, since not taking the footnotes and other related metadata into consideration when using real-time data could lead to erroneous conclusions.