For more information

Archived information

Archived information is provided for reference, research or recordkeeping purposes. It is not subject to the Government of Canada Web Standards and has not been altered or updated since it was archived. Please "contact us" to request a format other than those available.

To obtain more information on the data, products and services available from the Census of Agriculture, please

  • Call Client Services, Statistics Canada, 1-800-263-1136
  • or contact us by e-mail, fax or phone.

If you have difficulty accessing the Census of Agriculture documents or tables or other Statistics Canada documents, contact the Statistics Canada Help Line at 1-800-949-9491 weekdays from 8:00 a.m. to 5:00 p.m. Eastern Time.

Date modified:

Monthly Retail Trade Survey (MRTS) Data Quality Statement

Objectives, uses and users
Concepts, variables and classifications
Coverage and frames
Sampling
Questionnaire design
Response and nonresponse
Data collection and capture operations
Editing
Imputation
Estimation
Revisions and seasonal adjustment
Data quality evaluation
Disclosure control

1. Objectives, uses and users

1.1. Objective

The Monthly Retail Trade Survey (MRTS) provides information on the performance of the retail trade sector on a monthly basis, and when combined with other statistics, represents an important indicator of the state of the Canadian economy.

1.2. Uses

The estimates provide a measure of the health and performance of the retail trade sector. Information collected is used to estimate level and monthly trend for retail sales. At the end of each year, the estimates provide a preliminary look at annual retail sales and performance.

1.3. Users

A variety of organizations, sector associations, and levels of government make use of the information. Retailers rely on the survey results to compare their performance against similar types of businesses, as well as for marketing purposes. Retail associations are able to monitor industry performance and promote their retail industries. Investors can monitor industry growth, which can result in better access to investment capital by retailers. Governments are able to understand the role of retailers in the economy, which aids in the development of policies and tax incentives. As an important industry in the Canadian economy, governments are able to better determine the overall health of the economy through the use of the estimates in the calculation of the nation’s Gross Domestic Product (GDP).

2. Concepts, variables and classifications

2.1. Concepts

The retail trade sector comprises establishments primarily engaged in retailing merchandise, generally without transformation, and rendering services incidental to the sale of merchandise.

The retailing process is the final step in the distribution of merchandise; retailers are therefore organized to sell merchandise in small quantities to the general public. This sector comprises two main types of retailers, that is, store and non-store retailers. The MRTS covers only store retailers. Their main characteristics are described below. Store retailers operate fixed point-of-sale locations, located and designed to attract a high volume of walk-in customers. In general, retail stores have extensive displays of merchandise and use mass-media advertising to attract customers. They typically sell merchandise to the general public for personal or household consumption, but some also serve business and institutional clients. These include establishments such as office supplies stores, computer and software stores, gasoline stations, building material dealers, plumbing supplies stores and electrical supplies stores.

In addition to selling merchandise, some types of store retailers are also engaged in the provision of after-sales services, such as repair and installation. For example, new automobile dealers, electronic and appliance stores and musical instrument and supplies stores often provide repair services, while floor covering stores and window treatment stores often provide installation services. As a general rule, establishments engaged in retailing merchandise and providing after sales services are classified in this sector. Catalogue sales showrooms, gasoline service stations, and mobile home dealers are treated as store retailers.

2.2. Variables

Sales are defined as the sales of all goods purchased for resale, net of returns and discounts. This includes commission revenue and fees earned from selling goods and services on account of others, such as selling lottery tickets, bus tickets, and phone cards. It also includes parts and labour revenue from repair and maintenance; revenue from rental and leasing of goods and equipment; revenues from services, including food services; sales of goods manufactured as a secondary activity; and the proprietor’s withdrawals, at retail, of goods for personal use. Other revenue from rental of real estate, placement fees, operating subsidies, grants, royalties and franchise fees are excluded.

Trading Location is the physical location(s) in which business activity is conducted in each province and territory, and for which sales are credited or recognized in the financial records of the company. For retailers, this would normally be a store.

Constant Dollars: The value of retail trade is measured in two ways; including the effects of price change on sales and net of the effects of price change. The first measure is referred to as retail trade in current dollars and the latter as retail trade in constant dollars. The method of calculating the current dollar estimate is to aggregate the weighted value of sales for all retail outlets. The method of calculating the constant dollar estimate is to first adjust the sales values to a base year, using the Consumer Price Index, and then sum up the resulting values.

2.3. Classification

The Monthly Retail Trade Survey is based on the definition of retail trade under the NAICS (North American Industry Classification System). NAICS is the agreed upon common framework for the production of comparable statistics by the statistical agencies of Canada, Mexico and the United States. The agreement defines the boundaries of twenty sectors. NAICS is based on a production-oriented, or supply based conceptual framework in that establishments are groups into industries according to similarity in production processes used to produce goods and services.

Estimates appear for 21 industries based on special aggregations of the 2012 North American Industry Classification System (NAICS) industries. The 21 industries are further aggregated to 11 sub-sectors.

Geographically, sales estimates are produced for Canada and each province and territory.

3. Coverage and frames

Statistics Canada’s Business Register ( BR) provides the frame for the Monthly Retail Trade Survey. The BR is a structured list of businesses engaged in the production of goods and services in Canada. It is a centrally maintained database containing detailed descriptions of most business entities operating within Canada. The BR includes all incorporated businesses, with or without employees. For unincorporated businesses, the BR includes all employers with businesses, and businesses with no employees with annual sales that have a Goods and Services Tax (GST) or annual revenue that declares individual taxes.  annual sales greater than $30,000 that have a Goods and Services Tax (GST) account (the BR does not include unincorporated businesses with no employees and with annual sales less than $30,000).

The businesses on the BR are represented by a hierarchical structure with four levels, with the statistical enterprise at the top, followed by the statistical company, the statistical establishment and the statistical location. An enterprise can be linked to one or more statistical companies, a statistical company can be linked to one or more statistical establishments, and a statistical establishment to one or more statistical locations.

The target population for the MRTS consists of all statistical establishments on the BR that are classified to the retail sector using the North American Industry Classification System (NAICS) (approximately 200,000 establishments). The NAICS code range for the retail sector is 441100 to 453999. A statistical establishment is the production entity or the smallest grouping of production entities which: produces a homogeneous set of goods or services; does not cross provincial boundaries; and provides data on the value of output, together with the cost of principal intermediate inputs used, along with the cost and quantity of labour used to produce the output. The production entity is the physical unit where the business operations are carried out. It must have a civic address and dedicated labour.

The exclusions to the target population are ancillary establishments (producers of services in support of the activity of producing goods and services for the market of more than one establishment within the enterprise, and serves as a cost centre or a discretionary expense centre for which data on all its costs including labour and depreciation can be reported by the business), future establishments, establishments with a missing or a zero gross business income (GBI) value on the BR and establishments in the following non-covered NAICS:

  • 4541 (electronic shopping and mail-order houses)
  • 4542 (vending machine operators)
  • 45431 (fuel dealers)
  • 45439 (other direct selling establishments)

4. Sampling

The MRTS sample consists of 10,000 groups of establishments (clusters) classified to the Retail Trade sector selected from the Statistics Canada Business Register. A cluster of establishments is defined as all establishments belonging to a statistical enterprise that are in the same industrial group and geographical region. The MRTS uses a stratified design with simple random sample selection in each stratum. The stratification is done by industry groups (the mainly, but not only four digit level NAICS), and the geographical regions consisting of the provinces and territories, as well as three provincial sub-regions. We further stratify the population by size.

The size measure is created using a combination of independent survey data and three administrative variables: the annual profiled revenue, the GST sales expressed on an annual basis, and the declared tax revenue (T1 or T2). The size strata consist of one take-all (census), at most, two take-some (partially sampled) strata, and one take-none (non-sampled) stratum. Take-none strata serve to reduce respondent burden by excluding the smaller businesses from the surveyed population. These businesses should represent at most ten percent of total sales. Instead of sending questionnaires to these businesses, the estimates are produced through the use of administrative data.

The sample was allocated optimally in order to reach target coefficients of variation at the national, provincial/territorial, industrial, and industrial groups by province/territory levels. The sample was also inflated to compensate for dead, non-responding, and misclassified units.

MRTS is a repeated survey with maximisation of monthly sample overlap. The sample is kept month after month, and every month new units are added (births) to the sample.  MRTS births, i.e., new clusters of establishment(s), are identified every month via the BR’s latest universe. They are stratified according to the same criteria as the initial population. A sample of these births is selected according to the sampling fraction of the stratum to which they belong and is added to the monthly sample. Deaths occur on a monthly basis. A death can be a cluster of establishment(s) that have ceased their activities (out-of-business) or whose major activities are no longer in retail trade (out-of-scope). The status of these businesses is updated on the BR using administrative sources and survey feedback, including feedback from the MRTS. Methods to treat dead units and misclassified units are part of the sample and population update procedures.

5. Questionnaire design

The Monthly Retail Trade Survey incorporates the following sub-surveys:

Monthly Retail Trade Survey - R8

Monthly Retail Trade Survey (with inventories) – R8

Survey of Sales and Inventories of Alcoholic Beverages

The questionnaires collect monthly data on retail sales and the number of trading locations by province or territory and inventories of goods owned and intended for resale from a sample of retailers. The items on the questionnaires have remained unchanged for several years. For the 2004 redesign, the general questionnaires were subject to cosmetic changes only. The questionnaire for Sales and Inventories of Alcoholic Beverages underwent more extensive changes. The modifications were discussed with stakeholders and the respondents were given an opportunity to comment before the new questionnaire was finalized. If further changes are needed to any of the questionnaires, proposed changes would go through a review committee and a field test with respondents and data users to ensure its relevancy.

6. Response and nonresponse

6.1. Response and non-response

Despite the best efforts of survey managers and operations staff to maximize response in the MRTS, some non-response will occur. For statistical establishments to be classified as responding, the degree of partial response (where an accurate response is obtained for only some of the questions asked a respondent) must meet a minimum threshold level below which the response would be rejected and considered a unit nonresponse.  In such an instance, the business is classified as not having responded at all.

Non-response has two effects on data: first it introduces bias in estimates when nonrespondents differ from respondents in the characteristics measured; and second, it contributes to an increase in the sampling variance of estimates because the effective sample size is reduced from that originally sought.

The degree to which efforts are made to get a response from a non-respondent is based on budget and time constraints, its impact on the overall quality and the risk of nonresponse bias.

The main method to reduce the impact of non-response at sampling is to inflate the sample size through the use of over-sampling rates that have been determined from similar surveys.

Besides the methods to reduce the impact of non-response at sampling and collection, the non-responses to the survey that do occur are treated through imputation. In order to measure the amount of non-response that occurs each month, various response rates are calculated. For a given reference month, the estimation process is run at least twice (a preliminary and a revised run). Between each run, respondent data can be identified as unusable and imputed values can be corrected through respondent data. As a consequence, response rates are computed following each run of the estimation process.

For the MRTS, two types of rates are calculated (un-weighted and weighted). In order to assess the efficiency of the collection process, un-weighted response rates are calculated. Weighted rates, using the estimation weight and the value for the variable of interest, assess the quality of estimation. Within each of these types of rates, there are distinct rates for units that are surveyed and for units that are only modeled from administrative data that has been extracted from GST files.

To get a better picture of the success of the collection process, two un-weighted rates called the ‘collection results rate’ and the ‘extraction results rate’ are computed. They are computed by dividing the number of respondents by the number of units that we tried to contact or tried to receive extracted data for them. Non-monthly reporters (respondents with special reporting arrangements where they do not report every month but for whom actual data is available in subsequent revisions) are excluded from both the numerator and denominator for the months where no contact is performed.

In summary, the various response rates are calculated as follows:

Weighted rates:

Survey Response rate (estimation) =
Sum of weighted sales of units with response status i / Sum of survey weighted sales

where i = units that have either reported data that will be used in estimation or are converted refusals, or have reported data that has not yet been resolved for estimation.

Admin Response rate (estimation) =
Sum of weighted sales of units with response status ii / Sum of administrative weighted sales

where ii = units that have data that was extracted from administrative files and are usable for estimation.

Total Response rate (estimation) =
Sum of weighted sales of units with response status i or response status ii / Sum of all weighted sales

Un-weighted rates:

Survey Response rate (collection) =
Number of questionnaires with response status iii/ Number of questionnaires with response status iv

where iii = units that have either reported data (unresolved, used or not used for estimation) or are converted refusals.

where iv = all of the above plus units that have refused to respond, units that were not contacted and other types of non-respondent units.

Admin Response rate (extraction) =
Number of questionnaires with response status vi/ Number of questionnaires with response status vii

where vi = in-scope units that have data (either usable or non-usable) that was extracted from administrative files

where vii = all of the above plus units that have refused to report to the administrative data source, units that were not contacted and other types of non-respondent units.

(% of questionnaire collected over all in-scope questionnaires)

Collection Results Rate =
Number of questionnaires with response status iii / Number of questionnaires with response status viii

where iii = same as iii defined above

where viii = same as iv except for the exclusion of units that were contacted because their response is unavailable for a particular month since they are non-monthly reporters.

Extraction Results Rate =
Number of questionnaires with response status ix / Number of questionnaires with response status vii

where ix = same as vi with the addition of extracted units that have been imputed or were out of scope

where vii = same as vii defined above

(% of questionnaires collected over all questionnaire in-scope we tried to collect)

All the above weighted and un-weighted rates are provided at the industrial group, geography and size group level or for any combination of these levels.

Use of Administrative Data

Managing response burden is an ongoing challenge for Statistics Canada. In an attempt to alleviate response burden and survey costs, especially for smaller businesses, the MRTS has reduced the number of simple establishments in the sample that are surveyed directly and instead derives sales data for these establishments from Goods and Service Tax (GST) files using a statistical model. The model accounts for differences between sales and revenue (reported for GST purposes) as well as for the time lag between the survey reference period and the reference period of the GST file.

For more information on the methodology used for modeling sales from administrative data sources, refer to ‘Monthly Retail Trade Survey: Use of Administrative Data’ under ‘Documentation’ of the IMDB.

Table 1 contains the weighted response rates for all industry groups as well as for total retail trade for each province and territory. For more detailed weighted response rates, please contact the Marketing and Dissemination Section at (613) 951-3549, toll free: 1-877-421-3067 or by e-mail at retailinfo@statcan.

6.2. Methods used to reduce non-response at collection

Significant effort is spent trying to minimize non-response during collection. Methods used, among others, are interviewer techniques such as probing and persuasion, repeated re-scheduling and call-backs to obtain the information, and procedures dealing with how to handle non-compliant (refusal) respondents.

If data are unavailable at the time of collection, a respondent's best estimates are also accepted, and are subsequently revised once the actual data become available.

To minimize total non-response for all variables, partial responses are accepted. In addition, questionnaires are customized for the collection of certain variables, such as inventory, so that collection is timed for those months when the data are available.

Finally, to build trust and rapport between the interviewers and respondents, cases are generally assigned to the same interviewer each month. This action establishes a personal relationship between interviewer and respondent, and builds respondent trust.

7. Data collection and capture operations

Collection of the data is performed by Statistics Canada’s Regional Offices.

Table 1
Weighted response rates by NAICS, for all provinces/territories: February 2013
Table summary
This table displays the results of table 1 weighted response rates by NAICS, for all provinces/territories: February 2013. The information is grouped by NAICS - Canada (appearing as row headers), Weighted Response Rates, Total, Survey, and Administrative (appearing as column headers).
  Weighted Response Rates
Total Survey Administrative
NAICS - Canada
Motor Vehicle and Parts Dealers 92.8 93.7 54.9
Automobile Dealers 94.0 94.4 48.1
New Car Dealers1 95.7 95.7  
Used Car Dealers 68.6 71.8 48.1
Other Motor Vehicle Dealers 77.5 80.6 63.1
Automotive Parts, Accessories and Tire Stores 87.4 93.3 52.1
Furniture and Home Furnishings Stores 81.4 84.4 50.6
Furniture Stores 85.2 87.0 51.5
Home Furnishings Stores 74.8 79.4 50.1
Electronics and Appliance Stores 87.3 88.9 36.3
Building Material and Garden Equipment Dealers 85.0 89.0 49.4
Food and Beverage Stores 87.0 90.8 41.9
Grocery Stores 89.8 93.9 43.9
Grocery (except Convenience) Stores 91.5 95.1 47.6
Convenience Stores 65.9 74.8 18.9
Specialty Food Stores 68.3 75.5 38.8
Beer, Wine and Liquor Stores 79.1 80.6 17.6
Health and Personal Care Stores 90.8 91.8 77.5
Gasoline Stations 85.8 86.8 69.4
Clothing and Clothing Accessories Stores 82.2 83.4 29.3
Clothing Stores 82.4 83.6 32.7
Shoe Stores 89.3 91.1  
Jewellery, Luggage and Leather Goods Stores 75.5 76.1 28.2
Sporting Goods, Hobby, Book and Music Stores 87.1 93.3 21.0
General Merchandise Stores 98.7 99.1 41.8
Department Stores 100.0 100.0  
Other general merchadise stores 97.6 98.3 41.8
Miscellaneous Store Retailers 80.4 86.4 24.1
Total 89.2 91.3 50.4
Regions
Newfoundland and Labrador 90.6 92.0 42.7
Prince Edward Island 89.7 90.9 9.6
Nova Scotia 92.8 94.3 55.7
New Brunswick 84.9 87.3 49.2
Québec 90.0 92.2 60.6
Ontario 89.7 92.1 41.5
Manitoba 89.6 90.0 63.6
Saskatchewan 87.8 89.5 43.0
Alberta 89.0 90.5 59.8
British Columbia 87.3 89.8 39.9
Yukon Territory 76.0 76.0  
Northwest Territories 85.3 85.3  
Nunavut 72.0 72.0  
1 There are no administrative records used in new car dealers

Weighted Response Rates

Respondents are sent a questionnaire or are contacted by telephone to obtain their sales and inventory values, as well as to confirm the opening or closing of business trading locations. Collection of the data begins approximately 7 working days after the end of the reference month and continues for the duration of that month.

New entrants to the survey are introduced to the survey via an introductory letter that informs the respondent that a representative of Statistics Canada will be calling. This call is to introduce the respondent to the survey, confirm the respondent's business activity, establish and begin data collection, as well as to answer any questions that the respondent may have.

8. Editing

Data editing is the application of checks to detect missing, invalid or inconsistent entries or to point to data records that are potentially in error. In the survey process for the MRTS, data editing is done at two different time periods.

First of all, editing is done during data collection. Once data are collected via the telephone, or via the receipt of completed mail-in questionnaires, the data are captured using customized data capture applications. All data are subjected to data editing. Edits during data collection are referred to as field edits and generally consist of validity and some simple consistency edits. They are used to detect mistakes made during the interview by the respondent or the interviewer and to identify missing information during collection in order to reduce the need for follow-up later on. Another purpose of the field edits is to clean up responses. In the MRTS, the current month’s responses are edited against the respondent’s previous month’s responses and/or the previous year’s responses for the current month. Field edits are also used to identify problems with data collection procedures and the design of the questionnaire, as well as the need for more interviewer training.

Follow-up with respondents occurs to validate potential erroneous data following any failed preliminary edit check of the data. Once validated, the collected data is regularly transmitted to the head office in Ottawa.

Secondly, editing known as statistical editing is also done after data collection and this is more empirical in nature. Statistical editing is run prior to imputation in order to identify the data that will be used as a basis to impute non-respondents. Large outliers that could disrupt a monthly trend are excluded from trend calculations by the statistical edits. It should be noted that adjustments are not made at this stage to correct the reported outliers.

The first step in the statistical editing is to identify which responses will be subjected to the statistical edit rules. Reported data for the current reference month will go through various edit checks.

The first set of edit checks is based on the Hidiriglou-Berthelot method whereby a ratio of the respondent’s current month data over historical (last month, same month last year) or auxiliary data is analyzed. When the respondent’s ratio differs significantly from ratios of respondents who are similar in terms of industry and/or geography group, the response is deemed an outlier.

The second set of edits consists of an edit known as the share of market edit. With this method, one is able to edit all respondents, even those where historical and auxiliary data is unavailable. The method relies on current month data only. Therefore, within a group of respondents, that are similar in terms of industrial group and/or geography, if the weighted contribution of a respondent to the group’s total is too large, it will be flagged as an outlier.

For edit checks based on the Hidiriglou-Berthelot method, data that are flagged as an outlier will not be included in the imputation models (those based on ratios). Also, data that are flagged as outliers in the share of market edit will not be included in the imputation models where means and medians are calculated to impute for responses that have no historical responses.

In conjunction with the statistical editing after data collection of reported data, there is also error detection done on the extracted GST data. Modeled data based on the GST are also subject to an extensive series of processing steps which thoroughly verify each record that is the basis for the model as well as the record being modeled. Edits are performed at a more aggregate level (industry by geography level) to detect records which deviate from the expected range, either by exhibiting large month-to-month change, or differing significantly from the remaining units. All data which fail these edits are subject to manual inspection and possible corrective action.

9. Imputation

Imputation in the MRTS is the process used to assign replacement values for missing data. This is done by assigning values when they are missing on the record being edited to ensure that estimates are of high quality and that a plausible, internal consistency is created. Due to concerns of response burden, cost and timeliness, it is generally impossible to do all follow-ups with the respondents in order to resolve missing responses. Since it is desirable to produce a complete and consistent microdata file, imputation is used to handle the remaining missing cases.

In the MRTS, imputation is based on historical data or administrative data (GST sales). The appropriate method is selected according to a strategy that is based on whether historical data is available, auxiliary data is available and/or which reference month is being processed.

There are three types of historical imputation methods. The first type is a general trend that uses one historical data source (previous month, data from next month or data from same month previous year). The second type is a regression model where data from previous month and same month previous year are used simultaneously. The third type uses the historical data as a direct replacement value for a non-respondent. Depending upon the particular reference month, there is an order of preference that exists so that top quality imputation can result. The historical imputation method that was labelled as the third type above is always the last option in the order for each reference month.

The imputation methods using administrative data are automatically selected when historical information is unavailable for a non-respondent. The administrative data source (annual GST sales) is the basis of these methods. The annual GST sales are used for two types of methods. One is a general trend that will be used for simple structure, e.g. enterprises with only one establishment, and a second type is called median-average that is used for units with a more complex structure.

10. Estimation

Estimation is a process that approximates unknown population parameters using only part of the population that is included in a sample. Inferences about these unknown parameters are then made, using the sample data and associated survey design. This stage uses Statistics Canada's Generalized Estimation System (GES).

For retail sales, the population is divided into a survey portion (take-all and take-some strata) and a non-survey portion (take-none stratum). From the sample that is drawn from the survey portion, an estimate for the population is determined through the use of a Horvitz-Thompson estimator where responses for sales are weighted by using the inverses of the inclusion probabilities of the sampled units. Such weights (called sampling weights) can be interpreted as the number of times that each sampled unit should be replicated to represent the entire population. The calculated weighted sales values are summed by domain, to produce the total sales estimates by each industrial group / geographic area combination. A domain is defined as the most recent classification values available from the BR for the unit and the survey reference period. These domains may differ from the original sampling strata because units may have changed size, industry or location. Changes in classification are reflected immediately in the estimates and do not accumulate over time. For the non-survey portion, the sales are estimated with statistical models using monthly GST sales.

For more information on the methodology for modeling sales from administrative data sources which also contributes to the estimates of the survey portion, refer to ‘Monthly Retail Survey: Use of Administrative Data’ under ‘Documentation’ of the IMDB.

The measure of precision used for the MRTS to evaluate the quality of a population parameter estimate and to obtain valid inferences is the variance. The variance from the survey portion is derived directly from a stratified simple random sample without replacement.

Sample estimates may differ from the expected value of the estimates. However, since the estimate is based on a probability sample, the variability of the sample estimate with respect to its expected value can be measured. The variance of an estimate is a measure of the precision of the sample estimate and is defined as the average, over all possible samples, of the squared difference of the estimate from its expected value.

11. Revisions and seasonal adjustment

Revisions in the raw data are required to correct known non-sampling errors. These normally include replacing imputed data with reported data, corrections to previously reported data, and estimates for new births that were not known at the time of the original estimates. Raw data are revised, on a monthly basis, for the month immediately prior to the current reference month being published. That is, when data for December are being published for the first time, there will also be revisions, if necessary, to the raw data for November. In addition, revisions are made once a year, with the initial release of the February data, for all months in the previous year. The purpose is to correct any significant problems that have been found that apply for an extended period. The actual period of revision depends on the nature of the problem identified, but rarely exceeds three years. Time series contain the elements essential to the description, explanation and forecasting of the behaviour of an economic phenomenon: "They are statistical records of the evolution of economic processes through time."1 Economic time series such as the Monthly Retail Trade Survey can be broken down into five main components: the trend-cycle, seasonality, the trading-day effect, the Easter holiday effect and the irregular component.

The trend represents the long-term change in the series, whereas the cycle represents a smooth, quasi-periodical movement about the trend, showing a succession of growth and decline phases (e.g., the business cycle). These two components—the trend and the cycle—are estimated together, and the trend-cycle reflects the fundamental evolution of the series. The other components reflect short-term transient movements.

The seasonal component represents sub-annual, monthly or quarterly fluctuations that recur more or less regularly from one year to the next. Seasonal variations are caused by the direct and indirect effects of the climatic seasons and institutional factors (attributable to social conventions or administrative rules; e.g., Christmas).

The trading-day component originates from the fact that the relative importance of the days varies systematically within the week and that the number of each day of the week in a given month varies from year to year. This effect is present when activity varies with the day of the week. For instance, Sunday is typically less active than the other days, and the number of Sundays, Mondays, etc., in a given month changes from year to year.

The Easter holiday effect is the variation due to the shift of part of April’s activity to March when Easter falls in March rather than April.

Lastly, the irregular component includes all other more or less erratic fluctuations not taken into account in the preceding components. It is a residual that includes errors of measurement on the 1. A Note on the Seasonal adjustment of Economic Time Series», Canadian Statistical Review, August 1974.  A variable itself as well as unusual events (e.g., strikes, drought, floods, major power blackout or other unexpected events causing variations in respondents’ activities).

Thus, the latter four components—seasonal, irregular, trading-day and Easter holiday effect—all conceal the fundamental trend-cycle component of the series. Seasonal adjustment (correction of seasonal variation) consists in removing the seasonal, trading-day and Easter holiday effect components from the series, and it thus helps reveal the trend-cycle. While seasonal adjustment permits a better understanding of the underlying trend-cycle of a series, the seasonally adjusted series still contains an irregular component. Slight month-to-month variations in the seasonally adjusted series may be simple irregular movements. To get a better idea of the underlying trend, users should examine several months of the seasonally adjusted series.

Since April 2008, Monthly Retail Trade Survey data are seasonally adjusted using the X-12- ARIMA2 software. The technique that is used essentially consists of first correcting the initial series for all sorts of undesirable effects, such as the trading-day and the Easter holiday effects, by a module called regARIMA. These effects are estimated using regression models with ARIMA errors (auto-regressive integrated moving average models). The series can also be extrapolated for at least one year by using the model. Subsequently, the raw series—pre-adjusted and extrapolated if applicable— is seasonally adjusted by the X-11 method.

The X-11 method is used for analysing monthly and quarterly series. It is based on an iterative principle applied in estimating the different components, with estimation being done at each stage using adequate moving averages3. The moving averages used to estimate the main components—the trend and seasonality—are primarily smoothing tools designed to eliminate an undesirable component from the series. Since moving averages react poorly to the presence of atypical values, the X-11 method includes a tool for detecting and correcting atypical points. This tool is used to clean up the series during the seasonal adjustment. Outlying data points can also be detected and corrected in advance, within the regARIMA module.

Lastly, the annual totals of the seasonally adjusted series are forced to the annual totals of the original series.

Unfortunately, seasonal adjustment removes the sub-annual additivity of a system of series; small discrepancies can be observed between the sum of seasonally adjusted series and the direct seasonal adjustment of their total. To insure or restore additivity in a system of series, a reconciliation process is applied or indirect seasonal adjustment is used, i.e. the seasonal adjustment of a total is derived by the summation of the individually seasonally adjusted series.

12. Data quality evaluation

The methodology of this survey has been designed to control errors and to reduce their potential effects on estimates. However, the survey results remain subject to errors, of which sampling error is only one component of the total survey error. Sampling error results when observations are made only on a sample and not on the entire population. All other errors arising from the various phases of a survey are referred to as nonsampling errors. For example, these types of errors can occur when a respondent provides incorrect information or does not answer certain questions; when a unit in the target population is omitted or covered more than once; when GST data for records being modeled for a particular month are not representative of the actual record for various reasons; when a unit that is out of scope for the survey is included by mistake or when errors occur in data processing, such as coding or capture errors.

Prior to publication, combined survey results are analyzed for comparability; in general, this includes a detailed review of individual responses (especially for large businesses), general economic conditions and historical trends.

A common measure of data quality for surveys is the coefficient of variation (CV). The coefficient of variation, defined as the standard error divided by the sample estimate, is a measure of precision in relative terms. Since the coefficient of variation is calculated from responses of individual units, it also measures some non-sampling errors.

The formula used to calculate coefficients of variation (CV) as percentages is:

CV (X) = S(X) * 100% / X
where X denotes the estimate and S(X) denotes the standard error of X.

Confidence intervals can be constructed around the estimates using the estimate and the CV. Thus, for our sample, it is possible to state with a given level of confidence that the expected value will fall within the confidence interval constructed around the estimate. For example, if an estimate of $12,000,000 has a CV of 2%, the standard error will be $240,000 (the estimate multiplied by the CV). It can be stated with 68% confidence that the expected values will fall within the interval whose length equals the standard deviation about the estimate, i.e. between $11,760,000 and $12,240,000.

Alternatively, it can be stated with 95% confidence that the expected value will fall within the interval whose length equals two standard deviations about the estimate, i.e. between $11,520,000 and $12,480,000.

Finally, due to the small contribution of the non-survey portion to the total estimates, bias in the non-survey portion has a negligible impact on the CVs. Therefore, the CV from the survey portion is used for the total estimate that is the summation of estimates from the surveyed and non-surveyed portions.

13. Disclosure control

Statistics Canada is prohibited by law from releasing any data which would divulge information obtained under the Statistics Act that relates to any identifiable person, business or organization without the prior knowledge or the consent in writing of that person, business or organization. Various confidentiality rules are applied to all data that are released or published to prevent the publication or disclosure of any information deemed confidential. If necessary, data are suppressed to prevent direct or residual disclosure of identifiable data.

Confidentiality analysis includes the detection of possible "direct disclosure", which occurs when the value in a tabulation cell is composed of a few respondents or when the cell is dominated by a few companies.

1.0 Introduction

Archived information

Archived information is provided for reference, research or recordkeeping purposes. It is not subject to the Government of Canada Web Standards and has not been altered or updated since it was archived. Please "contact us" to request a format other than those available.

1.1 Drivers of the Agriculture Statistics Program Review

The need for the information collected from the CEAG goes well beyond the requirements of the agriculture sector. It is used to respond to a broad spectrum of issues such as health, food security and safety, natural resource use, climate change and crisis management. These interconnections are an important distinguishing feature of the data generated by the agriculture statistics program.

The Food and Agriculture Organization (FAO) of the United Nations (UN) summarizes the dilemma that Canada and many countries around the world are currently facing with respect to agriculture data collection:

At the same time as governments face pressure to cut costs, they are also being confronted with increasing and more complex demands for data. There has been growing interest in topics such as food security, the environment, farm labour, and special agricultural practices like organic farming. 5

To meet these challenges, the Agriculture Division of Statistics Canada continually reviews its program to maintain relevance and efficiency. Through the processes of program performance (biennial and quadrennial review), the consultations with AAFC (as part of the interdepartmental letter of agreement) and the quinquennial CEAG user consultations, changing data requirements are regularly reviewed and reflected in the program.

Nevertheless, there are a number of additional factors that warrant a more extensive review of the entire agriculture statistics program at this time:

  • Statistics Canada must review its census programs, including the CEAG, prior to the next cycle.
  • Increasing concern regarding the burden placed on producers to provide not only statistical information, but also information for the administration of agricultural programs, warrants a review of the means by which data on the industry are collected. In keeping with the need to reduce response burden is the federal government's launching of the Red Tape Reduction Commission, whose mandate is (in part) "… to get rid of unnecessary intrusions." 6
  • Statistics Canada is rationalizing and centralizing the way that business surveys will be collected and processed. In preparation for these changes, the review (and subsequent transition) of the agriculture statistics program has been in progress since 2010/2011.7
  • Additionally, all federal government departments have been tasked with undertaking a strategic review of current programs and processes with a view to gaining efficiencies as part of the federal deficit reduction action plan.
  • The Statistics Act includes a provision for cancelling a quinquennial CEAG (in years ending in "6"). Therefore, it needs to be determined whether the 2016 CEAG is necessary, and if so, to establish the requirements for the 2016 CEAG.

As a result of these factors, the Agriculture Division endeavoured to answer the following key questions:

  1. Is a CEAG still the best way to meet the data requirements for policy and program purposes? If so, what should its frequency be? More specifically, is a CEAG required in 2016?
  2. Given the data requirements for policy purposes, is the CEAG in its current form the most efficient way to gather the information, and are there efficiencies to be gained in the CEAG?
  3. How can the agriculture statistics program as a whole be streamlined to reduce response burden and costs, while continuing to meet priority data requirements?

To respond to these questions, the Agriculture Division undertook the following activities:

  • A legislative review was conducted.
  • Consultations were held with key stakeholders from federal, provincial and territorial governments, municipal and regional land-use planners, agriculture producer organizations and industry representatives, as well as the Advisory Committee on Agriculture Statistics. A detailed survey was conducted of the Agriculture Division's clients to illuminate data requirements and to narrow down which ones are critical. Workshops were held with the key stakeholders and with Agriculture Division staff.
  • Respondent burden within the current program was analyzed. Discussions with the provinces were also held regarding burden placed on agriculture respondents through data collection at the provincial level.
  • The current program and its integration with other programs within the Agency were reviewed in depth.
  • An international review of other countries' agriculture statistics programs was conducted.

The information gathered from conducting these exercises resulted in the development of several alternative options for the delivery of the agriculture statistics program. The report analyzes the advantages and disadvantages of various options. Strategies are identified for reducing response burden and finding further efficiencies for the delivery of the entire agriculture statistics program while bearing in mind the requirements for agriculture data.

1.2 Structure of the report

The report first presents an overview of the need for agriculture data, particularly in the current state of volatility in the food production industry. Next, a brief summary of the legislative review is presented, followed by an examination of the different applications of the quinquennial CEAG data.

The international review presents agriculture survey programs, CEAGs and agricultural remote-sensing applications in various countries throughout the world. The current Canadian program was examined in light of the information gleaned from the international review, which resulted in the development and evaluation of different options for delivery of the Canadian agriculture statistics program.

Date modified:

5.0 Answering the key questions

Archived information

Archived information is provided for reference, research or recordkeeping purposes. It is not subject to the Government of Canada Web Standards and has not been altered or updated since it was archived. Please "contact us" to request a format other than those available.

As a result of the research done, the Agriculture Division is able to respond to the three questions asked at the outset of this review.

1) Is a CEAG still the best way to meet the data requirements for policy and program purposes? If so, what should its frequency be? More specifically, is a CEAG required in 2016?

The review confirmed that a complete enumeration of the agriculture industry (CEAG) is needed to meet the policy and program requirements of federal and provincial governments, industry and other key stakeholders. The activities conducted by the federal and provincial governments that depend on this completeness are numerous, including health policy, land planning, crisis management, international trade, environmental accounting and reporting, and global commitments to the international community. Several of these inter-disciplinary activities are massive undertakings spanning more than one federal department as well as provincial governments. However, it is not solely the agriculture industry that relies on data from the CEAG. In the absence of another source of data that fully enumerates the industry, a CEAG is still the best way to meet these requirements.

The quinquennial CEAG data required to re-align survey estimates and survey frames are also critical to Statistics Canada, other federal departments (particularly AAFC) and provincial governments. At this time, no other source of information exists from which to extract this information. Intercensal frame deterioration is a current challenge despite the fact that a CEAG is conducted every five years. The magnitude of the intercensal revisions can sometimes be significant as a result. For policy and program evaluation and performance reporting to Treasury Board, re-alignment of the estimates decennially is insufficient, as it can lead to programs that are out of alignment for a long period of time. Accurate estimates are especially important given the volatility in the industry and the level of support that governments disburse to the agri-food industry.

Through consultations with key data users and through the assessment of the requirements for agriculture statistics thus far, it has become evident that a CEAG conducted less frequently than every five years will result in data gaps that could not be filled by any other means in the short and medium term. Preliminary discussions with the major stakeholders revealed that they are unprepared for these data gaps. In the absence of the 2016 CEAG, the work performed by several federal and provincial departments would be impacted due to the cross-sectional nature of many policies and programs. The volatility in the industry discussed earlier in the report further raises the need for a quinquennial enumeration of the industry. The most affected external departments would be AAFC, Health Canada, Environment Canada and the provincial governments. The absence of the 2016 CEAG data would also impact the entire agriculture statistics program, due to its integrated nature. These reasons support conducting a CEAG in 2016.

2) Given the data requirements for policy purposes, is the CEAG in its current form the most efficient way to gather the information, and are there efficiencies to be gained in the CEAG?

There are efficiencies that could be gained in the CEAG over time that could provide for the requirements of complete enumeration and survey re-alignment. Several features have been identified that could reduce some of the burden and cost associated with the current CEAG, while continuing to satisfy the requirements of complete enumeration and survey re-alignment that are so critical on a quinquennial basis.

The chosen option must take into consideration the requirements for policy and program development, monitoring and evaluation by collecting the required information from all agriculture producers, but limiting the CEAG content to these specific requirements.24 Regardless of the chosen option, eliminating questions that can be replaced with taxation data or other administrative data could reduce the CEAG content considerably over time.

3) How can the agriculture statistics program as a whole be streamlined to reduce response burden and costs, while continuing to meet priority data requirements?

The current Canadian agriculture statistics program comprises a quinquennial CEAG, the commodity-specific surveys, administrative and taxation data, remote sensing, the agriculture economic statistics series as well as the research and analysis work. These components can be considered bricks that constitute the foundation of the program.

Taking a progressive approach to integrating new features into the current Canadian system reduces the risk of incurring significant investments in an entirely revised program and also reduces the risks of error and loss of coherence associated with more substantial changes. By maintaining the foundation of the current program, the basic structure could remain intact while being adapted over a period of several years. In this way, the strategies of replacing survey data with administrative data, taxation data and remote sensing technology will result in efficiencies, while minimizing the risks to the relevance, coherence and accuracy of the program.

Respondent burden could be reduced over time as new administrative data sources are identified, evaluated and incorporated into the agriculture statistics program. Remote sensing also has the potential to play a more important role in supporting the agriculture statistics program. Further work will be required to quantify the investments, savings and timelines associated with the adoption of administrative data and remote sensing technology.

Survey response burden could be alleviated by reducing either the target population or the survey population. Reducing the target population would affect the coherence and comparability of the data, whereas maintaining the same target population and reducing the number of farms eligible to be surveyed could allow the currently published estimates to be maintained. The quinquennial CEAG provides regular data for the modelling of the non-surveyed population. This strategy requires no investment to adjust historical data to a new target population definition.

Further cost savings could also be introduced into the program by rationalizing and reducing the number of survey occasions per year for some crop, horticulture and livestock surveys where user data requirements can continue to be met. In addition, further response co-ordination in the CEAG years could reduce response burden over time as other sources of data become incorporated into the program.

It is possible to continue to develop a revised agriculture statistics program that respects the international agriculture statistics priorities and guidelines, namely:

Several of the features presented in the various options could result in further opportunities for reducing response burden and finding cost efficiencies. Increasing the utilization of remote sensing, increasing the incorporation of taxation data and administrative data will result in reduced response burden, realized cost efficiencies and a good quality program in the medium to longer term.

Date modified:

9.0 Endnotes

Archived information

Archived information is provided for reference, research or recordkeeping purposes. It is not subject to the Government of Canada Web Standards and has not been altered or updated since it was archived. Please "contact us" to request a format other than those available.

  1. For the purposes of this report, the word "farm" represents all agricultural operations as per the Census of Agriculture:
    Agricultural operation
    A farm, ranch or other agricultural operation producing agricultural products for sale. Also includes: feedlots, greenhouses, mushroom houses and nurseries; farms producing Christmas trees, fur, game, sod, maple syrup or fruit and berries; beekeeping and poultry hatchery operations; operations with alternative livestock (bison, deer, elk, llamas, alpacas, wild boars, etc.) or alternative poultry (ostriches, emus, etc.), when the animal or derived products are intended for sale; backyard gardens if agricultural products are intended for sale; operations involved in boarding horses, riding stables and stables for housing and/or training horses even if no agriculture products are sold. Sales in the past 12 months not required but there must be the intention to sell.
    NOTE: For the Yukon, Nunavut and Northwest Territories only, the definition also includes operations involved in the following: herding wild animals (such as caribou and muskox), breeding sled dogs, horse outfitting and rigging, and harvesting indigenous plants and berries.
    http://www.statcan.gc.ca/ca-ra2006/gloss-eng.htm
     
  2. Response burden reduction in this report also includes the need to reduce red tape as identified by the Government of Canada's (GoC) Red Tape Reduction Commission.
    http://www.reduceredtape.gc.ca/why-pourquoi/grow-croitre01-eng.asp#toc2 (accessed June 4, 2012).
     
  3. The key federal users consulted were Agriculture and Agri-Food Canada (AAFC), Environment Canada, Health Canada and the following AAFC portfolio partners: the Canadian Food Inspection Agency (CFIA), the Canadian Grain Commission, the Canadian Dairy Commission, Farm Credit Canada and the Farm Products Council of Canada. The key provincial users consulted were the stakeholders from the provincial and territorial agriculture ministries and statistics agencies. The key industry stakeholders represented producer organizations and industry clients of the Agriculture Division.
     
  4. For the purposes of CEAG planning, the short term is defined as the next five years; medium term is defined as the next five to ten years; and the long term is any timeframe longer than ten years with further precision indicated where possible.
     
  5. Food and Agriculture Organization (FAO) of the United Nations, 2007, A System of Integrated Agricultural Censuses and Surveys, Volume 1, World Programme for the Census of Agriculture 2010, Rome.
    http://www.fao.org/docrep/009/a0135e/A0135E04.htm#ch1 (accessed June 4, 2012).
     
  6. PM Announces Red Tape Reduction Commission.http://pm.gc.ca/eng/media.asp?id=3894 (accessed June 4, 2012).
     
  7. Specifically, the Corporate Business Architecture (CBA) initiative.
     
  8. Derek Burleton and Dina Cover, 2011, Unprecedented Volatility A Hallmark of Agriculture's New Age, TD Economics, p. 4.
    http://www.td.com/document/PDF/economics/special/dc1111_agriculture.pdf (accessed June 4, 2012).
     
  9. Ministerial Declaration: "Action Plan on Food Price Volatility and Agriculture," Meeting of the G20 Agriculture Ministers, (Paris), June 22-23, 2011. p. 2.
    http://un-foodsecurity.org/sites/default/files/110623_G20_AgMinisters
    _Action_Plan_Agriculture_Food_Price_Volatility.pdf
    (accessed June 4, 2012).
     
  10. Canada's response to Action Two of the "Action Plan on Food Price Volatility and Agriculture" states that the data on crops produced eight times per crop year meet the AMIS requirements. It also states that the data collected on the major classes of livestock are sufficient. It further cites Statistics Canada's remote sensing Crop Condition Assessment Program (CCAP) data meet the requirements to estimate yield models and production of some crops in Canada. Accountability for G20 Food Security Commitments, AAFC correspondence, September 2011.
     
  11. FAO, op.cit., p. 60.
     
  12. Agriculture and Agri-Food Canada, 2011, An Overview of the Canadian Agriculture and Agri-food System, Ottawa, Ontario.
    http://www4.agr.gc.ca/AAFC‑AAC/display-afficher.do?id=1295963199087&lang=eng (accessed June 4, 2012).
     
  13. Statistics Act, section 20 : A census of agriculture of Canada shall be taken by Statistics Canada
    • (a) in the year 1971 and in every tenth year thereafter; and
    • (b) in the year 1976 and in every tenth year thereafter, unless the Governor in Council otherwise directs in respect of any such year. 1970-71-72, c. 15, s. 19.
      http://laws-lois.justice.gc.ca/eng/acts/S-19/FullText.html (accessed June 4, 2012).
       
  14. Statistics Act, section 22: Without limiting the duties of Statistics Canada under Section 3 or affecting any of its powers or duties in respect of any specific statistics that may otherwise be authorized or required under this Act, the Chief Statistician shall, under the direction of the Minister, collect, compile, analyze, abstract and publish statistics in relation to all or any of the following matters in Canada:
    • (a) population;
    • (b) agriculture;
    • (c) health and welfare;
    • (d) law enforcement, the administration of justice and corrections;
      …; and
    • (u) any other matters prescribed by the Minister or by the Governor in Council. 1970-71-72, c. 15, s. 21; 1976-77, c. 54, s. 74.
      http://laws-lois.justice.gc.ca/eng/acts/S-19/FullText.html (accessed June 4, 2012).
       
  15. Early censuses included questions on population and agriculture together. Starting in 1896, a separate CEAG was conducted in Manitoba, and in Alberta and Saskatchewan beginning in 1906. The CEAG has been conducted every five years in the Prairie provinces since 1906.
     
  16. The Council on Food, Agricultural and Resource Economics (C_FARE), 2007, Improving Information About America's Farms and Ranches: A Review of the Census of Agriculture, Washington, D.C.
    http://www.cfare.org/publications/20070307cfare_census_review_Full_Report.pdf,
    (accessed June 4, 2012).
     
  17. The Survey of Household Spending shows that in 2009 the broadband connection for internet use in rural Canada was 28% compared with 50% for population centres of 500,000 and over. Statistics Canada, 2010, Survey of Household Spending, 2009, Ottawa, Ontario.
     
  18. The Regulations (EC) No 1166/2008 and No 1200/2009 regulate the content and conduct of the European agricultural surveys and census.
     
  19. Statistical Clearing House, Australian Bureau of Statistics, http://www.nss.gov.au/nss/home.nsf/pages/About+SCH (accessed June 4, 2012).
     
  20. The Farm Register will be migrated to the Business Register (BR) in 2012.
     
  21. FAO, op.cit. p. 18.
     
  22. Ibid.
     
  23. Don Royce, 2011, Preliminary Report on Methodology Options for the 2016 Census, Statistics Canada, Ottawa, Ontario.
    http://www12.statcan.gc.ca/strat/index-eng.cfm (accessed June 4, 2012).
     
  24. The FAO recommendations for conducting a CEAG state that a CEAG should be conducted more frequently than every ten years. The recommendations suggest that a CEAG be conducted based on complete enumeration for the core content required by policy makers. Further data not considered essential for policy making are to be collected from a sample of the population either concurrently with the CEAG or post-censally. The FAO recommendations are based on the internationally recognized need to reduce collection and processing costs for agriculture censuses as well as to provide for the increasing amount of information sought from the CEAG.
    Source: World Bank and FAO, 2010, The Global Strategy to Improve Agricultural and Rural Statistics, Report Number 56719-GLB, Washington D.C.
    http://www.fao.org/fileadmin/templates/ess/documents/meetings_and_workshops/
    seminar_on_global_strategy_22_06_2009/global_strategy_document_20090622.pdf
    (accessed June 4, 2012).
     
  25. FAO, op.cit.
     
  26. World Bank and FAO, op.cit.
Date modified:

4.0 Options

Archived information

Archived information is provided for reference, research or recordkeeping purposes. It is not subject to the Government of Canada Web Standards and has not been altered or updated since it was archived. Please "contact us" to request a format other than those available.

The development and evaluation of alternative options involved several steps:

  • A review team was developed consisting of a group of Agriculture Division managers, methodologists from Business Surveys Methods Division and a representative from AAFC.
  • The key overarching considerations to be taken into account during the assessment of the alternative options were developed and agreed upon by the Review Team and Senior Management at Statistics Canada.
  • Three alternative options were developed based on the priority data requirements for the Canadian program, coupled with the international review.
  • Detailed criteria were developed to evaluate the alternative models. (These criteria are presented in Table 2 on the following page.)
  • The three alternative options were subsequently evaluated against the current Canadian program. The essential conditions and investments required for implementation of the models were determined along with each option's strengths, weaknesses and risks.
  • The most attractive attributes of the three alternative options within the Canadian context were combined to develop two hybrid options, which were subsequently evaluated.

4.1 Evaluation of the options

The following table lists the criteria that were used to evaluate the options. A total of 32 evaluation criteria were identified and organized into 10 categories. They include Statistics Canada's six elements of quality (relevance, accuracy, timeliness, coherence, interpretability and accessibility) as well as a number of other categories that merit special consideration in the context of the agriculture statistics program (cost, response burden, operational feasibility and acceptability).

Table 2: Option evaluation criteria
Quality Evaluation criteria
Relevance: Content
Frequency
Target population
Small area data needs
Accuracy: (Reliability) CVs of important survey variables
Bias of important survey variables
Available information to identify the target population and associate it with the data
Quality of 'take-none' modelling (defining the survey population)
Accuracy of the data sources
Coherence:(Comparability) Coherence of important survey variable time series
Coherence of data between sources
Timeliness: Impact of the data source on the timeliness
Interpretability: Details available on administrative files
Accessibility: Data suppression
Availability of supplementary information
Respondent Burden: Number of contacts per unit
Interviewing time per unit
Sensitive content
Burden on people other than survey respondents
Burden placed on respondents by entities other than Statistics Canada
Cost: Collection costs
Post-collection costs
Development costs
Cost sustainability
Costs to other organizations in the system
Compliance costs to farmers
Operations: Ability to react quickly to new needs
Ability to conduct large occasional surveys
Timing of and time necessary for implementation
Statistics Canada Corporate Business Architecture compliance
Acceptability: Acceptance in the data user community
Acceptance in the respondent community

The key features, strengths, weaknesses, risks and investments of the current Canadian program are presented next, followed by the evaluation of the alternative models that were explored.

4.2 Baseline option: The current Canadian program

The current Canadian program is a highly integrated system. The production data from both the crops and livestock sections combined with prices from both survey and administrative sources generate the farm cash receipts. Expenses, derived largely from administrative sources, serve to generate the net farm income estimates. As well, data from CEAG flow into the commodity programs, while data from the commodity, farm income and prices programs are used to validate the CEAG data. Survey frame updates flow from the survey programs into the Farm Register,20 and, subsequently, into the CEAG, and vice versa. The integrated nature of the program requires that for any proposed change to part of the program, impacts on the other components of the program must be assessed.

Description for figure 1

Figure 1: The Current Agriculture Statistics ProgramThe Current Agriculture Statistics Program

4.2.1 Key features

  • A CEAG is conducted nationally every five years in years ending in "1" and "6." Response burden is minimized during the years that the CEAG is conducted. For the 2011 CEAG, follow-up calls were eliminated or co-ordinated for the FFS sample, some surveys were cancelled, the cap-on-calls for the majority of surveys was reduced and the sample size for the July livestock survey was reduced significantly.
  • The CEAG is linked to the CEPOP/NHS in years ending in "1" and "6" to provide socioeconomic data.
  • The target population for both the CEAG and surveys includes all farms with the intention to sell agricultural products. This definition provides comprehensive coverage to users.
  • The survey population varies by survey; some survey samples exclude operations based on a minimum size threshold or for specific farm types. (For example, the FFS excludes operations with complex structures, farms on First Nations Reserves, community pastures and farms with less than $10,000 in gross sales.)
  • A frame maintenance program includes information from both administrative sources and a short survey to update and maintain the Farm Register.
  • The survey program is commodity specific and comprises field crop, horticulture, livestock and financial surveys.
  • Remote sensing delivers the CCAP, which combines earth observation, geographic information systems (GIS) and the Internet to provide near-real time information on crop and pasture/rangeland conditions using a mapping application for agricultural land.
  • Administrative data are an integral part of the program (approximately 140 different sources are incorporated) including tax data, marketings, prices, imports, exports, production, debt, inspections data, etc. These data are provided by Canada Revenue Agency, provincial administrations, national producer organizations, AAFC and Statistics Canada's International Trade Division.

4.2.2 Strengths

  • The five-year interval between CEAGs maintains the relevance and usefulness of the data to users. The CEAG data used for policy development and evaluation, program monitoring, benchmarking, measuring industry structural changes, supporting legislative and regulatory instruments and for trade purposes are perceived to be sufficiently frequent.
  • Although some data gaps exist, this model meets the majority of user requirements for small area data, benchmarking and critical survey frame information.
  • This model has the advantage of reliability and predictability since the program has been running successfully for a long period of time.

4.2.3 Weaknesses

  • Response burden is a concern in an environment where the Government of Canada is firmly committed to reducing red tape.
  • The cost of the program is a concern in an environment of deficit reduction and increased efficiency.
  • Despite some very rapidly produced statistics, there are some concerns with the timeliness of some of the statistics.

4.2.4 Essential conditions

  • The current program has developed over time with the funding, technology and infrastructure required, so the essential conditions for this option are in place. However, fiscal pressures and commitments to reduce response burden are raising uncertainty as to the sustainability of this model.
  • The Corporate Business Architecture (CBA) is transforming the way that Statistics Canada collects and compiles data. The entire agriculture statistics program will complete the transition to the CBA in 2014-15. The CBA is expected to increase the overall efficiency of the agriculture program.
  • The transition from the Farm Register to the Business Register in 2012 is also expected to reduce the cost of the frame and will allow the Division to measure and manage response burden in a more global manner.

4.2.5 Required investments

  • Regular maintenance costs and post-censal redesign for an existing survey system.
  • Investments required for each CEAG cycle through Treasury Board submissions.

4.2.6 Risks

  • Should the 2016 CEAG be cancelled by Order in Council, there would not be time to fully replace it. There would therefore be significant data gaps, particularly related to benchmark data, small area data and frame update information.

4.3 Option 1: The modified British Model

4.3.1 Key features

  • A CEAG would be conducted every 10 years (in years ending in "1").
  • The CEAG would be linked to the CEPOP/NHS in years ending in "1" to provide socioeconomic data.
  • Two annual modular surveys would replace 12 of the current commodity-specific surveys conducted throughout the year. These surveys would be conducted in June and December. Different commodities would be collected together but subsequently processed and disseminated separately. Overlap between commodities would be controlled to reduce burden for diversified farms. (For example, a mixed livestock and crops farm may only receive the crops module for one survey occasion and the livestock module on another survey occasion. The operation would not automatically receive both modules on every selected survey occasion.) The number of survey occasions per year would be reduced for field crop, horticulture and livestock surveys.
  • The sample size and content of the June Modular Survey would be expanded in the years ending in "5" and "8" to compensate for some of the data loss due to the absence of a CEAG in years ending in "6." In these two years, comprehensive survey modules would be integrated, so that analysis can be conducted at the whole farm level as is presently the case with the CEAG. (For example, a mixed livestock and crops farm will receive both the crops and the livestock modules in these years.)
  • Tax data would be used to replace all comparable financial questions on the CEAG and on surveys.
  • The target population for both the CEAG and surveys would exclude smaller farms under a specified production threshold, (for example, an amount of cultivated land, livestock, other criteria or combination thereof), for reasons of burden and cost.
  • The survey population would be equivalent to the target population.
  • A regular frame maintenance program would include a short survey to complete missing information from new farm tax filer records and those operations not recently surveyed to update and maintain the agriculture frame on the Business Register. (It should be noted that some of these activities are already carried out in the current program.)
  • A small number of commodity-specific surveys (Greenhouse, Sod and Nursery Survey, Mushroom, Maple, etc.) would continue to exist because of their unique requirements.

4.3.2 Strengths

  • This option mitigates some of the risk of data loss by providing a subset of data requirements should the CEAG in years ending in "6" be cancelled by Order in Council.
  • Structural changes, new production and trends would be captured by the CEAG in years ending in "1" and partially captured with the two expanded occasions of the June Modular Survey in years ending in "5" and "8."
  • The new annual modular surveys provide a flexible, regular vehicle to identify and address emerging issues.

4.3.3 Weaknesses

  • This option's content and sample size increases in years ending in "5" and "8" do not meet user needs for small area and custom geographic data, for provincial benchmarking data or for the enumeration of rare or emerging commodities.
  • This option provides a reduced level of survey frame information, even with an increase in sample size in years ending in "5" and "8." This would lead to frame deterioration and a related decrease in data accuracy from the survey program over the intercensal period.
  • The ten year gap between CEAGs would reduce the relevance and usefulness of the data to users. The CEAG data used for policy development and evaluation, support of legislative and regulatory instruments and for trade purposes is likely to become out of date before the next CEAG is conducted.
  • The generalized survey design for integrated surveys would not be ideal for some commodities.
  • The timeliness of data releases would be affected.
  • This option does not allow the entire population to be measured. The program will no longer cover 100% of agriculture activity in Canada. This loss of coherence and comparability of the data would require transition data (back-casting) and technical assistance to data users to make adjustments for changes to the coverage of the target population and the availability and frequency of data.

4.3.4 Essential conditions

  • An Order in Council would be required to cancel a CEAG in years ending in "6."
  • Technology and procedures would have to exist to deliver the modular surveys in an intelligent manner, so that response burden and collection costs could be controlled. To minimize burden and collection costs, delivery and collection of the appropriate modules (crops, livestock, financial, other) would need to be established prior to collection. The appropriate module would be determined from CEAG information and frame and survey update information.
  • Due to the lengthy gap between CEAGs, an enhanced coverage program would have to be implemented to maintain the data required to determine whether an operation should be included in the target population based on the predetermined threshold. This coverage program could include access to AAFC administrative program data such as crop insurance, AgriInvest and AgriStability, supplemented with a frame update survey. Both the access to and processing of the administrative data would require development.

4.3.5 Required investments

  • The new modular surveys would have to be designed, tested, developed and implemented.
  • Historical data would have to be adjusted to match the new target population definition. This includes the development of user training material to avoid data misuse and misinterpretation and to clarify the impacts of the change to the target population.
  • Alternative sources of commodity data would have to be developed at the micro level (for example, program data such as crop insurance, AgriInvest and AgriStability). This is necessary to establish and maintain threshold information on the agriculture frame on the Business Register between CEAGs and the large survey years.

4.3.6 Risks

  • There could be negative reaction from data users regarding
    • changes to the target population
    • the loss of small area data
    • the loss of provincial benchmarking data
    • the increase in reaction time to capture new trends and industry structural changes
    • the timeliness of specific annual commodity data that would be included in the two integrated surveys.
  • The two very large and comprehensive surveys in years ending in "5" and "8" may result in as much response burden as the CEAG in years ending in "6".

4.4 Option 2: The modified Australian/American Model

4.4.1 Key features

  • A CEAG would be conducted every 10 years (in years ending in "1"). (It should be noted, however, that both Australia and the US conduct censuses of agriculture every five years.)
  • The CEAG would be linked to the CEPOP/NHS in years ending in "1," to provide socioeconomic data.
  • One new large survey would be conducted in years ending in "6" to compensate for some of the data loss due to the lack of a CEAG in those years.
  • Tax data would be used to replace all comparable financial questions on the CEAG and surveys.
  • The target population for both the CEAG and surveys would exclude farms under an estimated value of agricultural operation. (For example, US = $1,000 USD; Australia = $5,000 AUD.)
  • The survey population would be equivalent to the target population.
  • A regular frame maintenance program would include a short survey to complete missing information from new farm tax filer records and those operations not recently surveyed to update and maintain the agriculture frame on the Business Register.
  • The survey program would remain commodity specific much like the current program. However, the number of survey occasions per year would be reduced for some crop, horticulture and livestock surveys.
  • Remote sensing would play an increasingly important role. This technology would be integrated into the agriculture statistics program as it becomes mature. The initial focus would be on replacing the Potato Area Survey and the July and September Field Crop Reporting Surveys in the Prairie provinces.
  • Administrative data would play an increasingly important role. These data would be integrated into the agriculture statistics program as they become available.
  • This option expands on current partnerships and promotes new partnerships with federal, provincial and industry stakeholders. These partnerships would be necessary to share responsibility for the development, collection and compilation of administrative data (such as AAFC's AgriInvest and AgriStability programs and livestock traceability data).

4.4.2 Strengths

  • This option would achieve cost savings and reduce response burden by conducting a new large survey in years ending in "6" instead of conducting a CEAG.
  • This option mitigates some of the risk of data loss by providing a subset of data requirements should the CEAG in years ending in "6" be cancelled by Order in Council.
  • Since the annual agriculture surveys remain relatively similar, this option would be expected to have little impact on data users in terms of timeliness and survey content.

4.4.3 Weaknesses

  • Despite the content and sample size increases of the new large survey in years ending in "6," this option does not meet user needs for small area data, custom geographic data, provincial benchmarking data or for the enumeration of rare or emerging commodities.
  • This option provides a reduced level of survey frame information, even with an increase in sample size in years ending in "6." This would lead to frame deterioration and a related decrease in data accuracy from the survey program over the intercensal period.
  • The ten year gap between CEAGs would reduce the relevance and usefulness of the data to users. The CEAG data used for policy development and evaluation, support of legislative and regulatory instruments and for trade purposes is likely to become out of date before the next CEAG is conducted.
  • This option does not allow the entire population to be measured. The program will no longer cover 100% of agriculture activity in Canada. This loss of coherence and comparability of the data would require transition data (back-casting) and technical assistance to data users to make adjustments for changes to the coverage of the target population and the availability and frequency of data.

4.4.4 Essential conditions

  • An Order in Council would be required to cancel a CEAG in years ending in "6."
  • To increase the use of administrative data, it would be necessary to renegotiate existing partnerships or develop new ones with federal, provincial and industry stakeholders. These agreements would establish protocols for data sharing, confidentiality and protection. The collaboration of multiple players over several jurisdictions would have to be established and maintained. This commitment must begin at the highest levels in the participating organizations and extend to the working level.
  • Federal, provincial and industry data holders would need to include a declaration to their data providers (farm operators) regarding the provision of data for statistical purposes. There may be a need to change legislation.
  • Agriculture respondents would have to be aware of and support the increasing use of administrative data, being aware of the associated benefits and risks.
  • A feasibility study would be required to fully evaluate the costs, benefits, risks and potential timeframes for incorporating administrative data sources and increased use of technology (such as remote sensing) into the program.
  • A methodologically sound and realistic framework through which new sources of administrative data could be identified, evaluated, incorporated and operationalized in the program must be developed to reduce the risk of errors.

4.4.5 Required investments

  • Historical data would have to be adjusted to match the new target population definition. This includes the development of user training material to avoid data misuse and misinterpretation and to clarify the impacts of the change to the target population.
  • Alternative sources of commodity data would have to be developed at the micro level (for example, program data such as crop insurance, AgriInvest and AgriStability) for statistical purposes and to establish and maintain threshold information on the agriculture frame on the Business Register between CEAGs and the new large survey years.
  • Remote sensing would have to be developed to completely or partially replace traditional field crop surveys. A Land Area Survey would need to be developed. The data from this survey combined with administrative data (e.g., crop insurance data) would be used to calibrate remote sensing results.
  • A new large survey to replace the CEAG in years ending in "6" would have to be developed and implemented.

4.4.6 Risks

  • There could be negative reaction from data users regarding
    • changes to the target population
    • the loss of small area data
    • the loss of provincial benchmarking data
    • the increase in reaction time to capture new trends and industry structural changes.
  • Increased reliance on administrative data sources may put the coherence, comparability and sustainability of the data at risk due to changes in programs, regulations or provider partners over time.

4.5 Option 3: The modified Scandinavian Model

4.5.1 Key features

  • Administrative data form the basis of Option 3. There would be no traditional CEAG. (It should be noted, however, that all Scandinavian countries conduct a traditional CEAG every 10 years.) An administratively based CEAG could potentially be conducted on an annual basis if sufficient information existed.
  • Linking of the CEAG to the CEPOP/NHS should be possible for the years that the CEPOP/NHS are conducted to provide socioeconomic data.
  • Existing administrative data sources would be expanded to include new sources, as they become available. For example,
    • crop insurance
    • AgriInvest, AgriStability and Business Risk Management programs
    • CFIA data
    • national producer organizations (NPOs)
    • livestock traceability systems.
  • A farm structure survey would be conducted every three years to address administrative data gaps, monitor changes, measure emerging trends and perform frame updates and maintenance.
  • Tax data would be used to replace all comparable financial questions on the CEAG and surveys.
  • The target population for both the CEAG and surveys would exclude farms under a specific sales threshold.
  • The survey population would be equivalent to the target population.
  • A regular frame maintenance program would include a short survey to complete missing information from farm tax filer or administrative data records to update and maintain the agriculture frame on the Business Register.
  • A small program of other surveys would be run each year, if necessary, to cover data requirements not covered by administrative data or farm structure survey data (on specific commodities such as fur production). As more administrative sources become available, more survey data would be replaced by administrative data.
  • This option depends on reliable, complete, timely, stable and accessible administrative information covering the target population.

4.5.2 Strengths

  • This option achieves significant cost savings in the long term by cancelling the CEAG in years ending in "1" and "6."
  • Because it uses data already collected for administrative purposes, the marginal costs of producing statistics are generally much less than for a traditional CEAG or commodity-specific survey (once the databases, systems, and data-sharing and protection protocols are in place).
  • This option has the potential to reduce survey response burden by replacing the traditional CEAG and survey program with an administratively based CEAG and survey program that uses data already collected for other purposes.
  • Like the traditional CEAG, the administratively based CEAG can meet the objectives of the FAO features of a CEAG, which are to provide data on the structure of agriculture (from small administrative units) that enable detailed cross-tabulations to use as benchmarks for current agriculture statistics and frames for agricultural sample surveys.21
  • An administratively based CEAG may be able to produce data on a yearly basis, compared to every five or ten years for a traditional CEAG.
  • The administratively based CEAG may be used to identify subgroups for surveys, if needed, depending on the variables available.

4.5.3 Weaknesses

  • Few of the essential conditions currently exist for this option to be successfully implemented in the short or medium term. Development of an administratively based CEAG would be a longer term process, requiring several years or even decades.
  • A significant front-end investment would be required to implement this option. In addition, negotiating agreements among many players and across multiple levels of government and non-governmental organizations would be necessary. Maintaining systems, definitions, concepts, as well as ongoing oversight would require additional resources and funding.
  • The program content would initially be limited to the data variables already available in the administrative databases. Over time, the required variables could be added to the administrative requirements of the programs, so that they could be collected for statistical purposes. This would likely require additional legislation and funding. It may also require enforcement strategies to ensure compliance with the statistical requirements and data-sharing agreements.
  • The concepts and definitions that apply to data in the administrative databases may not correspond to those desired for statistical purposes. Linkage of different administrative databases for the same unit may result in data inconsistencies that may be difficult to resolve without significant investments. Changes to and differences in concepts, definitions, target populations, etc., of administrative sources across jurisdictions and over time may limit the data availability, comparability and accuracy of the data for statistical purposes.
  • Unlike the traditional CEAG, the administratively based model cannot provide a snapshot of the entire country at one point in time during a census year. Data from multiple administrative sources are unlikely to reference one date.
  • There would likely be an increase in overall response burden due to the fact that every agriculture producer would be required to provide administrative data to fill statistical requirements whereas a survey approach requires only a sample of operators to provide such data.
  • There would be increased burden placed on the providers of administrative data to meet the requirements of the national statistical agency.
  • Developing and expanding partnerships with federal, provincial and industry stakeholders as well as with academia will require an investment of time and resources.

4.5.4 Essential conditions

  • The Statistics Act may need to be revised to cancel the CEAG in years ending in "1" and "6." (An investigation would need to determine if an administratively based CEAG would meet the legal requirements for a CEAG.)
  • Legislation to provide a stronger regulatory framework to develop, collect and acquire administrative data sources would be necessary. The legislation would also have to provide a detailed definition of data protection; for example, it should specify that the statistical data produced by the linkage process cannot be fed back to the administrative databases (known as the "one-way traffic" principle). In other words, the Statistics Act allows for Statistics Canada to procure data, but prevents Statistics Canada from feeding any data back to the source (for example, the Canada Revenue Agency) as stipulated in the Statistics Act.
  • A strong infrastructure covering legislative, regulatory and operational requirements, along with inter-agency cooperation across jurisdictions would be necessary. This would require the adoption of a "clearing house" approach to ensure that the same data are not collected more than once by different organizations.
  • To increase the use of administrative data, it would be necessary to renegotiate existing partnerships or develop new ones with federal, provincial and industry stakeholders. These agreements would establish protocols for data sharing, confidentiality and protection. The collaboration of multiple players over several jurisdictions would have to be established and maintained. This commitment must begin at the highest levels in the participating organizations and extend to the working level.
  • Federal, provincial and industry data holders would need to include a declaration to their data providers (farm operators) regarding the provision of data for statistical purposes. There may be a need to change legislation.
  • There would have to be a unique common identifier for all agricultural operations. This identifier should be used to conduct virtually all transactions with government (at all levels) thus enabling its use to link administrative data across all government sources.
  • Agriculture respondents would have to be aware of and support the increasing use of administrative data, being aware of the associated benefits and risks.
  • It would be necessary to develop a good set of register systems that fulfill administrative needs and that also contain data covering the most important subject areas for the statistical system. The coverage of these databases and the quality of the data contained within them would have to be sufficient to be useful for statistical purposes.
  • There would have to be incentives, such as legal requirements, for the target farm population to register and to inform authorities of changes or events (for example, changes of address, operator or ownership, bankruptcies). This documentation would have to be reliably recorded and with minimal delay.
  • It would be necessary to have a reliable method of assigning units to a detailed geographic level (geocoding) to produce small-area detail (for example, assigning owners, operators or establishments to specific geocodes).
  • In the absence of a CEAG, the frame maintenance would depend entirely on administrative data rather than drawing from a CEAG (and other sources).
  • Highly skilled, professional staff and training would be required to maintain this program due to the complexity that arises when data are procured from many different sources for many different programs. It would be necessary for analysts to be able to interpret the differences in concepts, definitions, scope and history of the administrative sources of data, particularly if attempting to conduct analysis in an integrated approach using data from different administrative sources. In addition, it would be necessary to educate users to accurately interpret the data due to the complexity of this option, both in terms of its operations and the resulting data.

4.5.5 Required investments

  • Historical data would have to be adjusted to match the new target population definition. This includes the development of user training material to avoid data misuse and misinterpretation and to clarify the impacts of the change to the target population.
  • Alternative sources of commodity data would have to be developed at the micro level (for example, program data such as crop insurance, AgriInvest and AgriStability) for statistical purposes and to establish and maintain threshold information on the agriculture frame on the Business Register.
  • Developmental costs may be substantial in the short and medium term. The cost of developing and maintaining these data and the systems required may be shifted from Statistics Canada to the providers of the administrative data. There would likely be costs related to the cleaning of the data and ensuring coherence among the sources. As such, Statistics Canada may be required to share these costs with the data holders.

4.5.6 Risks

  • There could be negative reaction from data users regarding
    • changes to the target population
    • the increase in reaction time to capture new trends and industry structural changes
    • the timeliness of specific commodity statistics.
  • The potential exists to increase response burden by requiring all producers to provide information currently collected by a sample of the population. For example, data currently collected from a comparatively small sample of survey respondents represents the larger target population; however, if these same data were required on an administrative form, all program participants would be required to provide this information thereby significantly increasing response burden. (An example of this would be adding a data variable, currently collected on a sample survey, to a tax form, which all farm tax filers would be required to provide.)
  • Increased reliance on administrative data sources may put coherence, comparability and sustainability of the data at risk due to changes in programs, regulations or provider partners over time.
  • The perception of intrusiveness and loss of privacy may lead to a loss of cooperation by the agriculture community.

4.6 Summary of the alternative options

To better assess the potential of these three options to revamp the current Canadian agriculture statistics program, the advantages and disadvantages of each need to be compared and contrasted against the others. The following presents a summary of that evaluation.

Summary of Options 1, 2 and 3

Options 1 and 2 (modified British and Australian/American Models) both feature a CEAG conducted every ten years. Their main difference is in how they go about filling the intercensal data gaps; each model takes a different approach.

Option 1 (Modified British Model) involves a CEAG conducted every 10 years. It incorporates two modular surveys that would be conducted in June and December each year. A range of commodities would be collected together but subsequently processed and disseminated separately to maintain relevance to data users. Overlap between commodities would be controlled to reduce response burden for diversified farms.

A small number of commodity specific surveys would continue to exist because of their unique requirements. The sample size and content of the June Modular Survey would be expanded in the years ending in "5" and "8" to compensate for the absence of a CEAG in years ending in "6." In these two years, comprehensive survey modules would be integrated so that analysis could be conducted at the whole farm level as is presently possible with the current CEAG. The target population for both the CEAG and the surveys would exclude smaller farms under a specified production threshold. The survey population would be equivalent to the target population.

Option 2 (Modified Australian/American Model) involves a CEAG conducted every 10 years (although both countries conduct a CEAG every five years). It would comprise a commodity specific intercensal survey program much like the current Canadian program. One new large survey would be conducted to compensate for the loss of some of the data due to the absence of the CEAG in years ending in "6." The target population for both the CEAG and the surveys excludes farms under an estimated value of agricultural operation. For example, US = $1,000 USD; Australia = $5,000 AUD.) The survey population would be equivalent to the target population as in Option 1. However Option 2 features increased incorporation of remote sensing technology and administrative data compared with Option 1.

Option 3 (the Modified Scandinavian Model) is largely based on administrative data and therefore no traditional CEAG would be required (although it should be noted that all Scandinavian countries conduct a traditional CEAG every ten years). A farm structure survey would be conducted every three years to address administrative data gaps, monitor changes, measure emerging trends and perform frame updates and maintenance. If necessary, a small number of special surveys (for example, on specific commodities such as fur production) would be run each year to meet data requirements not covered by administrative data or by the farm structure survey. The target population for both the administratively based CEAG and the special surveys would exclude farms under a specific sales threshold. The survey population would be equivalent to the target population. This option would largely eliminate the need to conduct many agriculture surveys presently necessary with the current Canadian model, but is only possible when comprehensive databases and administrative data sources are available.

Advantages of Options 1, 2 and 3

The examination of these options led to the identification of their key strengths and weaknesses as well as to the investments that would be required for implementation in Canada. The most promising features that emerged from this evaluation include the modular survey approach of Option 1, the similarities with the current Canadian program of Option 2 and the incorporation of administrative data of Option 3.

Both Options 1 and 2 mitigate some of the risk of data loss by providing a subset of data requirements should the CEAG in years ending in "6" be cancelled. In their own way, each of these options would realize cost efficiencies and reduce response burden in the years ending in "6." With both options, structural changes, new production and trends would be captured by the CEAG in years ending in "1" and partially captured intercensally.

In addition, Option 2 provides the benefit of being relatively similar to the current survey program and therefore would be expected to have less impact on data users in terms of timeliness and survey content.

Option 3 achieves cost savings by cancelling the CEAG in years ending in "1" and "6." For the data already collected for existing administrative purposes, the marginal costs of producing statistics would be generally much less than for a traditional CEAG or commodity-specific survey (once the databases, systems, and data-sharing and protection protocols are in place). This model has the potential to reduce survey response burden by replacing traditional surveys with administrative data.

Like the traditional CEAG, the administratively based CEAG can meet the objectives of the FAO features of a CEAG, which are to provide data on the structure of agriculture (from small administrative units) that enable detailed cross-tabulations to use as benchmarks for current agriculture statistics and frames for agricultural sample surveys.22 An administratively based CEAG may be able to produce data on a yearly basis, compared to every five or ten years with a traditional CEAG.

Disadvantages of Options 1, 2 and 3

The weaknesses of these options were determined to be sufficiently significant that none of them could be adapted in their entirety to the Canadian context. These options are unable to adequately fill the data needs to replace the quinquennial CEAG, particularly when it comes to the need for benchmarking and small area data.

Option 1 would require significant restructuring of the current program including design, development, testing and implementation of the two new integrated modular surveys. As well, the integrated survey approach would adversely affect the timeliness for some crop and livestock estimates. In spite of this option's survey content and sample size increases in years ending in "5" and "8," this strategy would not meet user needs for small area and custom geographic data, for provincial benchmarking data and for the enumeration of rare or emerging commodities that only a CEAG based on complete enumeration can give.

The key disadvantages would be losses to coherence, data gaps and relevance related to

  • changes to the target population
  • the loss of small area data
  • the loss of provincial benchmarking data
  • the increased delay in capturing new trends and structural changes in the industry.

For Option 1, the two large and comprehensive surveys in years ending in "5" and "8" may result in as much response burden as the CEAG in years ending in "6" that they replace, without providing the benefits of complete enumeration at one point in time.

For both Options 1 and 2, the ten year gap between CEAGs would reduce the relevance and usefulness of the data to users. The CEAG data used for policy development and evaluation, support of legislative and regulatory instruments and for trade purposes would become out of date before the next CEAG is conducted. Like Option 1, Option 2 does not meet the user needs for small area data, provincial benchmarking data and for the enumeration of rare or emerging commodities.

In addition, these two options do not allow the entire population to be measured. The program will no longer cover 100% of agriculture activity in Canada. This would cause a loss of coherence and comparability of the data, which would require transition data (back-casting) and technical assistance to data users to adjust for changes to the target population and the availability and frequency of data. This work would also require the development of user training material to avoid data misuse and misinterpretation and to clarify the impacts of the change to the target population.

Option 3 would require an extensive administrative framework that does not currently exist in Canada and that would require significant time and investment to establish. As was discovered in the CEPOP review,23 a common unique identifier permitting efficient linkages of multiple datasets would be required for such an administrative model to function. Such an identifier does not currently exist in Canada.

Additionally, response burden would likely be increased in such a model due to the fact that every agriculture producer would be required to provide administrative data to fill statistical requirements, whereas the present survey approach requires only a sample of operators to provide such data. Administrative concepts would have to be aligned with statistical concepts to ensure coherence. Privacy and confidentiality aspects of a program based on administrative data would have to be evaluated.

The evaluation determined that none of these options on their own would be an adequate replacement for the current agriculture statistics program. The investments required coupled with the data losses and compromises to the quality, timeliness, relevance and coherence of the data are not outweighed by the reduction in response burden and costs.

However, specific components of these alternative options were identified as being productive and efficient.

4.7 Refining the options for further consideration

The evaluation of the three options led to the identification of their most attractive features and their major weaknesses when considered in the Canadian context. Consequently, two hybrids of these options were constructed that incorporate these advantages while minimizing the disadvantages.

A description of these two hybrid options follows.

4.8 Option 4: Hybrid A

4.8.1 Key features

Hybrid A features a full decennial CEAG with increases in content and sample sizes of commodity-specific surveys in years ending in "6," coupled with increased use of administrative data and remote sensing.

More specifically:

  • A CEAG would be conducted every 10 years (in years ending in "1"). The questionnaire content would be similar to the current Canadian option, with the following distinctions:
    • the detailed expense questions would be replaced with taxation data (i.e., the CEAG would exclude these questions)
    • any questions that could be replaced with comparable and available administrative data would be excluded.
  • The CEAG would be linked to the CEPOP/NHS in years ending in "1" to provide socioeconomic data.
  • There would be an increase in content and sample size in the existing commodity-specific surveys in years ending in "6" to compensate for some of the data loss due to the absence of a quinquennial CEAG.
  • The survey program would remain commodity specific much like the current program. The number of survey occasions per year would be reduced for some crop, horticulture and livestock surveys.
  • Tax data would be used to replace all comparable financial questions on the CEAG and in surveys.
  • The target population would remain the same as for the current program (i.e., the target population includes all farms that produce agricultural products intended for sale).
  • The survey population would continue to exclude smaller farms under a specified threshold for reasons of burden and cost. The option to raise the threshold for specific surveys needs to be investigated further. The non-surveyed population would continue to be estimated (using statistical models) and included in published estimates.
  • An annual rolling frame update program would provide frame maintenance on a continuous basis for frame update and sampling efficiency purposes that would have been provided by a quinquennial CEAG. The program would include a short annual survey to a rotating percentage of the target population to complete missing information for new farm tax filers and operations not recently surveyed, so that the agriculture frame on the Business Register can be updated and maintained.
  • Remote sensing would play an increasingly important role. This technology would be integrated into the agriculture statistics program as it becomes mature. The initial focus would be on replacing the Potato Area Survey and the July and September Field Crop Reporting Surveys in the Prairie provinces.
  • Administrative data would play an increasingly important role. These data would be integrated into the agriculture statistics program, replacing content on surveys and censuses as they become available.
  • This option expands on current partnerships and promotes new partnerships with federal, provincial and industry stakeholders. These partnerships would be necessary to share responsibility for the development, collection, and compilation of administrative data and for Statistics Canada to obtain access (such as to AAFC's AgriInvest and AgriStability programs and livestock traceability data).

4.8.2 Strengths

  • This option provides an evolutionary approach to change within the agriculture statistics program, reducing risks to the relevance, coherence and accuracy of the program.
  • This option would achieve cost savings and reduce response burden by replacing the CEAG in years ending in "6." These reductions would be partially offset with increases in sample size and content for the main annual surveys in years ending in "6" and an increased frame update survey.
  • This option mitigates some of the risk of data loss by providing a subset of data requirements should the CEAG in years ending in "6" be cancelled by Order in Council.
  • The current target population definition would remain unchanged and therefore the coherence of agriculture data would not be affected. No investment would be required to adjust historical data for a new target population definition. Similarly, there would be no investment required to develop training material for users to avoid data misuse and misinterpretation and to clarify impacts of changes to the target population.
  • Since the annual agriculture surveys remain relatively similar to the current program, this option would be expected to have little impact on data users in terms of timeliness and survey content.

4.8.3 Weaknesses

  • This option's survey content and sample size increases in years ending in "6" do not meet user needs for small area and custom geographic data, for provincial benchmarking data or for the enumeration of rare or emerging commodities.
  • This option provides a reduced level of survey frame information, even with a rolling frame update, leading to frame deterioration and a related decrease in data accuracy from the survey program over the intercensal period.
  • The ten year gap between CEAGs would reduce the relevance and usefulness of the data to users. The CEAG data used for policy development and evaluation, support of legislative and regulatory instruments and for trade purposes is likely to become out of date before the next CEAG is conducted.
  • The loss of the quinquennial CEAG would impact the ability to model for the non-surveyed portion of the population in the survey program.

4.8.4 Essential conditions

  • An Order in Council would be required to cancel the CEAG in years ending in "6."
  • To increase the use of administrative data, it would be necessary to renegotiate existing partnerships or develop new ones with federal, provincial and industry stakeholders. These agreements would establish protocols for data sharing, confidentiality and protection. The collaboration of multiple players over several jurisdictions would have to be established and maintained. This commitment must begin at the highest levels in the participating organizations and extend to the working level.
  • Federal, provincial and industry data holders would need to include a declaration to their data providers (farm operators) regarding the provision of data for statistical purposes. There may be a need to change legislation.
  • Agriculture respondents would have to be aware of and support the increasing use of administrative data, being aware of the associated benefits and risks.
  • A feasibility study would be required to fully evaluate the costs, benefits, risks and potential timeframes for incorporating administrative data sources and increased use of technology (such as remote sensing) into the program.
  • A methodologically sound and realistic framework through which new sources of administrative data could be identified, evaluated, incorporated and operationalized in the program would have to be developed to reduce the risk of error.

4.8.5 Required investments

  • Alternative sources of commodity data would have to be acquired, adapted and incorporated at the micro level (for example, program data such as crop insurance, AgriInvest and AgriStability).
  • Remote sensing would have to be developed to completely or partially replace traditional field crop surveys. A Land Area Survey would need to be developed. The data from this survey combined with administrative data (e.g., crop insurance data) would be used to calibrate remote sensing results.

4.8.6 Risks

  • There could be negative reaction from data users regarding
    • the loss of some small area data
    • the loss of some provincial benchmarking data
    • the increase in reaction time to capture new trends and industry structural changes.
  • Increased reliance on administrative data sources may put the coherence, comparability and sustainability of the data at risk due to changes in programs, regulations or provider partners over time.

4.9 Option 5: Hybrid B

4.9.1 Key features

Hybrid B features a full decennial CEAG with a reduced quinquennial CEAG, coupled with increased use of administrative data and remote sensing. More specifically:

  • A CEAG would be conducted every five years:
    • In years ending in "1" the questionnaire content would be similar to the current Canadian program, with the following modifications to reduce response burden:
      • The detailed expense questions would be replaced with taxation data (i.e. the CEAG would exclude these questions). Although expense questions represent 7% of the content of the questionnaire, their impact on the level of burden is much greater, due to the need to access reference documents and the potential sensitivity of the questions.
      • Any other questions that could be replaced with comparable and available administrative data would also be excluded.
    • For the years ending in "6" a core CEAG would be defined by conducting user consultations and respondent testing. The content of the CEAG in years ending in "6" would be cut to a strict minimum (core) to provide
      • small area data
      • information on the structure of agriculture
      • data to use as benchmarks for required agriculture statistics
      • information required to maintain the frames necessary for the agricultural sample surveys.
        The core content needs to be determined in consultation with key stakeholders to identify priority data requirements; however there is potential to reduce current content by at least 50%. Over time, an increasing amount of content would be obtained through administrative sources rather than from a traditional CEAG.
    • Critical data requirements that do not fit the core CEAG criteria could be obtained using a modular approach targeting only a subset of the population, (such as specific farm types or farms located in specific regions), and linking to the fully enumerated results (as recommended by the FAO).
  • The CEAG would continue to be linked to the CEPOP/NHS in years ending in "1" and "6" to provide socioeconomic data.
  • The survey program would remain commodity specific much like the current program. The number of survey occasions per year would be reduced for some crop, horticulture and livestock surveys.
  • Tax data would be used to replace all comparable financial questions on the CEAG and in surveys.
  • The target population would remain the same as for the current program (i.e. the target population would consist of all farms that produce agricultural products intended for sale). The CEAG would collect data for the entire target population.
  • The survey population would exclude, to a greater extent than currently, smaller farms under a specified threshold for reasons of burden and cost. To determine the optimal reduction in the survey population further investigation is required. The non-surveyed population would continue to be estimated (using statistical models largely based on CEAG data) and included in published estimates.
  • A regular frame maintenance program would include a short survey to complete missing information from farm tax filer records and other administrative data sources to update and maintain the agriculture frame on the Business Register.
  • Remote sensing would play an increasingly important role. This technology would be integrated into the agriculture statistics program as it becomes mature. The initial focus would be on replacing the national Potato Area Survey and the July and September Field Crop Reporting Surveys in the Prairie provinces.
  • Administrative data would play an increasingly important role. These data would be integrated into the agriculture statistics program, replacing content on surveys and censuses as they become available.
  • This option expands on current partnerships and promotes new partnerships with federal, provincial and industry stakeholders. These partnerships would be necessary to share responsibility for the development, collection, and compilation of administrative data and for Statistics Canada to obtain access (such as to AAFC's AgriInvest and AgriStability programs and livestock traceability data).

4.9.2 Strengths

  • This option meets user needs for small area, provincial benchmarking and critical survey frame information. These were identified as a major weakness in the other options.
  • This option allows for the survey population threshold for specific surveys to be raised because a quinquennial CEAG provides complete and regular data for modelling the non-surveyed population.
  • This option provides the best coverage of data users' needs, albeit less than the current model. In particular, the requirements for provincial benchmark data, small area data, data on rare and emerging commodities and the ability to perform cross-tabulation analysis are best met with this alternative.
  • The modular approach for non-core data in years ending in "6" (either concurrently or post-censally) has the flexibility to target only a subset of the population, (such as a specific farm type or farms located in specific regions), as recommended by the FAO.
  • This option provides an evolutionary approach to change within the agriculture statistics program, reducing risks to the relevance, coherence and accuracy of the program. It could also be implemented in a reasonable timeframe.
  • This option would achieve cost savings and response burden by replacing some of the content of the decennial CEAG in years ending in "1" and by reducing the content further to the core in CEAG years ending in "6." Research into administrative sources and consultation with users and stakeholders will provide the information required to quantify the savings and response burden reductions to be realized.
  • The current target population definition remains unchanged and therefore the coherence of the agriculture data over time is not affected. No investment would be required to adjust historical data for a new target population definition. Similarly, there would be no investment required to develop training material for users to avoid data misuse and misinterpretation and to clarify impacts of the change to the target population.
  • Since the annual agriculture surveys remain similar, this option is expected to have less impact on data users in terms of timeliness and content.

4.9.3 Weaknesses

  • The potential exists to increase response burden if coordination, technology and procedures are not well defined with administrative data providers. For example, data currently collected from a comparatively small sample of survey respondents represents the larger target population; however, if these same data had to be added to an administrative form, all program participants would be required to provide this information, thereby significantly increasing response burden. (An example of this would be in adding a data variable, currently collected on a sample survey, to the tax form where all farm tax filers would be required to provide it.) The goal is to reduce overall burden, not to simply transfer it from one organization to another.
  • Increased reliance on administrative data sources may put coherence, comparability and sustainability of the data at risk due to changes in programs, concepts, regulations or provider partners over time.

4.9.4 Essential conditions

  • To increase the use of administrative data, it would be necessary to renegotiate existing partnerships or develop new ones with federal, provincial and industry stakeholders. These agreements would establish protocols for data sharing, confidentiality and protection. The collaboration of multiple players over several jurisdictions will have to be established and maintained. This collaboration must begin at the highest level in the participating organizations and extend to the working level.
  • Federal, provincial and industry data holders would need to include a declaration to their data providers (farm operators) regarding the provision of data for statistical purposes. There may be a need to change legislation.
  • Agriculture respondents would have to be aware of and support the increasing use of administrative data, being aware of the associated benefits and risks.
  • A feasibility study would be required to fully evaluate the costs, benefits, risks and potential timeframes for incorporating administrative data sources and increased use of technology (such as remote sensing) into the program.
  • A methodologically sound, integrated and realistic framework through which new sources of administrative data could be identified, evaluated, incorporated and operationalized in the program must be developed to reduce the risk of error.

4.9.5 Required investments

  • Alternative sources of commodity data would have to be acquired, adapted and incorporated at the micro level (for example, program data such as crop insurance, AgriInvest and AgriStability).
  • Remote sensing would have to be developed to completely or partially replace traditional field crop surveys. A Land-Use Area Frame Survey would need to be developed. The data from this survey combined with administrative data (e.g., crop insurance data) would be used to calibrate remote sensing results.

4.9.6 Risks

  • There could be negative reaction from data users regarding
    • the loss of some small area data
    • the loss of some provincial benchmarking data.
  • This option does not immediately mitigate the risk of data loss should the CEAG be cancelled in years ending in "6" by Order in Council. Over time, this risk would be mitigated with increasing incorporation of data from administrative sources (including taxation data).

4.10 Summary of the hybrid options

The hybrid options were developed to capitalize on the most attractive features of the first three options, while minimizing those aspects that scored the least in the evaluation.

The principal difference between the two hybrid options is the frequency with which the CEAG is conducted. With Hybrid A, there would be no CEAG in years ending in "6," but the current commodity-specific surveys would be increased in content and sample size in those years.

With Hybrid B, a CEAG in years ending in "6" would be conducted, but would be reduced in size to core requirements providing

  • small area data
  • information on the structure of agriculture
  • data used as benchmarks (re-alignment) for current agriculture statistics
  • information required to maintain the agriculture frames necessary for the agricultural sample surveys.

The two hybrids were evaluated against the current program using the same method as the evaluation of the first three options. This exercise was undertaken primarily to determine whether a blend of the most attractive features of the first three options could adequately compensate for the absence of a quinquennial CEAG. With alternatives in place, would it be possible to conduct a full CEAG every ten years and continue to meet priority data requirements?

The alternatives presented in the first three options were found to be lacking in terms of their ability to adequately compensate for the loss of quinquennial CEAG data. For Hybrid A, another option was evaluated: that of increasing the sample size and content on the entire survey program during the CEAG years ending in "6."

Hybrid A would compensate in part for this loss by increasing the content and sample size of the entire survey program during the years ending in "6." With this option, no new surveys would have to be developed and therefore development costs would be minimized. Limited benchmarking would be possible, and keeping the same target and survey population definitions would provide some continuity to the time series data. Data would be released in the same timeframe as the current program, providing sufficient resources were available to process the larger volume of data.

Hybrid B, on the other hand, would provide comprehensive coverage of the entire population every five years. Hybrid B provides all of the advantages of Hybrid A in addition to providing an answer to the principal problems with the other options, namely:

  • it provides stability with regard to the target population definition, although the survey population definition should be studied with a view to reducing survey response burden
  • it provides for small area data at a frequency that users require
  • it provides provincial benchmarking data at a frequency that users require
  • it minimizes the delay in capturing new trends and structural changes in the industry
  • it builds on the already solid foundation of the current program thereby minimizing risks to the quality, relevance, coherence and accuracy of the program's data.

Table 3 summarizes the key features of the options explored and evaluated.

Based on these criteria, the option with the least response burden is Hybrid B. The most burdensome option is the Scandinavian Model because response burden is increased since the entire population of producers is required to provide data that are presently collected from a sample of producers. The burden on the organizations collecting the administrative data is also increased. Therefore, there is some shifting of costs and burden from Statistics Canada to other organizations.

Based on the cost criteria, the least costly option is the present Canadian program largely due to the fact that no additional costs are required to develop alternative collection vehicles. The most expensive option is the Scandinavian Model since aside from lowering Statistics Canada's collection costs this option scored higher for all of the remaining cost criteria.

Table 3: Summary of agriculture statistical program options evaluated

Date modified: