Business Payrolls Survey

Vacation Reporting

This notice is intended to assist respondents who may not be familiar with the correct procedures for vacation reporting. Improper reporting can seriously affect the published statistics on levels of employment, earnings and hours and canmisrepresent your industry and area.

The most common reporting errors are:

a) Exclusion of employees on paid vacation: The survey covers employees on paid absence as well as employeesat work.

b) Inclusion of advance vacation pay with the regular pay for the reference period: See #2 below on how to report this data correctly.

Please review your procedures for vacation reporting in light of the following guidelines:

1. Vacation Paid When Taken

Data for employees receiving regular pay on vacation should be reported along with the regular employees.

2. Advance Vacation Pay

a) Added to Regular Paycheque: If the vacation pay is paid to the employees as a percentage of their regular pay throughout the year, include the amounts in Section A and in the other appropriate section(s).

b) Lump Sum Payment: When an employee receives his vacation pay in advance of the actual period of absence (for example, in conjunction with the immediately preceding regular pay or at any other time during the year), the advance vacation pay should be reported separately as a special payment.

If it is impossible to separate such advance vacation pay from the regular pay, report an earlier pay period which best reflects the regular level of activity.

3. Vacation Payments Withheld by Employer or Paid into a Trust

The proportion of vacation pay earned during the last pay period, and put aside or paid into a trust should be reported regularly with the last pay period.

Since these amounts will have already been reported on the Business Payrolls Survey when set aside by the employer, they are not to be included when paid out to the employees by the employer or from the union or association trust.

4. Vacation Closing

If the organization closes down for vacation during the reference period, report for employees that continue receiving their regular pay. Do not report for the employees for whom vacation pay has already been paid or attributed(see above).

If for any reason, you are unable to report according to the above guidelines, please include an explanation of the circumstances. If you have any questions concerning this or other matters related to the completion of the report, please contact Statistics Canada at the number provided in your documentation.

Business Payrolls Survey

Special Payments Made During the Month of December

Many special payments are paid to employees at the end of each year. This guide should help you in providing us with the necessary information concerning these specialpayments.

Report for special payments which are paid to employees for work performed or for other entitlements that are separate from regular wages and salaries; are paid at any time during the month of December and are not related exclusively to the last pay period of the month of December; and are usually recorded in the books usingthe cash basis method of accounting.

Following is a list of typical special payments made at the end of the year. This is not a comprehensive list. There may be other payments unique to your organization. If in doubt,please report the payment and explain what it covers.

Special payments can be for:

  • Any overtime paid during the reference month, including overtime accumulated from previous months
  • accumulated vacation pay
  • advance vacation pay
  • annual bonuses
  • Christmas bonuses
  • commissions and/or commission adjustments
  • cost of living adjustments
  • draws: annual, quarterly, monthly and/or periodic draws by owners of incorporated companies
  • other bonuses: ability, incentive, merit, piece work, production, sales, etc.
  • termination, severance and retirement payments

Report all moneys paid to your employees in the month of December. It is very important that you report the dates of the period the payment covers and not the payroll month in which the payment was paid out.

We thank you for your continuous cooperation and we wish you the very best during the New Year.

The LFS application consists of several questionnaire components (Contact, Household, Demographics, Rent, Labour Force Information and Exit), each of which is summarized below, followed by the lists of codesets. Each of the questionnaire components is comprised of a number of question blocks. For simplicity, as a result of the complexity of the logic within the application, not all possible questions and flows are presented. This is especially the case within the Contact Component where the scope of possible questions and flows is somewhat greater than that summarized below.

Selected dwellings are in the survey for six consecutive months. A birth interview corresponds to the first interview for a new household, and is usually conducted in person. Some birth interviews are now also conducted by telephone from centralized CATI work sites. Subsequent interviews are conducted in the following months, and are usually done by telephone.

Skip to text

Text begins

Contact component

The following information is collected at the start of each contact attempt.

II_R01A — Hello, I’m calling from Statistics Canada. My name is …
If interview in person, go to IC_R01
If birth interview by telephone, go to AR_Q01
If subsequent interview by telephone, go to SR_Q01

SR_Q01 — May I speak with … ?
If “Speaking”, go to IC_R01
If “Available”, go to II_R01B
If “Not available” or “No longer a household member”, go to AR_Q01
If “Wrong number”, go to TC_Q01

II_R01B — Hello, I’m calling from Statistics Canada. My name is …
Go to IC_R01

TC_Q01 — I would like to make sure I’ve dialed the right number. Is this: [telephone number]?
If yes, go to AR_Q01
If no, thank person and end call

AR_Q01 — May I speak with an adult member of the household?
If “Speaking” and CATI birth interview, go to TFCC_Q01
If “Speaking” and not CATI birth interview, go to IC_R01
If “Available”, go to II_R01C
If “Not available” and birth interview, go to ARA_Q01
If “Not available” and subsequent interview and SR_Q01= “Not available”, go to SRA_Q01
If “Not available” and subsequent interview and SR_Q01= “No longer a household member” or “Wrong number”, go to ARA_Q01

II_R01C — Hello, I’m calling from Statistics Canada. My name is …
If CATI birth interview, go to TFCC_Q01
If not CATI birth interview, go to IC_R01

SRA_Q01 — I would like to contact … When would he/she be available?
If “Available”, make appointment and then thank person and end call
If “Not available”, go to ARA_Q01

ARA_Q01 — When would an adult member of the household be available?
If “Available”, make appointment and then thank person and end call
If “Not available”, thank person and end call

TFCC_Q01 — In order to make sure I’ve reached the correct household, I need to confirm your address. Is it: [listing address]?
If yes, go to IC_R01
If no, go to TFCC_Q02

TFCC_Q02 — I would like to make sure I’ve dialed the right number. Is this: [telephone number]?
Thank person and end call

IC_R01
I’m calling regarding the Labour Force Survey.

LP_Q01 — Would you prefer to be interviewed in English or in French?
If CATI interview, go to MON_R01
If not CATI interview, go to Household Component

MON_R01
My supervisor may listen to this call for the purpose of quality control.

Household and demographics component

Household blocks

LA_N01
If CATI birth interview, go to MA_Q01
If subsequent interview in person, go to CMA_Q01
If subsequent interview by telephone, go to SD_Q01

Confirm the listing address.
Go to MA_Q01

SD_Q01 — I would like to confirm your address. Are you still living at: [listing address]?
If yes and listing address is the same as mailing address, go to CHM_Q01
If yes and listing address is different from mailing address, go to CMA_Q01
If no, go to SD_Q02
If “Respondent never lived there”, go to SD_Q05

SD_Q02 — Does anyone who was living with you at that address still live there?
If yes, go to SD_Q03
If no, thank person and end call

SD_Q03 — Can you provide me with the current telephone number for that address?
If yes, go to SD_Q04
If no, thank person and end call

SD_Q04 — What is that telephone number, including the area code?
Thank person and end call

SD_Q05 — I would like to make sure I’ve dialed the right number. Is this: [telephone number]?
Thank person and end call

CHM_Q01 — Is this also your mailing address?
If yes, go to TN_Q01
If no, go to MA_Q01

CMA_Q01 — I would like to confirm your mailing address. Is it: [mailing address]?
If yes, go to TN_Q01
If no, go to MA_Q01

MA_Q01 — What is your correct mailing address?
If birth interview in person, go to DW_N02
If birth interview by telephone, go to DW_Q01
If subsequent interview, go to TN_Q01

DW_Q01 — What type of dwelling do you live in? Is it a:
Read categories to respondent.
Go to TN_Q01

DW_N02
Select the dwelling type.

TN_Q01 — Is this dwelling owned by a member of this household?

RS_R01
The next few questions ask for important basic information on the people in your household.
If birth interview, go to USU_Q01
If subsequent interview, go to PV2_Q01

USU_Q01 — What are the names of all persons who usually live here?
Begin with adults who have responsibility for the care or support of the family.

RS_Q02 — Is anyone staying here temporarily?
If yes, go to TEM_Q01
If no, go to RS_Q04

TEM_Q01 — What are the names of all persons who are staying here temporarily?
Add a person only if he/she has no other usual residence elsewhere.

RS_Q04 — Are there any other persons who usually live here but are now away at school, in hospital, or somewhere else?
If yes, go to OTH1_Q01
If no, go to Individual Demographics

OTH1_Q01 — What are the names of the other people who live or stay here?
Add a person only if he/she has no other usual residence elsewhere.
Go to Individual Demographics

PV2_Q01 — Do the following people still live or stay in this dwelling?
If yes, go to RS_Q05
If no, go to RES_Q02

PV2_Q01_RES_Q02 — Is … no longer a member of the household or deceased?

RS_Q05 — Does anyone else now live or stay here?
If yes, go to OTH2_Q01
If no, go to Individual Demographics

OTH2_Q01 — What are the names of the other people who live or stay here?
Add a person only if he/she has no other usual residence elsewhere.

Demographics blocks

The following demographic information is collected for each household member.

ANC_Q01 — What is …’s date of birth?

ANC_Q02 — So …’s age on [date of last day of reference week] was [calculated age]. Is that correct?
If yes, go to SEX_Q01
If no, go to ANC_Q03

ANC_Q03 — What is …’s age?

SEX_Q01
Enter …’s sex.

MSNC_Q01
If age<16, go to FI_N01
What is …’s marital status? Is he/she:
Read categories to respondent.

FI_N01
Enter …’s family identifier: A to Z.
Assign the same letter to all persons related by blood, marriage or adoption.

RR_N01
Determine a reference person for the family and select…’s relationship to that reference person. The reference person should be an adult involved in the care or support of the family.

IMM_Q01 — In what country was … born?
Specify country of birth according to current boundaries.
If 01-Canada, go to ABO_Q01

IMM_Q02 — Is ... now, or has he/she ever been, a landed immigrant in Canada?
A landed immigrant (permanent resident) is a person who has been granted the right to live in Canada permanently by immigration authorities.
If yes, go to IMM_Q03
If no, go to ABO_Q01

IMM_Q03 — In what year did … first become a landed immigrant?
Year:

IMM_Q04
If IMM_Q03 is more than five years ago go to ABO_Q01
In what month?
Month:

ABO_Q01
If Country of Birth is not Canada , USA or Greenland go to ED_Q01
Is ... an Aboriginal person, that is, North American Indian, Métis or Inuit?
If yes, go to ABO_Q02
If no, go to ED_Q01

ABO_Q02
If respondent has already specified the Aboriginal group(s), select the group(s) from list below; if not, ask: Is ... a North American Indian, Métis or Inuit?
Mark all that apply.

ED_Q01
If age<14, go to CAF_Q01
What is the highest grade of elementary or high school … ever completed?
If “Grade 8 or lower” or “Grade 9 – 10”, go to ED_Q03
If “Grade 11 – 13”, go to ED_Q02

ED_Q02 — Did … graduate from high school (secondary school)?

ED_Q03 — Has … received any other education that could be counted towards a degree, certificate or diploma from an educational institution?
If yes, go to ED_Q04
If no, go to CAF_Q01

ED_Q04 — What is the highest degree, certificate or diploma… has obtained?

CHE_Q01
If (Country of Birth is Canada) or (IMM_Q02 is No) or (respondent has not received a post-secondary degree, certificate or diploma) go to CAF_Q01
In what country did … complete his/her highest degree, certificate or diploma?
Specify country of highest education according to current boundaries.

CAF_Q01
If age<16 or age>65, go to ANC_Q01 for next household member
Is … a full-time member of the regular Canadian Armed Forces?

For each person aged 15 or over who is not a full-time member of the regular armed forces complete the Labour force information component.

Rent component

The Rent Component is generated only for cases where the answer to TN_Q01 (“Is this dwelling owned by a member of this household?”) in the Household Component is “No”, and province/territory is not Yukon Territory, Northwest Territories or Nunavut.

RRF_R01

The next few questions are about your rent. The information collected is used to calculate the rent portion of the Consumer Price Index.

RM_Q01
If rent information exists from the previous month, go to RM_Q04
If dwelling type is not “Low-rise apartment” and not “High-rise apartment”, go to RM_Q02

On which floor do you live?

RM_Q02 — To the best of your knowledge, how old is your building?

RM_Q03 — How many bedrooms are there in your dwelling?

RM_Q04 — This month, is the rent for your dwelling subsidized by government or an employer, or a relative?
If yes, go to RM_Q04A
If no, go to RM_Q05

RM_Q04A — In what manner is the rent for your dwelling subsidized?

RM_Q05 — This month, are you operating a business from your dwelling?
If yes, go to RM_Q05A
If no, go to RM_Q06

RM_Q05A — Does the business affect the amount of rent paid?

RM_Q06 — How much is the total monthly rent for your dwelling?
If $0, go to RM_Q07
If >$0, go to RM_Q08

RM_Q07 — What is the reason that the rent is $0?
If RM_Q04 = yes, go to end of Rent Component

RM_Q08
If rent information does not exist from the previous month, go to RM_Q09B
If there has been a complete change in household membership, go to RM_Q09B
If RM_Q04 = yes, go to RM_Q09B

Since last month, have there been any changes in the amount of rent paid?
If yes, go to RM_Q08A
If no, go to RM_Q09B

RM_Q08A — What is the reason for the change in rent since last month?
Mark all that apply.

RM_Q09B
If dwelling type is not “Low-rise apartment” and not “High-rise apartment”, go to RM_Q14
If rent information exists from the previous month and there has not been a complete change in household membership, go to RM_Q09S

Does this month’s rent include parking facilities?
If yes, go to RM_Q10
If no, go to RM_Q14

RM_Q09S — Since last month, have there been any changes in the parking facilities?
If yes, go to RM_Q10
If no, go to RM_Q14

RM_Q10 — What types of parking facilities are included in your rent?
Mark all that apply.

RM_Q11
If “Closed garage or indoor parking” is not marked in RM_Q10, go to RM_Q12
How many closed garage or indoor parking spaces are included in your rent?

RM_Q12
If “Outside parking with plug-in” is not marked in RM_Q10, go to RM_Q13
How many outside parking spaces with plug-in are included in your rent?

RM_Q13
If “Outside parking without plug-in” is not marked in RM_Q10, go to RM_Q14
How many outside parking spaces without plug-in are included in your rent?

RM_Q14
If rent information does not exist from the previous month, go to RM_Q15
If there has been a complete change in household membership, go to RM_Q15
If “Change in utilities, services, appliances, or furnishings” is marked in RM_Q08A, go to RM_Q15

Since last month, have there been any changes in the utilities, services, household appliances, or furnishings included in the rent?
If yes, go to RM_Q15
If no, go to end of Rent Component

RM_Q15 — Which of the following utilities, services, household appliances, or furnishings are included as part of the monthly rent?
Read list to respondent. Mark all that apply.

Labour force information component

In this component, a path is assigned according to the answers provided. This path is used to control the flow through the component. For paths 1, 2, 6, and 7 the path determines the labour force status, but for paths 3, 4 and 5 other conditions (for example, availability for work) must be considered to distinguish between those who are unemployed and those who are not in the labour force.

PATHS

1 Employed, at work
2 Employed, absent from work
3 Temporary layoff
4 Job seeker
5 Future start
6 Not in labour force, able to work
7 Not in labour force, permanently unable to work

Job attachment block

LFI_Q100 — Many of the following questions concern ...’s activities last week. By last week, I mean the week beginning on Sunday, [date of first day of reference week], and ending last Saturday, [date of last day of reference week].
Last week, did ... work at a job or business?
(regardless of the number of hours)

If yes, then PATH = 1 and go to 102
If no, go to 101
If “Permanently unable to work”, then PATH = 7 and go to 104

LFI_Q101 — Last week, did ... have a job or business from which he/she was absent?
If no, go to 104

LFI_Q102 — Did he/she have more than one job or business last week?
If no, go to 110

LFI_Q103 — Was this a result of changing employers?
Go to 110

Past job attachment block

LFI_Q104 — Has he/she ever worked at a job or business?
If no, go to 170

LFI_Q105 — When did he/she last work?
If subsequent interview and no change in 105 and last month’s PATH = 3, go to 131
Else if subsequent interview and no change in 105 and last month’s PATH = 4 to 7, go to 170
Else if not within past year, go to 170
Else if PATH = 7, go to 131
Else if PATH not 7, go to 110

Job description block

LFI_Q110— If 103 = yes, I am now going to ask some questions about …’s new job or business. Was he/she an employee or self-employed?
If 103 = no, I am now going to ask some questions about the job or business at which he/she usually works the most hours. Was he/she an employee or self-employed?
Otherwise, Was he/she an employee or self-employed?
If not “Self-employed”, go to 114

LFI_Q111 — Did he/she have an incorporated business?

LFI_Q112 — Did he/she have any employees?

LFI_Q113 — What was the name of his/her business?
Go to 115

LFI_Q114 — For whom did he/she work?
(name of business, government department or agency, or person)

LFI_Q115 — What kind of business, industry or service was this?
(e.g., cardboard box manufacturing, road maintenance, retail shoe store, secondary school, dairy farm, municipal government)

LFI_Q116 — What kind of work was he/she doing?
(e.g., babysitting in own home, factory worker, forestry technician)

LFI_Q117 — What were his/her most important activities or duties?
(e.g., caring for children, stamp press machine operator, forest examiner)

LFI_Q118 — When did he/she start working for/at [name of employer/name of business]?

Absence – Separation block

LFI_Q130
If PATH = 1, go to 150
If 101 = no, go to 131

What was the main reason ... was absent from work last week?
If “Temporary layoff due to business conditions”, go to 134
If “Seasonal layoff”, go to 136
If “Casual job, no work available”, go to 137
Otherwise, PATH = 2 and go to 150

LFI_Q131 — What was the main reason ... stopped working at that [job/business]?
If not “ Lost job, laid off or job ended”, go to 137

LFI_Q132 — Can you be more specific about the main reason for his/her job loss?
If PATH = 7, go to 137
Else if “Business conditions”, go to 133
Otherwise, go to 137

LFI_Q133 — Does he/she expect to return to that job?
If no or “Not sure”, go to 137

LFI_Q134 — Has ...’s employer given him/her a date to return?
If yes, go to 136

LFI_Q135 — Has he/she been given any indication that he/she will be recalled within the next 6 months?

LFI_Q136 — As of last week, how many weeks had ... been on layoff?
If 130 = “Seasonal layoff”, go to 137
Else if 134 = no and 135 = no, go to 137
Else if on layoff more than 52 weeks, go to 137
Otherwise, PATH = 3 and go to 137

LFI_Q137 — Did he/she usually work more or less than 30 hours per week?
If PATH = 3, go to 190
Otherwise, go to 170

Work hours (Main job) block

LFI_Q150
The following questions refer to ...’s work hours at his/her [new] [job/business] [at name of employer].
If 110 = “Employee”, Excluding overtime, does the number of paid hours ... works vary from week to week?
Otherwise, Does the number of hours ... works vary from week to week?
If yes, go to 152

LFI_Q151
If 110 = “Employee”, Excluding overtime, how many paid hours does ... work per week?
Otherwise, How many hours does ... work per week?
If PATH = 2, go to 158
If 110 = “Employee”,go to 153
Otherwise, go to 157

LFI_Q152
If 110 = “Employee”, Excluding overtime, on average, how many paid hours does ... usually work per week?
Otherwise, On average, how many hours does ... usually work per week?
If PATH = 2, go to 158
If 110 = “Employee”, go to 153
Otherwise, go to 157

LFI_Q153 — Last week, how many hours was he/she away from this job because of vacation, illness, or any other reason?
If 0 hours, go to 155

LFI_Q154 — What was the main reason for that absence?

LFI_Q155 — Last week, how many hours of paid overtime did he/she work at this job?

LFI_Q156 — Last week, how many extra hours without pay did he/she work at this job?
If 150 = no, then actual hours = 151 - 153 + 155 + 156 and go to 158

LFI_Q157 — Last week, how many hours did he/she actually work at his/her [new] [job/business] [at name of employer]?

LFI_Q158
If 151 >= 29.5 or 152 >= 29.5, and PATH = 2, go to 162
If 151 >= 29.5 or 152 >= 29.5, and PATH = 1, go to 200

Does he/she want to work 30 or more hours per week [at a single job]?
If yes, go to 160

LFI_Q159 — What is the main reason ... does not want to work 30 or more hours per week [at a single job]?
If PATH = 2, go to 162
Otherwise, go to 200

LFI_Q160 — What is the main reason ... usually works less than 30 hours per week [at his/her main job]?
If not (“Business conditions” or “Could not find work with 30 or more hours per week”) and PATH = 2, go to 162
If not (“Business conditions” or “Could not find work with 30 or more hours per week”) and PATH = 1, go to 200

LFI_Q161 — At any time in the 4 weeks ending last Saturday, [date of last day of reference week], did he/she look for full-time work?
If PATH = 2, go to 162
Otherwise, go to 200

Absence block

LFI_Q162 — As of last week, how many weeks had ... been continuously absent from work?
If (110 is “Employee”) or (110 is “Self-employed” and 111 is yes), go to 163
Otherwise, go to 200

LFI_Q163 — Is he/she getting any wages or salary from his/her [employer/business] for any time off last week?
Go to 200

Job search - Future start block

LFI_Q170
If PATH = 7, go to 500
In the 4 weeks ending last Saturday, [date of last day of reference week], did ... do anything to find work?
If no and age >= 65, then PATH = 6 and go to 500
If no and age <= 64, go to 174
If yes, then PATH = 4 and go to 171

LFI_Q171 — What did he/she do to find work in those 4 weeks? Did he/she do anything else to find work?

LFI_Q172 — As of last week, how many weeks had he/she been looking for work?
[since the date last worked]

LFI_Q173 — What was his/her main activity before he/she started looking for work?
Go to 177

LFI_Q174 — Last week, did ... have a job to start at a definite date in the future?
If no, then PATH = 6 and go to 176

LFI_Q175 — Will he/she start that job before or after Sunday, [date of the first day after four weeks from the last day of reference week]?
If “Before the date above”, then PATH = 5 and go to 190
If “On or after the date above”, then PATH = 6 and go to 500

LFI_Q176 — Did he/she want a job last week?
If no, go to 500

LFI_Q177 — Did he/she want a job with more or less than 30 hours per week?

LFI_Q178
If PATH = 4, go to 190
What was the main reason he/she did not look for work last week?
If “Believes no work available”, go to 190
Otherwise, go to 500

Availability block

LFI_Q190 — Could he/she have worked last week [if he/she had been recalled/if a suitable job had been offered]?
If yes, go to 400

LFI_Q191 — What was the main reason ... was not available to work last week?
Go to 400

Earnings block

LFI_Q200
If 110 is not “Employee”, go to 300
If subsequent interview and no change in 110, 114, 115, 116, 117, 118, go to 300

Now I’d like to ask a few short questions about ...’s earnings from his/her [new] job [at name of employer].
Is he/she paid by the hour?

LFI_Q201 — Does he/she usually receive tips or commissions?
If 200 = no, go to 204

LFI_Q202 — [Including tips and commissions,] what is his/her hourly rate of pay?
Go to 220

LFI_Q204 — What is the easiest way for you to tell us his/her wage or salary, [including tips and commissions,] before taxes and other deductions?
Would it be yearly, monthly, weekly, or on some other basis?

If “Yearly”, go to 209
If “Monthly”, go to 208
If “Semi-monthly”, go to 207
If “Bi-weekly”, go to 206
If “Weekly” or “Other”, go to 205

LFI_Q205 — [Including tips and commissions,] what is his/her weekly wage or salary, before taxes and other deductions?
Go to 220

LFI_Q206 — [Including tips and commissions,] what is his/her bi-weekly wage or salary, before taxes and other deductions?
Go to 220

LFI_Q207 — [Including tips and commissions,] what is his/her semi-monthly wage or salary, before taxes and other deductions?
Go to 220

LFI_Q208 — [Including tips and commissions,] what is his/her monthly wage or salary, before taxes and other deductions?
Go to 220

LFI_Q209 — [Including tips and commissions,] what is his/her yearly wage or salary, before taxes and other deductions?
Go to 220

Union block

LFI_Q220 — Is he/she a union member at his/her [new] job [at name of employer]?
If yes, go to 240

LFI_Q221 — Is he/she covered by a union contract or collective agreement?

Permanence block

LFI_Q240 — Is ...’s [new] job [at name of employer] permanent, or is there some way that it is not permanent?
(e.g., seasonal, temporary, term or casual)

If “Permanent”, go to 260

LFI_Q241 — In what way is his/her job not permanent?
Go to 260

Firm size block

LFI_Q260 — About how many persons are employed at the location where ... works for [name of employer]?
Would it be: [Less than 20, 20 to 99, 100 to 500, or over 500]?

Read categories to respondent.

LFI_Q261 — Does [name of employer] operate at more than one location?
If no, or 260 = “Over 500”, go to 300

LFI_Q262 — In total, about how many persons are employed at all locations?
Would it be: [Less than 20, 20 to 99, 100 to 500, or over 500]?

Read categories to respondent
Go to 300

Class of worker – Hours at other job block

LFI_Q300
If 102 = no, go to 400
Now I have a couple of questions about ...’s [other/old] job or business. Was he/she an employee or self-employed?
If not “Self-employed”, go to 320

LFI_Q301 — Did he/she have an incorporated business?

LFI_Q302 — Did he/she have any employees?

LFI_Q320
If 300 = “Employee”, Excluding overtime, how many paid hours [does/did] ... usually work per week at this job?
Otherwise, How many hours [does/did] ... usually work per week at this [business/family business]?
If PATH = 2, go to 400

LFI_Q321 — Last week, how many hours did ... actually work at this [job/business/family business]?
Go to 400

Temporary layoff job search block

LFI_Q400
If PATH not 3, go to 500
In the 4 weeks ending last Saturday, [date of last day of reference week], did ... look for a job with a different employer?
Go to 500

School attendance block

LFI_Q500
If age >= 65, go to END
Last week, was ... attending a school, college or university?
If no, go to 520

LFI_Q501 — Was he/she enrolled as a full-time or part-time student?

LFI_Q502 — What kind of school was this?
Go to 520

Returning students block

LFI_Q520
If survey month not May through August, go to END
Else if age not 15 to 24, go to END
Else if subsequent interview and 520 in previous month was “no”, go to END
Else if subsequent interview and 520 in previous month was “yes”, go to 521

Was he/she a full-time student in March of this year?
If no, go to END

LFI_Q521 — Does he/she expect to be a full-time student this fall?

Exit component

The following information is collected at the end of the LFS interview each month to gather information for future contacts and to thank respondents for their participation. In many cases, this information will be pre-filled for confirmation in subsequent interviews.

EI_R01
If rotate-out (for example, last month for interview), go to TY_R02
Before we finish, I would like to ask you a few other questions.

FC_R01
As part of the Labour Force Survey, we will contact your household next month during the week of [date of first day of next month survey week].
After this month, this dwelling has [calculated number of remaining interviews] LFS interview(s) left.

HC_Q01 — Who would be the best person to contact?

TEL_Q01
If no telephone number exists, go to TEL_Q02
I would like to confirm your telephone number. Is it [telephone number]?
If yes, go to PC_Q01
If no, go to TEL_Q02

TEL_Q02 — What is your telephone number, including the area code?

PC_Q01
If CATI interview, go to PTC_Q01
May we conduct the next interview by telephone?
If yes, go to PTC_Q01
If no, go to PV_R01

PV_R01
In this case we will make a personal visit next month during the week of [date of first day of next month survey week].

PTC_Q01
If preferred time to call information does not exist from the previous month, go to PTC_Q02
I would like to confirm the time of day you would prefer that we call. Is it [preferred time to call]?
If yes, go to PTC_N03
If no, go to PTC_Q02

PTC_Q02 — What time of day would you prefer that we call? Would it be the morning, the afternoon, the evening, or ANY TIME?
Mark all that apply.

PTC_N03
Enter any other information about the preferred time to call.

LQ_Q01
If CATI interview, go to TY_R01
If subsequent interview, go to TY_R01
If dwelling type is not “Single detached” and not “Double” and not “Row or terrace” and not “Duplex”, go to TY_R01

Is there another set of living quarters within this structure?
If yes, go to LQ_N02
If no, go to TY_R01

LQ_N02
Remember to verify the cluster list and add one or more multiples if necessary.

TY_R01
Thank you very much for your participation in this month’s Labour Force Survey. Have a nice [day/evening].

Go to END

TY_R02
Thank you very much for your participation in the Labour Force Survey. Although your six months in the Labour Force Survey are over, your household may be contacted by Statistics Canada some time in the future for another survey. Have a nice [day/evening].

List of codesets

Codes for Contact component

SR_Q01

1 Yes, speaking to respondent
2 Yes, respondent available
3 No, respondent not available
4 No, respondent no longer a household member
5 Wrong number

AR_Q01

1 Yes, speaking to an adult member
2 Yes, an adult member is available
3 No, an adult member is not available

SRA_Q01 / ARA_Q01

1 Make hard appointment
2 Make soft appointment
3 Not available

LP_Q01

1 English
2 French
3 Other

Codes for Household component

SD_Q01

1 Yes
2 No
3 No, respondent never lived there

DW_Q01 / DW_N02

01 Single detached
02 Double
03 Row or terrace
04 Duplex
05 Low rise apartment of fewer than 5 stories or a flat
06 High rise apartment of 5 stories or more
07 Institution
08 Hotel; rooming/lodging house; camp
09 Mobile home
10 Other – Specify

PV2_Q01_RES_Q02

1 No longer a member
2 Deceased

Codes for Individual demographics

SEX_Q01

1 Male
2 Female

MSNC_Q01

1 Married
2 Living common-law
3 Widowed
4 Separated
5 Divorced
6 Single, never married

RR_N01

1 Reference person
2 Spouse
3 Son or daughter (birth, adopted or step)
4 Grandchild
5 Son-in-law or daughter-in-law
6 Foster child (less than 18 years of age)
7 Parent
8 Parent-in-law
9 Brother or sister
10 Other relative - Specify

IMM_Q01

Responses that do not correspond to one of the twelve countries explicitly listed are recorded as "Other –Search" and invoke a country search file containing a list of all current countries.

01 Canada
02 United States
03 United Kingdom
04 Germany
05 Italy
06 Poland
07 Portugal
08 China (People’s Republic of)
09 Hong Kong
10 India
11 Philippines
12 Vietnam
13 Other – Search

IMM_Q02

1 Yes
2 No

ABO_Q01

1 Yes
2 No

ABO_Q02

Mark all that apply.

1 North American Indian
2 Métis
3 Inuit (Eskimo)

ED_Q01

1 Grade 8 or lower (Quebec: Secondary II or lower)
2 Grade 9 - 10 (Quebec: Secondary III or IV, Newfoundland and Labrador: 1st year of secondary)
3 Grade 11 - 13 (Quebec: Secondary V, Newfoundland and Labrador: 2nd to 4th year of secondary)

ED_Q04

1 No postsecondary degree, certificate or diploma
2 Trade certificate or diploma from a vocational school or apprenticeship training
3 Non-university certificate or diploma from a community college, CEGEP, school of nursing, etc.
4 University certificate below bachelor’s level
5 Bachelor’s degree
6 University degree or certificate above bachelor’s degree

CHE_Q01

Responses that do not correspond to one of the twelve countries explicitly listed are recorded as "Other –Search" and invoke a country search file containing a list of all current countries.

01 Canada
02 United States
03 United Kingdom
04 Germany
05 Italy
06 Poland
07 Portugal
08 China (People’s Republic of)
09 Hong Kong
10 India
11 Philippines
12 Vietnam
13 Other – Search

Codes for Rent component

RM_Q02

1 No more than 5 years old
2 More than 5 but no more than 10 years old
3 More than 10 but no more than 20 years old
4 More than 20 but no more than 40 years old
5 More than 40 years old

RM_Q04A

1 Income-related/Government agencies
2 Employer
3 Owned by a relative
4 Other - Specify

RM_Q08A

1 Change in utilities, services, appliances, or furnishings
2 Change in parking facilities
3 New Lease
4 Other - Specify

RM_Q10

1 Closed garage or indoor parking
2 Outside parking with plug-in
3 Outside parking without plug-in

RM_Q15

1 Heat – Electric
2 Heat - Natural Gas
3 Heat - Other Specify
4 Electricity
5 Cablevision
6 Refrigerator
7 Range
8 Washer
9 Dryer
10 Other major appliance – Specify
11 Furniture
12 None of the above

Codes for Labour force information

LFI_Q100

1 Yes
2 No
3 Permanently unable to work

LFI_Q110 / LFI_Q300

1 Employee
2 Self-employed
3 Working in a family business without pay

LFI_Q130

01 Own illness or disability
02 Caring for own children
03 Caring for elder relative (60 years of age or older)
04 Maternity or parental leave
05 Other personal or family responsibilities
06 Vacation
07 Labour dispute (strike or lockout) (Employees only)
08 Temporary layoff due to business conditions (Employees only)
09 Seasonal layoff (Employees only)
10 Casual job, no work available (Employees only)
11 Work schedule (e.g., shift work) (Employees only)
12 Self-employed, no work available (Self-employed only)
13 Seasonal business (excluding employees)
14 Other - Specify

LFI_Q131

01 Own illness or disability
02 Caring for own children
03 Caring for elder relative (60 years of age or older)
04 Pregnancy (Females only)
05 Other personal or family responsibilities
06 Going to school
07 Lost job, laid off or job ended (Employees only)
08 Business sold or closed down (excluding employees)
09 Changed residence
10 Dissatisfied with job
11 Retired
12 Other - Specify

LFI_Q132

1 End of seasonal job
2 End of temporary, term or contract job (non-seasonal)
3 Casual job
4 Company moved
5 Company went out of business
6 Business conditions (e.g., not enough work, drop in orders, retooling)
7 Dismissal by employer (e.g., fired)
8 Other - Specify

LFI_Q133 / LFI_Q521

1 Yes
2 No
3 Not sure

LFI_Q137 / LFI_Q177

1 30 or more hours per week
2 Less than 30 hours per week

LFI_Q154

01 Own illness or disability
02 Caring for own children
03 Caring for elder relative (60 years of age or older)
04 Maternity or parental leave
05 Other personal or family responsibilities
06 Vacation
07 Labour dispute (strike or lockout)
08 Temporary layoff due to business conditions
09 Holiday (legal or religious)
10 Weather
11 Job started or ended during week
12 Working short-time (e.g., due to material shortages, plant maintenance or repair, etc.)
13 Other - Specify

LFI_Q159

1 Own illness or disability
2 Caring for own children
3 Caring for elder relative (60 years of age or older)
4 Other personal or family responsibilities
5 Going to school
6 Personal preference
7 Other - Specify

LFI_Q160

1 Own illness or disability
2 Caring for own children
3 Caring for elder relative (60 years of age or older)
4 Other personal or family responsibilities
5 Going to school
6 Business conditions
7 Could not find work with 30 or more hours per week
8 Other - Specify

LFI_Q171

1 Public employment agency
2 Private employment agency
3 Union
4 Employers directly
5 Friends or relatives
6 Placed or answered ads
7 Looked at job ads
8 Other - Specify

LFI_Q173

1 Working
2 Managing a home
3 Going to school
4 Other - Specify

LFI_Q175

1 Before the date above
2 On or after the date above

LFI_Q178

1 Own illness or disability
2 Caring for own children
3 Caring for elder relative (60 years of age or older)
4 Other personal or family responsibilities
5 Going to school
6 Waiting for recall (to former job)
7 Waiting for replies from employers
8 Believes no work available (in area, or suited to skills)
9 No reason given
10 Other - Specify

LFI_Q191

1 Own illness or disability
2 Caring for own children
3 Caring for elder relative (60 years of age or older)
4 Other personal or family responsibilities
5 Going to school
6 Vacation
7 Already has a job
8 Other - Specify

LFI_Q204

1 Yearly
2 Monthly
3 Semi-monthly
4 Bi-weekly
5 Weekly
6 Other - Specify

LFI_Q241

1 Seasonal job
2 Temporary, term or contract job (non-seasonal)
3 Casual job
5 Other - Specify

LFI_Q260 / LFI_Q262

1 Less than 20
2 20 to 99
3 100 to 500
4 Over 500

LFI_Q501

1 Full-time
2 Part-time

LFI_Q502

1 Elementary, junior high school, high school or equivalent
2 Community college, junior college, or CEGEP
3 University
4 Other - Specify

Codes for Exit component

PTC_Q02

1 ANY TIME
2 Morning
3 Afternoon
4 Evening
5 NOT morning
6 NOT afternoon
7 NOT evening

 

Update of the Classification of Instructional Programs (CIP) Canada 2011

April 1, 2014 (Previous notice)

The Classification of Instructional Programs (CIP) Canada is currently being updated. The updated CIP will be available in 2016.

At this time, Statistics Canada is soliciting input from data producers and data users to ensure their needs continue to be met by CIP, and from individuals, user groups, representatives of educational institutions or ministries of education and educational experts to ensure that the illustrative examples in CIP remain current. Proposals for updates to CIP should be submitted to standards-normes@statcan.gc.ca. Guidelines for submissions are presented below to assist you in providing your input.

Input is requested by January 05, 2015 but will be accepted until March 31, 2015. Decisions on proposed updates will be made between April 2014 and April 2015. To enable us to fully consider your suggestions in time for inclusion in this update, please send them early in the consultation period. You may send more than one submission, if that enables you to comment earlier.

Guidelines

This is an update of the classification only. It is not a revision. Accordingly, class descriptions and titles cannot be changed; no new classes will be added; no classes will be moved within the classification; no classes will be merged; and no content will be moved between classes.

Updates that will be considered include:

  • Suggestions for new postsecondary educational programs to be added as examples that were not included in CIP Canada 2011. When making such proposals, please suggest to which existing CIP class the educational program should be added as an example. Please also include any information available concerning the content of the program and any institutions where it is offered.
  • Suggestions for revisions to examples that were included in CIP Canada 2011. Perhaps some examples reflect terminology that is no longer in use, or perhaps some examples should have been more specific. Please include any available supporting information.

Proposals should include supporting information on the rationale for the suggested example(s).

Submissions may be in either official language. They should contain contact information to allow follow up with the submitter to obtain further information or clarification, if required.

CIP Canada 2011 may be viewed on the Statistics Canada website at: Classification of Instructional Programs (CIP) Canada 2011.

Criteria for updates to CIP

Please consider the following criteria when preparing your input to the update of the Classification of Instructional Programs.

Proposed changes should:

  1. Be consistent with classification principles of mutual exclusivity, exhaustiveness, and homogeneity within classes.
  2. Be relevant, that is, the updates must be of analytical interest, result in enhanced data usefulness to users and be based on appropriate research or subject matter expertise.

Statistics Canada - Producer Prices Division

XXXX

Collected under the authority of the Statistics Act
Revised Statutes of Canada, 1985, Chapter S19.
Completion of this questionnaire is a legal requirement under this act.

Month

Survey purpose

The prices you report are essential to the production of indexes measuring the movement of prices in the Canadian economy. In order to enhance the information you provide in this survey, Statistics Canada plans to combine the responses relating to your organization with the information you previously provided on this survey.

The reporting form sets out our request for price information for the period shown. We urge you to read the instructions carefully and fill in the requested information. Thank you for your cooperation.

If necessary, please make address label corrections below.

Company
Attn:
Street
City, Province
A1A 1A1

Please return the questionnaire by the end of the month.

Statistics Canada, Producer Prices Division, 170, Tunney's Pasture Driveway, Ottawa, Ontario K1A 0T6 Fax:  613-951-3117 or 1-855-314-8765

Statistics Canada

Should you require further information with respect to this report, please contact the Producer Prices Division Contact indicated on the reverse side. Please feel free to call collect or call 1-888-951-4550 for general enquiries.

The information and data pre-coded on this form reflects the respondent's preference.

Record linkages

To enhance the data from this survey, Statistics Canada may combine it with information from other surveys or from administrative sources.

Confidentiality

Statistics Canada is prohibited by law from releasing any information it collects which could identify any person, business, or organization, unless consent has been given by the respondent or as permitted by the Statistics Act. Statistics Canada will use the information from this survey for statistical purposes.

Fax or other electronic transmission disclosure
Statistics Canada advises you that there could be a risk of disclosure during facsimile or other electronic transmission.

However, upon receipt, Statistics Canada will provide the guaranteed level of protection afforded all information collected under the authority of the Statistics Act.

Respondent:

Contact :

CONTACT NAME

Email address@statcan.gc.ca

Commodity Specialist

(613) 951-

Commodity Specialist

To complete this Price Report:

1) In Box A enter the transaction price in effect on the 15th of the month indicated.

2) In Box B enter “NS” if no sales occurred and give an estimate in Box A for the transaction prices.

3) If there is any change in the description of product and/or transaction description please amend.

Product ID

  • Commodity Description:
  • Description of Product:
  • Transaction Description:
    • C1:
    • C3:
    • C2:
    • C4:

Date of last reported price change :
YYYY-MM

C1 to C4 Transaction description as specified above
Circle reasons for price change
Further explanation of price change
(pertinent market information)

Date:

  • A
  • B
  • C1
  • C2
  • C3
  • C4
  • D
    • 1
    • 2
    • 3
    • 4
    • 5
    • 6
    • 7

Reasons for price change :

  1. Material costs
  2. Labour costs
  3. Competitive factors
  4. Physical content
  5. Terms of sale
  6. Exchange rate
  7. Others - describe

Monthly Retail Trade Survey (MRTS) Data Quality Statement

Objectives, uses and users
Concepts, variables and classifications
Coverage and frames
Sampling
Questionnaire design
Response and non-response
Data collection and capture operations
Editing
Imputation
Estimation
Revisions and seasonal adjustment
Data quality evaluation
Disclosure control

1. Objectives, uses and users

1.1. Objective

The Monthly Retail Trade Survey (MRTS) provides information on the performance of the retail trade sector on a monthly basis, and when combined with other statistics, represents an important indicator of the state of the Canadian economy.

1.2. Uses

The estimates provide a measure of the health and performance of the retail trade sector. Information collected is used to estimate level and monthly trend for retail sales. At the end of each year, the estimates provide a preliminary look at annual retail sales and performance.

1.3. Users

A variety of organizations, sector associations, and levels of government make use of the information. Retailers rely on the survey results to compare their performance against similar types of businesses, as well as for marketing purposes. Retail associations are able to monitor industry performance and promote their retail industries. Investors can monitor industry growth, which can result in better access to investment capital by retailers. Governments are able to understand the role of retailers in the economy, which aids in the development of policies and tax incentives. As an important industry in the Canadian economy, governments are able to better determine the overall health of the economy through the use of the estimates in the calculation of the nation’s Gross Domestic Product (GDP).

2. Concepts, variables and classifications

2.1. Concepts

The retail trade sector comprises establishments primarily engaged in retailing merchandise, generally without transformation, and rendering services incidental to the sale of merchandise.

The retailing process is the final step in the distribution of merchandise; retailers are therefore organized to sell merchandise in small quantities to the general public. This sector comprises two main types of retailers, that is, store and non-store retailers. The MRTS covers only store retailers. Their main characteristics are described below. Store retailers operate fixed point-of-sale locations, located and designed to attract a high volume of walk-in customers. In general, retail stores have extensive displays of merchandise and use mass-media advertising to attract customers. They typically sell merchandise to the general public for personal or household consumption, but some also serve business and institutional clients. These include establishments such as office supplies stores, computer and software stores, gasoline stations, building material dealers, plumbing supplies stores and electrical supplies stores.

In addition to selling merchandise, some types of store retailers are also engaged in the provision of after-sales services, such as repair and installation. For example, new automobile dealers, electronic and appliance stores and musical instrument and supplies stores often provide repair services, while floor covering stores and window treatment stores often provide installation services. As a general rule, establishments engaged in retailing merchandise and providing after sales services are classified in this sector. Catalogue sales showrooms, gasoline service stations, and mobile home dealers are treated as store retailers.

2.2. Variables

Sales are defined as the sales of all goods purchased for resale, net of returns and discounts. This includes commission revenue and fees earned from selling goods and services on account of others, such as selling lottery tickets, bus tickets, and phone cards. It also includes parts and labour revenue from repair and maintenance; revenue from rental and leasing of goods and equipment; revenues from services, including food services; sales of goods manufactured as a secondary activity; and the proprietor’s withdrawals, at retail, of goods for personal use. Other revenue from rental of real estate, placement fees, operating subsidies, grants, royalties and franchise fees are excluded.

Trading Location is the physical location(s) in which business activity is conducted in each province and territory, and for which sales are credited or recognized in the financial records of the company. For retailers, this would normally be a store.

Constant Dollars: The value of retail trade is measured in two ways; including the effects of price change on sales and net of the effects of price change. The first measure is referred to as retail trade in current dollars and the latter as retail trade in constant dollars. The method of calculating the current dollar estimate is to aggregate the weighted value of sales for all retail outlets. The method of calculating the constant dollar estimate is to first adjust the sales values to a base year, using the Consumer Price Index, and then sum up the resulting values.

2.3. Classification

The Monthly Retail Trade Survey is based on the definition of retail trade under the NAICS (North American Industry Classification System). NAICS is the agreed upon common framework for the production of comparable statistics by the statistical agencies of Canada, Mexico and the United States. The agreement defines the boundaries of twenty sectors. NAICS is based on a production-oriented, or supply based conceptual framework in that establishments are groups into industries according to similarity in production processes used to produce goods and services.

Estimates appear for 21 industries based on special aggregations of the 2012 North American Industry Classification System (NAICS) industries. The 21 industries are further aggregated to 11 sub-sectors.

Geographically, sales estimates are produced for Canada and each province and territory.

3. Coverage and frames

Statistics Canada’s Business Register ( BR) provides the frame for the Monthly Retail Trade Survey. The BR is a structured list of businesses engaged in the production of goods and services in Canada. It is a centrally maintained database containing detailed descriptions of most business entities operating within Canada. The BR includes all incorporated businesses, with or without employees. For unincorporated businesses, the BR includes all employers with businesses, and businesses with no employees with annual sales that have a Goods and Services Tax (GST) or annual revenue that declares individual taxes.  annual sales greater than $30,000 that have a Goods and Services Tax (GST) account (the BR does not include unincorporated businesses with no employees and with annual sales less than $30,000).

The businesses on the BR are represented by a hierarchical structure with four levels, with the statistical enterprise at the top, followed by the statistical company, the statistical establishment and the statistical location. An enterprise can be linked to one or more statistical companies, a statistical company can be linked to one or more statistical establishments, and a statistical establishment to one or more statistical locations.

The target population for the MRTS consists of all statistical establishments on the BR that are classified to the retail sector using the North American Industry Classification System (NAICS) (approximately 200,000 establishments). The NAICS code range for the retail sector is 441100 to 453999. A statistical establishment is the production entity or the smallest grouping of production entities which: produces a homogeneous set of goods or services; does not cross provincial boundaries; and provides data on the value of output, together with the cost of principal intermediate inputs used, along with the cost and quantity of labour used to produce the output. The production entity is the physical unit where the business operations are carried out. It must have a civic address and dedicated labour.

The exclusions to the target population are ancillary establishments (producers of services in support of the activity of producing goods and services for the market of more than one establishment within the enterprise, and serves as a cost centre or a discretionary expense centre for which data on all its costs including labour and depreciation can be reported by the business), future establishments, establishments with a missing or a zero gross business income (GBI) value on the BR and establishments in the following non-covered NAICS:

  • 4541 (electronic shopping and mail-order houses)
  • 4542 (vending machine operators)
  • 45431 (fuel dealers)
  • 45439 (other direct selling establishments)

4. Sampling

The MRTS sample consists of 10,000 groups of establishments (clusters) classified to the Retail Trade sector selected from the Statistics Canada Business Register. A cluster of establishments is defined as all establishments belonging to a statistical enterprise that are in the same industrial group and geographical region. The MRTS uses a stratified design with simple random sample selection in each stratum. The stratification is done by industry groups (the mainly, but not only four digit level NAICS), and the geographical regions consisting of the provinces and territories, as well as three provincial sub-regions. We further stratify the population by size.

The size measure is created using a combination of independent survey data and three administrative variables: the annual profiled revenue, the GST sales expressed on an annual basis, and the declared tax revenue (T1 or T2). The size strata consist of one take-all (census), at most, two take-some (partially sampled) strata, and one take-none (non-sampled) stratum. Take-none strata serve to reduce respondent burden by excluding the smaller businesses from the surveyed population. These businesses should represent at most ten percent of total sales. Instead of sending questionnaires to these businesses, the estimates are produced through the use of administrative data.

The sample was allocated optimally in order to reach target coefficients of variation at the national, provincial/territorial, industrial, and industrial groups by province/territory levels. The sample was also inflated to compensate for dead, non-responding, and misclassified units.

MRTS is a repeated survey with maximisation of monthly sample overlap. The sample is kept month after month, and every month new units are added (births) to the sample.  MRTS births, i.e., new clusters of establishment(s), are identified every month via the BR’s latest universe. They are stratified according to the same criteria as the initial population. A sample of these births is selected according to the sampling fraction of the stratum to which they belong and is added to the monthly sample. Deaths occur on a monthly basis. A death can be a cluster of establishment(s) that have ceased their activities (out-of-business) or whose major activities are no longer in retail trade (out-of-scope). The status of these businesses is updated on the BR using administrative sources and survey feedback, including feedback from the MRTS. Methods to treat dead units and misclassified units are part of the sample and population update procedures.

5. Questionnaire design

The Monthly Retail Trade Survey incorporates the following sub-surveys:

Monthly Retail Trade Survey - R8

Monthly Retail Trade Survey (with inventories) – R8

Survey of Sales and Inventories of Alcoholic Beverages

The questionnaires collect monthly data on retail sales and the number of trading locations by province or territory and inventories of goods owned and intended for resale from a sample of retailers. The items on the questionnaires have remained unchanged for several years. For the 2004 redesign, the general questionnaires were subject to cosmetic changes only. The questionnaire for Sales and Inventories of Alcoholic Beverages underwent more extensive changes. The modifications were discussed withstakeholders and the respondents were given an opportunity to comment before the new questionnaire was finalized. If further changes are needed to any of the questionnaires, proposed changes would go through a review committee and a field test with respondents and data users to ensure its relevancy.

6. Response and non-response

6.1. Response and non-response

Despite the best efforts of survey managers and operations staff to maximize response in the MRTS, some non-response will occur. For statistical establishments to be classified as responding, the degree of partial response (where an accurate response is obtained for only some of the questions asked a respondent) must meet a minimum threshold level below which the response would be rejected and considered a unit non-response.  In such an instance, the business is classified as not having responded at all.

Non-response has two effects on data: first it introduces bias in estimates when non-respondents differ from respondents in the characteristics measured; and second, it contributes to an increase in the sampling variance of estimates because the effective sample size is reduced from that originally sought.

The degree to which efforts are made to get a response from a non-respondent is based on budget and time constraints, its impact on the overall quality and the risk of non-response bias.

The main method to reduce the impact of non-response at sampling is to inflate the sample size through the use of over-sampling rates that have been determined from similar surveys.

Besides the methods to reduce the impact of non-response at sampling and collection, the non-responses to the survey that do occur are treated through imputation. In order to measure the amount of non-response that occurs each month, various response rates are calculated. For a given reference month, the estimation process is run at least twice (a preliminary and a revised run). Between each run, respondent data can be identified as unusable and imputed values can be corrected through respondent data. As a consequence, response rates are computed following each run of the estimation process.

For the MRTS, two types of rates are calculated (un-weighted and weighted). In order to assess the efficiency of the collection process, un-weighted response rates are calculated. Weighted rates, using the estimation weight and the value for the variable of interest, assess the quality of estimation. Within each of these types of rates, there are distinct rates for units that are surveyed and for units that are only modeled from administrative data that has been extracted from GST files.

To get a better picture of the success of the collection process, two un-weighted rates called the ‘collection results rate’ and the ‘extraction results rate’ are computed. They are computed by dividing the number of respondents by the number of units that we tried to contact or tried to receive extracted data for them. Non-monthly reporters (respondents with special reporting arrangements where they do not report every month but for whom actual data is available in subsequent revisions) are excluded from both the numerator and denominator for the months where no contact is performed.

In summary, the various response rates are calculated as follows:

Weighted rates:

Survey Response rate (estimation) =
Sum of weighted sales of units with response status i / Sum of survey weighted sales

where i = units that have either reported data that will be used in estimation or are converted refusals, or have reported data that has not yet been resolved for estimation.

Admin Response rate (estimation) =
Sum of weighted sales of units with response status ii / Sum of administrative weighted sales

where ii = units that have data that was extracted from administrative files and are usable for estimation.

Total Response rate (estimation) =
Sum of weighted sales of units with response status i or response status ii / Sum of all weighted sales

Un-weighted rates:

Survey Response rate (collection) =
Number of questionnaires with response status iii/ Number of questionnaires with response status iv

where iii = units that have either reported data (unresolved, used or not used for estimation) or are converted refusals.

where iv = all of the above plus units that have refused to respond, units that were not contacted and other types of non-respondent units.

Admin Response rate (extraction) =
Number of questionnaires with response status vi/ Number of questionnaires with response status vii

where vi = in-scope units that have data (either usable or non-usable) that was extracted from administrative files

where vii = all of the above plus units that have refused to report to the administrative data source, units that were not contacted and other types of non-respondent units.

(% of questionnaire collected over all in-scope questionnaires)

Collection Results Rate =
Number of questionnaires with response status iii / Number of questionnaires with response status viii

where iii = same as iii defined above

where viii = same as iv except for the exclusion of units that were contacted because their response is unavailable for a particular month since they are non-monthly reporters.

Extraction Results Rate =
Number of questionnaires with response status ix / Number of questionnaires with response status vii

where ix = same as vi with the addition of extracted units that have been imputed or were out of scope

where vii = same as vii defined above

(% of questionnaires collected over all questionnaire in-scope we tried to collect)

All the above weighted and un-weighted rates are provided at the industrial group, geography and size group level or for any combination of these levels.

Use of Administrative Data

Managing response burden is an ongoing challenge for Statistics Canada. In an attempt to alleviate response burden and survey costs, especially for smaller businesses, the MRTS has reduced the number of simple establishments in the sample that are surveyed directly and instead derives sales data for these establishments from Goods and Service Tax (GST) files using a statistical model. The model accounts for differences between sales and revenue (reported for GST purposes) as well as for the time lag between the survey reference period and the reference period of the GST file.

For more information on the methodology used for modeling sales from administrative data sources, refer to ‘Monthly Retail Trade Survey: Use of Administrative Data’ under ‘Documentation’ of the IMDB.

Table 1 contains the weighted response rates for all industry groups as well as for total retail trade for each province and territory. For more detailed weighted response rates, please contact the Marketing and Dissemination Section at (613) 951-3549, toll free: 1-877-421-3067 or by e-mail at retailinfo@statcan.

6.2. Methods used to reduce non-response at collection

Significant effort is spent trying to minimize non-response during collection. Methods used, among others, are interviewer techniques such as probing and persuasion, repeated re-scheduling and call-backs to obtain the information, and procedures dealing with how to handle non-compliant (refusal) respondents.

If data are unavailable at the time of collection, a respondent's best estimates are also accepted, and are subsequently revised once the actual data become available.

To minimize total non-response for all variables, partial responses are accepted. In addition, questionnaires are customized for the collection of certain variables, such as inventory, so that collection is timed for those months when the data are available.

Finally, to build trust and rapport between the interviewers and respondents, cases are generally assigned to the same interviewer each month. This action establishes a personal relationship between interviewer and respondent, and builds respondent trust.

7. Data collection and capture operations

Collection of the data is performed by Statistics Canada’s Regional Offices.

Table 1
Weighted response rates by NAICS, for all provinces and territories: December 2014
Table summary
This table displays the results of Table 1: Weighted response rates by NAICS Weighted Response Rates (appearing as column headers).
  Weighted Response Rates
Total Survey Administrative
NAICS - Canada  
Motor Vehicle and Parts Dealers 91.8 92.6 56.9
Automobile Dealers 93.1 93.5 55.7
New Car Dealers 94.3 94.3 Note ...: not applicable
Used Car Dealers 74.1 78.2 55.7
Other Motor Vehicle Dealers 71.8 71.1 76.5
Automotive Parts, Accessories and Tire Stores 86.0 91.6 44.8
Furniture and Home Furnishings Stores 85.1 89.3 38.6
Furniture Stores 88.8 90.2 60.1
Home Furnishings Stores 79.8 88.0 28.4
Electronics and Appliance Stores 88.3 88.3 85.8
Building Material and Garden Equipment Dealers 84.5 87.2 54.8
Food and Beverage Stores 88.2 91.7 43.8
Grocery Stores 92.8 97.0 45.4
Grocery (except Convenience) Stores 93.9 97.9 47.2
Convenience Stores 75.8 83.2 30.9
Specialty Food Stores 71.3 76.2 45.7
Beer, Wine and Liquor Stores 78.7 80.1 24.3
Health and Personal Care Stores 90.9 91.4 83.8
Gasoline Stations 79.0 79.5 70.0
Clothing and Clothing Accessories Stores 87.9 88.5 62.9
Clothing Stores 88.0 88.5 61.0
Shoe Stores 91.1 91.9 11.9
Jewellery, Luggage and Leather Goods Stores 85.5 86.4 71.8
Sporting Goods, Hobby, Book and Music Stores 89.8 92.6 49.2
General Merchandise Stores 97.4 97.9 23.8
Department Stores 100.0 100.0 Note ...: not applicable
Other general merchandise stores 94.9 96.0 23.8
Miscellaneous Store Retailers 81.9 87.0 39.5
Total 89.4 91.0 53.5
Regions  
Newfoundland and Labrador 82.7 83.8 33.7
Prince Edward Island 89.3 90.3 29.5
Nova Scotia 92.4 93.4 57.1
New Brunswick 89.4 91.7 44.6
Québec 87.6 89.2 64.0
Ontario 91.9 93.8 47.9
Manitoba 84.6 85.2 57.1
Saskatchewan 90.7 92.5 44.4
Alberta 90.3 91.9 54.7
British Columbia 85.5 87.1 47.2
Yukon Territory 87.0 87.0 Note ...: not applicable
Northwest Territories 89.7 89.7 Note ...: not applicable
Nunavut 70.9 70.9 Note ...: not applicable

Weighted Response Rates

Respondents are sent a questionnaire or are contacted by telephone to obtain their sales and inventory values, as well as to confirm the opening or closing of business trading locations. Collection of the data begins approximately 7 working days after the end of the reference month and continues for the duration of that month.

New entrants to the survey are introduced to the survey via an introductory letter that informs the respondent that a representative of Statistics Canada will be calling. This call is to introduce the respondent to the survey, confirm the respondent's business activity, establish and begin data collection, as well as to answer any questions that the respondent may have.

8. Editing

Data editing is the application of checks to detect missing, invalid or inconsistent entries or to point to data records that are potentially in error. In the survey process for the MRTS, data editing is done at two different time periods.

First of all, editing is done during data collection. Once data are collected via the telephone, or via the receipt of completed mail-in questionnaires, the data are captured using customized data capture applications. All data are subjected to data editing. Edits during data collection are referred to as field edits and generally consist of validity and some simple consistency edits. They are used to detect mistakes made during the interview by the respondent or the interviewer and to identify missing information during collection in order to reduce the need for follow-up later on. Another purpose of the field edits is to clean up responses. In the MRTS, the current month’s responses are edited against the respondent’s previous month’s responses and/or the previous year’s responses for the current month. Field edits are also used to identify problems with data collection procedures and the design of the questionnaire, as well as the need for more interviewer training.

Follow-up with respondents occurs to validate potential erroneous data following any failed preliminary edit check of the data. Once validated, the collected data is regularly transmitted to the head office in Ottawa.

Secondly, editing known as statistical editing is also done after data collection and this is more empirical in nature. Statistical editing is run prior to imputation in order to identify the data that will be used as a basis to impute non-respondents. Large outliers that could disrupt a monthly trend are excluded from trend calculations by the statistical edits. It should be noted that adjustments are not made at this stage to correct the reported outliers.

The first step in the statistical editing is to identify which responses will be subjected to the statistical edit rules. Reported data for the current reference month will go through various edit checks.

The first set of edit checks is based on the Hidiriglou-Berthelot method whereby a ratio of the respondent’s current month data over historical (last month, same month last year) or auxiliary data is analyzed. When the respondent’s ratio differs significantly from ratios of respondents who are similar in terms of industry and/or geography group, the response is deemed an outlier.

The second set of edits consists of an edit known as the share of market edit. With this method, one is able to edit all respondents, even those where historical and auxiliary data is unavailable. The method relies on current month data only. Therefore, within a group of respondents, that are similar in terms of industrial group and/or geography, if the weighted contribution of a respondent to the group’s total is too large, it will be flagged as an outlier.

For edit checks based on the Hidiriglou-Berthelot method, data that are flagged as an outlier will not be included in the imputation models (those based on ratios). Also, data that are flagged as outliers in the share of market edit will not be included in the imputation models where means and medians are calculated to impute for responses that have no historical responses.

In conjunction with the statistical editing after data collection of reported data, there is also error detection done on the extracted GST data. Modeled data based on the GST are also subject to an extensive series of processing steps which thoroughly verify each record that is the basis for the model as well as the record being modeled. Edits are performed at a more aggregate level (industry by geography level) to detect records which deviate from the expected range, either by exhibiting large month-to-month change, or differing significantly from the remaining units. All data which fail these edits are subject to manual inspection and possible corrective action.

9. Imputation

Imputation in the MRTS is the process used to assign replacement values for missing data. This is done by assigning values when they are missing on the record being edited to ensure that estimates are of high quality and that a plausible, internal consistency is created. Due to concerns of response burden, cost and timeliness, it is generally impossible to do all follow-ups with the respondents in order to resolve missing responses. Since it is desirable to produce a complete and consistent microdata file, imputation is used to handle the remaining missing cases.

In the MRTS, imputation is based on historical data or administrative data (GST sales). The appropriate method is selected according to a strategy that is based on whether historical data is available, auxiliary data is available and/or which reference month is being processed.

There are three types of historical imputation methods. The first type is a general trend that uses one historical data source (previous month, data from next month or data from same month previous year). The second type is a regression model where data from previous month and same month, previous year are used simultaneously. The third type uses the historical data as a direct replacement value for a non-respondent. Depending upon the particular reference month, there is an order of preference that exists so that top quality imputation can result. The historical imputation method that was labelled as the third type above is always the last option in the order for each reference month.

The imputation method using administrative data is automatically selected when historical information is unavailable for a non-respondent. Trends are then applied to the administrative data source (monthly size) depending on whether the structure is simple, e.g. enterprises with only one establishment, or the unit has a more complex structure.

10. Estimation

Estimation is a process that approximates unknown population parameters using only part of the population that is included in a sample. Inferences about these unknown parameters are then made, using the sample data and associated survey design. This stage uses Statistics Canada's Generalized Estimation System (GES).

For retail sales, the population is divided into a survey portion (take-all and take-some strata) and a non-survey portion (take-none stratum). From the sample that is drawn from the survey portion, an estimate for the population is determined through the use of a Horvitz-Thompson estimator where responses for sales are weighted by using the inverses of the inclusion probabilities of the sampled units. Such weights (called sampling weights) can be interpreted as the number of times that each sampled unit should be replicated to represent the entire population. The calculated weighted sales values are summed by domain, to produce the total sales estimates by each industrial group / geographic area combination. A domain is defined as the most recent classification values available from the BR for the unit and the survey reference period. These domains may differ from the original sampling strata because units may have changed size, industry or location. Changes in classification are reflected immediately in the estimates and do not accumulate over time. For the non-survey portion, the sales are estimated with statistical models using monthly GST sales.

For more information on the methodology for modeling sales from administrative data sources which also contributes to the estimates of the survey portion, refer to ‘Monthly Retail Survey: Use of Administrative Data’ under ‘Documentation’ of the IMDB.

The measure of precision used for the MRTS to evaluate the quality of a population parameter estimate and to obtain valid inferences is the variance. The variance from the survey portion is derived directly from a stratified simple random sample without replacement.

Sample estimates may differ from the expected value of the estimates. However, since the estimate is based on a probability sample, the variability of the sample estimate with respect to its expected value can be measured. The variance of an estimate is a measure of the precision of the sample estimate and is defined as the average, over all possible samples, of the squared difference of the estimate from its expected value.

11. Revisions and seasonal adjustment

Revisions in the raw data are required to correct known non-sampling errors. These normally include replacing imputed data with reported data, corrections to previously reported data, and estimates for new births that were not known at the time of the original estimates.

Raw data are revised, on a monthly basis, for the month immediately prior to the current reference month being published. That is, when data for December are being published for the first time, there will also be revisions, if necessary, to the raw data for November. In addition, revisions are made once a year, with the initial release of the February data, for all months in the previous years. The purpose is to correct any significant problems that have been found that apply for an extended period. The actual period of revision depends on the nature of the problem identified, but rarely exceeds three years. The revision period can be extended when historical revisions or restratitfication are done.

Retail trade data are seasonally adjusted using the X12-ARIMA method. This consists of extrapolating a year's worth of raw data with the ARIMA model (auto-regressive integrated moving average model), and of seasonally adjusting the raw time series. Finally, the annual totals of the seasonally adjusted series are forced to the annual totals of the original series.

The seasonally adjusted data also need to be revised. In part, they need to reflect the revisions identified for the raw data. Also, the seasonally adjusted estimates are calculated using X-12-ARIMA, and are sensitive to the most recent values reported in the raw data. For this reason, with the release of each month of new data, the seasonally adjusted values for the previous three months are revised.  A seasonally adjusted time series is a time series that has been modified to eliminate the effect of seasonal and calendar influences. For this reason, the seasonally adjusted data allows for more meaningful comparisons of economic conditions from month to month.

Once a year, seasonal adjustments options are reviewed to take into account the most recent data. Revised seasonally adjusted estimates for each month in the previous years are released at the same time as the annual revision to the raw data. The actual period of revision depends on the number years the raw data was revised.

12. Data quality evaluation

The methodology of this survey has been designed to control errors and to reduce their potential effects on estimates. However, the survey results remain subject to errors, of which sampling error is only one component of the total survey error. Sampling error results when observations are made only on a sample and not on the entire population. All other errors arising from the various phases of a survey are referred to as nonsampling errors. For example, these types of errors can occur when a respondent provides incorrect information or does not answer certain questions; when a unit in the target population is omitted or covered more than once; when GST data for records being modeled for a particular month are not representative of the actual record for various reasons; when a unit that is out of scope for the survey is included by mistake or when errors occur in data processing, such as coding or capture errors.

Prior to publication, combined survey results are analyzed for comparability; in general, this includes a detailed review of individual responses (especially for large businesses), general economic conditions and historical trends.

A common measure of data quality for surveys is the coefficient of variation (CV). The coefficient of variation, defined as the standard error divided by the sample estimate, is a measure of precision in relative terms. Since the coefficient of variation is calculated from responses of individual units, it also measures some non-sampling errors.

The formula used to calculate coefficients of variation (CV) as percentages is:

CV (X) = S(X) * 100% / X
where X denotes the estimate and S(X) denotes the standard error of X.

Confidence intervals can be constructed around the estimates using the estimate and the CV. Thus, for our sample, it is possible to state with a given level of confidence that the expected value will fall within the confidence interval constructed around the estimate. For example, if an estimate of $12,000,000 has a CV of 2%, the standard error will be $240,000 (the estimate multiplied by the CV). It can be stated with 68% confidence that the expected values will fall within the interval whose length equals the standard deviation about the estimate, i.e. between $11,760,000 and $12,240,000.

Alternatively, it can be stated with 95% confidence that the expected value will fall within the interval whose length equals two standard deviations about the estimate, i.e. between $11,520,000 and $12,480,000.

Finally, due to the small contribution of the non-survey portion to the total estimates, bias in the non-survey portion has a negligible impact on the CVs. Therefore, the CV from the survey portion is used for the total estimate that is the summation of estimates from the surveyed and non-surveyed portions.

13. Disclosure control

Statistics Canada is prohibited by law from releasing any data which would divulge information obtained under the Statistics Act that relates to any identifiable person, business or organization without the prior knowledge or the consent in writing of that person, business or organization. Various confidentiality rules are applied to all data that are released or published to prevent the publication or disclosure of any information deemed confidential. If necessary, data are suppressed to prevent direct or residual disclosure of identifiable data.

Confidentiality analysis includes the detection of possible "direct disclosure", which occurs when the value in a tabulation cell is composed of a few respondents or when the cell is dominated by a few companies.

 
 

Monthly Retail Trade Survey (MRTS) Data Quality Statement

Objectives, uses and users
Concepts, variables and classifications
Coverage and frames
Sampling
Questionnaire design
Response and non-response
Data collection and capture operations
Editing
Imputation
Estimation
Revisions and seasonal adjustment
Data quality evaluation
Disclosure control

1. Objectives, uses and users

1.1. Objective

The Monthly Retail Trade Survey (MRTS) provides information on the performance of the retail trade sector on a monthly basis, and when combined with other statistics, represents an important indicator of the state of the Canadian economy.

1.2. Uses

The estimates provide a measure of the health and performance of the retail trade sector. Information collected is used to estimate level and monthly trend for retail sales. At the end of each year, the estimates provide a preliminary look at annual retail sales and performance.

1.3. Users

A variety of organizations, sector associations, and levels of government make use of the information. Retailers rely on the survey results to compare their performance against similar types of businesses, as well as for marketing purposes. Retail associations are able to monitor industry performance and promote their retail industries. Investors can monitor industry growth, which can result in better access to investment capital by retailers. Governments are able to understand the role of retailers in the economy, which aids in the development of policies and tax incentives. As an important industry in the Canadian economy, governments are able to better determine the overall health of the economy through the use of the estimates in the calculation of the nation’s Gross Domestic Product (GDP).

2. Concepts, variables and classifications

2.1. Concepts

The retail trade sector comprises establishments primarily engaged in retailing merchandise, generally without transformation, and rendering services incidental to the sale of merchandise.

The retailing process is the final step in the distribution of merchandise; retailers are therefore organized to sell merchandise in small quantities to the general public. This sector comprises two main types of retailers, that is, store and non-store retailers. The MRTS covers only store retailers. Their main characteristics are described below. Store retailers operate fixed point-of-sale locations, located and designed to attract a high volume of walk-in customers. In general, retail stores have extensive displays of merchandise and use mass-media advertising to attract customers. They typically sell merchandise to the general public for personal or household consumption, but some also serve business and institutional clients. These include establishments such as office supplies stores, computer and software stores, gasoline stations, building material dealers, plumbing supplies stores and electrical supplies stores.

In addition to selling merchandise, some types of store retailers are also engaged in the provision of after-sales services, such as repair and installation. For example, new automobile dealers, electronic and appliance stores and musical instrument and supplies stores often provide repair services, while floor covering stores and window treatment stores often provide installation services. As a general rule, establishments engaged in retailing merchandise and providing after sales services are classified in this sector. Catalogue sales showrooms, gasoline service stations, and mobile home dealers are treated as store retailers.

2.2. Variables

Sales are defined as the sales of all goods purchased for resale, net of returns and discounts. This includes commission revenue and fees earned from selling goods and services on account of others, such as selling lottery tickets, bus tickets, and phone cards. It also includes parts and labour revenue from repair and maintenance; revenue from rental and leasing of goods and equipment; revenues from services, including food services; sales of goods manufactured as a secondary activity; and the proprietor’s withdrawals, at retail, of goods for personal use. Other revenue from rental of real estate, placement fees, operating subsidies, grants, royalties and franchise fees are excluded.

Trading Location is the physical location(s) in which business activity is conducted in each province and territory, and for which sales are credited or recognized in the financial records of the company. For retailers, this would normally be a store.

Constant Dollars: The value of retail trade is measured in two ways; including the effects of price change on sales and net of the effects of price change. The first measure is referred to as retail trade in current dollars and the latter as retail trade in constant dollars. The method of calculating the current dollar estimate is to aggregate the weighted value of sales for all retail outlets. The method of calculating the constant dollar estimate is to first adjust the sales values to a base year, using the Consumer Price Index, and then sum up the resulting values.

2.3. Classification

The Monthly Retail Trade Survey is based on the definition of retail trade under the NAICS (North American Industry Classification System). NAICS is the agreed upon common framework for the production of comparable statistics by the statistical agencies of Canada, Mexico and the United States. The agreement defines the boundaries of twenty sectors. NAICS is based on a production-oriented, or supply based conceptual framework in that establishments are groups into industries according to similarity in production processes used to produce goods and services.

Estimates appear for 21 industries based on special aggregations of the 2012 North American Industry Classification System (NAICS) industries. The 21 industries are further aggregated to 11 sub-sectors.

Geographically, sales estimates are produced for Canada and each province and territory.

3. Coverage and frames

Statistics Canada’s Business Register ( BR) provides the frame for the Monthly Retail Trade Survey. The BR is a structured list of businesses engaged in the production of goods and services in Canada. It is a centrally maintained database containing detailed descriptions of most business entities operating within Canada. The BR includes all incorporated businesses, with or without employees. For unincorporated businesses, the BR includes all employers with businesses, and businesses with no employees with annual sales that have a Goods and Services Tax (GST) or annual revenue that declares individual taxes.  annual sales greater than $30,000 that have a Goods and Services Tax (GST) account (the BR does not include unincorporated businesses with no employees and with annual sales less than $30,000).

The businesses on the BR are represented by a hierarchical structure with four levels, with the statistical enterprise at the top, followed by the statistical company, the statistical establishment and the statistical location. An enterprise can be linked to one or more statistical companies, a statistical company can be linked to one or more statistical establishments, and a statistical establishment to one or more statistical locations.

The target population for the MRTS consists of all statistical establishments on the BR that are classified to the retail sector using the North American Industry Classification System (NAICS) (approximately 200,000 establishments). The NAICS code range for the retail sector is 441100 to 453999. A statistical establishment is the production entity or the smallest grouping of production entities which: produces a homogeneous set of goods or services; does not cross provincial boundaries; and provides data on the value of output, together with the cost of principal intermediate inputs used, along with the cost and quantity of labour used to produce the output. The production entity is the physical unit where the business operations are carried out. It must have a civic address and dedicated labour.

The exclusions to the target population are ancillary establishments (producers of services in support of the activity of producing goods and services for the market of more than one establishment within the enterprise, and serves as a cost centre or a discretionary expense centre for which data on all its costs including labour and depreciation can be reported by the business), future establishments, establishments with a missing or a zero gross business income (GBI) value on the BR and establishments in the following non-covered NAICS:

  • 4541 (electronic shopping and mail-order houses)
  • 4542 (vending machine operators)
  • 45431 (fuel dealers)
  • 45439 (other direct selling establishments)

4. Sampling

The MRTS sample consists of 10,000 groups of establishments (clusters) classified to the Retail Trade sector selected from the Statistics Canada Business Register. A cluster of establishments is defined as all establishments belonging to a statistical enterprise that are in the same industrial group and geographical region. The MRTS uses a stratified design with simple random sample selection in each stratum. The stratification is done by industry groups (the mainly, but not only four digit level NAICS), and the geographical regions consisting of the provinces and territories, as well as three provincial sub-regions. We further stratify the population by size.

The size measure is created using a combination of independent survey data and three administrative variables: the annual profiled revenue, the GST sales expressed on an annual basis, and the declared tax revenue (T1 or T2). The size strata consist of one take-all (census), at most, two take-some (partially sampled) strata, and one take-none (non-sampled) stratum. Take-none strata serve to reduce respondent burden by excluding the smaller businesses from the surveyed population. These businesses should represent at most ten percent of total sales. Instead of sending questionnaires to these businesses, the estimates are produced through the use of administrative data.

The sample was allocated optimally in order to reach target coefficients of variation at the national, provincial/territorial, industrial, and industrial groups by province/territory levels. The sample was also inflated to compensate for dead, non-responding, and misclassified units.

MRTS is a repeated survey with maximisation of monthly sample overlap. The sample is kept month after month, and every month new units are added (births) to the sample.  MRTS births, i.e., new clusters of establishment(s), are identified every month via the BR’s latest universe. They are stratified according to the same criteria as the initial population. A sample of these births is selected according to the sampling fraction of the stratum to which they belong and is added to the monthly sample. Deaths occur on a monthly basis. A death can be a cluster of establishment(s) that have ceased their activities (out-of-business) or whose major activities are no longer in retail trade (out-of-scope). The status of these businesses is updated on the BR using administrative sources and survey feedback, including feedback from the MRTS. Methods to treat dead units and misclassified units are part of the sample and population update procedures.

5. Questionnaire design

The Monthly Retail Trade Survey incorporates the following sub-surveys:

Monthly Retail Trade Survey - R8

Monthly Retail Trade Survey (with inventories) – R8

Survey of Sales and Inventories of Alcoholic Beverages

The questionnaires collect monthly data on retail sales and the number of trading locations by province or territory and inventories of goods owned and intended for resale from a sample of retailers. The items on the questionnaires have remained unchanged for several years. For the 2004 redesign, the general questionnaires were subject to cosmetic changes only. The questionnaire for Sales and Inventories of Alcoholic Beverages underwent more extensive changes. The modifications were discussed withstakeholders and the respondents were given an opportunity to comment before the new questionnaire was finalized. If further changes are needed to any of the questionnaires, proposed changes would go through a review committee and a field test with respondents and data users to ensure its relevancy.

6. Response and non-response

6.1. Response and non-response

Despite the best efforts of survey managers and operations staff to maximize response in the MRTS, some non-response will occur. For statistical establishments to be classified as responding, the degree of partial response (where an accurate response is obtained for only some of the questions asked a respondent) must meet a minimum threshold level below which the response would be rejected and considered a unit non-response.  In such an instance, the business is classified as not having responded at all.

Non-response has two effects on data: first it introduces bias in estimates when non-respondents differ from respondents in the characteristics measured; and second, it contributes to an increase in the sampling variance of estimates because the effective sample size is reduced from that originally sought.

The degree to which efforts are made to get a response from a non-respondent is based on budget and time constraints, its impact on the overall quality and the risk of non-response bias.

The main method to reduce the impact of non-response at sampling is to inflate the sample size through the use of over-sampling rates that have been determined from similar surveys.

Besides the methods to reduce the impact of non-response at sampling and collection, the non-responses to the survey that do occur are treated through imputation. In order to measure the amount of non-response that occurs each month, various response rates are calculated. For a given reference month, the estimation process is run at least twice (a preliminary and a revised run). Between each run, respondent data can be identified as unusable and imputed values can be corrected through respondent data. As a consequence, response rates are computed following each run of the estimation process.

For the MRTS, two types of rates are calculated (un-weighted and weighted). In order to assess the efficiency of the collection process, un-weighted response rates are calculated. Weighted rates, using the estimation weight and the value for the variable of interest, assess the quality of estimation. Within each of these types of rates, there are distinct rates for units that are surveyed and for units that are only modeled from administrative data that has been extracted from GST files.

To get a better picture of the success of the collection process, two un-weighted rates called the ‘collection results rate’ and the ‘extraction results rate’ are computed. They are computed by dividing the number of respondents by the number of units that we tried to contact or tried to receive extracted data for them. Non-monthly reporters (respondents with special reporting arrangements where they do not report every month but for whom actual data is available in subsequent revisions) are excluded from both the numerator and denominator for the months where no contact is performed.

In summary, the various response rates are calculated as follows:

Weighted rates:

Survey Response rate (estimation) =
Sum of weighted sales of units with response status i / Sum of survey weighted sales

where i = units that have either reported data that will be used in estimation or are converted refusals, or have reported data that has not yet been resolved for estimation.

Admin Response rate (estimation) =
Sum of weighted sales of units with response status ii / Sum of administrative weighted sales

where ii = units that have data that was extracted from administrative files and are usable for estimation.

Total Response rate (estimation) =
Sum of weighted sales of units with response status i or response status ii / Sum of all weighted sales

Un-weighted rates:

Survey Response rate (collection) =
Number of questionnaires with response status iii/ Number of questionnaires with response status iv

where iii = units that have either reported data (unresolved, used or not used for estimation) or are converted refusals.

where iv = all of the above plus units that have refused to respond, units that were not contacted and other types of non-respondent units.

Admin Response rate (extraction) =
Number of questionnaires with response status vi/ Number of questionnaires with response status vii

where vi = in-scope units that have data (either usable or non-usable) that was extracted from administrative files

where vii = all of the above plus units that have refused to report to the administrative data source, units that were not contacted and other types of non-respondent units.

(% of questionnaire collected over all in-scope questionnaires)

Collection Results Rate =
Number of questionnaires with response status iii / Number of questionnaires with response status viii

where iii = same as iii defined above

where viii = same as iv except for the exclusion of units that were contacted because their response is unavailable for a particular month since they are non-monthly reporters.

Extraction Results Rate =
Number of questionnaires with response status ix / Number of questionnaires with response status vii

where ix = same as vi with the addition of extracted units that have been imputed or were out of scope

where vii = same as vii defined above

(% of questionnaires collected over all questionnaire in-scope we tried to collect)

All the above weighted and un-weighted rates are provided at the industrial group, geography and size group level or for any combination of these levels.

Use of Administrative Data

Managing response burden is an ongoing challenge for Statistics Canada. In an attempt to alleviate response burden and survey costs, especially for smaller businesses, the MRTS has reduced the number of simple establishments in the sample that are surveyed directly and instead derives sales data for these establishments from Goods and Service Tax (GST) files using a statistical model. The model accounts for differences between sales and revenue (reported for GST purposes) as well as for the time lag between the survey reference period and the reference period of the GST file.

For more information on the methodology used for modeling sales from administrative data sources, refer to ‘Monthly Retail Trade Survey: Use of Administrative Data’ under ‘Documentation’ of the IMDB.

Table 1 contains the weighted response rates for all industry groups as well as for total retail trade for each province and territory. For more detailed weighted response rates, please contact the Marketing and Dissemination Section at (613) 951-3549, toll free: 1-877-421-3067 or by e-mail at retailinfo@statcan.

6.2. Methods used to reduce non-response at collection

Significant effort is spent trying to minimize non-response during collection. Methods used, among others, are interviewer techniques such as probing and persuasion, repeated re-scheduling and call-backs to obtain the information, and procedures dealing with how to handle non-compliant (refusal) respondents.

If data are unavailable at the time of collection, a respondent's best estimates are also accepted, and are subsequently revised once the actual data become available.

To minimize total non-response for all variables, partial responses are accepted. In addition, questionnaires are customized for the collection of certain variables, such as inventory, so that collection is timed for those months when the data are available.

Finally, to build trust and rapport between the interviewers and respondents, cases are generally assigned to the same interviewer each month. This action establishes a personal relationship between interviewer and respondent, and builds respondent trust.

7. Data collection and capture operations

Collection of the data is performed by Statistics Canada’s Regional Offices.

Table 1
Weighted response rates by NAICS, for all provinces and territories: May 2014
Table summary
This table displays the results of Weighted response rates by NAICS Weighted Response Rates (appearing as column headers).
  Weighted Response Rates
Total Survey Administrative
NAICS - Canada  
Motor Vehicle and Parts Dealers 93.2 94.0 60.2
Automobile Dealers 95.1 95.5 55.0
New Car Dealers 96.7 96.7 Note ...: not applicable
Used Car Dealers 66.7 69.1 55.0
Other Motor Vehicle Dealers 75.1 76.7 63.9
Automotive Parts, Accessories and Tire Stores 87.8 91.1 60.9
Furniture and Home Furnishings Stores 88.8 93.0 54.1
Furniture Stores 93.1 94.9 59.8
Home Furnishings Stores 81.2 88.7 51.5
Electronics and Appliance Stores 85.4 86.7 46.0
Building Material and Garden Equipment Dealers 88.9 91.6 57.8
Food and Beverage Stores 83.0 85.9 46.7
Grocery Stores 88.8 92.3 48.2
Grocery (except Convenience) Stores 92.3 95.8 50.1
Convenience Stores 42.8 44.3 32.3
Specialty Food Stores 67.8 73.9 43.7
Beer, Wine and Liquor Stores 64.5 65.2 32.8
Health and Personal Care Stores 90.2 90.7 82.1
Gasoline Stations 72.5 72.9 65.3
Clothing and Clothing Accessories Stores 83.7 84.6 50.1
Clothing Stores 82.6 83.3 54.8
Shoe Stores 88.8 90.1 0.0
Jewellery, Luggage and Leather Goods Stores 86.2 88.8 49.7
Sporting Goods, Hobby, Book and Music Stores 86.9 92.7 24.8
General Merchandise Stores 98.5 99.1 20.0
Department Stores 100.0 100.0 Note ...: not applicable
Other general merchandise stores 97.5 98.5 20.0
Miscellaneous Store Retailers 79.9 86.6 28.5
Total 87.6 89.3 53.7
Regions  
Newfoundland and Labrador 88.4 89.6 42.1
Prince Edward Island 87.7 89.2 12.0
Nova Scotia 90.4 91.6 59.5
New Brunswick 85.8 88.0 46.9
Québec 87.6 89.8 53.7
Ontario 88.6 90.3 52.7
Manitoba 88.3 89.0 59.4
Saskatchewan 89.2 90.7 51.6
Alberta 87.2 88.2 67.4
British Columbia 84.2 86.3 40.3
Yukon Territory 83.9 83.9 0.0
Northwest Territories 83.6 83.6 0.0
Nunavut 72.9 72.9 0.0

Weighted Response Rates

Respondents are sent a questionnaire or are contacted by telephone to obtain their sales and inventory values, as well as to confirm the opening or closing of business trading locations. Collection of the data begins approximately 7 working days after the end of the reference month and continues for the duration of that month.

New entrants to the survey are introduced to the survey via an introductory letter that informs the respondent that a representative of Statistics Canada will be calling. This call is to introduce the respondent to the survey, confirm the respondent's business activity, establish and begin data collection, as well as to answer any questions that the respondent may have.

8. Editing

Data editing is the application of checks to detect missing, invalid or inconsistent entries or to point to data records that are potentially in error. In the survey process for the MRTS, data editing is done at two different time periods.

First of all, editing is done during data collection. Once data are collected via the telephone, or via the receipt of completed mail-in questionnaires, the data are captured using customized data capture applications. All data are subjected to data editing. Edits during data collection are referred to as field edits and generally consist of validity and some simple consistency edits. They are used to detect mistakes made during the interview by the respondent or the interviewer and to identify missing information during collection in order to reduce the need for follow-up later on. Another purpose of the field edits is to clean up responses. In the MRTS, the current month’s responses are edited against the respondent’s previous month’s responses and/or the previous year’s responses for the current month. Field edits are also used to identify problems with data collection procedures and the design of the questionnaire, as well as the need for more interviewer training.

Follow-up with respondents occurs to validate potential erroneous data following any failed preliminary edit check of the data. Once validated, the collected data is regularly transmitted to the head office in Ottawa.

Secondly, editing known as statistical editing is also done after data collection and this is more empirical in nature. Statistical editing is run prior to imputation in order to identify the data that will be used as a basis to impute non-respondents. Large outliers that could disrupt a monthly trend are excluded from trend calculations by the statistical edits. It should be noted that adjustments are not made at this stage to correct the reported outliers.

The first step in the statistical editing is to identify which responses will be subjected to the statistical edit rules. Reported data for the current reference month will go through various edit checks.

The first set of edit checks is based on the Hidiriglou-Berthelot method whereby a ratio of the respondent’s current month data over historical (last month, same month last year) or auxiliary data is analyzed. When the respondent’s ratio differs significantly from ratios of respondents who are similar in terms of industry and/or geography group, the response is deemed an outlier.

The second set of edits consists of an edit known as the share of market edit. With this method, one is able to edit all respondents, even those where historical and auxiliary data is unavailable. The method relies on current month data only. Therefore, within a group of respondents, that are similar in terms of industrial group and/or geography, if the weighted contribution of a respondent to the group’s total is too large, it will be flagged as an outlier.

For edit checks based on the Hidiriglou-Berthelot method, data that are flagged as an outlier will not be included in the imputation models (those based on ratios). Also, data that are flagged as outliers in the share of market edit will not be included in the imputation models where means and medians are calculated to impute for responses that have no historical responses.

In conjunction with the statistical editing after data collection of reported data, there is also error detection done on the extracted GST data. Modeled data based on the GST are also subject to an extensive series of processing steps which thoroughly verify each record that is the basis for the model as well as the record being modeled. Edits are performed at a more aggregate level (industry by geography level) to detect records which deviate from the expected range, either by exhibiting large month-to-month change, or differing significantly from the remaining units. All data which fail these edits are subject to manual inspection and possible corrective action.

9. Imputation

Imputation in the MRTS is the process used to assign replacement values for missing data. This is done by assigning values when they are missing on the record being edited to ensure that estimates are of high quality and that a plausible, internal consistency is created. Due to concerns of response burden, cost and timeliness, it is generally impossible to do all follow-ups with the respondents in order to resolve missing responses. Since it is desirable to produce a complete and consistent microdata file, imputation is used to handle the remaining missing cases.

In the MRTS, imputation is based on historical data or administrative data (GST sales). The appropriate method is selected according to a strategy that is based on whether historical data is available, auxiliary data is available and/or which reference month is being processed.

There are three types of historical imputation methods. The first type is a general trend that uses one historical data source (previous month, data from next month or data from same month previous year). The second type is a regression model where data from previous month and same month, previous year are used simultaneously. The third type uses the historical data as a direct replacement value for a non-respondent. Depending upon the particular reference month, there is an order of preference that exists so that top quality imputation can result. The historical imputation method that was labelled as the third type above is always the last option in the order for each reference month.

The imputation method using administrative data is automatically selected when historical information is unavailable for a non-respondent. Trends are then applied to the administrative data source (monthly size) depending on whether the structure is simple, e.g. enterprises with only one establishment, or the unit has a more complex structure.

10. Estimation

Estimation is a process that approximates unknown population parameters using only part of the population that is included in a sample. Inferences about these unknown parameters are then made, using the sample data and associated survey design. This stage uses Statistics Canada's Generalized Estimation System (GES).

For retail sales, the population is divided into a survey portion (take-all and take-some strata) and a non-survey portion (take-none stratum). From the sample that is drawn from the survey portion, an estimate for the population is determined through the use of a Horvitz-Thompson estimator where responses for sales are weighted by using the inverses of the inclusion probabilities of the sampled units. Such weights (called sampling weights) can be interpreted as the number of times that each sampled unit should be replicated to represent the entire population. The calculated weighted sales values are summed by domain, to produce the total sales estimates by each industrial group / geographic area combination. A domain is defined as the most recent classification values available from the BR for the unit and the survey reference period. These domains may differ from the original sampling strata because units may have changed size, industry or location. Changes in classification are reflected immediately in the estimates and do not accumulate over time. For the non-survey portion, the sales are estimated with statistical models using monthly GST sales.

For more information on the methodology for modeling sales from administrative data sources which also contributes to the estimates of the survey portion, refer to ‘Monthly Retail Survey: Use of Administrative Data’ under ‘Documentation’ of the IMDB.

The measure of precision used for the MRTS to evaluate the quality of a population parameter estimate and to obtain valid inferences is the variance. The variance from the survey portion is derived directly from a stratified simple random sample without replacement.

Sample estimates may differ from the expected value of the estimates. However, since the estimate is based on a probability sample, the variability of the sample estimate with respect to its expected value can be measured. The variance of an estimate is a measure of the precision of the sample estimate and is defined as the average, over all possible samples, of the squared difference of the estimate from its expected value.

11. Revisions and seasonal adjustment

Revisions in the raw data are required to correct known non-sampling errors. These normally include replacing imputed data with reported data, corrections to previously reported data, and estimates for new births that were not known at the time of the original estimates.

Raw data are revised, on a monthly basis, for the month immediately prior to the current reference month being published. That is, when data for December are being published for the first time, there will also be revisions, if necessary, to the raw data for November. In addition, revisions are made once a year, with the initial release of the February data, for all months in the previous years. The purpose is to correct any significant problems that have been found that apply for an extended period. The actual period of revision depends on the nature of the problem identified, but rarely exceeds three years. The revision period can be extended when historical revisions or restratitfication are done.

Retail trade data are seasonally adjusted using the X12-ARIMA method. This consists of extrapolating a year's worth of raw data with the ARIMA model (auto-regressive integrated moving average model), and of seasonally adjusting the raw time series. Finally, the annual totals of the seasonally adjusted series are forced to the annual totals of the original series.

The seasonally adjusted data also need to be revised. In part, they need to reflect the revisions identified for the raw data. Also, the seasonally adjusted estimates are calculated using X-12-ARIMA, and are sensitive to the most recent values reported in the raw data. For this reason, with the release of each month of new data, the seasonally adjusted values for the previous three months are revised.  A seasonally adjusted time series is a time series that has been modified to eliminate the effect of seasonal and calendar influences. For this reason, the seasonally adjusted data allows for more meaningful comparisons of economic conditions from month to month.

Once a year, seasonal adjustments options are reviewed to take into account the most recent data. Revised seasonally adjusted estimates for each month in the previous years are released at the same time as the annual revision to the raw data. The actual period of revision depends on the number years the raw data was revised.

12. Data quality evaluation

The methodology of this survey has been designed to control errors and to reduce their potential effects on estimates. However, the survey results remain subject to errors, of which sampling error is only one component of the total survey error. Sampling error results when observations are made only on a sample and not on the entire population. All other errors arising from the various phases of a survey are referred to as nonsampling errors. For example, these types of errors can occur when a respondent provides incorrect information or does not answer certain questions; when a unit in the target population is omitted or covered more than once; when GST data for records being modeled for a particular month are not representative of the actual record for various reasons; when a unit that is out of scope for the survey is included by mistake or when errors occur in data processing, such as coding or capture errors.

Prior to publication, combined survey results are analyzed for comparability; in general, this includes a detailed review of individual responses (especially for large businesses), general economic conditions and historical trends.

A common measure of data quality for surveys is the coefficient of variation (CV). The coefficient of variation, defined as the standard error divided by the sample estimate, is a measure of precision in relative terms. Since the coefficient of variation is calculated from responses of individual units, it also measures some non-sampling errors.

The formula used to calculate coefficients of variation (CV) as percentages is:

CV (X) = S(X) * 100% / X
where X denotes the estimate and S(X) denotes the standard error of X.

Confidence intervals can be constructed around the estimates using the estimate and the CV. Thus, for our sample, it is possible to state with a given level of confidence that the expected value will fall within the confidence interval constructed around the estimate. For example, if an estimate of $12,000,000 has a CV of 2%, the standard error will be $240,000 (the estimate multiplied by the CV). It can be stated with 68% confidence that the expected values will fall within the interval whose length equals the standard deviation about the estimate, i.e. between $11,760,000 and $12,240,000.

Alternatively, it can be stated with 95% confidence that the expected value will fall within the interval whose length equals two standard deviations about the estimate, i.e. between $11,520,000 and $12,480,000.

Finally, due to the small contribution of the non-survey portion to the total estimates, bias in the non-survey portion has a negligible impact on the CVs. Therefore, the CV from the survey portion is used for the total estimate that is the summation of estimates from the surveyed and non-surveyed portions.

13. Disclosure control

Statistics Canada is prohibited by law from releasing any data which would divulge information obtained under the Statistics Act that relates to any identifiable person, business or organization without the prior knowledge or the consent in writing of that person, business or organization. Various confidentiality rules are applied to all data that are released or published to prevent the publication or disclosure of any information deemed confidential. If necessary, data are suppressed to prevent direct or residual disclosure of identifiable data.

Confidentiality analysis includes the detection of possible "direct disclosure", which occurs when the value in a tabulation cell is composed of a few respondents or when the cell is dominated by a few companies.

Monthly Retail Trade Survey (MRTS) Data Quality Statement

Objectives, uses and users
Concepts, variables and classifications
Coverage and frames
Sampling
Questionnaire design
Response and non-response
Data collection and capture operations
Editing
Imputation
Estimation
Revisions and seasonal adjustment
Data quality evaluation
Disclosure control

1. Objectives, uses and users

1.1. Objective

The Monthly Retail Trade Survey (MRTS) provides information on the performance of the retail trade sector on a monthly basis, and when combined with other statistics, represents an important indicator of the state of the Canadian economy.

1.2. Uses

The estimates provide a measure of the health and performance of the retail trade sector. Information collected is used to estimate level and monthly trend for retail sales. At the end of each year, the estimates provide a preliminary look at annual retail sales and performance.

1.3. Users

A variety of organizations, sector associations, and levels of government make use of the information. Retailers rely on the survey results to compare their performance against similar types of businesses, as well as for marketing purposes. Retail associations are able to monitor industry performance and promote their retail industries. Investors can monitor industry growth, which can result in better access to investment capital by retailers. Governments are able to understand the role of retailers in the economy, which aids in the development of policies and tax incentives. As an important industry in the Canadian economy, governments are able to better determine the overall health of the economy through the use of the estimates in the calculation of the nation’s Gross Domestic Product (GDP).

2. Concepts, variables and classifications

2.1. Concepts

The retail trade sector comprises establishments primarily engaged in retailing merchandise, generally without transformation, and rendering services incidental to the sale of merchandise.

The retailing process is the final step in the distribution of merchandise; retailers are therefore organized to sell merchandise in small quantities to the general public. This sector comprises two main types of retailers, that is, store and non-store retailers. The MRTS covers only store retailers. Their main characteristics are described below. Store retailers operate fixed point-of-sale locations, located and designed to attract a high volume of walk-in customers. In general, retail stores have extensive displays of merchandise and use mass-media advertising to attract customers. They typically sell merchandise to the general public for personal or household consumption, but some also serve business and institutional clients. These include establishments such as office supplies stores, computer and software stores, gasoline stations, building material dealers, plumbing supplies stores and electrical supplies stores.

In addition to selling merchandise, some types of store retailers are also engaged in the provision of after-sales services, such as repair and installation. For example, new automobile dealers, electronic and appliance stores and musical instrument and supplies stores often provide repair services, while floor covering stores and window treatment stores often provide installation services. As a general rule, establishments engaged in retailing merchandise and providing after sales services are classified in this sector. Catalogue sales showrooms, gasoline service stations, and mobile home dealers are treated as store retailers.

2.2. Variables

Sales are defined as the sales of all goods purchased for resale, net of returns and discounts. This includes commission revenue and fees earned from selling goods and services on account of others, such as selling lottery tickets, bus tickets, and phone cards. It also includes parts and labour revenue from repair and maintenance; revenue from rental and leasing of goods and equipment; revenues from services, including food services; sales of goods manufactured as a secondary activity; and the proprietor’s withdrawals, at retail, of goods for personal use. Other revenue from rental of real estate, placement fees, operating subsidies, grants, royalties and franchise fees are excluded.

Trading Location is the physical location(s) in which business activity is conducted in each province and territory, and for which sales are credited or recognized in the financial records of the company. For retailers, this would normally be a store.

Constant Dollars: The value of retail trade is measured in two ways; including the effects of price change on sales and net of the effects of price change. The first measure is referred to as retail trade in current dollars and the latter as retail trade in constant dollars. The method of calculating the current dollar estimate is to aggregate the weighted value of sales for all retail outlets. The method of calculating the constant dollar estimate is to first adjust the sales values to a base year, using the Consumer Price Index, and then sum up the resulting values.

2.3. Classification

The Monthly Retail Trade Survey is based on the definition of retail trade under the NAICS (North American Industry Classification System). NAICS is the agreed upon common framework for the production of comparable statistics by the statistical agencies of Canada, Mexico and the United States. The agreement defines the boundaries of twenty sectors. NAICS is based on a production-oriented, or supply based conceptual framework in that establishments are groups into industries according to similarity in production processes used to produce goods and services.

Estimates appear for 21 industries based on special aggregations of the 2012 North American Industry Classification System (NAICS) industries. The 21 industries are further aggregated to 11 sub-sectors.

Geographically, sales estimates are produced for Canada and each province and territory.

3. Coverage and frames

Statistics Canada’s Business Register ( BR) provides the frame for the Monthly Retail Trade Survey. The BR is a structured list of businesses engaged in the production of goods and services in Canada. It is a centrally maintained database containing detailed descriptions of most business entities operating within Canada. The BR includes all incorporated businesses, with or without employees. For unincorporated businesses, the BR includes all employers with businesses, and businesses with no employees with annual sales that have a Goods and Services Tax (GST) or annual revenue that declares individual taxes.  annual sales greater than $30,000 that have a Goods and Services Tax (GST) account (the BR does not include unincorporated businesses with no employees and with annual sales less than $30,000).

The businesses on the BR are represented by a hierarchical structure with four levels, with the statistical enterprise at the top, followed by the statistical company, the statistical establishment and the statistical location. An enterprise can be linked to one or more statistical companies, a statistical company can be linked to one or more statistical establishments, and a statistical establishment to one or more statistical locations.

The target population for the MRTS consists of all statistical establishments on the BR that are classified to the retail sector using the North American Industry Classification System (NAICS) (approximately 200,000 establishments). The NAICS code range for the retail sector is 441100 to 453999. A statistical establishment is the production entity or the smallest grouping of production entities which: produces a homogeneous set of goods or services; does not cross provincial boundaries; and provides data on the value of output, together with the cost of principal intermediate inputs used, along with the cost and quantity of labour used to produce the output. The production entity is the physical unit where the business operations are carried out. It must have a civic address and dedicated labour.

The exclusions to the target population are ancillary establishments (producers of services in support of the activity of producing goods and services for the market of more than one establishment within the enterprise, and serves as a cost centre or a discretionary expense centre for which data on all its costs including labour and depreciation can be reported by the business), future establishments, establishments with a missing or a zero gross business income (GBI) value on the BR and establishments in the following non-covered NAICS:

  • 4541 (electronic shopping and mail-order houses)
  • 4542 (vending machine operators)
  • 45431 (fuel dealers)
  • 45439 (other direct selling establishments)

4. Sampling

The MRTS sample consists of 10,000 groups of establishments (clusters) classified to the Retail Trade sector selected from the Statistics Canada Business Register. A cluster of establishments is defined as all establishments belonging to a statistical enterprise that are in the same industrial group and geographical region. The MRTS uses a stratified design with simple random sample selection in each stratum. The stratification is done by industry groups (the mainly, but not only four digit level NAICS), and the geographical regions consisting of the provinces and territories, as well as three provincial sub-regions. We further stratify the population by size.

The size measure is created using a combination of independent survey data and three administrative variables: the annual profiled revenue, the GST sales expressed on an annual basis, and the declared tax revenue (T1 or T2). The size strata consist of one take-all (census), at most, two take-some (partially sampled) strata, and one take-none (non-sampled) stratum. Take-none strata serve to reduce respondent burden by excluding the smaller businesses from the surveyed population. These businesses should represent at most ten percent of total sales. Instead of sending questionnaires to these businesses, the estimates are produced through the use of administrative data.

The sample was allocated optimally in order to reach target coefficients of variation at the national, provincial/territorial, industrial, and industrial groups by province/territory levels. The sample was also inflated to compensate for dead, non-responding, and misclassified units.

MRTS is a repeated survey with maximisation of monthly sample overlap. The sample is kept month after month, and every month new units are added (births) to the sample.  MRTS births, i.e., new clusters of establishment(s), are identified every month via the BR’s latest universe. They are stratified according to the same criteria as the initial population. A sample of these births is selected according to the sampling fraction of the stratum to which they belong and is added to the monthly sample. Deaths occur on a monthly basis. A death can be a cluster of establishment(s) that have ceased their activities (out-of-business) or whose major activities are no longer in retail trade (out-of-scope). The status of these businesses is updated on the BR using administrative sources and survey feedback, including feedback from the MRTS. Methods to treat dead units and misclassified units are part of the sample and population update procedures.

5. Questionnaire design

The Monthly Retail Trade Survey incorporates the following sub-surveys:

Monthly Retail Trade Survey - R8

Monthly Retail Trade Survey (with inventories) – R8

Survey of Sales and Inventories of Alcoholic Beverages

The questionnaires collect monthly data on retail sales and the number of trading locations by province or territory and inventories of goods owned and intended for resale from a sample of retailers. The items on the questionnaires have remained unchanged for several years. For the 2004 redesign, the general questionnaires were subject to cosmetic changes only. The questionnaire for Sales and Inventories of Alcoholic Beverages underwent more extensive changes. The modifications were discussed withstakeholders and the respondents were given an opportunity to comment before the new questionnaire was finalized. If further changes are needed to any of the questionnaires, proposed changes would go through a review committee and a field test with respondents and data users to ensure its relevancy.

6. Response and non-response

6.1. Response and non-response

Despite the best efforts of survey managers and operations staff to maximize response in the MRTS, some non-response will occur. For statistical establishments to be classified as responding, the degree of partial response (where an accurate response is obtained for only some of the questions asked a respondent) must meet a minimum threshold level below which the response would be rejected and considered a unit non-response.  In such an instance, the business is classified as not having responded at all.

Non-response has two effects on data: first it introduces bias in estimates when non-respondents differ from respondents in the characteristics measured; and second, it contributes to an increase in the sampling variance of estimates because the effective sample size is reduced from that originally sought.

The degree to which efforts are made to get a response from a non-respondent is based on budget and time constraints, its impact on the overall quality and the risk of non-response bias.

The main method to reduce the impact of non-response at sampling is to inflate the sample size through the use of over-sampling rates that have been determined from similar surveys.

Besides the methods to reduce the impact of non-response at sampling and collection, the non-responses to the survey that do occur are treated through imputation. In order to measure the amount of non-response that occurs each month, various response rates are calculated. For a given reference month, the estimation process is run at least twice (a preliminary and a revised run). Between each run, respondent data can be identified as unusable and imputed values can be corrected through respondent data. As a consequence, response rates are computed following each run of the estimation process.

For the MRTS, two types of rates are calculated (un-weighted and weighted). In order to assess the efficiency of the collection process, un-weighted response rates are calculated. Weighted rates, using the estimation weight and the value for the variable of interest, assess the quality of estimation. Within each of these types of rates, there are distinct rates for units that are surveyed and for units that are only modeled from administrative data that has been extracted from GST files.

To get a better picture of the success of the collection process, two un-weighted rates called the ‘collection results rate’ and the ‘extraction results rate’ are computed. They are computed by dividing the number of respondents by the number of units that we tried to contact or tried to receive extracted data for them. Non-monthly reporters (respondents with special reporting arrangements where they do not report every month but for whom actual data is available in subsequent revisions) are excluded from both the numerator and denominator for the months where no contact is performed.

In summary, the various response rates are calculated as follows:

Weighted rates:

Survey Response rate (estimation) =
Sum of weighted sales of units with response status i / Sum of survey weighted sales

where i = units that have either reported data that will be used in estimation or are converted refusals, or have reported data that has not yet been resolved for estimation.

Admin Response rate (estimation) =
Sum of weighted sales of units with response status ii / Sum of administrative weighted sales

where ii = units that have data that was extracted from administrative files and are usable for estimation.

Total Response rate (estimation) =
Sum of weighted sales of units with response status i or response status ii / Sum of all weighted sales

Un-weighted rates:

Survey Response rate (collection) =
Number of questionnaires with response status iii/ Number of questionnaires with response status iv

where iii = units that have either reported data (unresolved, used or not used for estimation) or are converted refusals.

where iv = all of the above plus units that have refused to respond, units that were not contacted and other types of non-respondent units.

Admin Response rate (extraction) =
Number of questionnaires with response status vi/ Number of questionnaires with response status vii

where vi = in-scope units that have data (either usable or non-usable) that was extracted from administrative files

where vii = all of the above plus units that have refused to report to the administrative data source, units that were not contacted and other types of non-respondent units.

(% of questionnaire collected over all in-scope questionnaires)

Collection Results Rate =
Number of questionnaires with response status iii / Number of questionnaires with response status viii

where iii = same as iii defined above

where viii = same as iv except for the exclusion of units that were contacted because their response is unavailable for a particular month since they are non-monthly reporters.

Extraction Results Rate =
Number of questionnaires with response status ix / Number of questionnaires with response status vii

where ix = same as vi with the addition of extracted units that have been imputed or were out of scope

where vii = same as vii defined above

(% of questionnaires collected over all questionnaire in-scope we tried to collect)

All the above weighted and un-weighted rates are provided at the industrial group, geography and size group level or for any combination of these levels.

Use of Administrative Data

Managing response burden is an ongoing challenge for Statistics Canada. In an attempt to alleviate response burden and survey costs, especially for smaller businesses, the MRTS has reduced the number of simple establishments in the sample that are surveyed directly and instead derives sales data for these establishments from Goods and Service Tax (GST) files using a statistical model. The model accounts for differences between sales and revenue (reported for GST purposes) as well as for the time lag between the survey reference period and the reference period of the GST file.

For more information on the methodology used for modeling sales from administrative data sources, refer to ‘Monthly Retail Trade Survey: Use of Administrative Data’ under ‘Documentation’ of the IMDB.

Table 1 contains the weighted response rates for all industry groups as well as for total retail trade for each province and territory. For more detailed weighted response rates, please contact the Marketing and Dissemination Section at (613) 951-3549, toll free: 1-877-421-3067 or by e-mail at retailinfo@statcan.

6.2. Methods used to reduce non-response at collection

Significant effort is spent trying to minimize non-response during collection. Methods used, among others, are interviewer techniques such as probing and persuasion, repeated re-scheduling and call-backs to obtain the information, and procedures dealing with how to handle non-compliant (refusal) respondents.

If data are unavailable at the time of collection, a respondent's best estimates are also accepted, and are subsequently revised once the actual data become available.

To minimize total non-response for all variables, partial responses are accepted. In addition, questionnaires are customized for the collection of certain variables, such as inventory, so that collection is timed for those months when the data are available.

Finally, to build trust and rapport between the interviewers and respondents, cases are generally assigned to the same interviewer each month. This action establishes a personal relationship between interviewer and respondent, and builds respondent trust.

7. Data collection and capture operations

Collection of the data is performed by Statistics Canada’s Regional Offices.

Table 1: Weighted response rates by NAICS, for all provinces and territories: July 2016
Table summary
This table displays the results of Table 1: Weighted response rates by NAICS Weighted Response Rates, calculated using Total, Survey and Administrative units of measure (appearing as column headers).
  Weighted Response Rates
Total Survey Administrative
NAICS - Canada  
Motor Vehicle and Parts Dealers 90.8 91.1 73.2
Automobile Dealers 92.9 93.1 78.2
New Car Dealers 94.5 94.5 Note ...: not applicable
Used Car Dealers 66.7 64.2 78.2
Other Motor Vehicle Dealers 69.3 68.1 82.6
Automotive Parts, Accessories and Tire Stores 81.0 84.2 55.1
Furniture and Home Furnishings Stores 76.4 78.6 54.9
Furniture Stores 75.8 76.7 58.5
Home Furnishings Stores 77.5 82.5 52.7
Electronics and Appliance Stores 80.4 80.5 74.4
Building Material and Garden Equipment Dealers 89.0 89.9 79.5
Food and Beverage Stores 87.9 89.3 67.8
Grocery Stores 89.0 90.4 71.2
Grocery (except Convenience) Stores 91.6 92.8 74.5
Convenience Stores 56.1 57.4 46.4
Specialty Food Stores 54.4 57.0 41.5
Beer, Wine and Liquor Stores 90.7 91.1 70.8
Health and Personal Care Stores 78.2 77.3 92.3
Gasoline Stations 79.2 80.0 68.0
Clothing and Clothing Accessories Stores 83.8 84.8 35.0
Clothing Stores 84.2 85.3 21.1
Shoe Stores 86.1 86.0 93.3
Jewellery, Luggage and Leather Goods Stores 76.8 78.4 50.3
Sporting Goods, Hobby, Book and Music Stores 84.9 90.1 22.1
General Merchandise Stores 95.8 95.8 92.5
Department Stores 91.8 91.8 Note ...: not applicable
Other general merchandise stores 98.3 98.3 92.5
Miscellaneous Store Retailers 76.9 79.1 49.9
Total 87.2 88.1 68.7
Regions  
Newfoundland and Labrador 82.0 82.0 80.4
Prince Edward Island 80.5 80.8 51.4
Nova Scotia 87.2 87.2 85.1
New Brunswick 81.5 82.2 68.2
Québec 90.1 91.1 73.2
Ontario 87.4 88.6 60.4
Manitoba 83.8 84.3 63.6
Saskatchewan 88.7 89.0 81.8
Alberta 84.0 84.5 73.7
British Columbia 87.9 88.5 71.9
Yukon Territory 77.3 77.3 Note ...: not applicable
Northwest Territories 80.5 80.5 Note ...: not applicable
Nunavut 93.1 93.1 Note ...: not applicable


Weighted Response Rates

Respondents are sent a questionnaire or are contacted by telephone to obtain their sales and inventory values, as well as to confirm the opening or closing of business trading locations. Collection of the data begins approximately 7 working days after the end of the reference month and continues for the duration of that month.

New entrants to the survey are introduced to the survey via an introductory letter that informs the respondent that a representative of Statistics Canada will be calling. This call is to introduce the respondent to the survey, confirm the respondent's business activity, establish and begin data collection, as well as to answer any questions that the respondent may have.

8. Editing

Data editing is the application of checks to detect missing, invalid or inconsistent entries or to point to data records that are potentially in error. In the survey process for the MRTS, data editing is done at two different time periods.

First of all, editing is done during data collection. Once data are collected via the telephone, or via the receipt of completed mail-in questionnaires, the data are captured using customized data capture applications. All data are subjected to data editing. Edits during data collection are referred to as field edits and generally consist of validity and some simple consistency edits. They are used to detect mistakes made during the interview by the respondent or the interviewer and to identify missing information during collection in order to reduce the need for follow-up later on. Another purpose of the field edits is to clean up responses. In the MRTS, the current month’s responses are edited against the respondent’s previous month’s responses and/or the previous year’s responses for the current month. Field edits are also used to identify problems with data collection procedures and the design of the questionnaire, as well as the need for more interviewer training.

Follow-up with respondents occurs to validate potential erroneous data following any failed preliminary edit check of the data. Once validated, the collected data is regularly transmitted to the head office in Ottawa.

Secondly, editing known as statistical editing is also done after data collection and this is more empirical in nature. Statistical editing is run prior to imputation in order to identify the data that will be used as a basis to impute non-respondents. Large outliers that could disrupt a monthly trend are excluded from trend calculations by the statistical edits. It should be noted that adjustments are not made at this stage to correct the reported outliers.

The first step in the statistical editing is to identify which responses will be subjected to the statistical edit rules. Reported data for the current reference month will go through various edit checks.

The first set of edit checks is based on the Hidiriglou-Berthelot method whereby a ratio of the respondent’s current month data over historical (last month, same month last year) or auxiliary data is analyzed. When the respondent’s ratio differs significantly from ratios of respondents who are similar in terms of industry and/or geography group, the response is deemed an outlier.

The second set of edits consists of an edit known as the share of market edit. With this method, one is able to edit all respondents, even those where historical and auxiliary data is unavailable. The method relies on current month data only. Therefore, within a group of respondents, that are similar in terms of industrial group and/or geography, if the weighted contribution of a respondent to the group’s total is too large, it will be flagged as an outlier.

For edit checks based on the Hidiriglou-Berthelot method, data that are flagged as an outlier will not be included in the imputation models (those based on ratios). Also, data that are flagged as outliers in the share of market edit will not be included in the imputation models where means and medians are calculated to impute for responses that have no historical responses.

In conjunction with the statistical editing after data collection of reported data, there is also error detection done on the extracted GST data. Modeled data based on the GST are also subject to an extensive series of processing steps which thoroughly verify each record that is the basis for the model as well as the record being modeled. Edits are performed at a more aggregate level (industry by geography level) to detect records which deviate from the expected range, either by exhibiting large month-to-month change, or differing significantly from the remaining units. All data which fail these edits are subject to manual inspection and possible corrective action.

9. Imputation

Imputation in the MRTS is the process used to assign replacement values for missing data. This is done by assigning values when they are missing on the record being edited to ensure that estimates are of high quality and that a plausible, internal consistency is created. Due to concerns of response burden, cost and timeliness, it is generally impossible to do all follow-ups with the respondents in order to resolve missing responses. Since it is desirable to produce a complete and consistent microdata file, imputation is used to handle the remaining missing cases.

In the MRTS, imputation is based on historical data or administrative data (GST sales). The appropriate method is selected according to a strategy that is based on whether historical data is available, auxiliary data is available and/or which reference month is being processed.

There are three types of historical imputation methods. The first type is a general trend that uses one historical data source (previous month, data from next month or data from same month previous year). The second type is a regression model where data from previous month and same month, previous year are used simultaneously. The third type uses the historical data as a direct replacement value for a non-respondent. Depending upon the particular reference month, there is an order of preference that exists so that top quality imputation can result. The historical imputation method that was labelled as the third type above is always the last option in the order for each reference month.

The imputation method using administrative data is automatically selected when historical information is unavailable for a non-respondent. Trends are then applied to the administrative data source (monthly size) depending on whether the structure is simple, e.g. enterprises with only one establishment, or the unit has a more complex structure.

10. Estimation

Estimation is a process that approximates unknown population parameters using only part of the population that is included in a sample. Inferences about these unknown parameters are then made, using the sample data and associated survey design. This stage uses Statistics Canada's Generalized Estimation System (GES).

For retail sales, the population is divided into a survey portion (take-all and take-some strata) and a non-survey portion (take-none stratum). From the sample that is drawn from the survey portion, an estimate for the population is determined through the use of a Horvitz-Thompson estimator where responses for sales are weighted by using the inverses of the inclusion probabilities of the sampled units. Such weights (called sampling weights) can be interpreted as the number of times that each sampled unit should be replicated to represent the entire population. The calculated weighted sales values are summed by domain, to produce the total sales estimates by each industrial group / geographic area combination. A domain is defined as the most recent classification values available from the BR for the unit and the survey reference period. These domains may differ from the original sampling strata because units may have changed size, industry or location. Changes in classification are reflected immediately in the estimates and do not accumulate over time. For the non-survey portion, the sales are estimated with statistical models using monthly GST sales.

For more information on the methodology for modeling sales from administrative data sources which also contributes to the estimates of the survey portion, refer to ‘Monthly Retail Survey: Use of Administrative Data’ under ‘Documentation’ of the IMDB.

The measure of precision used for the MRTS to evaluate the quality of a population parameter estimate and to obtain valid inferences is the variance. The variance from the survey portion is derived directly from a stratified simple random sample without replacement.

Sample estimates may differ from the expected value of the estimates. However, since the estimate is based on a probability sample, the variability of the sample estimate with respect to its expected value can be measured. The variance of an estimate is a measure of the precision of the sample estimate and is defined as the average, over all possible samples, of the squared difference of the estimate from its expected value.

11. Revisions and seasonal adjustment

Revisions in the raw data are required to correct known non-sampling errors. These normally include replacing imputed data with reported data, corrections to previously reported data, and estimates for new births that were not known at the time of the original estimates.

Raw data are revised, on a monthly basis, for the month immediately prior to the current reference month being published. That is, when data for December are being published for the first time, there will also be revisions, if necessary, to the raw data for November. In addition, revisions are made once a year, with the initial release of the February data, for all months in the previous years. The purpose is to correct any significant problems that have been found that apply for an extended period. The actual period of revision depends on the nature of the problem identified, but rarely exceeds three years. The revision period can be extended when historical revisions or restratitfication are done.

Retail trade data are seasonally adjusted using the X12-ARIMA method. This consists of extrapolating a year's worth of raw data with the ARIMA model (auto-regressive integrated moving average model), and of seasonally adjusting the raw time series. Finally, the annual totals of the seasonally adjusted series are forced to the annual totals of the original series.

The seasonally adjusted data also need to be revised. In part, they need to reflect the revisions identified for the raw data. Also, the seasonally adjusted estimates are calculated using X-12-ARIMA, and are sensitive to the most recent values reported in the raw data. For this reason, with the release of each month of new data, the seasonally adjusted values for the previous three months are revised.  A seasonally adjusted time series is a time series that has been modified to eliminate the effect of seasonal and calendar influences. For this reason, the seasonally adjusted data allows for more meaningful comparisons of economic conditions from month to month.

Once a year, seasonal adjustments options are reviewed to take into account the most recent data. Revised seasonally adjusted estimates for each month in the previous years are released at the same time as the annual revision to the raw data. The actual period of revision depends on the number years the raw data was revised.

12. Data quality evaluation

The methodology of this survey has been designed to control errors and to reduce their potential effects on estimates. However, the survey results remain subject to errors, of which sampling error is only one component of the total survey error. Sampling error results when observations are made only on a sample and not on the entire population. All other errors arising from the various phases of a survey are referred to as nonsampling errors. For example, these types of errors can occur when a respondent provides incorrect information or does not answer certain questions; when a unit in the target population is omitted or covered more than once; when GST data for records being modeled for a particular month are not representative of the actual record for various reasons; when a unit that is out of scope for the survey is included by mistake or when errors occur in data processing, such as coding or capture errors.

Prior to publication, combined survey results are analyzed for comparability; in general, this includes a detailed review of individual responses (especially for large businesses), general economic conditions and historical trends.

A common measure of data quality for surveys is the coefficient of variation (CV). The coefficient of variation, defined as the standard error divided by the sample estimate, is a measure of precision in relative terms. Since the coefficient of variation is calculated from responses of individual units, it also measures some non-sampling errors.

The formula used to calculate coefficients of variation (CV) as percentages is:

CV (X) = S(X) * 100% / X
where X denotes the estimate and S(X) denotes the standard error of X.

Confidence intervals can be constructed around the estimates using the estimate and the CV. Thus, for our sample, it is possible to state with a given level of confidence that the expected value will fall within the confidence interval constructed around the estimate. For example, if an estimate of $12,000,000 has a CV of 2%, the standard error will be $240,000 (the estimate multiplied by the CV). It can be stated with 68% confidence that the expected values will fall within the interval whose length equals the standard deviation about the estimate, i.e. between $11,760,000 and $12,240,000.

Alternatively, it can be stated with 95% confidence that the expected value will fall within the interval whose length equals two standard deviations about the estimate, i.e. between $11,520,000 and $12,480,000.

Finally, due to the small contribution of the non-survey portion to the total estimates, bias in the non-survey portion has a negligible impact on the CVs. Therefore, the CV from the survey portion is used for the total estimate that is the summation of estimates from the surveyed and non-surveyed portions.

13. Disclosure control

Statistics Canada is prohibited by law from releasing any data which would divulge information obtained under the Statistics Act that relates to any identifiable person, business or organization without the prior knowledge or the consent in writing of that person, business or organization. Various confidentiality rules are applied to all data that are released or published to prevent the publication or disclosure of any information deemed confidential. If necessary, data are suppressed to prevent direct or residual disclosure of identifiable data.

Confidentiality analysis includes the detection of possible "direct disclosure", which occurs when the value in a tabulation cell is composed of a few respondents or when the cell is dominated by a few companies.

Monthly Retail Trade Survey (MRTS) Data Quality Statement

Objectives, uses and users
Concepts, variables and classifications
Coverage and frames
Sampling
Questionnaire design
Response and non-response
Data collection and capture operations
Editing
Imputation
Estimation
Revisions and seasonal adjustment
Data quality evaluation
Disclosure control

1. Objectives, uses and users

1.1. Objective

The Monthly Retail Trade Survey (MRTS) provides information on the performance of the retail trade sector on a monthly basis, and when combined with other statistics, represents an important indicator of the state of the Canadian economy.

1.2. Uses

The estimates provide a measure of the health and performance of the retail trade sector. Information collected is used to estimate level and monthly trend for retail sales. At the end of each year, the estimates provide a preliminary look at annual retail sales and performance.

1.3. Users

A variety of organizations, sector associations, and levels of government make use of the information. Retailers rely on the survey results to compare their performance against similar types of businesses, as well as for marketing purposes. Retail associations are able to monitor industry performance and promote their retail industries. Investors can monitor industry growth, which can result in better access to investment capital by retailers. Governments are able to understand the role of retailers in the economy, which aids in the development of policies and tax incentives. As an important industry in the Canadian economy, governments are able to better determine the overall health of the economy through the use of the estimates in the calculation of the nation’s Gross Domestic Product (GDP).

2. Concepts, variables and classifications

2.1. Concepts

The retail trade sector comprises establishments primarily engaged in retailing merchandise, generally without transformation, and rendering services incidental to the sale of merchandise.

The retailing process is the final step in the distribution of merchandise; retailers are therefore organized to sell merchandise in small quantities to the general public. This sector comprises two main types of retailers, that is, store and non-store retailers. The MRTS covers only store retailers. Their main characteristics are described below. Store retailers operate fixed point-of-sale locations, located and designed to attract a high volume of walk-in customers. In general, retail stores have extensive displays of merchandise and use mass-media advertising to attract customers. They typically sell merchandise to the general public for personal or household consumption, but some also serve business and institutional clients. These include establishments such as office supplies stores, computer and software stores, gasoline stations, building material dealers, plumbing supplies stores and electrical supplies stores.

In addition to selling merchandise, some types of store retailers are also engaged in the provision of after-sales services, such as repair and installation. For example, new automobile dealers, electronic and appliance stores and musical instrument and supplies stores often provide repair services, while floor covering stores and window treatment stores often provide installation services. As a general rule, establishments engaged in retailing merchandise and providing after sales services are classified in this sector. Catalogue sales showrooms, gasoline service stations, and mobile home dealers are treated as store retailers.

2.2. Variables

Sales are defined as the sales of all goods purchased for resale, net of returns and discounts. This includes commission revenue and fees earned from selling goods and services on account of others, such as selling lottery tickets, bus tickets, and phone cards. It also includes parts and labour revenue from repair and maintenance; revenue from rental and leasing of goods and equipment; revenues from services, including food services; sales of goods manufactured as a secondary activity; and the proprietor’s withdrawals, at retail, of goods for personal use. Other revenue from rental of real estate, placement fees, operating subsidies, grants, royalties and franchise fees are excluded.

Trading Location is the physical location(s) in which business activity is conducted in each province and territory, and for which sales are credited or recognized in the financial records of the company. For retailers, this would normally be a store.

Constant Dollars: The value of retail trade is measured in two ways; including the effects of price change on sales and net of the effects of price change. The first measure is referred to as retail trade in current dollars and the latter as retail trade in constant dollars. The method of calculating the current dollar estimate is to aggregate the weighted value of sales for all retail outlets. The method of calculating the constant dollar estimate is to first adjust the sales values to a base year, using the Consumer Price Index, and then sum up the resulting values.

2.3. Classification

The Monthly Retail Trade Survey is based on the definition of retail trade under the NAICS (North American Industry Classification System). NAICS is the agreed upon common framework for the production of comparable statistics by the statistical agencies of Canada, Mexico and the United States. The agreement defines the boundaries of twenty sectors. NAICS is based on a production-oriented, or supply based conceptual framework in that establishments are groups into industries according to similarity in production processes used to produce goods and services.

Estimates appear for 21 industries based on special aggregations of the 2012 North American Industry Classification System (NAICS) industries. The 21 industries are further aggregated to 11 sub-sectors.

Geographically, sales estimates are produced for Canada and each province and territory.

3. Coverage and frames

Statistics Canada’s Business Register ( BR) provides the frame for the Monthly Retail Trade Survey. The BR is a structured list of businesses engaged in the production of goods and services in Canada. It is a centrally maintained database containing detailed descriptions of most business entities operating within Canada. The BR includes all incorporated businesses, with or without employees. For unincorporated businesses, the BR includes all employers with businesses, and businesses with no employees with annual sales that have a Goods and Services Tax (GST) or annual revenue that declares individual taxes.  annual sales greater than $30,000 that have a Goods and Services Tax (GST) account (the BR does not include unincorporated businesses with no employees and with annual sales less than $30,000).

The businesses on the BR are represented by a hierarchical structure with four levels, with the statistical enterprise at the top, followed by the statistical company, the statistical establishment and the statistical location. An enterprise can be linked to one or more statistical companies, a statistical company can be linked to one or more statistical establishments, and a statistical establishment to one or more statistical locations.

The target population for the MRTS consists of all statistical establishments on the BR that are classified to the retail sector using the North American Industry Classification System (NAICS) (approximately 200,000 establishments). The NAICS code range for the retail sector is 441100 to 453999. A statistical establishment is the production entity or the smallest grouping of production entities which: produces a homogeneous set of goods or services; does not cross provincial boundaries; and provides data on the value of output, together with the cost of principal intermediate inputs used, along with the cost and quantity of labour used to produce the output. The production entity is the physical unit where the business operations are carried out. It must have a civic address and dedicated labour.

The exclusions to the target population are ancillary establishments (producers of services in support of the activity of producing goods and services for the market of more than one establishment within the enterprise, and serves as a cost centre or a discretionary expense centre for which data on all its costs including labour and depreciation can be reported by the business), future establishments, establishments with a missing or a zero gross business income (GBI) value on the BR and establishments in the following non-covered NAICS:

  • 4541 (electronic shopping and mail-order houses)
  • 4542 (vending machine operators)
  • 45431 (fuel dealers)
  • 45439 (other direct selling establishments)

4. Sampling

The MRTS sample consists of 10,000 groups of establishments (clusters) classified to the Retail Trade sector selected from the Statistics Canada Business Register. A cluster of establishments is defined as all establishments belonging to a statistical enterprise that are in the same industrial group and geographical region. The MRTS uses a stratified design with simple random sample selection in each stratum. The stratification is done by industry groups (the mainly, but not only four digit level NAICS), and the geographical regions consisting of the provinces and territories, as well as three provincial sub-regions. We further stratify the population by size.

The size measure is created using a combination of independent survey data and three administrative variables: the annual profiled revenue, the GST sales expressed on an annual basis, and the declared tax revenue (T1 or T2). The size strata consist of one take-all (census), at most, two take-some (partially sampled) strata, and one take-none (non-sampled) stratum. Take-none strata serve to reduce respondent burden by excluding the smaller businesses from the surveyed population. These businesses should represent at most ten percent of total sales. Instead of sending questionnaires to these businesses, the estimates are produced through the use of administrative data.

The sample was allocated optimally in order to reach target coefficients of variation at the national, provincial/territorial, industrial, and industrial groups by province/territory levels. The sample was also inflated to compensate for dead, non-responding, and misclassified units.

MRTS is a repeated survey with maximisation of monthly sample overlap. The sample is kept month after month, and every month new units are added (births) to the sample.  MRTS births, i.e., new clusters of establishment(s), are identified every month via the BR’s latest universe. They are stratified according to the same criteria as the initial population. A sample of these births is selected according to the sampling fraction of the stratum to which they belong and is added to the monthly sample. Deaths occur on a monthly basis. A death can be a cluster of establishment(s) that have ceased their activities (out-of-business) or whose major activities are no longer in retail trade (out-of-scope). The status of these businesses is updated on the BR using administrative sources and survey feedback, including feedback from the MRTS. Methods to treat dead units and misclassified units are part of the sample and population update procedures.

5. Questionnaire design

The Monthly Retail Trade Survey incorporates the following sub-surveys:

Monthly Retail Trade Survey - R8

Monthly Retail Trade Survey (with inventories) – R8

Survey of Sales and Inventories of Alcoholic Beverages

The questionnaires collect monthly data on retail sales and the number of trading locations by province or territory and inventories of goods owned and intended for resale from a sample of retailers. The items on the questionnaires have remained unchanged for several years. For the 2004 redesign, the general questionnaires were subject to cosmetic changes only. The questionnaire for Sales and Inventories of Alcoholic Beverages underwent more extensive changes. The modifications were discussed withstakeholders and the respondents were given an opportunity to comment before the new questionnaire was finalized. If further changes are needed to any of the questionnaires, proposed changes would go through a review committee and a field test with respondents and data users to ensure its relevancy.

6. Response and non-response

6.1. Response and non-response

Despite the best efforts of survey managers and operations staff to maximize response in the MRTS, some non-response will occur. For statistical establishments to be classified as responding, the degree of partial response (where an accurate response is obtained for only some of the questions asked a respondent) must meet a minimum threshold level below which the response would be rejected and considered a unit non-response.  In such an instance, the business is classified as not having responded at all.

Non-response has two effects on data: first it introduces bias in estimates when non-respondents differ from respondents in the characteristics measured; and second, it contributes to an increase in the sampling variance of estimates because the effective sample size is reduced from that originally sought.

The degree to which efforts are made to get a response from a non-respondent is based on budget and time constraints, its impact on the overall quality and the risk of non-response bias.

The main method to reduce the impact of non-response at sampling is to inflate the sample size through the use of over-sampling rates that have been determined from similar surveys.

Besides the methods to reduce the impact of non-response at sampling and collection, the non-responses to the survey that do occur are treated through imputation. In order to measure the amount of non-response that occurs each month, various response rates are calculated. For a given reference month, the estimation process is run at least twice (a preliminary and a revised run). Between each run, respondent data can be identified as unusable and imputed values can be corrected through respondent data. As a consequence, response rates are computed following each run of the estimation process.

For the MRTS, two types of rates are calculated (un-weighted and weighted). In order to assess the efficiency of the collection process, un-weighted response rates are calculated. Weighted rates, using the estimation weight and the value for the variable of interest, assess the quality of estimation. Within each of these types of rates, there are distinct rates for units that are surveyed and for units that are only modeled from administrative data that has been extracted from GST files.

To get a better picture of the success of the collection process, two un-weighted rates called the ‘collection results rate’ and the ‘extraction results rate’ are computed. They are computed by dividing the number of respondents by the number of units that we tried to contact or tried to receive extracted data for them. Non-monthly reporters (respondents with special reporting arrangements where they do not report every month but for whom actual data is available in subsequent revisions) are excluded from both the numerator and denominator for the months where no contact is performed.

In summary, the various response rates are calculated as follows:

Weighted rates:

Survey Response rate (estimation) =
Sum of weighted sales of units with response status i / Sum of survey weighted sales

where i = units that have either reported data that will be used in estimation or are converted refusals, or have reported data that has not yet been resolved for estimation.

Admin Response rate (estimation) =
Sum of weighted sales of units with response status ii / Sum of administrative weighted sales

where ii = units that have data that was extracted from administrative files and are usable for estimation.

Total Response rate (estimation) =
Sum of weighted sales of units with response status i or response status ii / Sum of all weighted sales

Un-weighted rates:

Survey Response rate (collection) =
Number of questionnaires with response status iii/ Number of questionnaires with response status iv

where iii = units that have either reported data (unresolved, used or not used for estimation) or are converted refusals.

where iv = all of the above plus units that have refused to respond, units that were not contacted and other types of non-respondent units.

Admin Response rate (extraction) =
Number of questionnaires with response status vi/ Number of questionnaires with response status vii

where vi = in-scope units that have data (either usable or non-usable) that was extracted from administrative files

where vii = all of the above plus units that have refused to report to the administrative data source, units that were not contacted and other types of non-respondent units.

(% of questionnaire collected over all in-scope questionnaires)

Collection Results Rate =
Number of questionnaires with response status iii / Number of questionnaires with response status viii

where iii = same as iii defined above

where viii = same as iv except for the exclusion of units that were contacted because their response is unavailable for a particular month since they are non-monthly reporters.

Extraction Results Rate =
Number of questionnaires with response status ix / Number of questionnaires with response status vii

where ix = same as vi with the addition of extracted units that have been imputed or were out of scope

where vii = same as vii defined above

(% of questionnaires collected over all questionnaire in-scope we tried to collect)

All the above weighted and un-weighted rates are provided at the industrial group, geography and size group level or for any combination of these levels.

Use of Administrative Data

Managing response burden is an ongoing challenge for Statistics Canada. In an attempt to alleviate response burden and survey costs, especially for smaller businesses, the MRTS has reduced the number of simple establishments in the sample that are surveyed directly and instead derives sales data for these establishments from Goods and Service Tax (GST) files using a statistical model. The model accounts for differences between sales and revenue (reported for GST purposes) as well as for the time lag between the survey reference period and the reference period of the GST file.

For more information on the methodology used for modeling sales from administrative data sources, refer to ‘Monthly Retail Trade Survey: Use of Administrative Data’ under ‘Documentation’ of the IMDB.

Table 1 contains the weighted response rates for all industry groups as well as for total retail trade for each province and territory. For more detailed weighted response rates, please contact the Marketing and Dissemination Section at (613) 951-3549, toll free: 1-877-421-3067 or by e-mail at retailinfo@statcan.

6.2. Methods used to reduce non-response at collection

Significant effort is spent trying to minimize non-response during collection. Methods used, among others, are interviewer techniques such as probing and persuasion, repeated re-scheduling and call-backs to obtain the information, and procedures dealing with how to handle non-compliant (refusal) respondents.

If data are unavailable at the time of collection, a respondent's best estimates are also accepted, and are subsequently revised once the actual data become available.

To minimize total non-response for all variables, partial responses are accepted. In addition, questionnaires are customized for the collection of certain variables, such as inventory, so that collection is timed for those months when the data are available.

Finally, to build trust and rapport between the interviewers and respondents, cases are generally assigned to the same interviewer each month. This action establishes a personal relationship between interviewer and respondent, and builds respondent trust.

7. Data collection and capture operations

Collection of the data is performed by Statistics Canada’s Regional Offices.

Table 1: Weighted response rates by NAICS, for all provinces and territories: June 2016
Table summary
This table displays the results of Table 1: Weighted response rates by NAICS Weighted Response Rates, calculated using Total, Survey and Administrative units of measure (appearing as column headers).
  Weighted Response Rates
Total Survey Administrative
NAICS - Canada  
Motor Vehicle and Parts Dealers 91.2 91.9 57.1
Automobile Dealers 93.2 93.5 56.7
New Car Dealers 94.8 94.8 Note ...: not applicable
Used Car Dealers 66.6 68.1 56.7
Other Motor Vehicle Dealers 70.1 70.6 64.5
Automotive Parts, Accessories and Tire Stores 84.6 89.6 49.7
Furniture and Home Furnishings Stores 72.3 75.1 42.6
Furniture Stores 69.2 69.4 64.3
Home Furnishings Stores 77.8 86.8 31.4
Electronics and Appliance Stores 89.8 90.3 69.2
Building Material and Garden Equipment Dealers 86.9 89.7 43.6
Food and Beverage Stores 85.5 87.9 50.7
Grocery Stores 90.5 93.4 53.1
Grocery (except Convenience) Stores 92.0 94.6 56.4
Convenience Stores 70.6 76.6 29.7
Specialty Food Stores 53.7 55.4 44.1
Beer, Wine and Liquor Stores 75.1 76.1 33.2
Health and Personal Care Stores 83.8 84.3 75.8
Gasoline Stations 74.5 75.8 55.3
Clothing and Clothing Accessories Stores 87.1 88.0 46.3
Clothing Stores 88.3 89.1 48.6
Shoe Stores 82.0 83.1 9.7
Jewellery, Luggage and Leather Goods Stores 82.8 84.4 53.0
Sporting Goods, Hobby, Book and Music Stores 84.8 90.5 28.6
General Merchandise Stores 98.6 99.2 24.4
Department Stores 100.0 100.0 Note ...: not applicable
Other general merchandise stores 97.8 98.7 24.4
Miscellaneous Store Retailers 63.4 66.0 35.6
Total 87.1 88.6 51.2
Regions  
Newfoundland and Labrador 85.5 86.1 62.0
Prince Edward Island 80.7 81.1 50.2
Nova Scotia 89.0 90.1 53.2
New Brunswick 83.8 85.2 53.6
Québec 83.8 85.7 49.6
Ontario 88.3 90.4 43.7
Manitoba 80.4 80.6 68.1
Saskatchewan 89.2 89.6 76.7
Alberta 88.2 89.3 61.2
British Columbia 89.7 90.9 57.0
Yukon Territory 73.0 73.0 Note ...: not applicable
Northwest Territories 87.3 87.3 Note ...: not applicable
Nunavut 92.9 92.9 Note ...: not applicable


Weighted Response Rates

Respondents are sent a questionnaire or are contacted by telephone to obtain their sales and inventory values, as well as to confirm the opening or closing of business trading locations. Collection of the data begins approximately 7 working days after the end of the reference month and continues for the duration of that month.

New entrants to the survey are introduced to the survey via an introductory letter that informs the respondent that a representative of Statistics Canada will be calling. This call is to introduce the respondent to the survey, confirm the respondent's business activity, establish and begin data collection, as well as to answer any questions that the respondent may have.

8. Editing

Data editing is the application of checks to detect missing, invalid or inconsistent entries or to point to data records that are potentially in error. In the survey process for the MRTS, data editing is done at two different time periods.

First of all, editing is done during data collection. Once data are collected via the telephone, or via the receipt of completed mail-in questionnaires, the data are captured using customized data capture applications. All data are subjected to data editing. Edits during data collection are referred to as field edits and generally consist of validity and some simple consistency edits. They are used to detect mistakes made during the interview by the respondent or the interviewer and to identify missing information during collection in order to reduce the need for follow-up later on. Another purpose of the field edits is to clean up responses. In the MRTS, the current month’s responses are edited against the respondent’s previous month’s responses and/or the previous year’s responses for the current month. Field edits are also used to identify problems with data collection procedures and the design of the questionnaire, as well as the need for more interviewer training.

Follow-up with respondents occurs to validate potential erroneous data following any failed preliminary edit check of the data. Once validated, the collected data is regularly transmitted to the head office in Ottawa.

Secondly, editing known as statistical editing is also done after data collection and this is more empirical in nature. Statistical editing is run prior to imputation in order to identify the data that will be used as a basis to impute non-respondents. Large outliers that could disrupt a monthly trend are excluded from trend calculations by the statistical edits. It should be noted that adjustments are not made at this stage to correct the reported outliers.

The first step in the statistical editing is to identify which responses will be subjected to the statistical edit rules. Reported data for the current reference month will go through various edit checks.

The first set of edit checks is based on the Hidiriglou-Berthelot method whereby a ratio of the respondent’s current month data over historical (last month, same month last year) or auxiliary data is analyzed. When the respondent’s ratio differs significantly from ratios of respondents who are similar in terms of industry and/or geography group, the response is deemed an outlier.

The second set of edits consists of an edit known as the share of market edit. With this method, one is able to edit all respondents, even those where historical and auxiliary data is unavailable. The method relies on current month data only. Therefore, within a group of respondents, that are similar in terms of industrial group and/or geography, if the weighted contribution of a respondent to the group’s total is too large, it will be flagged as an outlier.

For edit checks based on the Hidiriglou-Berthelot method, data that are flagged as an outlier will not be included in the imputation models (those based on ratios). Also, data that are flagged as outliers in the share of market edit will not be included in the imputation models where means and medians are calculated to impute for responses that have no historical responses.

In conjunction with the statistical editing after data collection of reported data, there is also error detection done on the extracted GST data. Modeled data based on the GST are also subject to an extensive series of processing steps which thoroughly verify each record that is the basis for the model as well as the record being modeled. Edits are performed at a more aggregate level (industry by geography level) to detect records which deviate from the expected range, either by exhibiting large month-to-month change, or differing significantly from the remaining units. All data which fail these edits are subject to manual inspection and possible corrective action.

9. Imputation

Imputation in the MRTS is the process used to assign replacement values for missing data. This is done by assigning values when they are missing on the record being edited to ensure that estimates are of high quality and that a plausible, internal consistency is created. Due to concerns of response burden, cost and timeliness, it is generally impossible to do all follow-ups with the respondents in order to resolve missing responses. Since it is desirable to produce a complete and consistent microdata file, imputation is used to handle the remaining missing cases.

In the MRTS, imputation is based on historical data or administrative data (GST sales). The appropriate method is selected according to a strategy that is based on whether historical data is available, auxiliary data is available and/or which reference month is being processed.

There are three types of historical imputation methods. The first type is a general trend that uses one historical data source (previous month, data from next month or data from same month previous year). The second type is a regression model where data from previous month and same month, previous year are used simultaneously. The third type uses the historical data as a direct replacement value for a non-respondent. Depending upon the particular reference month, there is an order of preference that exists so that top quality imputation can result. The historical imputation method that was labelled as the third type above is always the last option in the order for each reference month.

The imputation method using administrative data is automatically selected when historical information is unavailable for a non-respondent. Trends are then applied to the administrative data source (monthly size) depending on whether the structure is simple, e.g. enterprises with only one establishment, or the unit has a more complex structure.

10. Estimation

Estimation is a process that approximates unknown population parameters using only part of the population that is included in a sample. Inferences about these unknown parameters are then made, using the sample data and associated survey design. This stage uses Statistics Canada's Generalized Estimation System (GES).

For retail sales, the population is divided into a survey portion (take-all and take-some strata) and a non-survey portion (take-none stratum). From the sample that is drawn from the survey portion, an estimate for the population is determined through the use of a Horvitz-Thompson estimator where responses for sales are weighted by using the inverses of the inclusion probabilities of the sampled units. Such weights (called sampling weights) can be interpreted as the number of times that each sampled unit should be replicated to represent the entire population. The calculated weighted sales values are summed by domain, to produce the total sales estimates by each industrial group / geographic area combination. A domain is defined as the most recent classification values available from the BR for the unit and the survey reference period. These domains may differ from the original sampling strata because units may have changed size, industry or location. Changes in classification are reflected immediately in the estimates and do not accumulate over time. For the non-survey portion, the sales are estimated with statistical models using monthly GST sales.

For more information on the methodology for modeling sales from administrative data sources which also contributes to the estimates of the survey portion, refer to ‘Monthly Retail Survey: Use of Administrative Data’ under ‘Documentation’ of the IMDB.

The measure of precision used for the MRTS to evaluate the quality of a population parameter estimate and to obtain valid inferences is the variance. The variance from the survey portion is derived directly from a stratified simple random sample without replacement.

Sample estimates may differ from the expected value of the estimates. However, since the estimate is based on a probability sample, the variability of the sample estimate with respect to its expected value can be measured. The variance of an estimate is a measure of the precision of the sample estimate and is defined as the average, over all possible samples, of the squared difference of the estimate from its expected value.

11. Revisions and seasonal adjustment

Revisions in the raw data are required to correct known non-sampling errors. These normally include replacing imputed data with reported data, corrections to previously reported data, and estimates for new births that were not known at the time of the original estimates.

Raw data are revised, on a monthly basis, for the month immediately prior to the current reference month being published. That is, when data for December are being published for the first time, there will also be revisions, if necessary, to the raw data for November. In addition, revisions are made once a year, with the initial release of the February data, for all months in the previous years. The purpose is to correct any significant problems that have been found that apply for an extended period. The actual period of revision depends on the nature of the problem identified, but rarely exceeds three years. The revision period can be extended when historical revisions or restratitfication are done.

Retail trade data are seasonally adjusted using the X12-ARIMA method. This consists of extrapolating a year's worth of raw data with the ARIMA model (auto-regressive integrated moving average model), and of seasonally adjusting the raw time series. Finally, the annual totals of the seasonally adjusted series are forced to the annual totals of the original series.

The seasonally adjusted data also need to be revised. In part, they need to reflect the revisions identified for the raw data. Also, the seasonally adjusted estimates are calculated using X-12-ARIMA, and are sensitive to the most recent values reported in the raw data. For this reason, with the release of each month of new data, the seasonally adjusted values for the previous three months are revised.  A seasonally adjusted time series is a time series that has been modified to eliminate the effect of seasonal and calendar influences. For this reason, the seasonally adjusted data allows for more meaningful comparisons of economic conditions from month to month.

Once a year, seasonal adjustments options are reviewed to take into account the most recent data. Revised seasonally adjusted estimates for each month in the previous years are released at the same time as the annual revision to the raw data. The actual period of revision depends on the number years the raw data was revised.

12. Data quality evaluation

The methodology of this survey has been designed to control errors and to reduce their potential effects on estimates. However, the survey results remain subject to errors, of which sampling error is only one component of the total survey error. Sampling error results when observations are made only on a sample and not on the entire population. All other errors arising from the various phases of a survey are referred to as nonsampling errors. For example, these types of errors can occur when a respondent provides incorrect information or does not answer certain questions; when a unit in the target population is omitted or covered more than once; when GST data for records being modeled for a particular month are not representative of the actual record for various reasons; when a unit that is out of scope for the survey is included by mistake or when errors occur in data processing, such as coding or capture errors.

Prior to publication, combined survey results are analyzed for comparability; in general, this includes a detailed review of individual responses (especially for large businesses), general economic conditions and historical trends.

A common measure of data quality for surveys is the coefficient of variation (CV). The coefficient of variation, defined as the standard error divided by the sample estimate, is a measure of precision in relative terms. Since the coefficient of variation is calculated from responses of individual units, it also measures some non-sampling errors.

The formula used to calculate coefficients of variation (CV) as percentages is:

CV (X) = S(X) * 100% / X
where X denotes the estimate and S(X) denotes the standard error of X.

Confidence intervals can be constructed around the estimates using the estimate and the CV. Thus, for our sample, it is possible to state with a given level of confidence that the expected value will fall within the confidence interval constructed around the estimate. For example, if an estimate of $12,000,000 has a CV of 2%, the standard error will be $240,000 (the estimate multiplied by the CV). It can be stated with 68% confidence that the expected values will fall within the interval whose length equals the standard deviation about the estimate, i.e. between $11,760,000 and $12,240,000.

Alternatively, it can be stated with 95% confidence that the expected value will fall within the interval whose length equals two standard deviations about the estimate, i.e. between $11,520,000 and $12,480,000.

Finally, due to the small contribution of the non-survey portion to the total estimates, bias in the non-survey portion has a negligible impact on the CVs. Therefore, the CV from the survey portion is used for the total estimate that is the summation of estimates from the surveyed and non-surveyed portions.

13. Disclosure control

Statistics Canada is prohibited by law from releasing any data which would divulge information obtained under the Statistics Act that relates to any identifiable person, business or organization without the prior knowledge or the consent in writing of that person, business or organization. Various confidentiality rules are applied to all data that are released or published to prevent the publication or disclosure of any information deemed confidential. If necessary, data are suppressed to prevent direct or residual disclosure of identifiable data.

Confidentiality analysis includes the detection of possible "direct disclosure", which occurs when the value in a tabulation cell is composed of a few respondents or when the cell is dominated by a few companies.

Monthly Retail Trade Survey (MRTS) Data Quality Statement

Objectives, uses and users
Concepts, variables and classifications
Coverage and frames
Sampling
Questionnaire design
Response and non-response
Data collection and capture operations
Editing
Imputation
Estimation
Revisions and seasonal adjustment
Data quality evaluation
Disclosure control

1. Objectives, uses and users

1.1. Objective

The Monthly Retail Trade Survey (MRTS) provides information on the performance of the retail trade sector on a monthly basis, and when combined with other statistics, represents an important indicator of the state of the Canadian economy.

1.2. Uses

The estimates provide a measure of the health and performance of the retail trade sector. Information collected is used to estimate level and monthly trend for retail sales. At the end of each year, the estimates provide a preliminary look at annual retail sales and performance.

1.3. Users

A variety of organizations, sector associations, and levels of government make use of the information. Retailers rely on the survey results to compare their performance against similar types of businesses, as well as for marketing purposes. Retail associations are able to monitor industry performance and promote their retail industries. Investors can monitor industry growth, which can result in better access to investment capital by retailers. Governments are able to understand the role of retailers in the economy, which aids in the development of policies and tax incentives. As an important industry in the Canadian economy, governments are able to better determine the overall health of the economy through the use of the estimates in the calculation of the nation’s Gross Domestic Product (GDP).

2. Concepts, variables and classifications

2.1. Concepts

The retail trade sector comprises establishments primarily engaged in retailing merchandise, generally without transformation, and rendering services incidental to the sale of merchandise.

The retailing process is the final step in the distribution of merchandise; retailers are therefore organized to sell merchandise in small quantities to the general public. This sector comprises two main types of retailers, that is, store and non-store retailers. The MRTS covers only store retailers. Their main characteristics are described below. Store retailers operate fixed point-of-sale locations, located and designed to attract a high volume of walk-in customers. In general, retail stores have extensive displays of merchandise and use mass-media advertising to attract customers. They typically sell merchandise to the general public for personal or household consumption, but some also serve business and institutional clients. These include establishments such as office supplies stores, computer and software stores, gasoline stations, building material dealers, plumbing supplies stores and electrical supplies stores.

In addition to selling merchandise, some types of store retailers are also engaged in the provision of after-sales services, such as repair and installation. For example, new automobile dealers, electronic and appliance stores and musical instrument and supplies stores often provide repair services, while floor covering stores and window treatment stores often provide installation services. As a general rule, establishments engaged in retailing merchandise and providing after sales services are classified in this sector. Catalogue sales showrooms, gasoline service stations, and mobile home dealers are treated as store retailers.

2.2. Variables

Sales are defined as the sales of all goods purchased for resale, net of returns and discounts. This includes commission revenue and fees earned from selling goods and services on account of others, such as selling lottery tickets, bus tickets, and phone cards. It also includes parts and labour revenue from repair and maintenance; revenue from rental and leasing of goods and equipment; revenues from services, including food services; sales of goods manufactured as a secondary activity; and the proprietor’s withdrawals, at retail, of goods for personal use. Other revenue from rental of real estate, placement fees, operating subsidies, grants, royalties and franchise fees are excluded.

Trading Location is the physical location(s) in which business activity is conducted in each province and territory, and for which sales are credited or recognized in the financial records of the company. For retailers, this would normally be a store.

Constant Dollars: The value of retail trade is measured in two ways; including the effects of price change on sales and net of the effects of price change. The first measure is referred to as retail trade in current dollars and the latter as retail trade in constant dollars. The method of calculating the current dollar estimate is to aggregate the weighted value of sales for all retail outlets. The method of calculating the constant dollar estimate is to first adjust the sales values to a base year, using the Consumer Price Index, and then sum up the resulting values.

2.3. Classification

The Monthly Retail Trade Survey is based on the definition of retail trade under the NAICS (North American Industry Classification System). NAICS is the agreed upon common framework for the production of comparable statistics by the statistical agencies of Canada, Mexico and the United States. The agreement defines the boundaries of twenty sectors. NAICS is based on a production-oriented, or supply based conceptual framework in that establishments are groups into industries according to similarity in production processes used to produce goods and services.

Estimates appear for 21 industries based on special aggregations of the 2012 North American Industry Classification System (NAICS) industries. The 21 industries are further aggregated to 11 sub-sectors.

Geographically, sales estimates are produced for Canada and each province and territory.

3. Coverage and frames

Statistics Canada’s Business Register ( BR) provides the frame for the Monthly Retail Trade Survey. The BR is a structured list of businesses engaged in the production of goods and services in Canada. It is a centrally maintained database containing detailed descriptions of most business entities operating within Canada. The BR includes all incorporated businesses, with or without employees. For unincorporated businesses, the BR includes all employers with businesses, and businesses with no employees with annual sales that have a Goods and Services Tax (GST) or annual revenue that declares individual taxes.  annual sales greater than $30,000 that have a Goods and Services Tax (GST) account (the BR does not include unincorporated businesses with no employees and with annual sales less than $30,000).

The businesses on the BR are represented by a hierarchical structure with four levels, with the statistical enterprise at the top, followed by the statistical company, the statistical establishment and the statistical location. An enterprise can be linked to one or more statistical companies, a statistical company can be linked to one or more statistical establishments, and a statistical establishment to one or more statistical locations.

The target population for the MRTS consists of all statistical establishments on the BR that are classified to the retail sector using the North American Industry Classification System (NAICS) (approximately 200,000 establishments). The NAICS code range for the retail sector is 441100 to 453999. A statistical establishment is the production entity or the smallest grouping of production entities which: produces a homogeneous set of goods or services; does not cross provincial boundaries; and provides data on the value of output, together with the cost of principal intermediate inputs used, along with the cost and quantity of labour used to produce the output. The production entity is the physical unit where the business operations are carried out. It must have a civic address and dedicated labour.

The exclusions to the target population are ancillary establishments (producers of services in support of the activity of producing goods and services for the market of more than one establishment within the enterprise, and serves as a cost centre or a discretionary expense centre for which data on all its costs including labour and depreciation can be reported by the business), future establishments, establishments with a missing or a zero gross business income (GBI) value on the BR and establishments in the following non-covered NAICS:

  • 4541 (electronic shopping and mail-order houses)
  • 4542 (vending machine operators)
  • 45431 (fuel dealers)
  • 45439 (other direct selling establishments)

4. Sampling

The MRTS sample consists of 10,000 groups of establishments (clusters) classified to the Retail Trade sector selected from the Statistics Canada Business Register. A cluster of establishments is defined as all establishments belonging to a statistical enterprise that are in the same industrial group and geographical region. The MRTS uses a stratified design with simple random sample selection in each stratum. The stratification is done by industry groups (the mainly, but not only four digit level NAICS), and the geographical regions consisting of the provinces and territories, as well as three provincial sub-regions. We further stratify the population by size.

The size measure is created using a combination of independent survey data and three administrative variables: the annual profiled revenue, the GST sales expressed on an annual basis, and the declared tax revenue (T1 or T2). The size strata consist of one take-all (census), at most, two take-some (partially sampled) strata, and one take-none (non-sampled) stratum. Take-none strata serve to reduce respondent burden by excluding the smaller businesses from the surveyed population. These businesses should represent at most ten percent of total sales. Instead of sending questionnaires to these businesses, the estimates are produced through the use of administrative data.

The sample was allocated optimally in order to reach target coefficients of variation at the national, provincial/territorial, industrial, and industrial groups by province/territory levels. The sample was also inflated to compensate for dead, non-responding, and misclassified units.

MRTS is a repeated survey with maximisation of monthly sample overlap. The sample is kept month after month, and every month new units are added (births) to the sample.  MRTS births, i.e., new clusters of establishment(s), are identified every month via the BR’s latest universe. They are stratified according to the same criteria as the initial population. A sample of these births is selected according to the sampling fraction of the stratum to which they belong and is added to the monthly sample. Deaths occur on a monthly basis. A death can be a cluster of establishment(s) that have ceased their activities (out-of-business) or whose major activities are no longer in retail trade (out-of-scope). The status of these businesses is updated on the BR using administrative sources and survey feedback, including feedback from the MRTS. Methods to treat dead units and misclassified units are part of the sample and population update procedures.

5. Questionnaire design

The Monthly Retail Trade Survey incorporates the following sub-surveys:

Monthly Retail Trade Survey - R8

Monthly Retail Trade Survey (with inventories) – R8

Survey of Sales and Inventories of Alcoholic Beverages

The questionnaires collect monthly data on retail sales and the number of trading locations by province or territory and inventories of goods owned and intended for resale from a sample of retailers. The items on the questionnaires have remained unchanged for several years. For the 2004 redesign, the general questionnaires were subject to cosmetic changes only. The questionnaire for Sales and Inventories of Alcoholic Beverages underwent more extensive changes. The modifications were discussed withstakeholders and the respondents were given an opportunity to comment before the new questionnaire was finalized. If further changes are needed to any of the questionnaires, proposed changes would go through a review committee and a field test with respondents and data users to ensure its relevancy.

6. Response and non-response

6.1. Response and non-response

Despite the best efforts of survey managers and operations staff to maximize response in the MRTS, some non-response will occur. For statistical establishments to be classified as responding, the degree of partial response (where an accurate response is obtained for only some of the questions asked a respondent) must meet a minimum threshold level below which the response would be rejected and considered a unit non-response.  In such an instance, the business is classified as not having responded at all.

Non-response has two effects on data: first it introduces bias in estimates when non-respondents differ from respondents in the characteristics measured; and second, it contributes to an increase in the sampling variance of estimates because the effective sample size is reduced from that originally sought.

The degree to which efforts are made to get a response from a non-respondent is based on budget and time constraints, its impact on the overall quality and the risk of non-response bias.

The main method to reduce the impact of non-response at sampling is to inflate the sample size through the use of over-sampling rates that have been determined from similar surveys.

Besides the methods to reduce the impact of non-response at sampling and collection, the non-responses to the survey that do occur are treated through imputation. In order to measure the amount of non-response that occurs each month, various response rates are calculated. For a given reference month, the estimation process is run at least twice (a preliminary and a revised run). Between each run, respondent data can be identified as unusable and imputed values can be corrected through respondent data. As a consequence, response rates are computed following each run of the estimation process.

For the MRTS, two types of rates are calculated (un-weighted and weighted). In order to assess the efficiency of the collection process, un-weighted response rates are calculated. Weighted rates, using the estimation weight and the value for the variable of interest, assess the quality of estimation. Within each of these types of rates, there are distinct rates for units that are surveyed and for units that are only modeled from administrative data that has been extracted from GST files.

To get a better picture of the success of the collection process, two un-weighted rates called the ‘collection results rate’ and the ‘extraction results rate’ are computed. They are computed by dividing the number of respondents by the number of units that we tried to contact or tried to receive extracted data for them. Non-monthly reporters (respondents with special reporting arrangements where they do not report every month but for whom actual data is available in subsequent revisions) are excluded from both the numerator and denominator for the months where no contact is performed.

In summary, the various response rates are calculated as follows:

Weighted rates:

Survey Response rate (estimation) =
Sum of weighted sales of units with response status i / Sum of survey weighted sales

where i = units that have either reported data that will be used in estimation or are converted refusals, or have reported data that has not yet been resolved for estimation.

Admin Response rate (estimation) =
Sum of weighted sales of units with response status ii / Sum of administrative weighted sales

where ii = units that have data that was extracted from administrative files and are usable for estimation.

Total Response rate (estimation) =
Sum of weighted sales of units with response status i or response status ii / Sum of all weighted sales

Un-weighted rates:

Survey Response rate (collection) =
Number of questionnaires with response status iii/ Number of questionnaires with response status iv

where iii = units that have either reported data (unresolved, used or not used for estimation) or are converted refusals.

where iv = all of the above plus units that have refused to respond, units that were not contacted and other types of non-respondent units.

Admin Response rate (extraction) =
Number of questionnaires with response status vi/ Number of questionnaires with response status vii

where vi = in-scope units that have data (either usable or non-usable) that was extracted from administrative files

where vii = all of the above plus units that have refused to report to the administrative data source, units that were not contacted and other types of non-respondent units.

(% of questionnaire collected over all in-scope questionnaires)

Collection Results Rate =
Number of questionnaires with response status iii / Number of questionnaires with response status viii

where iii = same as iii defined above

where viii = same as iv except for the exclusion of units that were contacted because their response is unavailable for a particular month since they are non-monthly reporters.

Extraction Results Rate =
Number of questionnaires with response status ix / Number of questionnaires with response status vii

where ix = same as vi with the addition of extracted units that have been imputed or were out of scope

where vii = same as vii defined above

(% of questionnaires collected over all questionnaire in-scope we tried to collect)

All the above weighted and un-weighted rates are provided at the industrial group, geography and size group level or for any combination of these levels.

Use of Administrative Data

Managing response burden is an ongoing challenge for Statistics Canada. In an attempt to alleviate response burden and survey costs, especially for smaller businesses, the MRTS has reduced the number of simple establishments in the sample that are surveyed directly and instead derives sales data for these establishments from Goods and Service Tax (GST) files using a statistical model. The model accounts for differences between sales and revenue (reported for GST purposes) as well as for the time lag between the survey reference period and the reference period of the GST file.

For more information on the methodology used for modeling sales from administrative data sources, refer to ‘Monthly Retail Trade Survey: Use of Administrative Data’ under ‘Documentation’ of the IMDB.

Table 1 contains the weighted response rates for all industry groups as well as for total retail trade for each province and territory. For more detailed weighted response rates, please contact the Marketing and Dissemination Section at (613) 951-3549, toll free: 1-877-421-3067 or by e-mail at retailinfo@statcan.

6.2. Methods used to reduce non-response at collection

Significant effort is spent trying to minimize non-response during collection. Methods used, among others, are interviewer techniques such as probing and persuasion, repeated re-scheduling and call-backs to obtain the information, and procedures dealing with how to handle non-compliant (refusal) respondents.

If data are unavailable at the time of collection, a respondent's best estimates are also accepted, and are subsequently revised once the actual data become available.

To minimize total non-response for all variables, partial responses are accepted. In addition, questionnaires are customized for the collection of certain variables, such as inventory, so that collection is timed for those months when the data are available.

Finally, to build trust and rapport between the interviewers and respondents, cases are generally assigned to the same interviewer each month. This action establishes a personal relationship between interviewer and respondent, and builds respondent trust.

7. Data collection and capture operations

Collection of the data is performed by Statistics Canada’s Regional Offices.

Table 1: Weighted response rates by NAICS, for all provinces and territories: May 2016
Table summary
This table displays the results of Table 1: Weighted response rates by NAICS Weighted Response Rates, calculated using Total, Survey and Administrative units of measure (appearing as column headers).
  Weighted Response Rates
Total Survey Administrative
NAICS - Canada  
Motor Vehicle and Parts Dealers 90.4 91.1 66.7
Automobile Dealers 91.9 92.2 69.0
New Car Dealers 93.3 93.3 Note ...: not applicable
Used Car Dealers 69.7 69.9 69.0
Other Motor Vehicle Dealers 76.7 78.7 65.7
Automotive Parts, Accessories and Tire Stores 84.1 86.6 64.7
Furniture and Home Furnishings Stores 87.8 90.8 59.7
Furniture Stores 91.4 92.3 75.0
Home Furnishings Stores 81.2 87.7 52.9
Electronics and Appliance Stores 82.9 83.3 65.6
Building Material and Garden Equipment Dealers 87.5 90.0 48.6
Food and Beverage Stores 83.8 86.1 51.4
Grocery Stores 88.2 90.8 54.3
Grocery (except Convenience) Stores 91.1 93.6 57.7
Convenience Stores 48.4 51.2 29.4
Specialty Food Stores 59.0 63.1 38.0
Beer, Wine and Liquor Stores 73.8 74.5 38.6
Health and Personal Care Stores 88.4 88.4 87.1
Gasoline Stations 76.9 78.6 52.7
Clothing and Clothing Accessories Stores 88.1 89.3 42.8
Clothing Stores 88.9 89.9 46.2
Shoe Stores 88.2 89.3 13.6
Jewellery, Luggage and Leather Goods Stores 80.7 83.2 42.0
Sporting Goods, Hobby, Book and Music Stores 87.9 92.6 31.1
General Merchandise Stores 99.3 99.4 90.2
Department Stores 100.0 100.0 Note ...: not applicable
Other general merchandise stores 98.9 99.0 90.2
Miscellaneous Store Retailers 77.9 82.9 28.5
Total 88.0 89.4 56.5
Regions  
Newfoundland and Labrador 82.0 82.9 48.2
Prince Edward Island 82.3 83.3 7.0
Nova Scotia 92.7 93.8 54.9
New Brunswick 84.3 85.5 52.8
Québec 89.7 91.5 57.7
Ontario 89.2 91.0 54.3
Manitoba 81.8 82.1 64.1
Saskatchewan 88.6 89.6 65.2
Alberta 86.9 88.1 64.4
British Columbia 85.1 86.4 48.9
Yukon Territory 78.4 78.4 Note ...: not applicable
Northwest Territories 82.5 82.5 Note ...: not applicable
Nunavut 94.0 94.0 Note ...: not applicable


Weighted Response Rates

Respondents are sent a questionnaire or are contacted by telephone to obtain their sales and inventory values, as well as to confirm the opening or closing of business trading locations. Collection of the data begins approximately 7 working days after the end of the reference month and continues for the duration of that month.

New entrants to the survey are introduced to the survey via an introductory letter that informs the respondent that a representative of Statistics Canada will be calling. This call is to introduce the respondent to the survey, confirm the respondent's business activity, establish and begin data collection, as well as to answer any questions that the respondent may have.

8. Editing

Data editing is the application of checks to detect missing, invalid or inconsistent entries or to point to data records that are potentially in error. In the survey process for the MRTS, data editing is done at two different time periods.

First of all, editing is done during data collection. Once data are collected via the telephone, or via the receipt of completed mail-in questionnaires, the data are captured using customized data capture applications. All data are subjected to data editing. Edits during data collection are referred to as field edits and generally consist of validity and some simple consistency edits. They are used to detect mistakes made during the interview by the respondent or the interviewer and to identify missing information during collection in order to reduce the need for follow-up later on. Another purpose of the field edits is to clean up responses. In the MRTS, the current month’s responses are edited against the respondent’s previous month’s responses and/or the previous year’s responses for the current month. Field edits are also used to identify problems with data collection procedures and the design of the questionnaire, as well as the need for more interviewer training.

Follow-up with respondents occurs to validate potential erroneous data following any failed preliminary edit check of the data. Once validated, the collected data is regularly transmitted to the head office in Ottawa.

Secondly, editing known as statistical editing is also done after data collection and this is more empirical in nature. Statistical editing is run prior to imputation in order to identify the data that will be used as a basis to impute non-respondents. Large outliers that could disrupt a monthly trend are excluded from trend calculations by the statistical edits. It should be noted that adjustments are not made at this stage to correct the reported outliers.

The first step in the statistical editing is to identify which responses will be subjected to the statistical edit rules. Reported data for the current reference month will go through various edit checks.

The first set of edit checks is based on the Hidiriglou-Berthelot method whereby a ratio of the respondent’s current month data over historical (last month, same month last year) or auxiliary data is analyzed. When the respondent’s ratio differs significantly from ratios of respondents who are similar in terms of industry and/or geography group, the response is deemed an outlier.

The second set of edits consists of an edit known as the share of market edit. With this method, one is able to edit all respondents, even those where historical and auxiliary data is unavailable. The method relies on current month data only. Therefore, within a group of respondents, that are similar in terms of industrial group and/or geography, if the weighted contribution of a respondent to the group’s total is too large, it will be flagged as an outlier.

For edit checks based on the Hidiriglou-Berthelot method, data that are flagged as an outlier will not be included in the imputation models (those based on ratios). Also, data that are flagged as outliers in the share of market edit will not be included in the imputation models where means and medians are calculated to impute for responses that have no historical responses.

In conjunction with the statistical editing after data collection of reported data, there is also error detection done on the extracted GST data. Modeled data based on the GST are also subject to an extensive series of processing steps which thoroughly verify each record that is the basis for the model as well as the record being modeled. Edits are performed at a more aggregate level (industry by geography level) to detect records which deviate from the expected range, either by exhibiting large month-to-month change, or differing significantly from the remaining units. All data which fail these edits are subject to manual inspection and possible corrective action.

9. Imputation

Imputation in the MRTS is the process used to assign replacement values for missing data. This is done by assigning values when they are missing on the record being edited to ensure that estimates are of high quality and that a plausible, internal consistency is created. Due to concerns of response burden, cost and timeliness, it is generally impossible to do all follow-ups with the respondents in order to resolve missing responses. Since it is desirable to produce a complete and consistent microdata file, imputation is used to handle the remaining missing cases.

In the MRTS, imputation is based on historical data or administrative data (GST sales). The appropriate method is selected according to a strategy that is based on whether historical data is available, auxiliary data is available and/or which reference month is being processed.

There are three types of historical imputation methods. The first type is a general trend that uses one historical data source (previous month, data from next month or data from same month previous year). The second type is a regression model where data from previous month and same month, previous year are used simultaneously. The third type uses the historical data as a direct replacement value for a non-respondent. Depending upon the particular reference month, there is an order of preference that exists so that top quality imputation can result. The historical imputation method that was labelled as the third type above is always the last option in the order for each reference month.

The imputation method using administrative data is automatically selected when historical information is unavailable for a non-respondent. Trends are then applied to the administrative data source (monthly size) depending on whether the structure is simple, e.g. enterprises with only one establishment, or the unit has a more complex structure.

10. Estimation

Estimation is a process that approximates unknown population parameters using only part of the population that is included in a sample. Inferences about these unknown parameters are then made, using the sample data and associated survey design. This stage uses Statistics Canada's Generalized Estimation System (GES).

For retail sales, the population is divided into a survey portion (take-all and take-some strata) and a non-survey portion (take-none stratum). From the sample that is drawn from the survey portion, an estimate for the population is determined through the use of a Horvitz-Thompson estimator where responses for sales are weighted by using the inverses of the inclusion probabilities of the sampled units. Such weights (called sampling weights) can be interpreted as the number of times that each sampled unit should be replicated to represent the entire population. The calculated weighted sales values are summed by domain, to produce the total sales estimates by each industrial group / geographic area combination. A domain is defined as the most recent classification values available from the BR for the unit and the survey reference period. These domains may differ from the original sampling strata because units may have changed size, industry or location. Changes in classification are reflected immediately in the estimates and do not accumulate over time. For the non-survey portion, the sales are estimated with statistical models using monthly GST sales.

For more information on the methodology for modeling sales from administrative data sources which also contributes to the estimates of the survey portion, refer to ‘Monthly Retail Survey: Use of Administrative Data’ under ‘Documentation’ of the IMDB.

The measure of precision used for the MRTS to evaluate the quality of a population parameter estimate and to obtain valid inferences is the variance. The variance from the survey portion is derived directly from a stratified simple random sample without replacement.

Sample estimates may differ from the expected value of the estimates. However, since the estimate is based on a probability sample, the variability of the sample estimate with respect to its expected value can be measured. The variance of an estimate is a measure of the precision of the sample estimate and is defined as the average, over all possible samples, of the squared difference of the estimate from its expected value.

11. Revisions and seasonal adjustment

Revisions in the raw data are required to correct known non-sampling errors. These normally include replacing imputed data with reported data, corrections to previously reported data, and estimates for new births that were not known at the time of the original estimates.

Raw data are revised, on a monthly basis, for the month immediately prior to the current reference month being published. That is, when data for December are being published for the first time, there will also be revisions, if necessary, to the raw data for November. In addition, revisions are made once a year, with the initial release of the February data, for all months in the previous years. The purpose is to correct any significant problems that have been found that apply for an extended period. The actual period of revision depends on the nature of the problem identified, but rarely exceeds three years. The revision period can be extended when historical revisions or restratitfication are done.

Retail trade data are seasonally adjusted using the X12-ARIMA method. This consists of extrapolating a year's worth of raw data with the ARIMA model (auto-regressive integrated moving average model), and of seasonally adjusting the raw time series. Finally, the annual totals of the seasonally adjusted series are forced to the annual totals of the original series.

The seasonally adjusted data also need to be revised. In part, they need to reflect the revisions identified for the raw data. Also, the seasonally adjusted estimates are calculated using X-12-ARIMA, and are sensitive to the most recent values reported in the raw data. For this reason, with the release of each month of new data, the seasonally adjusted values for the previous three months are revised.  A seasonally adjusted time series is a time series that has been modified to eliminate the effect of seasonal and calendar influences. For this reason, the seasonally adjusted data allows for more meaningful comparisons of economic conditions from month to month.

Once a year, seasonal adjustments options are reviewed to take into account the most recent data. Revised seasonally adjusted estimates for each month in the previous years are released at the same time as the annual revision to the raw data. The actual period of revision depends on the number years the raw data was revised.

12. Data quality evaluation

The methodology of this survey has been designed to control errors and to reduce their potential effects on estimates. However, the survey results remain subject to errors, of which sampling error is only one component of the total survey error. Sampling error results when observations are made only on a sample and not on the entire population. All other errors arising from the various phases of a survey are referred to as nonsampling errors. For example, these types of errors can occur when a respondent provides incorrect information or does not answer certain questions; when a unit in the target population is omitted or covered more than once; when GST data for records being modeled for a particular month are not representative of the actual record for various reasons; when a unit that is out of scope for the survey is included by mistake or when errors occur in data processing, such as coding or capture errors.

Prior to publication, combined survey results are analyzed for comparability; in general, this includes a detailed review of individual responses (especially for large businesses), general economic conditions and historical trends.

A common measure of data quality for surveys is the coefficient of variation (CV). The coefficient of variation, defined as the standard error divided by the sample estimate, is a measure of precision in relative terms. Since the coefficient of variation is calculated from responses of individual units, it also measures some non-sampling errors.

The formula used to calculate coefficients of variation (CV) as percentages is:

CV (X) = S(X) * 100% / X
where X denotes the estimate and S(X) denotes the standard error of X.

Confidence intervals can be constructed around the estimates using the estimate and the CV. Thus, for our sample, it is possible to state with a given level of confidence that the expected value will fall within the confidence interval constructed around the estimate. For example, if an estimate of $12,000,000 has a CV of 2%, the standard error will be $240,000 (the estimate multiplied by the CV). It can be stated with 68% confidence that the expected values will fall within the interval whose length equals the standard deviation about the estimate, i.e. between $11,760,000 and $12,240,000.

Alternatively, it can be stated with 95% confidence that the expected value will fall within the interval whose length equals two standard deviations about the estimate, i.e. between $11,520,000 and $12,480,000.

Finally, due to the small contribution of the non-survey portion to the total estimates, bias in the non-survey portion has a negligible impact on the CVs. Therefore, the CV from the survey portion is used for the total estimate that is the summation of estimates from the surveyed and non-surveyed portions.

13. Disclosure control

Statistics Canada is prohibited by law from releasing any data which would divulge information obtained under the Statistics Act that relates to any identifiable person, business or organization without the prior knowledge or the consent in writing of that person, business or organization. Various confidentiality rules are applied to all data that are released or published to prevent the publication or disclosure of any information deemed confidential. If necessary, data are suppressed to prevent direct or residual disclosure of identifiable data.

Confidentiality analysis includes the detection of possible "direct disclosure", which occurs when the value in a tabulation cell is composed of a few respondents or when the cell is dominated by a few companies.