The Treasury

Global Navigation

Personal tools

Treasury
Publication

The Elasticity of Taxable Income in New Zealand WP 12/03

Appendix A: The Data

This appendix describes the data used in the analysis. The database was constructed by randomly sampling Inland Revenue's individual taxpayer population. It covers the period from 1994 to 2008. The number of taxpayers in the random sample rises from 128,440 in 1994 to 162,651 in 2008. The random sample is weighted to match the individual taxpayer population, which increased from 2,761,000 taxpayers in 1994 to 3,423,421 in 2008. The database includes people with wage/salary income (including taxable welfare benefits) and people who filed an IR5 or IR3 tax return or received a personal tax summary (PTS). It excludes people with no personal taxable income unless they filed.

The requirement for wage and salary earners to file an IR5 tax return was in part based on earnings over a threshold. This threshold was eased during the 1990s from $20,000 in the early 1990s, to $38,000 by 1999. However a significant administrative change in 2000 removed the IR5 tax return entirely and replaced it with the PTS, a pre-populated taxpayer square-up based on data collected from employers during the year. The income threshold was removed, with a consequential reduction in the number of taxpayers required to square-up. This has caused a structural break in the income tax data collected, especially on dividend and interest income. Taxpayers who previously filed an IR5 were not required to square up via a PTS if their only income was from salary and wages or from investments with the correct amount of tax deducted at source, or where the investment income was below a certain threshold.

Individual taxpayer information is gathered from the following sources/returns:

  1. Client registration
  2. Individual tax return IR3
  3. Personal tax summary (PTS) from 2000 onwards
  4. Salary/wage earner income tax return IR5 (pre-PTS) from 1994 to 1999
  5. Employer monthly schedule (EMS) from 2000 onwards
  6. Annual tax deduction certificate (TDC) from 1994 to 1999

Taxpayers are broadly categorised into two groups based on a taxpayer's 'entity class'; a client registration feature: (a) salary/wage earner with a salary/wage (SW) entity class; and (b) other taxpayers with a non-SW entity class (for example, self-employed or salary/wage earners with other income from rental properties or overseas investments). The selection method is as follows:

  1. A random two per cent of total salary/wage earners – with the random sample selected from the last two digits of their IRD number;
  2. A random ten per cent of total other individual taxpayers – also based on the last two digits of the IRD number with the chosen range including the two per cent sample above.

A taxpayer generally has one IRD number and the same entity class. However, a minority of taxpayers could have a second IRD number due to bankruptcy or they might retain the same IRD number but change entity class over time. This means that the above selection method will have some 'missing' taxpayers if a taxpayer has an entity class change from non-SW to SW, or a taxpayer is issued a new IRD number because of bankruptcy.

The dataset contains the following main variables:

  1. General variables – unique IRD number, date of birth, gender
  2. Income variables – salary/wage income, business income, estate or trust beneficiary income, interest income, dividend income, overseas income, rental income, shareholder-employee salary, partnership income, other income, taxable income
  3. Other variables – expenses, losses claimed, loss attributing qualifying company (LAQC) losses claimed

The gender variable is determined based on the 'title' of a taxpayer (e.g. Mr, Miss) so some imputation is required. When no title is present, or the title is ambiguous, gender is randomly assigned.

Some of the variables are unique to IR3 filers and not available for non-filers. Differences in the return financial variables are summarised in Table A1.

Table A1 – Income Variables
Table 1 – Income Variables.

Taxable income is the sum of all incomes less LAQC losses claimed, less expenses and less losses claimed (in that order). Taxable income is zero if a taxpayer has negative taxable income (i.e. a loss). As the focus is on income, rebates (such as child care, housekeeping and donations) are ignored.

The expenses variable is different from the business expenses that can be claimed in the general set of financial accounts. The expenses variable can be defined as 'fees' paid for professional services:

  • A fee to someone for completing a tax return.
  • Commission on interest or dividend income.
  • Expenses incurred in earning income that has had withholding tax deducted.
  • Additional expenses incurred in earning partnership income, for example, interest on capital borrowed to purchase a share in the partnership.
  • Interest on money borrowed to buy shares or to invest.
  • Premiums on loss of earnings insurance (income protection), provided the benefit from the insurance policy is taxable.
Page top