International Regional Science Review OnlineFirst, published on May 11, 2009 as doi:10.1177/0160017609336536

Understanding Spatial Variation in Tax Sheltering: The Role of Demographics, Ideology, and Taxes

International Regional Science Review Volume 000 Number 00 Month 2009 1-24 # 2009 SAGE Publications 10.1177/0160017609336536 http://irsr.sagepub.com hosted at http://online.sagepub.com

William M. Gentry Williams College, Williamstown, Massachusetts, e-mail: [email protected]

Matthew E. Kahn University of California at Los Angeles, e-mail: [email protected]

Taxpayers shelter income from taxation both through illegal evasion and legal avoidance. This tax sheltering creates a difference between a household’s actual income and what it reports to the tax authorities. While tax sheltering is a central concern for designing a tax system, the private nature of this behavior complicates evaluating the magnitude and determinants of such behavior. In this article, we combine zip-code level data on reported income from the Internal Revenue Service and the Census Bureau to examine three types of determinants of tax sheltering: (1) tax policy variables, including tax rates (2) political attitudes toward taxation; and (3) demographics. Our estimates suggest that higher tax rates increase the amount of tax sheltering. In terms of political support, our results suggest that places with voters who are either more conservative or less supportive of tax increases actually shelter less income. Keywords:

tax sheltering; elasticity of taxable income; political support for taxation

1. Introduction Tax sheltering is a combination of tax evasion and tax avoidance that reduces income reported on households’ tax returns relative to their true incomes. Tax evasion is the illegal underreporting of taxable income; tax avoidance refers to Authors’ Note: We thank Sandesh Dhungana for excellent research assistance. We appreciate comments from Alan Plumley, an anonymous referee, and participants at the conference ‘‘Understanding Place and the Economics of Space: Celebrating the Career of Roger Bolton.’’ All errors are solely our responsibility. 1

2 International Regional Science Review

deviations between taxable income and true income because households are not taxed on some forms of income (e.g., interest on tax-exempt bonds). The paucity of data on tax compliance impedes measuring the determinants of tax sheltering behavior. One reason for the lack of data is obvious: tax evasion is illegal so people are reluctant to reveal such behaviors. While tax avoidance behavior is legal, tax returns do not include information on all possible forms of tax sheltering. We combine Internal Revenue Service (IRS) data for 2001 reported at the zip-code level with zip-code level data from the 2000 Decennial Census to examine a measure of tax sheltering. Our main variable of interest is the ratio of adjusted gross income per tax return to average household income reported in the Census in a given zip-code area. From the outset, we want to be clear that these data cannot separate evasion from avoidance so that our results do not refer only to tax evasion. We use the geographic aspect of this measure of tax sheltering to examine the determinants of tax sheltering. Tax sheltering could depend on both financial incentives and attitudes toward taxation. The financial incentives to shelter income depend on the parameters of the tax code, including tax rates and enforcement efforts by the government. Theoretical predictions about the effect of higher tax rates on tax evasion depend on how potential fines interact with the tax rate. In addition, the opportunity to evade taxes may differ across types of income; for example, selfemployment may provide opportunities to underreport income. One would expect that higher marginal tax rates should induce more tax avoidance behavior. We focus on interstate variation in income tax rates as an exogenous source of variation in the tax incentives for tax sheltering. Political attitudes toward taxation can also affect tax sheltering behavior. We measure political attitudes with several variables. For California, we have zip-code level data on party affiliation of voters as well as voting patterns on several taxrelated ballot initiatives. Because these data are not available nationally and the meaning of party affiliation varies by region, we use the Poole-Rosenthal measures of political ideology, based on congressional voting records, to capture regional variation in political attitudes. We examine voter turnout to capture differences across place in political engagement. In addition to political attitudes, demographic differences, such as income, education, age, and race, may also influence attitudes toward tax sheltering. The difference between income reported to the tax authorities and true household income has important implications for the amount of tax revenues and the efficacy with which these revenues are collected. While in some instances tax avoidance behavior reflects households responding to incentives built into the tax system, both tax evasion and tax avoidance affect tax collections. Understanding tax sheltering can help policy makers improve the design of the tax system and tax administration. In terms of the economic distortions created by the tax system, Chetty (2008) shows that the deadweight loss of taxation can depend on a combination of the elasticity of

Gentry, Kahn / Understanding Spatial Variation in Tax Sheltering 3

taxable income and the elasticity of total income with respect to the tax rate. Tax sheltering creates a difference between these two elasticities; thus, estimating the elasticity of tax sheltering with respect to the tax rate has implications for the efficiency effects of the tax system. We organize our data in two ways to focus on different elements of tax sheltering. First, we focus on zip-code level data, both within California and nationally. Second, we compare tax sheltering in counties that border each other but are in adjacent states. By focusing on these bordered pairs, we hope to control for many factors that are the same across counties (e.g., they are in the same or similar labor markets) and isolated variables that differ across counties (e.g., marginal tax rates differ due to differences in state laws). To preview our results, our estimates suggest that higher tax rates increase the amount of tax sheltering; places with more self-employed people also shelter more income. In terms of political support, our results suggest that places with voters who are either more conservative or less supportive of tax increases actually shelter less income. We also find that minorities shelter more income from taxation and both older and more highly educated households shelter less income from taxation. The article is organized as follows. Section 2 provides some background on tax sheltering and efficiency measures of income taxes. Section 3 describes the advantages and disadvantages of the data that we use to explore tax sheltering. In section 4, we outline our empirical approach. Section 5 presents our results and section 6 offers a brief conclusion.

2. Background and Theory This section provides background information on several literatures that are related to our study of the geography of tax sheltering. After briefly distinguishing tax evasion and tax avoidance, we discuss the canonical models of the determinants of each behavior. We then turn to the implications of our analysis for the measurement of the efficiency cost of taxation.

2A. Overview of Tax Evasion and Avoidance Households can evade income taxes by underreporting income, overstating deductions or exemptions, or by not filing a tax return when required by law. Slemrod (2007) reports, based on tabulations for 2001 from the IRS, that underreporting of income accounts for 80 percent of the individual tax gap (the difference between the legally obligated tax liability and what people actually pay) as opposed to overstating deductions, exemptions, or credits. Individual income taxes account for about two-thirds of the aggregate tax gap of US$290 billion (after adjusting for expected

4 International Regional Science Review

collections from audits) or 13.7 percent of the statutory tax liability. Due to administrative features such as employer withholding of taxes on wages and informational reports by payees of some types of income, the estimated compliance rate varies considerably by type of income. Self-employment, rental income, and farm income tend to have considerably lower compliance rates. As with tax evasion, the goal of tax avoidance is to reduce the household’s tax liability without causing a significant distortion to the household’s consumption bundle (other than through the effect of having more cash). The critical distinction between the two concepts is that evasion is an illegal activity such that the authorities can compel taxpayers to pay the evaded tax; avoidance behavior is a legal activity that permits a taxpayer to reduce his or her tax liability. Examples of tax avoidance include investing in tax-exempt state and local bonds and deferring the realization of capital gains income; in these examples, alternative investment strategies would have yielded a higher tax burden but the tax authorities cannot dictate investment strategy. Of course, there are gray areas between evasion and avoidance because tax law is open to interpretation. An example is the restriction against deducting interest payments on debt if the proceeds to the debt are invested in tax-exempt bonds; while this tracing rule is easy to write down, it is difficult to implement because money is fungible. Economists tend not to estimate the overall magnitude of tax avoidance because the precise definition of what behaviors constitute avoidance is not possible. At what point does a behavior move from the category of pure tax avoidance into the category of real behavior that reduces the income tax liability? For example, increasing the level of savings in response to tax preferences for household saving changes real behavior by changing the timing of consumption; in contrast, merely shifting the form of savings in response to such incentives seems like tax avoidance. As stressed by Slemrod and Yitzhaki (2002), evasion and avoidance are central concepts for public finance. Models that assume that the government can costlessly collect taxes without fear of evasion and avoidance emphasize what Slemrod and Yitzhaki refer to as the real substitution responses to taxation. However, without understanding the magnitude of taxpayer’s evasion and avoidance possibilities, measuring real responses to taxation is difficult. Our research design of focusing on two different measures of income—one reported to the IRS and the other reported to the Census—takes the real responses to the tax system as given because they will affect both measures of income. Conditional on income reported to the Census, we ask what factors influence income reported to the IRS.

2B. Incentives for Tax Evasion and Tax Avoidance The canonical economic model of tax evasion is the model of taxpayer by Allingham and Sandmo (1972), who chooses an amount of tax to evade, given a fixed probability of detection and a proportional penalty based on the amount of tax evaded.

Gentry, Kahn / Understanding Spatial Variation in Tax Sheltering 5

Evading taxes creates a gamble with a pay off that depends on the tax rate, the probability of detection, and the fine upon being detected. Taxpayers with higher risk aversion are less likely to evade taxes. Depending on how risk aversion changes with income, the level of income can affect tax evasion. If relative risk aversion is increasing, decreasing, or stays constant with income, then evasion as a fraction of income will decrease, increase, or stay constant with income. The effect of the tax rate depends on the specific formulation of the penalty. Allingham and Sandmo assume that the penalty is proportional to the understatement of income, which leads to the prediction that higher tax rates create a substitution effect that increases evasion; as pointed out by Yitzhaki (1974), if the penalty is proportional to the tax evaded, then the tax rate will not have this substitution effect because the tax rate affects both the costs and benefits of evasion. While models of tax evasion emphasize risk aversion and detection probabilities, these factors do not affect the tax avoidance decision because the activity is legal. Instead of relying on risk and punishment as limiting factors, models of tax avoidance (see Slemrod 2001) appeal to the notion of taxpayers having access to an ‘‘avoidance technology.’’ The avoidance technology might include the ability to take compensation in a tax-advantaged form, such as untaxed fringe benefits, or to borrow with tax deductible interest to invest in lightly taxed assets. Stiglitz (1985) presents a model of tax avoidance that suggests that taxpayers have more opportunity to avoid taxes on capital income than labor income. If the marginal cost of avoidance increases with the amount of avoidance, taxpayers do not want to avoid all taxation. Because the marginal tax rate captures the marginal benefit to tax avoidance, an increase in the tax rate should increase the amount of tax avoidance. Empirical studies of tax evasion have focused on several types of data. Macroeconomic data have been used for aggregate measures of the tax gap (e.g., the measures mentioned above that Slemrod reports on overall tax evasion). Individual-level data on audited tax returns have been the subject of a number of studies (see Andreoni, Erard, and Feinstein 1998, for a survey of such studies). The advantage of using data on audited tax returns is that measured evasion is more likely to capture willful noncompliance with the tax laws. The disadvantage of such studies is that the evasion measure only captures evasion that can be detected by a tax auditor. Studies of audited tax returns have yielded mixed evidence on the effects of both tax rates and income on the amount of tax evasion. Using data on audited tax returns, Christian (1994) finds some evidence that higher-income households report a higher fraction of their income than lower-income households report. Given the difficulties in measuring tax evasion, other studies have resorted to experimental and survey data on tax evasion. The closest antecedent to our empirical strategy is the work of Dubin and Wilde (1988) and Beron, Tauchen, and Witte (1992; hereafter, BTW). Both of these studies use a special data set created by the IRS for tax returns from 1969, which are aggregated to the three-digit zip-code level (a strategy that yields a cross-sectional data set

6 International Regional Science Review

with roughly 800 observations). For each three-digit zip-code area, tax returns are broken into different audit classes (e.g., low-income household without business income and uses the standard deduction). For each audit class in each region, the data set includes (among other variables): (1) an estimated measure of voluntary compliance (the central focus of Dubin and Wilde 1988); (2) audit rates for 1968; (3) adjusted gross income (a focus of BTW 1992); (4) tax liability (also studied by BTW 1992); and (5) the number of tax returns. Both studies combine these data with socioeconomic variables from the 1970 Census. Dubin and Wilde find that audits deter noncompliance, especially for low- and middle-income households; even controlling for audit probabilities, self-employment and unemployment reduce compliance and manufacturing employment increases compliance. They report that for several audit classes, a higher percentage of non-white households reduces compliance. BTW (1992) examine the determinants of reported income and tax liability, controlling for income reported to the Census as a measure of ‘‘true’’ income. Our empirical specifications will follow a similar spirit; however, our data are not broken down by audit class and we do not have information on audit probabilities. They report evidence that higher education is associated with lower reported income and that an increase in female-headed households increases compliance. They do not find evidence of racial characteristics mattering for reported income or tax liability. BTW (1992) narrowly interpret their results on reported income as being about tax evasion; however, evasion is only one behavior that could affect reported income after controlling for true income.

2C. Social Identity, Geography, and Tax Evasion While the theoretical literature that has grown out of the Allingham and Sandmo (1972) framework has yielded many interesting insights into the potential determinants of tax sheltering, Andreoni, Erard, and Feinstein (1998) report a general consensus among researchers that these models predict much less compliance than observed in the tax systems in many developed countries. The argument is that audit probabilities are too low and the fines too small to generate the high rate of compliance observed in the data. As Slemrod (2007) points out, the overall audit probability does not necessarily reflect the audit probability of returns with underreported tax liability so the theory may have more explanatory power than the consensus suggests. Nonetheless, the common perception that tax compliance depends on more than just expected pay offs to gambling has lead to a search for other social factors that explain tax compliance. Tax evasion and tax avoidance reduces the amount of total revenue that government collects. Some people may justify this activity convincing themselves that government cannot be trusted to spend the money wisely on public goods. Others may view tax collection as collecting revenue that will be redistributed to others. A recent empirical literature has examined what factors determine support for government

Gentry, Kahn / Understanding Spatial Variation in Tax Sheltering 7

redistribution. Luttmer (2001) and Alesina and Glaeser (2004) argue that group altruism increases support for redistribution: people are more likely to support redistribution when the beneficiaries look like them. In an attempt to address this issue, in our empirical specifications we include various demographic characteristics. In addition to social identity, political ideology may affect tax sheltering. Our main measures of ideology are based on voting (of either individuals or representatives). The relationship between voting for higher taxes and sheltering income could come from two distinct channels. Ideology provides one reason to expect voting and tax sheltering to be related. People who vote for higher taxes may have an ideological bent that favors larger government and this ideology may influence their personal decisions. In other words, perhaps, liberals are willing to put their money where their mouths (or, at least, their votes) are. This mechanism suggests a positive correlation between the income ratio and voting for higher taxes. A more cynical mechanism for this relationship would be that it is less costly to vote for a higher tax rate if one has access to more tax sheltering opportunities. This self-interested mechanism suggests that causality runs from tax sheltering to voting: a higher reported income ratio will induce voting against higher taxes. The geography of tax sheltering can affect the ability of the federal government to redistribute through a progressive income tax. Albouy (2008) emphasizes that a federal income taxed based on nominal incomes places a heavier burden on people in high-cost areas than it places on residents of low-cost areas, holding the real incomes constant across place. While he focuses on how these tax differences create incentives for migration and spatial mismatch of people and places, the effective difference in taxation across place depends on propensities for tax sheltering as well as the interaction between the federal tax schedule and incomes in different places. However, we know relatively little about the distribution of tax sheltering across place.

2D. Taxable Income Elasticity, Efficiency, and Tax Sheltering A substantial literature on the elasticity of taxable income has developed following Feldstein’s (1995) work on how the Tax Reform Act of 1986 affected taxable incomes. One attractive feature of focusing on the taxable income elasticity is the claim that it is a sufficient statistic for calculating the deadweight loss of the tax system; the intuition is that individuals change all margins of behaviors—labor supply, investment, evasion, and avoidance—such that the private marginal cost of the change is the same across all behaviors (see Feldstein 1999). Chetty (2008) develops more detailed conditions under which the elasticity of taxable income measures the deadweight loss of the tax system. Building on earlier work by Slemrod (see, e.g., Slemrod and Yitzhaki 2002), Chetty’s insight is that not all behavioral adjustments have the same social cost. The social marginal cost of sheltering can differ from the private marginal cost of sheltering when the sheltering behavior creates a transfer of

8 International Regional Science Review

resources between agents. For example, some forms of tax sheltering require paying fees to financial planners; these fees are a cost to the taxpayer but not to society because the financial planner benefits from collecting the fee. In contrast, the social and private costs of distortions of real behavior are arguably relatively similar; a simple example is the lost output from distorting someone’s labor supply decision. Chetty (2008) develops a deadweight loss formula that incorporates a weighted average of the elasticity of taxable income (capturing both real and sheltering responses) with respect to the tax rate and the elasticity of total income (capturing real behaviors but not sheltering activity). The weights on the different elasticities depend on the magnitude of transfer costs associated with tax sheltering. If sheltering involves no transfer costs so that the social and private costs of sheltering are the same, then the elasticity of taxable income is sufficient for measuring the deadweight loss of the income tax. Our estimates of the responsiveness of tax sheltering to the tax rate identify the elasticity of sheltering with respect to the tax rate. The sheltering response captures the difference between the real response and the taxable income response so our estimates provide a sense of the importance of separating the two types of elasticities.

3. Data Challenges and Description Our approach to exploring the tax gap is relatively straightforward. We compare the zip-code aggregate household income reported on tax returns with the zip-code aggregate household income reported to the Census bureau. The critical assumption is that the difference between the two income measures represents tax sheltering behavior. This approach has several advantages over previous measures of the tax gap. First, relative to studies that rely on special tax return audits, our sample includes a broader set of households and our measure of the tax gap does not depend on whether the auditor can detect the income discrepancy. Second, we can include data on household characteristics that might be of interest for explaining behavior. Third, relative to experimental work, our data reflect actual behavior which avoids many of the criticisms endemic to using a laboratory setting to study economic decisions. Fourth, compared to aggregate measures of the tax gap, we use disaggregated data that allow us to explore some of the determinants of the tax gap. To fix ideas, consider the following relationship between reported income for tax purposes and true income: AGI ¼ Y  A  E þ e

where AGI is adjusted gross income for tax purposes, Y is true income, A is avoidance activity, E is tax evasion, and e is a random component that contributes to the difference in the income measures. Tax sheltering is the sum of avoidance (A) and evasion (E). We focus on the ratio of AGI to Y so that our dependent variable

Gentry, Kahn / Understanding Spatial Variation in Tax Sheltering 9

captures the fraction of income that is reported to the IRS. One can think of one minus this ratio as the sheltering ratio for each location. Any measure of overall tax sheltering has its problems and our measure is no exception to this rule. One issue for interpreting the measures of income is that filling out a tax return may frame the definition of income for some Census respondents. In responding to the Census, some people might simply report the same income that they reported to the IRS. Some people might never tally their income from sources that they do not report to the IRS. This sort of framing would systematically reduce our measure of the tax sheltering. In the notation from above, this framing concern suggests that some sheltering may be misclassified as true income; this misclassification will lead us to understate the amount of actual tax sheltering. Retirement savings, in its many different forms, provide an example of how these framing issues might affect any comparison of incomes reported to the IRS and Census. The tax return does not include contributions to tax-advantaged savings accounts, such as 401(k) plans, and excludes the capital income earned inside these accounts; however, all withdrawals from such accounts are included in taxable income. One could imagine households reporting income gross of contributions to these accounts to the Census bureau but forgetting to include the year-to-year investment returns in their income. Of course, some financially savvy households might include these investment returns as investment income. If either the contributions or the investment returns are included in the income that is reported to the Census Bureau, then there is a risk of double counting of income if households also include the withdrawals from these accounts as taxable income. Alternatively, some taxadvantaged savings accounts (so-called Roth-styled accounts) allow households to save after-tax earnings without having to pay tax on future investment returns. The natural reporting tendency for these accounts would be the same as tax-exempt bond interest: people may not report the returns on these accounts to the Census so that our measure of tax sheltering understates the true amount of tax sheltering. Compensation that comes through fringe benefits could also create differences between a household’s true income and what it reports to the IRS. Employer-provided health insurance is the canonical example. While such coverage has a substantial value, most households probably exclude its value when reporting income. Traditional defined benefit pensions create a host of measurement issues, similar to those discussed with personal retirement accounts. In some cases, the definition of income may vary for policy reasons without resulting in any behavioral change or even predict a change in other sources of income (as would be the case when fringe benefits substitute for cash compensation). One example is the income tax treatment of Social Security benefits. For many households, Social Security benefits are excluded from AGI but are likely to be reported to the Census Bureau as income. As should be apparent from our focus on AGI, our measure does not capture tax sheltering that occurs through deductions. Thus, tax sheltering strategies involving

10 International Regional Science Review

borrowing with tax deductible interest are outside the scope of our study. As reported above in section 2A, Slemrod (2007) reports that 80 percent of the tax gap comes from understating income rather than overstating deductions. While this statistic offers some comfort for focusing on income before deductions as a measure of tax sheltering, we note that it refers specifically to tax evasion without reference to tax avoidance. These data are only available at the zip-code aggregate level, instead of the individual level. Working with aggregated data creates a number of concerns. One particular problem with zip codes is that they are designed merely for the convenience of postal delivery. Other than respecting state lines, zip codes do not necessarily correspond with other political designations (e.g., towns or counties); of course, these political jurisdictions probably have considerable influence on the drawing of zipcode borders. They are not necessarily stable over time, though we are focusing on cross-sectional data and combining data from two relatively similar years, so we expect such changes to be of minor importance for our purposes. By the nature of the Census Bureau’s mandate, the Census Bureau is quite concerned with accurately measuring where people live (i.e., their ‘‘usual’’ residence) at the time of the Census survey. A precise measure of where people live is less important for the IRS and one might worry that location is not reported as accurately in the IRS data. One source of concern is that some people, especially high-income taxpayers, file tax returns using business addresses or their tax preparer’s address. This type of misclassification creates some outliers in our data. We address this misclassification concern in two ways. First, our dependent variable is the ratio of average AGI per tax return to the average Census income per household instead of simply the ratio of aggregate AGI in a zip code to aggregate Census income in a zip code. The ratio of aggregate incomes is influenced by whether the same households report being in the same zip code area in both data sets. If the misclassified households have the same average income (in both data sets) as the households that report living in the zip code, then by taking the ratio of the averages, we eliminate the influence of people reporting in different locations across the two data sets. Second, the misclassification of where households live may create outliers in the data. For this reason, along with a general concern about influential outliers,we trim the data such that observations in the top and bottom 1 percent of the distribution are assigned the value of the 99th and 1st percentile, respectively. An important aggregation issue for our purposes is that the Census and the IRS define a ‘‘household’’ differently. One option for dealing with this aggregation issue is to just use total income for the zip code from each data source. The total income has two drawbacks. First, the data are for two different years (data from the 2000 Census asks about income from 1999 and the IRS data are for 2001) so that migration and growth issues may heavily influence the ratio of the total values for each zip code. Zip-code level IRS data are not available for 1999. Second, while the Census is universal, not every household must file a tax return. Such nonfiling creates some

Gentry, Kahn / Understanding Spatial Variation in Tax Sheltering

11

measurement error in the ratio of the totals, providing another argument for focusing on the ratio of AGI per tax return in the zip code to Census income per household. If migrants into and out of the zip code between 1999 and 2001 have similar incomes to existing residents, then using the ratio of the averages should not be affected much by using data from two different years. A downside to using the ratio of averages is that some Census households may file multiple tax returns; examples include unrelated parties living together in a Census household, married couples who choose the married filing separately (a quite small number of returns for married couples living together), and children who file separately from their parents. Overall, these issues suggest that comparing income reported to the IRS and to Census is an imperfect measure of tax sheltering. However, as with any measurement error in a dependent variable, the important question is how the imperfections in measurement might bias the results. For example, while the average level of tax sheltering might be influenced by the measurement issues, it is more difficult to know how the measurement error affects the relationship between the marginal tax rate (for which we rely heavily on interstate variation as a source of identification) and tax sheltering. In interpreting our results, it will be important to ask whether a particular result may be influenced by measurement error instead of reflecting a behavioral relationship.

4. Empirical Approach We concentrate on two different cuts at the data that highlight specific hypotheses regarding the determinants of tax sheltering: (1) zip-code level data either within California or nationally; and (2) matched pairs of counties that share a common border but are in different states. The California data allow us to focus on a particular (large) state and include a broader set of political variables; however, we do not include the tax rate because the tax system does not vary within California. The national zip-code level sample allows for more date and we can include the marginal state tax rate; the disadvantage of the national data is that the political variables are noisy because they are only measured at the congressional district level. While focusing on bordering counties in different states reduces the size of our sample, this sample provides a cleaner test for tax effects because the variation in our measure of the tax rate comes from interstate variation in tax law. Our core regressions take the form: Income Ratiojl ¼ geographyl þ B1  Demographicsjl þ B2  Xjl þ Ujl

where the income ratio is the ratio of average AGI per tax return to average household income from the Census on various control variables and a set of approachspecific variables. As we discuss below, the geographic controls vary by specification. Demographics for zip code j in geographical area l is a vector of zip-code level

12 International Regional Science Review

demographic data from the year 2000 Census. Perhaps more novel, X represents a vector of place-specific attributes that will range from state tax rates to measures of the zip code’s political ideology. The control variables that are common across the approaches include: (1) the ratio of tax returns in the region to Census households in the region; (2) the percentage of the population that is self-employed based on Census information; (3) the percentage of households in the region in each of the fifteen different income groups based on Census reported income; (4) the percentage of residents aged 4060 and older than 60 years (with the percentage of residents under age 40 as the omitted category); (5) the percentage of the population born outside of the United States; (6) the percentage of the population that is Hispanic; (7) the percentage of the population that is African American; and (8) the percentage of households that own their homes. For the zip code level regressions, we include a dummy variable for whether the center of the zip code is within 30 miles of the central business district of a metropolitan area (i.e., an Metropolitan Statistical Areas [MSA]). Many of these variables capture standard demographic controls that might reflect attitudes toward the tax system or differential access to sheltering opportunities. The ratio of tax returns to the number of Census households is the ratio of the denominators of the two variables used to construct the dependent variable. The rationale for including this variable is that it might reflect places that have an unusually high or low number of tax returns per Census household. For example, this variable depends on the number of children who file separate tax returns and the number of Census households with multiple families. One would expect the estimated effect of this ratio to be negative from the mechanical effect of having more tax returns for the same amount of total households. This variable also captures (albeit roughly) variation in whether households must file tax returns. Because nonfiling households have low income, an increase in such households in a zip code (which is a decrease in the ratio of tax returns to the households) will tend to increase the reported income ratio. The percentage of the workforce that is self-employed should capture an increase in the opportunities for tax sheltering if the self-employed have more tax sheltering opportunities. We recognize that the key regressions reported in this article are ‘‘ecological regressions.’’ Ideally, we would have access to micro data with geographical identifiers. Instead, we have access to zip-code level data and we seek to interpret the results as if we can recover ‘‘micro parameters’’ based on these regressions. Our ability to make inferences about household behavior based on such data hinges on the absence of demographic interaction effects. For example, if the true data-generating process is that highly educated Hispanics engage in tax avoidance while less-educated Hispanics do not and highly educated blacks do not engage in tax avoidance while less-educated blacks do engage in such activity, we could not recover such interactions terms between ethnicity and education. In order for our aggregated approach to recover micro relationships, we need the demographic effects in equation (1) to enter as linearly separable regressors.

Gentry, Kahn / Understanding Spatial Variation in Tax Sheltering

13

4A. Variation in Tax Sheltering within California Focusing on California has the advantage of allowing us to measure political attitudes more precisely than we can in other states. First, we have data on the share of all votes in the year 2000 Presidential Election that were cast for the Republican candidate (George W. Bush). Second, we can measure political attitudes from data on voting results from California’s ballot initiative system. These initiatives allow for direct democracy; these propositions can become state law by a simple majority vote (Matususaka 2005). We use Census-tract level data on the number of votes in favor and against specific propositions to construct zip-code level variables that measure the share of the zip code’s voters who voted in favor of higher taxes in each initiative. We use three California ballot initiatives to identify tax ideology: (1) Proposition 167 in 1992; (2) Proposition 186 in 1994; and (3) Proposition 217 in 1996. None of these propositions received a majority vote. While the three propositions differed in specifics, they all would have increased income tax rates, especially for high-income households: (1) (2)

(3)

Proposition 167 in November 1992, raises top income tax rates, and repeals the 1991 sales tax increase. Proposition 186 in November 1994, the California Health Security Act, would have instituted a ‘‘single payer’’ health care system for California. This measure promises generous medical, dental, vision, mental health, and long-term care benefits to all state residents. Benefits would be financed by income tax increases (2.5 percent for all individuals and an additional 2.5 percent for those with incomes above US$250,000; US$500,000 for couples), payroll taxes of between 4.4 percent and 8.9 percent per employer, depending on the number of persons employed, and US$1 per pack surcharge on cigarettes. Proposition 217 in November 1996, would have reinstated for 1996 and subsequent tax years the 10 percent and 11 percent income tax rates on individuals’ taxable income over US$115,000 and US$230,000, respectively, and joint taxpayers’ income over US$230,000 and US$460,000, respectively. The state would have apportioned about half the revenue from these increases to local governments and about half to schools and community colleges. The state would have been prohibited from further reducing the local agencies’ proportionate share of local property taxes. This proposition would have prevented the state from lowering the tax rates on higher-income brackets, without a vote of the people.

A key feature of these various votes is that they are highly positively correlated. The pairwise correlations between the votes all exceed 0.80. To create a parsimonious way to include the information from these votes in the regression, we use the average vote in favor of the proposition as an overall measure of political support for higher taxes. The effect of voting on ballot initiatives that support taxes on whether households shelter income may be affected by reverse causality: people who are able to shelter

14 International Regional Science Review

more income may be more willing to support higher taxes. To address this concern, we run a separate set of California regressions that measure political attitudes and political engagement with a more general set of variables. First, we include the percentage of voters in the zip code who voted for the Republican candidate in the year 2000. Unlike the ballot propositions, which come from several years before our income data, the 2000 vote share is contemporaneous with the income data, so demographic shifts are less likely to create noise in this measure of political ideology. Second, we have county-level voter turnout based on the 1992 election as a measure of overall political engagement (though not as a measure of political views); we create a turnout measure by comparing voting with the 1990 population that was old enough to vote. Third, we use the Poole-Rosenthal factors that analyze members of the House of Representatives voting records in the 108th Congress. Poole and Rosenthal (1997) summarize voting records by two variables: (1) DWNOMINATE_1, captures a measure of liberal-moderate-conservative views; and (2) DWNOMINATE_2, captures views on race and civil rights. For each zip code in a congressional district, we assign the values associated with the district’s representative. Unfortunately, these factors only vary at the congressional district level so they provide less disaggregated information on ideology than the ballot initiatives. We are mainly interested in the effects along the liberal-conservative dimension; a higher value of this factor is associated with a more conservative member of Congress.

4B. National Sample at the Zip-Code Level and Tax Sheltering A downside of focusing on California is that the results might not generalize to other states. Unfortunately, we do not have ballot initiative votes or party affiliation at the zip-code level for other states. We can, however, estimate a model that focuses on demographic variables and the Poole-Rosenthal measures of ideology. The problem with this variable is that the level of aggregation for ideology (435 congressional districts) is much higher than the number of zip codes used in the regressions (31,075). While we can cluster standard errors so as to correct this effect in drawing statistical inference, the variable is still quite noisy as a measure of ideology because views can vary greatly within a congressional district. Unlike in the California regressions, we can include estimates of the state level marginal tax rate in the interstate regressions. State income tax policy provides a source of variation for identifying how tax sheltering responds to differences in tax rates. Federal tax policy is not especially useful in this regard because all zip codes face the same tax code. While marginal federal tax rates may vary across individuals, they mainly vary due to differences in income. This correlation creates a classic issue in empirical public finance of distinguishing tax effects from nonlinearities in income effects. We calculate zip-code level state marginal tax rates using the National Bureau of Economic Research’s TAXSIM calculator (www.nber.org/ *taxsim). We use average household characteristics for each zip code from the

Gentry, Kahn / Understanding Spatial Variation in Tax Sheltering

15

Census data to create a marginal tax rate. To focus on interstate variation in tax rates, we use the state level tax rate rather than the combined state and federal tax rate.

4C. Border Pairs Approach to Measuring Tax Sheltering Within a state, tax sheltering behavior could vary for many reasons that we cannot control for in our specifications. A correlation between these unobservable factors would induce a bias in the estimated tax effects. To maximize the value of interstate variation in the data, we follow a similar empirical strategy to Holmes (1998), who exploits a regression discontinuity approach to study the role of state labor regulation in determining manufacturing agglomeration patterns. He studies trends in county manufacturing employment growth in counties that border other counties in another state. By focusing on bordered pairs, the regression discontinuity approach attempts to minimize the impact of unobservable factors by assuming that the regional proximity reduces the importance of such factors. In a similar spirit, we compare our income ratio for adjacent counties where the two counties lie in different states. In these regressions, the unit of analysis is a county: we take our zip-code level data and aggregate it up to the county level. The sample is the set of counties that are adjacent to a county in another state. For each county pair, we include a border-pair fixed effect. This matched-pair research design leads to counties that border more than one county in another state being included in the regression more than once; we cluster our standard errors to adjust for counties appearing multiple times in the regression. The bordered-pair fixed effect absorbs the average effect of common regional attributes that the counties share (e.g., common or similar labor markets or industrial mix). The estimated coefficients rely on within pair differences in the observable variables. In the case of the tax rate variable, these differences derive mainly from differences in state tax policy.

5. Results Before turning to the regression results, a few observations from the summary statistics presented in table 1 are helpful. The mean ratio of AGI per tax return to income per Census household is quite similar across the three samples (0.79 in the California zip-code sample and 0.80 in the other two samples). While it is tempting to draw aggregate conclusions about tax sheltering from this simple mean, we caution against such conclusions for two reasons. First, many of the differences in reported income are not from behavioral decisions (e.g., the fact that Social Security benefits are excluded from AGI). Second, aggregate tax sheltering also depends on the number of tax returns filed per household. Some households will file multiple returns (e.g., parents and children file separately) and others will not be required

16 International Regional Science Review

Table 1 Summary Statistics California Zip Codes

National Zip Codes

County Border Pairs

Variable

Mean

SD

Mean

SD

Mean

SD

Income ratio State marginal tax rate Average ‘‘Yes’’ on ballot propositions Republican vote share Poole-Rosenthal Factor 1 Poole-Rosenthal Factor 2 Voter turnout Tax returns/households Self-employed Owner-occupied Housing Born outside United States Near an MSA Ages 40–60 Ages 60þ African American Hispanic Asian High school Some college College degree Advanced degree

0.786

0.151

0.797 4.423

0.124 2.2775

0.800 4.491

0.069 2.164

0.395 0.397 –0.040 –0.226 0.510 1.219 11.395 56.813 1.104 0.885 25.637 14.694 6.344 28.796 10.920 19.991 23.076 24.762 9.942

0.088 0.158 0.481 0.344 0.0769 3.252 4.987 18.260 0.541 0.319 4.872 6.098 9.825 21.782 11.529 5.927 5.553 10.335 7.990

0.049 –0.041 0.568 1.16 9.83 66.36 1.20 0.773 26.38 16.80 11.91 10.77 3.46 28.76 21.12 21.86 8.90

0.442 0.421 0.0945 0.258 4.26 17.67 1.72 0.419 4.05 6.76 18.99 16.56 6.45 9.25 5.04 9.47 7.09

0.0061 –0.0916 0.561 1.17 9.29 65.22 1.45

0.391 0.403 0.0952 0.115 2.77 13.68 1.48

26.51 16.89 13.84 9.76 3.08 29.60 20.08 21.17 9.22

2.01 3.28 14.24 11.98 3.43 6.56 4.10 5.94 4.83

Notes: MSA ¼ metropolitan statistical areas. For the California zip-code sample, there are 1,607 observations. In the national sample there are 31,075 observations and in the county border pair sample there are 1,142 observations.

to file at all. The mean number of tax returns per Census household is about 1.2 in each of the samples. The product of our tax sheltering ratio and the ratio of tax returns per household is equal to the ratio of aggregate AGI in the zip code to aggregate Census income in the zip code. Evaluated at the means, this product is 0.93 in the national sample, suggesting less tax sheltering than one might expect from the mean of the tax sheltering ratio. As discussed above, we use the ratio of ratios in our analysis to minimize outliers.

5A. California Results As discussed above, we estimate several variants of our model for the California data. Table 2 reports the results for the California sample. The first two columns have

Gentry, Kahn / Understanding Spatial Variation in Tax Sheltering

17

Table 2 Regression Results for California Zip Codes Dependent Variable: Ratio of Average AGI to Average Household Census Income Variable Average ‘‘yes’’ on ballot propositions

(1)

(2)

–0.130*** (0.0506)

–0.485*** (0.0852)

Republican vote share Poole-Rosenthal Factor 1 Poole-Rosenthal Factor 2 Voter turnout Near an MSA Tax returns/households Self-employed Owner-occupied housing Born outside United States Ages 60þ Ages 40–60 African American Hispanic Asian High school Some college College degree Advanced degree County fixed effects Adjusted R2

0.0010 (0.0063) –0.0027 (0.0093) –0.0050*** (0.0012) –0.0025*** (0.0004) 0.0056 (0.0043) 0.0041*** (0.0007) 0.0005 (0.0013) –0.0023*** (0.0004) –0.0017*** (0.0005) –0.0017*** (0.0004) 0.0029* (0.0016) 0.0030** (0.0013) 0.0002 (0.0013) 0.0072*** (0.0016) No 0.683

0.0107 (0.0073) –0.0003 (0.0090) –0.0014 (0.0011) –0.0023*** (0.0005) –0.0002 (0.0051) 0.0033*** (0.0008) 0.0006 (0.0016) –0.0012*** (0.0004) –0.0005 (0.0005) –0.0014*** (0.0004) 0.0017 (0.0017) 0.0040*** (0.0013) 0.0028** (0.0014) 0.0078*** (0.0016) Yes 0.697

(3)

(4)

0.119*** (0.0380) –0.0272*** (0.0074) –0.00694 (0.0079) 0.0129 (0.0438) 0.0060 (0.0065) –0.0047 (0.0096) –0.0048*** (0.0011) –0.0024*** (0.0004) 0.0020 (0.0045) 0.0040*** (0.0007) 0.0001 (0.0014) –0.0022*** (0.0004) –0.0015*** (0.0005) –0.0016*** (0.0004) 0.0028* (0.0016) 0.0034*** (0.0013) 0.0005 (0.0013) 0.0072*** (0.0015) No 0.685

0.214** (0.0925) –0.0269** (0.0121) –0.0183 (0.0122)

0.0106 (0.0107) –0.0027 (0.0145) –0.0024** (0.0011) –0.0022*** (0.0007) –0.0039 (0.0054) 0.0037*** (0.0013) 0.0000 (0.0017) –0.0015** (0.0006) –0.0008 (0.0007) –0.0014*** (0.0005) 0.0018 (0.0029) 0.0036** (0.0014) 0.0023 (0.0019) 0.0068*** (0.0025) Yes 0.688

Notes: AGI ¼ adjusted gross income; MSA ¼ metropolitan statistical areas. All the regressions are based on 1607 observations. The regressions are weighted by the number of households in the zip code. The regressions include the percentage of households in 15 different income ranges and a constant. *, **, *** represent statistical significance at the 90 percent, 95 percent, and 99 percent confidence levels, respectively. Standard errors (clustered at the county level) are reported in parentheses.

18 International Regional Science Review

results using the ballot propositions to measure political ideology; the last two columns use party-line voting, the Poole-Rosenthal factors, and voter turnout as measures of political ideology and engagement. The other difference across columns is that the second and fourth columns include county level fixed effects to control for possible regional differences within California. Controlling for county fixed effects, we exploit within-county variation in zip-code income ratios, demographics, and politics. In terms of political ideology, both specifications that use the ballot propositions yield estimated coefficients that suggest that places with more support for higher taxes report relatively less income to the IRS (conditional on the income they report to the Census Bureau). Thus, supporting tax increases and reporting more taxable income do not go hand-in-hand. The magnitude of the estimated effect varies depending on the inclusion of county level fixed effects with the estimated effect being almost four times higher (0.49 vs. 0.13) in the specification with fixed effects. In the larger of these two cases, a one standard deviation increase in the measure of tax support for the ballot propositions increases the income ratio by slightly more than one-quarter of a standard deviation. As discussed above, while ballot initiative voting is obviously related to people’s views of taxation, one concern is that the amount of sheltering that people undertake may influence their voting on ballot initiatives. That is, voting behavior may be endogenous to tax sheltering. In fact, this endogeneity story is consistent with our results using the ballot initiatives: places that support the ballot initiatives also have more tax sheltering. A natural way to deal with this problem econometrically would be to use an instrumental variables technique. However, a valid instrument needs to be correlated with voting on ballot initiatives without having a direct effect on attitudes toward tax sheltering. Lacking a good instrument it is not possible to sort out the relationship between voting and tax sheltering. As an alternative strategy for measuring the effects on political attitudes on tax sheltering, we estimate specifications that include relatively general measures of political attitudes—as captured by party affiliation, voter turnout, and the Poole-Rosenthal measures of ideology. We argue that these measures might be less susceptible to reverse causation: while being able to shelter income from taxation might make someone more likely to vote in favor of higher taxes, the effect on general political ideology is probably smaller. The third and fourth columns of table 2 present results from specifications that include party affiliation, voter turnout, and the Poole-Rosenthal measures of ideology as proxies for political attitudes. Because we have multiple variables that measure the effects of political concerns, the results are somewhat tricky to interpret. Specifically, both Republican vote share and the first Poole-Rosenthal factor capture conservatism: these variables have a correlation of 0.70. Because the PooleRosenthal variable is calculated at a more geographically aggregated level (i.e., a congressional district), it provides a coarser measure of ideology.

Gentry, Kahn / Understanding Spatial Variation in Tax Sheltering

19

Both specifications indicate that an increase in Republican voters is associated with a higher reported income ratio (less sheltering) and these results are statistically significant. However, the estimated effect of being in a congressional district with a more conservative member of Congress is negative and statistically significant. Based on the specification that includes the county fixed effects, the net effect of being in a place that is both one standard deviation more conservative and one standard deviation more Republican is that the income ratio is roughly 0.021 higher (less income sheltering). In unreported regressions, we estimated separate models using party voting and the Poole-Rosenthal measures. The estimated coefficient on the Republican variable is positive (but not statistically significant) in a regression without the Poole-Rosenthal variables; the estimated coefficient on the first PooleRosenthal factor is negative (but not statistically significant) in a regression without the Republican variable. Our conclusion is that these variables are capturing different aspects of political ideology and that controlling for these different aspects is important for estimating the overall effect. In terms of tax policy, we do not include the marginal tax rate in these regressions because variation in the tax rate would be solely due to differences in income levels across places. However, the regression includes the percentage of households who are self-employed as a test of whether the self-employed engage in more tax sheltering. With the exception of second column in table 2, the estimated coefficient is negative and statistically significant, consistent with the hypothesis that the selfemployed have more opportunities to shelter income from taxation. Turning to demographics, several patterns emerge from the results that are consistent across specifications. First, having more people own their own homes reduces the reported income ratio. While home ownership creates a form of income (i.e., imputed rent) that is not captured by AGI (which would suggest that home ownership is a form of tax sheltering), this story seems implausible because the Census definition is unlikely to capture imputed rent. Another possibility is that home ownership is correlated with aspects of the income distribution that are not completely controlled for by the income controls. For example, homeownership increases with income so that a larger fraction of households in areas with high homeownership also file taxes; if places with more nonfilers have a higher reported income ratio (because the Census average income reflects the lower-income households but the IRS data do not), then this indirect effect could explain the result. Second, places with more households with heads over the age of 60 also have less tax sheltering. This result is somewhat surprising, given that a substantial fraction of Social Security income is excluded from adjusted gross income (which would show up as a form of unintentional tax sheltering). One explanation is that older households realize more capital gains income that is not captured by Census income. Third, minorities appear to shelter more income from taxation (though this result is not statistically significant for Hispanics in the specification with county fixed effects). The regressions include income controls based on the fraction of households

20 International Regional Science Review

in different income ranges; however, if these controls do not completely capture the effects of income distribution on reporting, then the demographic variables might be picking up differences in the need to file tax returns. Fourth, the amount of sheltering decreases with the level of education of the head of household: places with more people with advanced degrees tend to report more income to the IRS relative to what they tell the Census. One possible explanation for this result is that, controlling for income, more highly educated people tend to have jobs with more informational reporting which makes tax evasion more difficult.

5B. National Zip Code Level Results Moving beyond California, we can estimate a national sample in which we include the Poole-Rosenthal ideology measures and voter turnout. Table 3 provides results from an ordinary least squares regression using the national sample of zip codes. The estimated effect of being in a more conservative congressional district on the reported income ratio is positive (0.0063), though only at the 90 percent significance level. A one standard deviation increase in first Poole-Rosenthal factor is associated with a 0.0027 increase in the reported income ratio; this effect is an order of magnitude smaller than the net effect of a California zip code being both more conservative and more republican. Unlike the California results in which voter turnout did not have a statistically significant impact on tax sheltering, places with higher voter turnout tend to have more tax sheltering. With respect to demographics, the national sample results are broadly consistent with the results reported for California. Table 3 also provides our first evidence on the effects of tax rates on tax sheltering. The estimated coefficient on the state marginal tax rate is 0.0020 and statistically significant at the 99 percent confidence level. A two percentage point increase in the state marginal tax rate decreases the reported income ratio by 0.0040, which is about onehalf of a percentage of the average income ratio. We return to the economic significance of the estimated tax effects in our discussion of the county border pair results.

5C. National County Border Pairs Results Table 4 reports results from estimating equation (1) with data from border pairs of counties. These regressions include a fixed effect for each border pair. By including this border pair fixed effect, we seek to exploit within border pair variation in state tax rates, political variables, and demographics to tease out effects on the tax income ratio. The regression discontinuity approach is most useful for estimating the effects of variables that are dissimilar across the boundary (e.g., the tax rate effect) because if the characteristic is similar across places, its effect will be subsumed into the fixed effect for the pair. We report two specifications. The first specification only has the marginal tax rate and the political variables, under the assumption that the border pair fixed

Gentry, Kahn / Understanding Spatial Variation in Tax Sheltering

21

Table 3 Regression Results for National Sample of Zip Codes Dependent Variable: Ratio of Average AGI to Average Household Census Income Variable State marginal tax rate Poole-Rosenthal Factor 1 Poole-Rosenthal Factor 2 Voter turnout Near an MSA Tax returns/households Self-employed Owner-occupied housing Born outside United States Ages 60þ Ages 40–60 African American Hispanic Asian High school Some college College degree Advanced degree Adjusted R2

(1) 0.00199*** (0.0006) 0.0063* (0.0035) 0.0041 (0.0036) –0.0453* (0.0268) –0.0019 (0.0033) –0.0258*** (0.0071) –0.0009** (0.0004) –0.0013*** (0.0002) –0.0049*** (0.0007) 0.0031*** (0.0003) 0.0005 (0.0005) –0.0020*** (0.0001) –0.0015*** (0.0001) –0.0023*** (0.0004) –0.0005 (0.0003) 0.0008** (0.0004) 0.0030*** (0.0004) 0.0036*** (0.0006) 0.533

Notes: AGI ¼ adjusted gross income; MSA ¼ metropolitan statistical areas. The regression is based on 31,075 observations. The regression is weighted by the number of households in the zip code. The regression includes the percentage of households in 15 different income ranges and a constant. *, **, *** represent statistical significance at the 90 percent, 95 percent, and 99 percent confidence levels, respectively. Standard errors (clustered by state) are reported in parentheses.

effects control for other factors. The second specification (which turns out to be our preferred specification) includes the covariates used in the zip-code level analysis. In both specifications, the estimated effect of state income tax rates on the income ratio is negative: counties in high-tax states have more tax sheltering than adjacent counties located in low tax states. Using the specification with the full set of demographics, the results indicate that a two percentage point increase in the tax rate decreases the income ratio by 0.0054 (i.e., 2  0.0027). To convert this into something akin to a price elasticity, consider that the ‘‘tax price’’ of behavior is one minus the tax rate. If someone starts with an overall income tax rate of 20 percent (federal plus state), they face a tax price of 0.80. An increase in the state marginal tax rate that increases the tax rate to 22 percent lowers the tax price to 0.78. Thus, a 2.5 percent decrease in the tax price causes a 0.68 percent (i.e., 0.0054 divided by the mean income ratio of 0.79) decrease in reported income—an elasticity of reported income of 0.27 (i.e., 0.68/2.5) with respect to the tax price. Relative to overall taxable income

22 International Regional Science Review

Table 4 Regression Results for County Border Pairs Dependent Variable: Ratio of Average AGI to Average Household Census Income Variable State marginal tax rate Poole-Rosenthal Factor 1 Poole-Rosenthal Factor 2 Voter turnout:county level Tax returns/households Owner-occupied housing Self-employed Born outside United States Ages 60þ Ages 40–60 African American Hispanic Asian High school Some college College degree Advanced degree Adjusted R2

(1)

(2)

–0.00449*** (0.0011) –0.0142 (0.0106) –0.0177* (0.0107) 0.307*** (0.0967) –0.157*** (0.0590)

–0.00269*** (0.00088) –0.00654 (0.00682) –0.00134 (0.00704) –0.119** (0.0490) –0.115*** (0.0434) –0.00142 (0.0010) –0.00156 (0.00124) –0.00502 (0.00373) 0.00303*** (0.00099) 0.00198 (0.00222) –0.00241*** (0.00063) –0.00167** (0.00072) –0.00606*** (0.00156) –0.00151 (0.00099) –0.00037 (0.00131) –0.00021 (0.00168) 0.00800*** (0.00297) 0.797

0.552

Notes: AGI ¼ adjusted gross income. All the regressions are based on 1142 observations. The regressions are weighted by the number of households in the county. The regressions include the percentage of households in 15 different income ranges and a constant. *, **, *** represent statistical significance at the 90 percent, 95 percent, and 99 percent confidence levels, respectively. Standard errors (clustered by county) are reported in parentheses.

elasticities (see discussion in Chetty 2008), this elasticity of tax sheltering is on the lower end of the reported range. However, because the overall taxable income elasticity is the sum of the elasticity of tax sheltering with respect to the tax price and the elasticity of economic income with respect to the tax price, this result suggests that sheltering plays a large role in the overall taxable income elasticity. In terms of the estimated effects of the other factors, most of the estimated effects are consistent with the results reported in the previous tables. The exceptions to this pattern are the estimated effects of self-employment and home ownership, which are no longer statistically significant.

6. Conclusion Tax sheltering activity has important public policy implications because government revenue collection hinges on whether taxpayers engage in this activity. Because

Gentry, Kahn / Understanding Spatial Variation in Tax Sheltering

23

tax sheltering encompasses both legal activity (tax avoidance) and illegal activity (tax evasion), empiricists have faced fundamental data challenges that have limited their ability to study these phenomena. In this article, we have used two independent residential zip-code level data sets that have enabled us to test several hypotheses concerning tax avoidance behavior. Our maintained assumption throughout this article is that households accurately report their incomes to the Census. Our justification for this assumption is that households have no economic incentive to systematically misreport this information. Under the assumption that households truthfully report their incomes to the Census, then the ratio of reported income to the IRS divided by reported income to the Census provides a metric of tax sheltering activity (albeit an imperfect measure) at the zipcode level. In our empirical work, we have used this ratio as our key dependent variable. Based on this variable, our results suggest the following: (1) in the California sample and the national zip-code samples, liberal communities engage in more tax avoidance than the average community; (2) higher tax rates encourage more tax sheltering; (3) consistent with the idea that the self-employed have more opportunities for tax sheltering, we find that the self-employed do indeed report relatively less income to the IRS than other households (conditional on what they report to the Census); and (4) certain demographic groups such as the young and minorities may engage in more tax sheltering behavior; however, more highly educated households may engage in less tax sheltering. While the results related to tax policy—the tax rate effects and the self-employment effect—are quite consistent with standard predictions from public finance, the results from the political and demographic variables open up a host of questions about the mechanism that is at work. One question that we cannot resolve with our data is the question of tax avoidance versus tax evasion. Another issue is whether opportunities to shelter taxes differ across groups or whether some groups are more aggressive in seeking out such opportunities. Does social capital or other ideas of social identity play a role in tax sheltering decisions? Furthermore, the relationships between the variables could be driven by measurement issues (e.g., the fact that the IRS does not have income for households that are not required to file tax returns) that do not have any direct behavioral implications. We hope future research helps sort through these issues.

References Albouy, David. 2008. The unequal geographic burden of federal taxation. University of Michigan, Department of Economics, Mimeo. Alesina, Alberto, and Edward Glaeser. 2004. Fighting poverty in the United States and Europe: A world of difference. Oxford: Oxford University Press. Allingham, Michael G., and Agnar Sandmo. 1972. Income tax evasion: A theoretical analysis. Journal of Public Economics 1:323-38.

24 International Regional Science Review

Andreoni, James, Brian Erard, and Jonathan Feinstein. 1998. Tax compliance. Journal of Economic Literature 36:818–60. Beron, Kurt J., Helen V. Tauchen, and Ann Dryden Witte. 1992. The effect of audits and socioeconomic Variables on Compliance. In Why people pay taxes: Tax compliance and enforcement, edited by Joel Slemrod, 6789. Ann Arbor: University of Michigan Press. Chetty, Raj. 2008. Is the taxable income elasticity sufficient to calculate deadweight loss? The Implications of evasion and avoidance. Working Paper No. 13844, National Bureau of Economic Research. Dubin, Jeffrey A., and Louis L. Wilde. 1988. An empirical analysis of federal income tax auditing and compliance. National Tax Journal 41:61–74. Feldstein, Martin. 1995. The effect of marginal tax rates on taxable income: A panel study of the 1986 Tax Reform Act. Journal of Political Economy 103:551-72. Feldstein, Martin. 1999. Tax avoidance and the deadweight loss of the income tax. Review of Economics and Statistics 81:674–80. Holmes, Thomas J. 1998. The effect of state policies on the location of manufacturing: Evidence from state borders. Journal of Political Economy 106:667–705. Luttmer, Erzo. 2001. Group loyalty and the taste for redistribution. Journal of Political Economy 109:500-28. Matususaka, John. 2005. Direct democracy works. Journal of Economic Perspectives 19:185-206. Poole, Keith, and Howard Rosenthal. (1997). Congress: a political-economic history of roll call voting. Oxford: Oxford University Press. Slemrod, Joel. 2001. A general model of the behavioral response to taxation. International Tax and Public Finance 8:119-128. ———. 2007. Cheating ourselves: The economics of tax evasion. Journal of Economic Perspectives 21:25-48. Slemrod, Joel, and Shlomo Yitzhaki. 2002. Tax avoidance, evasion, and administration. In Handbook of public economics, edited by Alan J. Auerbach and Martin Feldstein, 142370, 1st ed. 3 vols, chap. 22. Stiglitz, Joseph. 1985. A general theory of tax avoidance. National Tax Journal 38:325-37. Yitzhaki, Shlomo. 1974. Income tax evasion: A theoretical analysis. Journal of Public Economics 3:201-02.

Understanding Spatial Variation in Tax Sheltering: The ...

May 11, 2009 - Williams College, Williamstown, Massachusetts, e-mail: ... behavior. One reason for the lack of data is obvious: tax evasion is illegal so peo- ... First, we focus on zip-code level data, both within California and nationally. ..... tax years the 10 percent and 11 percent income tax rates on individuals' taxable.

154KB Sizes 0 Downloads 159 Views

Recommend Documents

Understanding Spatial Variation in Tax Sheltering: The ...
May 11, 2009 - Our estimates suggest that higher tax rates increase the amount of ... taxed on some forms of income (e.g., interest on tax-exempt bonds). ..... advantaged savings accounts (so-called Roth-styled accounts) allow households to.

Small-scale spatial variation in the interactions between ...
dispersal services (Snow 1971; McKey 1975; Howe and Estabrook 1977). However, the ..... Chavez-Ramirez F. and Slack R.D. 1994. Effects of avian foraging.

McGill University Working Paper Does Spatial Variation ...
lishing the district, BID taxes are binding upon all members, thus resolving ..... online to find whether that city had any BIDs and what the earliest year of BID adoption ... economic census I use data on the amount of retail sales; from the 1977, .

Spatial variation of output-input elasticities: Evidence ...
Mar 1, 2007 - county-level dataset along with associated geographic information. County- ... (1998) find that under intensive rice production systems in eastern. China, pesticide ...... Center for International Earth Science Information Network.

Spatial variation of output-input elasticities - Wiley Online Library
Mar 1, 2007 - ing spatial variation in the parameters improves model fit of the agricultural production function, and provides valuable insights into the relative ...

Spatial variation in post-dispersal seed removal in an ...
cal forests (Terborgh et al., 1990), which is suggestive of their possible ... Gá lia (22 240 S, 49 420 W), in the State of Sa˜o Paulo, southeast- ern Brazil. ...... University of Chicago Press, Chicago. Érard, C. .... Freeman, San Francisco. Sork