Skip to article frontmatterSkip to article content
Site not loading correctly?

This may be due to an incorrect BASE_URL configuration. See the MyST Documentation for reference.

Theoretical framework

This section develops a model of labor supply under tax misperception. A worker who misperceives her marginal tax rate chooses a suboptimal quantity of labor, generating a welfare loss that depends on the elasticity of labor supply, the variance of the misperception, and the level of the tax rate itself. The central result is a closed-form expression for the expected deadweight loss as a fraction of earnings.

Setup

Consider a population of workers indexed by ii. Worker ii earns hourly wage wiw_i and faces a linear marginal tax rate τi\tau_i. Due to the complexity of the tax code---overlapping brackets, phase-ins, phase-outs, credits, and the interaction of federal, state, and payroll taxes---the worker does not observe τi\tau_i directly. Instead, she perceives a noisy signal

τ^i=τi+δi,δiN(0,σ2),\hat{\tau}_i = \tau_i + \delta_i, \qquad \delta_i \sim N(0, \sigma^2),

where σ\sigma measures the severity of misperception---the misperception error has a standard deviation of σ\sigma (e.g., 12 percentage points when σ=0.12\sigma = 0.12). The noise δi\delta_i captures the gap between the worker’s mental model of her tax rate and the actual statutory-plus-effective rate she faces. This formulation nests perfect information as the special case σ=0\sigma = 0.

The normality assumption is a modeling convenience that delivers tractable closed-form results. The DWL formula depends only on E[δ2]=σ2E[\delta^2] = \sigma^2, so it holds for any symmetric distribution with variance σ2\sigma^2, not just the Gaussian. Departures from normality (e.g., heavier tails or discreteness in actual misperception) would affect higher-order corrections to the Taylor approximation but not the leading term. The Monte Carlo validation in the approximation accuracy subsection confirms that the second-order formula is robust to the clipping of extreme draws at the [0, 1] boundary.

The mean-zero assumption (E[δi]=0E[\delta_i] = 0) is a simplification. Rees-Jones & Taubinsky (2020) document that misperception is partly systematic: “ironing” taxpayers tend to perceive their marginal rate as closer to their average rate, which in a progressive system means E[δi]<0E[\delta_i] < 0 (underestimation). Decomposing misperception into bias μδ=E[δi]\mu_\delta = E[\delta_i] and noise σ2=Var(δi)\sigma^2 = \text{Var}(\delta_i), the DWL formula generalizes to include a bias term: E[DWL]12εwh(μδ2+σ2)/(1τ)E[\text{DWL}] \approx \frac{1}{2}\varepsilon w h^* (\mu_\delta^2 + \sigma^2)/(1-\tau). If systematic bias is present, the total welfare cost would be larger than the mean-zero case studied here. The calibrated σ=0.12\sigma = 0.12 already incorporates both systematic and idiosyncratic components of the cross-sectional dispersion in perceived rates, so it partially captures the bias channel. Specifically, the calibrated σ=0.12\sigma = 0.12 is best interpreted as the root-mean-square error (RMSE) of the misperception distribution, since it is derived from the cross-sectional dispersion of (perceived - actual) rates which includes any systematic component. Under this interpretation, σ2=μδ2+Var(δi)\sigma^2 = \mu_\delta^2 + \text{Var}(\delta_i), and the mean-zero DWL formula 12εwhσ2/(1τ)\frac{1}{2}\varepsilon w h^* \sigma^2 / (1-\tau) remains correct with σ\sigma interpreted as the RMSE rather than the standard deviation of mean-zero noise.

Preferences are quasilinear in consumption with isoelastic labor disutility:

U(C,h)=Cψh1+1/ε1+1/ε,U(C, h) = C - \psi \frac{h^{1 + 1/\varepsilon}}{1 + 1/\varepsilon},

where CC is consumption, hh is hours of labor, ε>0\varepsilon > 0 is the Frisch elasticity of labor supply, and ψ>0\psi > 0 is a scale parameter governing the disutility of work. Quasilinear preferences eliminate income effects on labor supply, so the Frisch elasticity coincides with the uncompensated and compensated elasticities. This tractability isolates the substitution-effect channel through which tax misperception distorts behavior, as discussed below.

The worker’s budget constraint is

C=w(1τ)h+v,C = w(1 - \tau) h + v,

where vv is a lump-sum transfer (or non-labor income). Because utility is linear in CC, the transfer vv does not affect the labor supply decision---it shifts consumption one-for-one but leaves the first-order condition for hours unchanged. This separation is essential: it means that misperception of τ\tau affects only the hours margin, not the marginal utility of income.

Optimal labor supply

A worker who correctly perceives her tax rate τ\tau maximizes UU by setting the marginal benefit of an hour of work equal to its marginal cost:

w(1τ)=ψh1/ε.w(1 - \tau) = \psi \, h^{1/\varepsilon}.

Solving for optimal hours:

h(τ)=(w(1τ)ψ)ε.h^*(\tau) = \left( \frac{w(1 - \tau)}{\psi} \right)^\varepsilon.

Hours are decreasing in the tax rate and increasing in the wage, with the Frisch elasticity ε\varepsilon governing the responsiveness of labor supply to the net-of-tax wage.

Misperceived labor supply

A worker who perceives the tax rate to be τ^=τ+δ\hat{\tau} = \tau + \delta solves the same first-order condition but evaluated at τ^\hat{\tau}:

h(τ^)=(w(1τ^)ψ)ε=(w(1τδ)ψ)ε.h(\hat{\tau}) = \left( \frac{w(1 - \hat{\tau})}{\psi} \right)^\varepsilon = \left( \frac{w(1 - \tau - \delta)}{\psi} \right)^\varepsilon.

When δ>0\delta > 0 the worker overestimates her tax rate and undersupplies labor; when δ<0\delta < 0 she underestimates the rate and oversupplies labor. In both cases, utility evaluated at the true tax rate τ\tau is lower than it would be at the optimum h(τ)h^*(\tau).

Individual deadweight loss

Define the individual deadweight loss from a misperception of magnitude δ\delta as the utility gap between optimal and misperceived behavior, with consumption evaluated at the true tax rate in both cases:

DWL(δ)=U(h(τ))U(h(τ+δ)).\text{DWL}(\delta) = U\bigl(h^*(\tau)\bigr) - U\bigl(h(\tau + \delta)\bigr).

Here U(h)U(h) is shorthand for indirect utility at hours hh given the true budget constraint: U(h)=w(1τ)h+vψh1+1/ε/(1+1/ε)U(h) = w(1-\tau)h + v - \psi h^{1+1/\varepsilon}/(1+1/\varepsilon). Since h(τ)h^*(\tau) maximizes this expression, DWL(δ)0\text{DWL}(\delta) \geq 0 for all δ\delta, with equality only at δ=0\delta = 0.

Second-order approximation

To obtain a closed-form expression for expected DWL, expand U(h)U(h) around the optimum hh^*. Write Δh=h(τ^)h(τ)\Delta h = h(\hat{\tau}) - h^*(\tau) for the hours distortion.

The first derivative of indirect utility with respect to hours is

Uh=w(1τ)ψh1/ε,U_h = w(1 - \tau) - \psi \, h^{1/\varepsilon},

which equals zero at h=hh = h^* by the first-order condition. The second derivative is

Uhh=ψεh(1/ε1)=w(1τ)εh,U_{hh} = -\frac{\psi}{\varepsilon} h^{* \, (1/\varepsilon - 1)} = -\frac{w(1-\tau)}{\varepsilon \, h^*},

where the second equality substitutes ψ=w(1τ)/h1/ε\psi = w(1-\tau) / h^{*\,1/\varepsilon} from the first-order condition. Since Uh=0U_h = 0 at the optimum, the Taylor expansion gives

DWL(δ)12Uhh(Δh)2.\text{DWL}(\delta) \approx \frac{1}{2} |U_{hh}| (\Delta h)^2.

Next, compute Δh\Delta h. Since h(τ^)=(w(1τδ)/ψ)εh(\hat{\tau}) = (w(1 - \tau - \delta)/\psi)^\varepsilon, a first-order expansion around δ=0\delta = 0 yields

Δhεhδ1τ.\Delta h \approx -\varepsilon \, h^* \frac{\delta}{1 - \tau}.

Substituting both expressions:

DWL(δ)12w(1τ)εhε2h2δ2(1τ)2=12εwhδ21τ.\text{DWL}(\delta) \approx \frac{1}{2} \cdot \frac{w(1-\tau)}{\varepsilon \, h^*} \cdot \varepsilon^2 h^{*2} \frac{\delta^2}{(1-\tau)^2} = \frac{1}{2} \varepsilon \, w \, h^* \frac{\delta^2}{1-\tau}.

The deadweight loss is quadratic in the misperception δ\delta: small errors are relatively harmless, but large errors are disproportionately costly. This convexity is central to the welfare analysis.

Expected deadweight loss

Taking expectations over δN(0,σ2)\delta \sim N(0, \sigma^2) and using E[δ2]=σ2E[\delta^2] = \sigma^2:

E[DWL]=12εwhσ21τ.E[\text{DWL}] = \frac{1}{2} \varepsilon \cdot w \cdot h^* \cdot \frac{\sigma^2}{1-\tau}.

Since whw \cdot h^* equals earnings, this can be expressed as a fraction of earnings:

E[DWL]earnings12εσ21τ.\boxed{\frac{E[\text{DWL}]}{\text{earnings}} \approx \frac{1}{2} \varepsilon \cdot \frac{\sigma^2}{1-\tau}.}

This is the paper’s core formula. It gives the welfare cost of tax misperception as a share of labor income, depending on three key parameters: the Frisch elasticity, the variance of misperception, and the tax rate.

Approximation accuracy

Because the formula is a second-order Taylor expansion of utility around the optimum, it is exact only for infinitesimal δ\delta and may diverge for large misperceptions. To assess its accuracy over the parameter ranges used in this paper, I compare the analytical approximation to a Monte Carlo estimate (500,000 draws of δN(0,σ2)\delta \sim N(0, \sigma^2), evaluating exact utility at each draw). At the baseline calibration (ε=0.33\varepsilon = 0.33, σ=0.12\sigma = 0.12, τ=0.30\tau = 0.30), the analytical formula understates the Monte Carlo estimate by 1.1%. At the most extreme corner of the sensitivity grid (ε=0.50\varepsilon = 0.50, σ=0.15\sigma = 0.15, τ=0.43\tau = 0.43), where σ/(1τ)=0.26\sigma/(1-\tau) = 0.26, the approximation understates the true expected DWL by 4.7%. The closed-form expression is therefore reliable across the entire parameter space considered here.

Discussion of the formula

The formula reveals several comparative statics.

Quadratic in σ\sigma. Doubling the standard deviation of misperception quadruples the expected welfare cost. This convexity implies that reducing misperception from σ=0.15\sigma = 0.15 to σ=0.10\sigma = 0.10 (a 56% reduction in σ2\sigma^2) lowers DWL by more than reducing from σ=0.10\sigma = 0.10 to σ=0.05\sigma = 0.05 (a 75% reduction in σ2\sigma^2 but from a lower base).

Linear in ε\varepsilon. More elastic labor supply amplifies the cost of misperception because workers with elastic supply distort their hours more in response to a given perceived price change. If labor supply were perfectly inelastic (ε=0\varepsilon = 0), misperception would have no welfare cost at all---workers would supply the same hours regardless of their beliefs about tax rates. The empirical magnitude of ε\varepsilon is therefore a key input to the calibration.

A note on elasticity concepts is warranted. The Frisch elasticity ε\varepsilon in the formula governs the responsiveness of hours to the net-of-tax wage, holding the marginal utility of wealth constant. Under quasilinear preferences, the Frisch, compensated (Hicksian), and uncompensated (Marshallian) elasticities all coincide because there is no income effect. The elasticity of taxable income (ETI) is a broader concept that captures responses on all margins---hours, effort, avoidance, and evasion---and is therefore typically larger than the hours elasticity. The baseline calibration uses ε=0.33\varepsilon = 0.33, which corresponds to the hours-only Frisch elasticity from Chetty (2012). If misperception also distorts avoidance and reporting decisions, the relevant elasticity would be closer to the ETI (approximately 0.25--0.50 per Saez et al. (2012)), and the welfare costs could be correspondingly larger or smaller depending on the specific margin. The present analysis isolates the labor supply channel for transparency.

Denominator (1τ)(1 - \tau). Higher tax rates amplify the welfare cost of misperception. This occurs because the same absolute misperception δ\delta represents a larger proportional change in the net-of-tax wage when τ\tau is high. A 10-percentage-point misperception shifts the net-of-tax rate from 70% to 60% (a 14% change) when τ=0.30\tau = 0.30, but from 40% to 30% (a 25% change) when τ=0.60\tau = 0.60.

Connection to Harberger triangles. The formula is a direct application of the Harberger approximation to the deadweight loss of taxation, except that the distortionary “wedge” is not the tax itself but the misperception δ\delta. Just as the traditional DWL of a tax τ\tau is approximately 12ετ2/(1τ)\frac{1}{2}\varepsilon \tau^2 / (1-\tau) times earnings, the DWL of misperception replaces τ2\tau^2 with σ2\sigma^2. The structural parallel clarifies that misperception generates a welfare loss with the same second-order form as a tax distortion, though unlike a tax, misperception generates no offsetting revenue.

Social planner extension

Consider a utilitarian social planner who chooses a linear tax rate τ\tau and finances a uniform lump-sum transfer (demogrant) vv from the resulting revenue. The government budget constraint requires

v=τNi=1Nwihi.v = \frac{\tau}{N} \sum_{i=1}^{N} w_i \, h_i.

The social welfare function uses inverse-wage weights ωi=wˉ/wi\omega_i = \bar{w}/w_i to create a redistribution motive, so W=(1/N)iωiUiW = (1/N) \sum_i \omega_i U_i. Workers with below-average wages receive higher weight, reflecting the standard utilitarian motive for progressive taxation. Under perfect information, these weights generate an interior optimal tax rate that balances redistribution gains against efficiency losses.

The disutility scale ψ\psi is normalized to one. Since rescaling ψ\psi by any constant λ\lambda multiplies all hours, all revenues, and all utility levels by the same factor (1/λ)ε(1/\lambda)^\varepsilon, the optimal tax rate τ\tau^* is invariant to the choice of ψ\psi. The calibrated optimal tax therefore depends only on the wage distribution and the elasticity, not on the normalization of hours.

Under perfect information (σ=0\sigma = 0), each worker chooses hi=h(τi)h_i = h^*(\tau_i) and the planner selects τ\tau to balance the equity gains from redistribution against the efficiency losses from labor supply distortion.

When workers misperceive the tax rate (σ>0\sigma > 0), the planner faces an additional cost. Workers choose h(τ^i)h(\hat{\tau}_i) instead of h(τi)h^*(\tau_i), reducing both private welfare and government revenue (since misallocated labor reduces the tax base). The expected per-worker welfare loss from misperception, 12εwhσ2/(1τ)\frac{1}{2}\varepsilon w h^* \sigma^2 / (1-\tau), is increasing in τ\tau through the (1τ)(1-\tau) denominator. This means that at the margin, raising the tax rate imposes a larger misperception cost when rates are already high.

The optimal tax rate τ\tau^* is therefore decreasing in σ\sigma. As misperception worsens, the planner optimally reduces the tax rate, accepting less redistribution in exchange for lower misperception-induced deadweight loss. In the extreme of perfect information (σ=0\sigma = 0), the planner chooses the standard optimal tax; as σ\sigma grows large, the optimal rate converges toward zero because the misperception cost eventually dominates the redistribution benefit.

The model predicts that a reduction in σ\sigma permits higher τ\tau at constant total DWL. Whether real-world measures to reduce misperception are compatible with maintaining or increasing progressivity is an empirical question beyond the scope of this model.

Why quasilinear preferences, not Cobb-Douglas?

A natural alternative specification is Cobb-Douglas utility over leisure LL and consumption CC:

U(L,C)=LαCβ,U(L, C) = L^\alpha C^\beta,

with time constraint L+h=TL + h = T. Under this specification, optimal leisure is L=αT/(α+β)L^* = \alpha T / (\alpha + \beta) when the worker receives no transfers, and this expression is independent of the net-of-tax wage w(1τ)w(1-\tau). The income and substitution effects of a wage change exactly cancel, producing a knife-edge result: hours are invariant to the perceived tax rate, so misperception has zero welfare cost.

This invariance is a well-known property of Cobb-Douglas preferences Block et al., 1980, and while it provides a useful theoretical benchmark, it is empirically implausible as a description of labor supply responses to taxation. A large body of evidence reviewed by Chetty (2012) documents positive compensated labor supply elasticities, meaning that the substitution effect dominates the income effect on the relevant margins. The quasilinear specification isolates this substitution effect by eliminating the income effect entirely, making it the natural workhorse for analyzing how tax misperception distorts labor supply.

More precisely, the welfare cost of misperception operates through the substitution effect: a worker who overestimates her tax rate reduces labor supply because she perceives a lower return to work, not because she feels wealthier. By shutting down income effects, the quasilinear model isolates this channel cleanly. Any utility specification with positive compensated elasticity would generate qualitatively similar results; the quasilinear form simply yields the most transparent closed-form expressions.

A further limitation of the model is its focus on the intensive margin of labor supply. Most workers cannot continuously adjust their hours; the majority of employees work fixed schedules set by employers Keane & Rogerson, 2012. However, the intensive-margin framework can be reinterpreted as capturing responses at the extensive margin (employment entry and exit) or along the effort margin, both of which are governed by similar elasticities. Moreover, the minority of workers with flexible hours---the self-employed, gig workers, and those choosing between part-time and full-time positions---may have substantially higher effective elasticities than the population average, partially offsetting the lower responsiveness of fixed-hours workers. The calibrated ε=0.33\varepsilon = 0.33 represents a blend of intensive and extensive margin responses, as discussed by Chetty (2012).

Misperception versus inattention

The model assumes that workers actively optimize against a perceived tax rate τ^\hat{\tau}, which may differ from the true rate. An alternative behavioral model posits that workers are inattentive to the tax rate---they assign zero or near-zero weight to the marginal rate in their labor supply decisions and instead follow heuristics or social norms (e.g., working 40 hours because the employer expects it). Under the inattention model, the welfare cost depends on the degree of under-reaction to tax incentives (the “attention parameter” θ\theta in the framework of Chetty et al. (2009)), not on the variance of perceived rates.

The distinction matters empirically. Survey evidence on perceived rates Rees-Jones & Taubinsky, 2020, Gideon, 2017 reveals that taxpayers do form beliefs about their marginal rates---they are not ignoring the rate but rather estimating it with error. The behavioral responses documented by Chetty & Saez (2013), in which EITC recipients adjusted their earnings after receiving information about marginal incentives, suggest that workers do optimize against perceived rates rather than ignoring them entirely. The misperception framework adopted here is therefore the more appropriate model for the population segments that form and act on tax rate beliefs, while the inattention framework may apply to workers who delegate all tax decisions to accountants or employers. The two models are complements rather than substitutes, and a complete welfare accounting would include both channels.

References
  1. Rees-Jones, A., & Taubinsky, D. (2020). Measuring “Schmeduling.” Review of Economic Studies, 87(5), 2399–2438. 10.1093/restud/rdz045
  2. Chetty, R. (2012). Bounds on elasticities with optimization frictions: A synthesis of micro and macro evidence on labor supply. Econometrica, 80(3), 969–1018. 10.3982/ECTA9043
  3. Saez, E., Slemrod, J., & Giertz, S. H. (2012). The elasticity of taxable income with respect to marginal tax rates: A critical review. Journal of Economic Literature, 50(1), 3–50. 10.1257/jel.50.1.3
  4. Block, M. K., Kramer, J. M., & Wagner, J. M. (1980). Labor supply under wage uncertainty. Economic Inquiry, 18(2), 261–269. 10.1111/j.1465-7295.1980.tb00573.x
  5. Keane, M., & Rogerson, R. (2012). Micro and macro labor supply elasticities: A reassessment of conventional wisdom. Journal of Economic Literature, 50(2), 464–476. 10.1257/jel.50.2.464
  6. Chetty, R., Looney, A., & Kroft, K. (2009). Salience and taxation: Theory and evidence. American Economic Review, 99(4), 1145–1177. 10.1257/aer.99.4.1145
  7. Gideon, M. (2017). Do individuals perceive income tax rates correctly? Public Finance Review, 45(1), 97–117. 10.1177/1091142115615670
  8. Chetty, R., & Saez, E. (2013). Teaching the tax code: Earnings responses to an experiment with EITC recipients. American Economic Journal: Applied Economics, 5(1), 1–31. 10.1257/app.5.1.1