Powerless: gains from trade when firm productivity is not Pareto distributed Marco Bee∗

Stefano Schiavo†

August, 2014

Abstract Most trade models featuring heterogeneous firms assume a Pareto (or power-law) productivity distribution, on the basis that it provides a reasonable representation of the data and, moreover, because of its analytical tractability. On the other hand, both theoretical and empirical works show that the characteristics of the productivity distribution crucially affect the estimated gains from trade in presence of firm heterogeneity. This paper investigates what happens to gains from trade when one departs from the assumption of a Pareto productivity distribution and opts instead for a longnormal distribution. In so doing, we complement recent evidence by Head et al. (2014) by offering a thorough comparison between results obtained with the two alternative assumptions, and their sensitivity to a number of key parameters. We find that thicker tails in the productivity distribution are associated with larger welfare gains from trade liberalization; the welfare gains estimated under a Pareto distribution tend to be both larger and more sensitive to variations in key parameters of the model, such as the elasticity of substitution. As for the additional contribution of heterogeneity to welfare, we find that the more heterogeneous is firm productivity, the smaller is the difference with respect to a standard model where all firms are identical. Keywords: lognormal, Pareto, trade, welfare, firm heterogeneity JEL Codes: F10, F12

∗ †

University of Trento; e-mail: [email protected] University of Trento and OFCE-DRIC; e-mail: [email protected]

1

1

Introduction

This paper investigates what happens to estimated gains from trade when one departs from the standard assumption of a Pareto productivity distribution that characterizes most of the existing literature. In so doing, we complement recent evidence by Head et al. (2014) by offering a thorough comparison between results obtained assuming either a Pareto or a lognormal distribution, and their sensitivity to a number of key parameters. The evaluation of the gains from trade accruing from models featuring firm heterogeneity (in productivity, as in Melitz 2003) has been capturing quite a lot of attention in the recent literature (Arkolakis et al. 2012, di Giovanni & Levchenko 2013, Melitz & Redding 2013, Head et al. 2014, see for instance). The issue has to do both with the comparison between standard (homogeneous firm) models and the more recent vintage of new-new trade theory, i.e. with the measurement of the additional gains brought about by heterogeneity (Arkolakis et al. 2012, Melitz & Redding 2013), and with the sensitivity of gains to the degree (and the shape) of heterogeneity (di Giovanni & Levchenko 2013, Head et al. 2014). Our notional journey starts from Arkolakis et al. (2012) and Melitz & Redding (2013). The former assess the additional gains from trade associated with firm heterogeneity. On top of the standard welfare gains already present in the new-trade literature Krugman (1980), heterogeneity adds an additional source of gains in the form of within-industry reallocation of market shares, forcing low productivity firms to exit and more efficient ones to expand, thus lifting aggregate productivity. Arkolakis et al. (2012) conclude that, in fact, this new insight does not alter much the evaluation of the benefits of trade, as the additional benefit associated with heterogeneity are small. Melitz & Redding (2013) investigate the issue further, and show that the dismal result by Arkolakis et al. (2012) depends on the specific way these authors use to compare models with homogeneous or heterogeneous productivity distributions. Using a different approach, they find that the differences in aggregate welfare are quantitatively important.1 Another interesting take at the issue of the welfare gains from trade is provided by di Giovanni & Levchenko (2013). They use a multi-country model to investigate the welfare impact of a series of reductions in fixed and variable trade costs under different degrees of heterogeneity in firm productivity. In particular, they shows that the degree of heterogeneity —the shape parameter of the Pareto distribution in their paper— significantly affects both the magnitude and the composition of gains from trade, as well as the benefits accruing from a reduction in fixed versus variable trade costs. The reason 1 More specifically, the “macro” calibration by Arkolakis et al. (2012) requires the two models to have the same trade elasticity with respect to trade costs, and the same domestic trade share, whereas the “micro” approach taken by Melitz & Redding (2013) only changes the degree of heterogeneity in the models, taking the homogeneous case as a limit (degenerate) case of the more general heterogeneous firm specification. See Melitz & Redding (2013) for more details on the two approaches.

2

for this is that productivity dispersion affects export participation and thus the importance of changes at the extensive margin of trade. A reduction in trade costs shifts the export productivity threshold and the number of firms that can successfully export: when the upper tail of the productivity distribution is thick, then marginal new exporters are much smaller than firms already exporting and thus have a very limited impact on welfare. Indeed, when comparing a model featuring a Zipf’s productivity distribution with a counterfactual economy where the distribution is less dispersed (the shape parameter moves from 1 to 2), they find that —in the latter case— gains from a reduction in fixed costs are an order of magnitude larger, while the impact of a reduction in variable costs an order of magnitude smaller. The question then arises as to whether choosing a different type of productivity distribution may push the argument by di Giovanni & Levchenko (2013) even further. Rephrasing, we are interested in understanding whether, when it comes to quantifying welfare gains and the potential benefits accruing from a reduction in fixed and/or variable trade costs, the assumption that productivity follows a Pareto distribution may no longer be an innocuous simplifying assumption, but rather crucially affect the results. The issue is rooted into the old question of the most appropriate distributional assumption to model firm size and productivity (the two being closely related in the standard monopolistic competition cum CES preferences that represents the backbone of Melitztype models). This is a classical theme in applied industrial organization, which dates back at least to Gibrat (1931), who claims that a lognormal distribution provides a very good representation of data on French firms, and postulates a model of multiplicative growth (where growth is independent of size) that generates such a distribution. Indeed, the lognormal and the Pareto distribution have been the two main competing alternatives, with the consensus among the academic community somehow swinging over the years (Steindl 1965, Ijiri & Simon 1974, 1977, Gabaix 1999, Axtell 2001, Luttmer 2007, Bee et al. 2014, see for instance). In the trade literature, most if not all papers assume a Pareto or Zipf’s productivity distribution. This choice is motivated on two main bases: first that the Pareto is tractable and very convenient from a modeling point of view, allowing for closed-form solutions; second, that it provides a good approximation of the data, at least for US firms. To substantiate this latter claim, reference is made to the work by Axtell (2001), who find how a Zipf’s law provides a good representation for the entire distribution of firms in his sample. More recent evidence, however, suggests that the Pareto distribution does not fit very well the whole size/productivity distribution, but only the upper tail. And even this is debatable both from a methodological (Virkar & Clauset 2012) and an empirical point of view (Rossi-Hansberg & Wright 2007, Bee et al. 2014, Head et al. 2014).2 2

The main methodological issue has to do with the common practice of binning the data before fitting a distribution. Virkar & Clauset (2012) forcefully show that this is an important source of bias.

3

Head et al. (2014) provides convincing evidence in favor of the lognormal distribution and, building on this, explore what happens to gains from trade when abandoning the Pareto assumption in favor of lognormality. They claim that the welfare effect can be twice as large as under the Pareto assumption: given that lognormal is normally though of as featuring lighter tails than the Pareto, this result by Head et al. (2014) seems at odds with the evidence put forward by di Giovanni & Levchenko (2013). This paper complement the analysis by i) providing a thorough comparison of the welfare effect of trade liberalization under alternative distributional assumptions; ii) looking at the additional welfare effect of heterogeneity; iii) investigating the sensitivity of results to key parameters of the model such as the elasticity of substitution, the degree of heterogeneity in firm size, the magnitude of (fixed and variable) trade costs. The rest of the paper is organized as follows: the next section presents a brief overview of the theoretical setup we refer to, largely based on the work by Melitz & Redding (2013); Section 3 represents the core of the paper and discusses our results. Section 5 link our findings to the existing literature and concludes.

2

Theoretical background

Melitz & Redding (2013) presents a simple two-country model from which they derive a series of general results relative to the welfare gains of trade liberalization that are independent of any distributional assumption. The paper assumes two symmetric countries populated by a continuum of heterogeneous firms that incur a sunk entry cost fe before they can discover their productivity φ, which is sampled from a common and invariant distribution g(φ). Production entails fixed costs (fd ) and a constant marginal cost that depends on productivity. If international trade is allowed, i.e. when countries move from autarky to free trade, then exporting firms face also a fixed export cost (fx ) and an iceberg variable trade cost (τ ). All costs are expressed in unit of labor, which is the sole factor of production. Demand is modeled by means of the usual CES preferences giving rise to the standard pricing rule for firms, namely a markup over marginal costs. The zero profit conditions in each market define the productivity cutoffs for serving domestic and foreign consumers: operating profits must cover fixed costs. Melitz & Redding (2013) compute the gains from trade (GFT) by comparing welfare in autarky W A with welfare under free trade W T , and show that irrespective of any assumption on the shape of the productivity distribution, GFT can be computed as the ratio between the productivity cutoffs for serving the domestic market under the different regimes:

φTd WT = WA φA d 4

where the superscript T and A identify the trade and autarky regimes, and φd is the minimum productivity level required to successfully serve the domestic market. We exploit this result to investigate what happens to GFT when one assumes different productivity distributions g(φ), and different values for parameters within the same distribution. In particular, we focus on comparing the results obtained assuming a Pareto versus a lognormal distribution for firm productivity. From a practical point of view, the main difference is that while the Pareto case allows one to get closed-form solutions for the threshold productivity levels, this is not possible in the lognormal case, so that we have to revert to numeric methods. Appendix ?? provides the readers with the full derivation of the model in the lognormal case. As for the homogeneous case, Melitz & Redding (2013) treat it as a special (degenerate) case of the more general model featuring heterogeneous productivity. In particular, they assume that after paying the sunk entry cost fe firms draw a productivity that is either zero or positive (φ¯d ) with exogenous probabilities. The calibration of the homogeneous productivity model is such that the autarky equilibria in the two models are equivalent. ¯ d) This means equating both the probability of successful entry in the homogeneous (1 − G ¯ ˜A and heterogeneous case (1 − G(φA d )), and the average productivity levels (φd = φd , where φ˜A is a weighted average of firm productivity in the heterogeneous case). d

3 3.1

Empirical analysis Selection of model parameters

To evaluate the GFT we need to make assumptions about the distribution of productivity and its key parameters (the shape parameter k and the minimum productivity φmin for the Pareto case, the standard deviation σ for the lognormal distribution), as well as about the parameters that determine the equilibrium, namely sunk and entry costs (fe ,fd ,f, x), transport costs (τ ), and the elasticity of substitution (ǫ). For the sake of comparability with existing works, we start from values of the parameters equal or similar to those adopted by Melitz (2003) and Head et al. (2014), and subsequently investigate the robustness of results to different values. The first thing to note is that the standard combination of CES preferences and monopolistic competition cum yields a strict relationship between firm size (as measured by sales) and productivity. In fact, sales of a firm with productivity φ can be expressed as: s(φ) = RP ǫ−1p(φ)1−ǫ where R is total expenditures, P the ideal price index, p(φ) is the equilibrium price set by a firm of productivity φ. Hence, sales are a power function of productivity. As noted by Head et al. (2014), this means that, for both the Pareto and the lognormal 5

distributions, sales follow the same distribution as productivity with minor changes to the parameters. This relationship is exploited in the literature because data on firm sales are more reliable that data on productivity; hence, one can estimate the parameters of the Pareto or lognormal distribution using sales data, and then derive the relevant parameters for the productivity distribution using the following simple relationships: k = k sales · ǫ − 1 (Pareto);

σ=

σ sales (ǫ − 1)

(lognormal)

. Melitz & Redding (2013) set k = 4.25 and ǫ = 4, implying a shape parameter for the distribution of firm size k sales = 1.42 roughly halfway from the two cases investigated by di Giovanni & Levchenko (2013), who compare GFT when k sales = 1 Vs k sales = 2. Other papers, namely di Giovanni & Levchenko (2013) and Head et al. (2014), take the estimated parameters of the Pareto and longormal distribution of firm sales as given, setting k sales = 1.06 as in Axtell (2001), and σ sales = 2.39 as in Head et al. (2014).3 From these values, one derives the parameters of the productivity distribution depending on the choice of the elasticity of substitution, which therefore plays a very important role. Both Melitz & Redding (2013) and Head et al. (2014) set ǫ = 4, although the sensitivity of the results to this parameter is one of the main points we address below. For the remaining parameters, Melitz & Redding (2013) choose values allowing them to match the stylized facts about the intensive and the extensive margin of trade in the US economy reported by Bernard et al. (2007). In particular, setting fe = 0.0145 and fx = 0.545 yields an export participation rate equal to 18%, while setting τ = 1.83 allows them to match the average fraction of exports in firm sales (14%). The latter calibration τ 1−ǫ exploits the relationship 1+τ 1−ǫ = export intensity. The Pareto scale parameter φmin and the fixed production cost fd are both set equal to 1, but the value has no bearing on the results. The same applies to the mean of the lognormal distribution, which does not enter the computations. Head et al. (2014) show that, under lognormality, the magnitude of the GFT are not invariant to the value of fe : as a consequence, we also experiment with their preferred value (0.5).

4

Gains from trade

We start our analysis by replicating the baseline exercises performed by Melitz & Redding (2013), in which GFT are computed for different values of τ and fx comparing the homogeneous and heterogeneous cases: in Figures 1 and 4 show two Pareto distributions 3

The former value is by far the most popular point estimate in the literature that assumes a Pareto distribution. On the other hand, Head et al. (2014) provide convincing evidence that the lognormal provides a very good fit to their data on export sales by French manufacturing firms. Similar results are reported also by Bee et al. (2014).

6

Welfare gains as a function of τ (σ = 0.797, ε = 4, k = 3.18, k = 4.25) 1

2

1.25 Logn het. Logn hom. Pareto het. (k=4.25) Pareto hom. (k=4.25) Pareto het. (k=3.18) Pareto hom. (k=3.18)

τ=1.83 1.2

1.15

1.1

1.05 X: 1.868 Y: 1.018

1 1

1.5

2

2.5

variable trade cost (τ)

Figure 1: Welfare gains as a function of τ in the lognormal and Pareto case. with different shape parameters, and a lognormal case. We use two values for the Pareto shape parameter, k1 = 4.25 (as in Melitz & Redding 2013), and k2 = 3.18 (resulting from assuming k sales = 1.06 and ǫ = 4). Figure 1 shows the welfare impact of moving from autarky to a free case scenario characterized by a range of finite variable cost values. Results for k = 4.25 are in line with Melitz & Redding (2013): the heterogeneous models always produce larger GFT, and in the heterogeneous case, large trade costs imply that that the representative firm does not find it profitable export. When comparing the three distributions we see the emergence of a ranking among the three cases: fatter tails (more heterogeneity) are associated with larger GFT, whereas the lognormal and the Pareto with k = 4.25 are very close. Furthermore, we see that fatter tails in the productivity distribution yields GFT that are more sensitive to variations in iceberg transport costs. In fact, as detailed in Table 1, both the range of GFT and their standard deviations are smaller when moving from the Pareto with the heaviest tail, to the second Pareto distribution, to the lognormal. Table 1 also reports the GFT computed for a specific value of iceberg transport costs (τ = 1.83) in the homogeneous and heterogeneous versions of the model for the three productivity distributions. The additional GFT associated with firm heterogeneity are extremely small in the case of the Pareto distribution with thicker tails (k = 3.18), whereas they are relatively important (1.8–2.1%) in the other two cases. This is consistent with the notion that a reduction in transport costs mainly affects the intensive margin of trade, so that heterogeneity has little effect on GFT (di Giovanni & Levchenko 2013). Indeed, the level of GFT is larger when the distribution of firms is more skewed, since a reduction

7

Table 1: Gains from trade (relative to autarky) generated by a reduction of variable trade costs: comparison between heterogeneous and homogeneous case and across different degrees of heterogeneity. autarky to τ = 1.83 ∆τ : heterog. case homog. Pareto (k = 3.18) Pareto (k = 4.25) lognormal Values in percentage

heterog.

range

st. dev.

4.5 2.2 1.9

23.34 20.89 19.67

6.73 5.91 5.89

4.1 0.1 0 points.

in τ mainly works through boosting trade of the few very large firms that populate the economy. Figure 4 replicates the same exercise mimicking the effect of a trade liberalization that moves the system from autarky to a free trade equilibrium characterized by τ = 1.83 and a varying degree of fixed costs, ranging between 0 and 1 (as in Melitz & Redding 2013). We can see that the levels of GFT are much smaller than before, roughly ranging between 1 and 5 percent in the three heterogeneous models, but also that the impact of heterogeneity is more clear cut. As opposed as in Figure 1 above, here fatter tails in the distribution of productivity are associated with less sensitivity of GFT to fixed costs. In fact, as noted by di Giovanni & Levchenko (2013), fatter tails imply a bigger difference between the largest exporters firms and the marginal firm which represent the extensive trade margin of a variation in fixed costs. The figure also confirms a clear-cut hierarchy for the magnitude of GFT, with larger welfare effects associated with fattertailed productivity distributions. Hence, estimations of GFT seem to vary significantly depending on the distributional assumptions. Sensitivity of the lognormal model to variations in σ. So far, we have assumed that the lognormal distribution is characterized by a specific dispersion parameter (standard deviation), setting σ = 0.797 as in in Head et al. (2014). Now, we explore the sensitivity of GFT to variation in the degree of firm heterogeneity as measured by σ, letting it varying between 0.2 and 1.8.4 Results are summarized in Figure 3: computations are performed keeping all the other parameters at their benchmark values (τ = 1.83, fe = 0.0145, fx = 0.545, ǫ = 4). As one could expect from the previous discussion, GFT increase monotonically with the degree of productivity dispersion σ, which in the case of a lognormal distribution 4

Given the value of other parameters, it turns out that for σ ' 2, finding the equilibrium productivity threshold becomes numerically difficult, so that the estimated welfare gains are unreliable. The range of values taken byσ is however reasonably large to provide us with a good representation of the model behavior.

8

Welfare gains as a function of f (σ = 0.797, k = 3.18, k = 4.25) x

1

2

1.06 fx = 0.545 1.05 Pareto (k=3.18) − heterogeneous

Pareto (k=3.18) − homogeneous

1.04

1.03 Pareto (k=4.25) − heterogeneous 1.02

1

Pareto (k=4.25) − homogeneous

Logn het. Logn hom. Pareto het. (k=4.25) Pareto hom. (k=4.25) Pareto het. (k=3.18) Pareto hom. (k=3.18)

1.01

0

0.1

0.2

logN − heterogeneous

logN − homogeneous

0.3

0.4

0.5

0.6

0.7

0.8

0.9

1

fixed export cost (fx)

Figure 2: Welfare gains as a function of fx in the lognormal and Pareto case. Transport costs τ = 1.83.

implies thicker tails. Variations can be important, as GFT pass from zero in the case of a very concentrated distribution (σ = 0.2) to 5% when σ = 1.8. As a reference, consider that GFT in the Pareto case are estimated in the range of 2.2% (k = 4.25) and 4.5% (k = 3.18). Sensitivity of GFT to variations in ǫ. Once we consider that, in both the Pareto and the lognormal case, the parameter governing the degree of heterogeneity in productivity (k and σ) depend on the value taken by the elasticity of substitution (because of the relationship between sales and productivity is mediated by it), and given that the latter is notoriously difficult to pin down, the question arises as to how sensitive are GFT to the specific choice of ǫ. In fact, (Behrens et al. 2012, footnote 8) note that “estimation results for ǫ depend both on the level of aggregation and the estimation method, and vary widely. For example, Hanson (2005) using aggregate U.S. data, obtains about 7 with non-linear least squares and about 2 with GMM. Estimates in Hummels (1999) vary from 2 to 5.26. Using extremely disaggregated data, Broda & Weinstein (2006) estimate several thousand elasticities of substitution, which range, depending on the industry and the level of aggregation, from 1.3 (telecommunication equipments) to 22.1 (crude oil).” 9

Lognormal welfare gains as a function of σ (τ = 1.83, ε = 4, fe=0.0145, fx=0.545) heterogeneous homogenous σ=0.797

1.05

welfare gains

1.04

1.03

1.02

1.01

1 0.2

0.4

0.6

0.8

1 σ

1.2

1.4

1.6

1.8

Figure 3: Welfare gains as a function of σ in the lognormal case. di Giovanni & Levchenko (2013) touch upon this issue by using three different values for ǫ: 4, 6 (their benchmark), and 8. They report larger welfare gains under a lower ǫ, although their are mainly interested with the fact that their main results are unaffected from a qualitative point of view. Figure 4 compares GFT under the Pareto and lognormal assumption when we let ǫ vary and, with it, we modify the degree of firm heterogeneity in the model. The exercise is performed by giving the baseline values to all parameters (τ, fe , fd , fx ), fixing the relevant parameters of the distribution of firm sales (k sales = 1.06 under Pareto, σ sales = 2.39 in the longnormal case) and exploiting the relationship that links the parameters of the distribution of firm size to those of the associated productivity distribution (k = 1.06 · ǫ − 1 and σ = 2.39/(ǫ − 1)). Hence, when the elasticity of substitution varies, so does the degree of heterogeneity.5 We do find that a lower elasticity of substitution is associated with higher GFT (as it implies thicker tails in the productivity distribution), and also find that the Pareto always dominates the lognormal model in term of welfare gains. GFT are very sensible to the choice of ǫ, moving from a 98% (Pareto) or 68% (lognormal) welfare increase when ǫ = 1.7 to virtually no effect when the elasticity of substitution goes above 7 or 8, well 5

Appendix B, which contains additional results not included in the paper, shows what happens to GFT when we let ǫ vary without affecting the degree of heterogeneity in firm productivity.

10

Lognormal and Pareto welfare gains as a function of ε with k/(ε−1)=1.06 and σ(ε−1)= 2.39 lognormal Pareto

1.9 1.8 1.7

welfare gains

1.6 1.5 1.4 1.3 1.2 1.1 1 2

4

6

8 10 elasticity of substitution (ε)

12

14

Figure 4: Lognormal and Pareto welfare gains as a function of ǫ. Variations in ǫ affect the degree of productivity dispersion (k or σ). Fixed and variable trade costs at their baseline values τ = 1.83, fx = 0.545. in the range of plausible values still. The assumption of a Pareto distribution appears to make results more sensitive to the elasticity of substitution: on the one hand GFT rise more steeply when ǫ goes down, on the other hand, GFT remain positive (i.e. non-zero) for a larger span of values. The comparison between the homogeneous and heterogeneous models shows that as ǫ gets smaller and the GFT increase in magnitude, the additional welfare gains due to firm heterogeneity, mainly driven by the extensive trade margin, get less and less relevant (see Figure B2 in the Appendix). Hence, a bit paradoxically, the more heterogeneous is productivity, the less is its contribution to GFT. Simultaneous variation in all model parameters. Variations in the elasticity of substitution not only affect the degree of heterogeneity in firm productivity, but also trade costs.6 In fact, Melitz & Redding (2013) set τ = 1.83 by exploiting the relationship τ 1−ǫ 1+τ 1−ǫ

= 0.14, where the right-hand-side of the equation equals the average share of export

6

Hummels (1999, as quoted by Behrens et al. (2012)) notes that “without knowing ǫ we cannot infer the size of the trade barrier, and without knowing the size of the barrier we cannot infer ǫ.

11

on total sales among US manufacturing firms. Hence, keeping τ fixed while letting ǫ vary, may introduce a bias in the analysis. Figure 4 shows what happens to GFT when we let τ change alongside k and σ when the elasticity of substitution varies. Conceptually, this means we are estimating the welfare effects of a switch from autarky to a free trade equilibrium in which the share of export on total sales is the one we observe in the data. In the analysis we have set export intensity equal to 21.6%, which is the level observed for French manufacturing in 2003 (the same data on which the lognormal distribution is estimated), although this specific choice has little impact on the results. Clearly, if τ adjusts to match the observed share of exporting firms, the magnitude of GFT get substantially reduced, as we are putting a brake on the working of the extensive margin. This is especially clear for low values of ǫ: the maximum gain one get is now in the range of 30–40% in the Pareto case, down from the 98% recorded previously. Head et al. (2014) show that in the lognormal model, fe also has a bearing on the GFT, while this is not the case under Pareto. The dotted line in Figure 4 therefore represents GFT computed setting fe = 0.5. This change, by making entry more difficult, increases the average productivity of operating firms and thus boosts GFT for each level of ǫ.7 Contrary to Head et al. (2014), we do not find an inversion in the ranking between the Pareto and the lognormal model when setting fe = 0.5: this is due to the choice of the shape parameter of the Pareto distribution. In fact, while we stick to the assumption of a nearly Zipf’s law for firm sales (k sales = 1.06), Head et al. use (much) larger values ranging from 1.37 to 4.85. Larger shape parameters, that imply thinner tails in the size and productivity distributions, yields lower GFT. In fact, changing the value of the sunk entry cost fe is likely to impact on export participation, by altering the composition of the population of successful entrants. As a result, if one wants to match the degree of export participation seen in the data, the magnitude of fixed exporting costs has to be adjusted accordingly. More specifically, to (approximately) match an export participation of 21.58% as in the French data, we need to set fx = 1.5 when fe = 0.5. This p Results do not change significantly with respect to Figure 4, although of course now the Pareto model is no longer invariant to a change in fe because fx also changes. In both the lognormal and the Pareto models higher costs are associated with large GFT, but the magnitude and the ranking among GFT delivered by different models remain unaltered (see Figure B3 in the Appendix). 7

A comparison of GFT in the homogeneous and heterogeneous case under lognormality of the productivity distribution (computed at τ = 1.83) shows that the additional benefits associated with firm heterogeneity are greatly reduced when fe grows. In fact, in the baseline case (fe = 0.0145) the welfare effect of trade liberalization goes from 2% in the homogeneous model to 3.2% in the heterogeneous one; when fe = 0.5, on the other hand, GFT move from 3.8 to 4.2%.

12

Lognormal and Pareto welfare gains as a function of ε, f = 0.5 Vs f =0.0145 e

e

1.4 Lognormal (fe=0.0145) Lognormal (fe=0.5) Pareto

welfare gains

1.3

1.2

1.1

1 2

4

6

8

10

12

14

ε

Figure 5: Lognormal and Pareto welfare gains as a function of ǫ: k and σ vary with ǫ; moreover, we also adjust τ in order to match data on export intensity for French manufacturing firms. Impact of a 50% reduction in iceberg transport costs. The last exercise we perform is to evaluate the effect of a 50% reduction in iceberg transport costs (as opposed to a mover from autarky to free trade), when ǫ varies, and k and σ adjust accordingly.8 GFT under Pareto always dominate those under a lognormal productivity distribution. What is more, in the Pareto case we find a monotonic upward-sloping relation linking GFT to ǫ, whereas the relationship is hump-shaped in the longnormal case. Hence, as the elasticity of substitution increases, the distance between the domestic productivity thresholds under the two values of τ first increase, yielding the upward sloping part of the curve, then, for values of ǫ ≅ 3.5 start decreasing thus reducing the welfare impact of a reduction in variable trade costs. The fact that the lognormal model does not yield closed-form solutions makes it difficult to understand the source of this peculiar behavior: one possibility, which we would like to explore in the future, is to use an approximation for the normal CDF that enters the equilibrium condition of the lognormal model and defines the threshold productivity levels. 8

To compute this exploit the fact that

W1T W0T

=

W1T /WA . W0T /WA

13

welfare gains (percentage points)

60

50

40

30

20

10 Lognormal Pareto 0 2

3

4

5 6 7 elasticity of substitution (ε)

8

9

10

Figure 6: Welfare effect of a 50% reduction in iceberg transport costs (τ ) in the lognormal and Pareto cases.

5

Conclusion

Large and persistent heterogeneity among firms has become a central tenet of present-day trade literature, but most existing works have crystallized around a specific shape of firm heterogeneity, postulating that the latter is well described by a Pareto distribution. We investigates what happens to the magnitude of gains from trade when one departs from the standard assumption of a Pareto productivity distribution. We take stock of the existing literature showing that the degree of heterogeneity in firm size and productivity matters a lot for both the magnitude and the composition of the welfare effect of trade liberalization di Giovanni & Levchenko (2013). The reason for this is the relative importance of marginal firms that represent the extensive margin of trade, versus large infra-marginal enterprises. We push the argument one step further and evaluate the effect of choosing a different type of productivity distribution, namely a lognormal distribution. In so doing, we complement recent evidence by Head et al. (2014) by offering a thorough comparison between GFT obtained under a Pareto and a lognormal distribution, as well as the sensitivity of the results to a number of key parameters. We find that GFT increase the fatter the tails of the productivity distribution, and this result carries over to the lognormal case. Our parametrization returns a clear-cut and stable ranking among the various models, with GFT under Pareto always dominating those obtained assuming a lognormal distribution.9 The Pareto assumption yields results that appear both larger and more sensitive to 9

This could change in case we compare a lognormal with large variance with a Pareto distribution characterized by a large shape parameter.

14

the key parameters of the model, most notably the elasticity of substitution, which is notoriously difficult to pin down. On the other hand, a model postulating that firm productivity follows a lognormal distribution does not yield closed-form solutions for the threshold levels of productivity, which are the main determinants of welfare gains. As for the additional welfare effect of heterogeneity relative to classical models where all firms are equal, we find that —maybe paradoxically— the more heterogeneous is firm productivity, the smaller the contribution of heterogeneity to the gains from trade.

References Arkolakis, C., Costinot, A. & Rodriguez-Clare, A. (2012), ‘New trade models, same old gains?’, American Economic Review 102(1), 94–130. Axtell, R. L. (2001), ‘Zipf distribution of U.S. firm sizes’, Science 293(5536), 1818–1820. Bee, M., Riccaboni, M. & Schiavo, S. (2014), Where gibrat meets zipf: Scale and scope of french firms, Discussion Paper Series 2014/3, Department of Economics and Management - University of Trento. Behrens, K., Ertur, C. & Koch, W. (2012), ‘"dual" gravity: Using spatial econometrics to control for multilateral resistance.’, Journal of Applied Econometrics 27(5), 773–794. Bernard, A., Jensen, B., Redding, S. & Schott, P. (2007), ‘Firms in international trade’, Journal of Economic Perspectives 21(3), 105–130. Broda, C. & Weinstein, D. E. (2006), ‘Globalization and the Gains from Variety’, The Quarterly Journal of Economics 121(2), 541–585. di Giovanni, J. & Levchenko, A. A. (2013), ‘Firm entry, trade, and welfare in Zipf’s world’, Journal of International Economics 89(2), 283–296. Gabaix, X. (1999), ‘Zipf’s law for cities: An explanation’, Quarterly Journal of Economics 114(3), 739–67. Gibrat, R. (1931), Les Inegalites Economiques, Sirey, Paris. Hanson, G. (2005), ‘Market potential, increasing returns and geographic concentration’, Journal of International Economics 67, 1–24. Head, K., Mayer, T. & Thoenig, M. (2014), ‘Welfare and trade without Pareto’, Papers and Proceedings of the American Economic Review 104(5), 3010–316. Hummels, D. (1999), Toward a geography of trade costs, GTAP Working Papers 1162, Center for Global Trade Analysis, Purdue University. 15

Ijiri, Y. & Simon, H. (1977), Skew Distributions and the Sizes of Business Firms, North Holland, Amsterdam. Ijiri, Y. & Simon, H. A. (1974), ‘Interpretations of departures from the Pareto curve firm-size distributions’, Journal of Political Economy 82(2), 315–331. Krugman, P. (1980), ‘Scale Economies, Product Differentiation, and the Pattern of Trade’, American Economic Review 70(5), 950–59. Luttmer, E. G. J. (2007), ‘Selection, growth, and the size distribution of firms’, The Quartely Journal of Economis 122(03), 1103–1144. Melitz, M. J. (2003), ‘The impact of trade on intra-industry reallocations and aggregate industry productivity’, Econometrica 71(6), 1695–1725. Melitz, M. J. & Redding, S. J. (2013), Firm heterogeneity and aggregate welfare, Working Paper 18919, NBER. Rossi-Hansberg, E. & Wright, M. L. (2007), ‘Establishment size dynamics in the aggregate economy’, American Economic Review 97(5), 1639. Steindl, J. (1965), Random Processes and the Growth of Firms; A Study of the Pareto Law, Hafner Publishing Company, New York, NY. Virkar, Y. & Clauset, A. (2012), Power-law distributions in binned empirical data. Preprint, arXiv:1208.3524.

16

Appendices A

Model derivation under lognormality

The threshold productivity levels needed to compute welfare gains are implicitly defined by the free entry condition of the model, requiring the expected value of profits (conditional   on surviving) to be equal to the sunk entry costs, 1 − G(φTd ) π ¯ = wfe , where G(φ) is   the cumulative distribution (CDF) of productivity, and 1 − G(φTd ) gives the proportion of firms that successfully enter the market. The zero profit condition implies that, both in the domestic and the foreign market, the marginal firm featuring the threshold value of productivity earns just enough to pay for its fixed costs (of production and, possibly, export). Combining the zero profit conditions for the domestic and foreign market, one can establish a relationship between the domestic and export productivity cutoffs: φTx





fx fd

1  ǫ−1

φTd .

(1)

Combining these elements, and choosing a specific distribution of productivity G(φ), it is then possible to compute GFT.  k φm in In the Pareto case, [1 − G(φ)] = : this greatly simplifies the computations, φ and yields a closed-form solutions for the productivity cutoffs and average productivity.10 Here below we detail the lognormal case.

A.1

Closed economy

We have to solve for φA d the free entry condition (??), which can be written as: 

˜A fe  φd = [1 − Gµ,σ2 (φA d )] fd φA d

!ǫ−1



− 1 .

(2)

Under the assumption of lognormality, Gµ,σ2 (x) = Φµ,σ2 (log(x)) is the Logn(µ, σ 2 ) CDF, ǫ−1 while (φ˜A is given by d) ǫ−1 (φ˜A = d)

ˆ



φǫ−1

φA d

gµ,σ2 (φ) dφ 1 − Gµ,σ2 (φA d)

(3)

with gµ,σ2 being the Logn(µ, σ 2) probability distribution function (PDF). The integral (3) is equal to: φ˜A = E(φǫ−1 |φ > φA ), (4) d

d

10 We refer the interested reader to the excellent Web Appending of the paper by Melitz & Redding (2013).

17

where φ ∼ Logn(µ, σ 2). The expected value can be computed explicitly and is given by ǫ−1 E(Y ǫ−1 |Y > φA ) d ) = E(Y

Φ((ǫ − 1)σ − a0 ) , Φ(−a0 )

(5)

where a0 = (log(φA d ) − µ)/σ. Note that, for integer ǫ, equation (5) is the (ǫ − 1) − th 2 A moment of φ|φ > φA d , i.e. the Logn(µ, σ ) distribution right-truncated at φd . It is well known that E(φǫ−1) = exp{(ǫ − 1)µ + (ǫ − 1)2 σ 2 /2}. Thus, equation (5) can be rewritten as Φ((ǫ − 1)σ − a0 ) 2 2 . E(φǫ−1 |φ > φA d ) = exp{(ǫ − 1)µ + (ǫ − 1) σ /2} Φ(−a0 ) From (4) and (5) it readily follows that ǫ−1 (φ˜A = exp{(ǫ − 1)µ + (ǫ − 1)2 σ 2 /2} d)

Φ((ǫ − 1)σ − a0 ) . Φ(−a0 )

(6)

˜A Hence, the problem consists in solving (2) for φA d with φd given by (6), namely:   fe exp{(ǫ − 1)µ + (ǫ − 1)2 σ 2 /2}Φ((ǫ − 1)σ − a0 )/Φ(−a0 ) A = [1 − Gµ,σ2 (φd )] −1 ǫ−1 fd (φA d) The solution of this equation must be found numerically.

A.2

Open economy

Similar results hold in open economy. If τ (fx /fd )1/(ǫ−1) > 1, only the most productive firms export. Given the relationship between the domestic and export productivity threshold defined by equation (1) above, we also have that φTx > φTd .11 The free entry condition to solve for φTd can be written as: 

φ˜T fe = fd [1 − Gµ,σ2 (φTd )]  dT φd

!ǫ−1





φ˜T − 1 + fx [1 − Gµ,σ2 (φTx )]  Tx φx

!ǫ−1



− 1 ,

(7)

where φ˜Tx is average productivity in the export market, and (φ˜Tx )ǫ−1 =

ˆ



φT x

φǫ−1 gµ,σ2 (φ) dφ. 1 − Gµ,σ2 (φTx )

(8)

Similarly to the closed-economy case, lognormality implies that equation (8) is given by (φ˜Tx )ǫ−1 = E(φǫ−1 |φ > φTx ) = E(Y ǫ−1 ) 11

Φ((ǫ − 1)σ − a1 ) , Φ(−a1 )

On the other hand, if τ (fx /fd )1/(ǫ−1) ≤ 1, then all firms export and φTx = φTd .

18

(9)

where a1 = (log(φTx ) − µ)/σ. As for the ratio W T /W A , we get:12 T

W WA

 T   φAd φ = d 1−σ 1/(σ−1)   1+τ fd

if τ φT d φA d

fd +fx

B

if τ

 1/(ǫ−1) fx fd

 1/(ǫ−1) fx fd

> 1;

(10)

≤ 1.

Additional Evidence

This section contains some additional evidence that we not presented in the main text. Figure B1 shows how GFT change when we let the elasticity of substitution ǫ vary, but keep k or σ unchanged to their baseline values. The assumption here is that we can directly calibrate the degree of heterogeneity in productivity without relying on the relationship between the latter and sales. Lognormal and Pareto welfare gains as a function of ε (k = 3.18, k =4.25, σ=0.797) 1

2

Lognormal Pareto k=4.25 Pareto k=3.18

1.25

welfare gains

1.2

1.15

1.1

1.05 X: 3.542 Y: 1.026

1 2

4

6

8

ε

10

12

14

Figure B1: Lognormal and Pareto welfare gains as a function of ǫ. The shape parameter k and the standard deviation of the lognormal distribution σ are kept fixed. Trade costs have their baseline values τ = 1.83, fx = 0.545). Here we see that the magnitude of GFT is reduced with respect to Figure 4 above, where ǫ affects the degree of heterogeneity as well. Also, while in case k = 3.18 GFT under Pareto are always larger than those under lognormality, for the case or a Pareto 12

See equations (11) and (27) in the Web Appendix of Melitz & Redding (2013).

19

productivity distribution with a shape parameter k = 4.25, values of the ǫ < 3.5 yield an inversion in the ranking, with the lognormal model delivering higher GFT. Lognormal and Pareto welfare gains as a function of ε (k/(ε−1) = 1.06, σ(ε−1)=2.39 lognormal het. lognormal hom. Pareto het. Pareto hom.

1.9 1.8

welfare gains

1.7 1.6 1.5 1.4 1.3 1.2 1.1 1 2

3

4

ε

5

6

7

8

Figure B2: Lognormal and Pareto welfare gains as a function of ǫ: homogeneous Vs heterogeneous models. Figure B2 compares GFT in the homogeneous and heterogeneous case under the alternative assumptions of a Pareto or lognormal productivity distribution. Differences are small compared with the magnitude of GFT, especially for small values of ǫ. Figure B3 compares GFT obtained from the lognormal and Pareto model for different levels of the sunk entry cost fe and corresponding fx so as to match the export participation rate (21.58%) found among French manufacturing firms in 2003.

20

Lognormal and Pareto welfare gains as a function of ε: f = 0.0145, f = 0.545 Vs f = 0.5, f = 1.5 e

x

e

x

Lognormal (fe=0.0145,fx=0.545) Lognormal (fe=0.5,fx=1.5) Pareto (fe=0.0145,fx=0.545)

1.25

Pareto(fe=0.5,fx=1.5)

welfare gains

1.2

1.15

1.1

1.05

1 2

4

6

8

ε

10

12

14

Figure B3: Lognormal and Pareto welfare gains as a function of ǫ: τ, kandσ vary with ǫ, fx is adjusted to changes in fe so to match export participation observed in the data on French manufacturing firms.

21

gains from trade when firm productivity is not Pareto ...

of the data and, moreover, because of its analytical tractability. On the other hand, both theoretical and empirical works show that the characteristics of the ...

235KB Sizes 0 Downloads 201 Views

Recommend Documents

Gains from Trade
Nov 11, 2015 - Any SCF not satisfying this property will necessarily require thor- ...... c. ' Oa. 1. 2. 1. 2 r. ωK. Ob. Ia. Ib. E. T c. ' Oa. 3. 4. 3. 4 r. ωKc. Ob. Ia. Ib.

Gains from Trade
Aug 8, 2013 - very least, no agent can be made worse off than her initial holding. However, this basic. ∗. Chambers: Department of Economics, University of California, San Diego. email: [email protected]. Hayashi: Adam Smith Business School, Univ

productivity gains, technology spillovers and trade ...
The importance of R&D spillovers for productivity growth has been well docu- ... albeit with varying levels of impact at different degrees of foreign ownership ... classes (i.e. they use data on a single year or on the beginning year and end year, ..

Trade Patterns, Income Differences and Gains from Trade
Nov 12, 2012 - To do this, we analyze highly disaggregated bilateral trade data and ...... for p1 that arises from substituting the free entry condition and the first.

Product Mix and Firm Productivity Responses to Trade Competition
Sections 5 and 6 take these predictions to the data and measure large responses in ...... “The Quantitative Analytics of the Basic Neomonetarist Model.” Journal.

Product Mix and Firm Productivity Responses to Trade Competition
demand shocks and the productivity of multi-product firms exporting to those ...... accounting then implies that this normalized number of consumers Lc ...

trade and productivity - CREI
institutional quality as well as geography and take into account the ... financial support from the CREI and CREA research institutes, the European Fund for .... Purchasing power parity (PPP) GDP differs from GDP in that the production of each good i

Gains from Financial Integration and Trade Openness ...
and Foreign countries can freely trade goods, but individuals are not able to invest abroad. ... Section four presents the quantitative results for the open economy: first under ... Moreover, capital account openness is associated with higher output

Energy efficiency gains from trade: greenhouse gas ...
Feb 2, 2012 - nini Hall, Berkeley, CA 94720 (email: [email protected]) ... output and increase exports, effectively allowing wealthy countries to outsource ..... Although the Melitz and Bustos models do not directly address the issue of.

Energy efficiency gains from trade: greenhouse gas ...
Feb 2, 2012 - Clean Development Mechanism energy efficiency and renewable energy projects combined. .... With firm-level data, I am able to reduce these sources of bias. .... II. Why trade liberalization would favor energy-efficient firms.

trade and productivity - CREI
Our empirical approach accounts for the endogeneity of trade and ... of openness as a measure of trade can be illustrated using a small open economies.

Imported Inputs and the Gains from Trade
a local elasticity of welfare with respect to the domestic share (the coeffi cient −η ε. ) that is about 3 times larger (in absolute value) than the one implied by the ...

Imported Inputs and the Gains from Trade
1Kasahara and Lapham (2007) have documented these facts for Chile, while similar facts can be found in Kurz (2006) .... 4About three quarters of manufacturing plants in Chile do not use any imported inputs. Excluding these ...... Eaton, J., and S. Ko

When one Sample is not Enough: Improving Text Database Selection ...
ABSTRACT. Database selection is an important step when searching over large numbers of distributed text databases. The database selection task relies on ...

pdf-12105\when-love-is-not-enough-unconditional-love ...
Connect more apps... Try one of the apps below to open or edit this item. pdf-12105\when-love-is-not-enough-unconditional-love-book-1-by-wade-kelly.pdf.

When one Sample is not Enough: Improving Text ... - Semantic Scholar
For illustration purposes, Table 2 reports the computed ...... Zipf, power-laws, and Pareto – A ranking tutorial. Available at http://ginger.hpl.hp.com/shl/-.

When Physics is Not “Just Physics”: Complexity ...
cognitive phenomena with a suspicion that biological and cognitive systems, capable of ..... ena. Although Euclid's fifth axiom itself has little to do the neurobiological sciences, the revolution ..... Westerhoff HV, Winder C, Messiha H, Simeonidis.

When one Sample is not Enough: Improving Text ... - Semantic Scholar
Computer Science Department. Columbia .... contains articles about Computer Science. .... ment classification [22], each category level might be assigned.

Pareto Improving Social Security Reform when ...
Apr 15, 2005 - tion of an unfunded social security system leads to a Pareto improve- ment. When .... Each agent lives for two periods, earns wage w in the first period on ..... niques we de-trend the economy by deterministic population growth and ...

GEOGRAPHY, PRODUCTIVITY AND TRADE: DOES ...
contrary, once trade costs are allowed to differ across locations, it can be shown that selection effects will be more .... one-to-one technology; this implies that one unit of labor is needed to produce one unit of this kind of good. ..... added, fi

Market size, division of labor, and firm productivity
Market size. Division of labor. Firm productivity. Technology transfer. We generalize Krugman's (1979) 'new trade' model by allowing for an explicit production chain in which a ... labor which brings about an increase in firm productivity. .... 4 A s

Gains from International Monetary Policy Coordination - CiteSeerX
Jun 20, 2007 - intermediate goods produced in two sectors, a traded good sector, and a non-traded ... Labor is mobile across sectors, but not across countries.