20. Investment

Note

本章主题:投资的实证。 1980 年代后发达经济体投资/GDP 下降(即便资本成本下降)。从金融视角看投资。§20.1 Hayashi (1982) Q 理论基准:边际 \(q\) 模型,一阶条件 \(C_{K'}+p=\) 边际 \(q\) (20.2);简化假设下 \(V=Kq(A)\) (20.6) 解出 \(\frac IK=-\frac p\gamma+\frac\beta\gamma\mathbb E[q(A')\mid A]\) (20.8),可检验预测 \(\boldsymbol\psi_i=\mathbf 0\) (20.10)(但边际 \(q\) 不可观测、模型过简、有摩擦时变形)。§20.2 Fazzari et al. (1988):现金流影响投资(内部融资成本低);按股利/收入分四类,越缺现金(Class 1)现金流-投资敏感性越高(图 20.1–20.2);但 Class 1 公司小、\(Q\) 测量误差大(衰减偏误)使敏感性虚高。§20.3 Kaplan-Zingales (1997) 反驳:高投资-现金敏感性 ≠ 更受约束;单期模型 \(\frac{dI}{dW}=\frac{C_{EE}}{F_{II}-C_{EE}}\) (20.15)(仅市场不完美时为正),但「敏感性随内部资金递减」的单调性 (20.16) 未必成立;实证用更好的约束指标证之。§20.4 Lenzu-Manaresi (2018/19):直接估一阶条件中的楔子 \(\tau_{i,t}=\chi_{i,t}(1-\lambda_{i,t})\)(影子资金成本),可估非上市小公司;缺点是只含数量配给、不含价格配给。§20.5 Rauh (2006) 用 DB 养老金强制缴款的现金流断点(图 20.8)估投资-现金流效应 (20.17)。§20.6 Zwick-Mahon (2017) 用奖励折旧 \(z=\theta+(1-\theta)z^B\) 的税基现金流冲击 DiD,长久期项目受益更大、集约+广延边际都正。§20.7 Whited (1992) 结构法:比较有/无借贷约束的欧拉方程 (20.22)/(20.25),GMM 检验,约束模型不被拒(公司确受融资约束,无债券评级者更甚)。§20.8 Matvos-Seru (2014) 内部资本市场权衡:经理效用 (20.29) 含「公司社会主义」负效用 \(\lambda\sum(z_{tj}-z_t^*)k_{tj}\),两步法估 \(\boldsymbol\theta\);集团总低于独立公司(暗面),但高利率期相对更高(亮面)。

Note

Chapter theme: the empirics of investment. Post-1980s, developed economies' investment/GDP fell (even as capital cost fell). We view investment from a finance angle. §20.1 Hayashi (1982) benchmark Q theory: a marginal-\(q\) model, f.o.c. \(C_{K'}+p=\) marginal \(q\) (20.2); under simplifying assumptions \(V=Kq(A)\) (20.6) gives \(\frac IK=-\frac p\gamma+\frac\beta\gamma\mathbb E[q(A')\mid A]\) (20.8), with testable prediction \(\boldsymbol\psi_i=\mathbf 0\) (20.10) (but marginal \(q\) is unobservable, the model too simple, and friction changes the form). §20.2 Fazzari et al. (1988): cash flow affects investment (cheaper internal financing); by dividend/income into four classes, the more cash-short (Class 1) the higher the cash-flow-investment sensitivity (Figures 20.1–20.2); but Class 1 firms are small with large \(Q\) measurement error (attenuation bias) inflating the sensitivity. §20.3 Kaplan-Zingales (1997) rebuts: high investment-cash sensitivity \(\ne\) more constrained; a one-period model gives \(\frac{dI}{dW}=\frac{C_{EE}}{F_{II}-C_{EE}}\) (20.15) (positive only when the market is imperfect), but the monotonicity "sensitivity decreasing in internal funds" (20.16) need not hold; shown empirically with better constraint measures. §20.4 Lenzu-Manaresi (2018/19): directly estimate the wedge in the f.o.c., \(\tau_{i,t}=\chi_{i,t}(1-\lambda_{i,t})\) (shadow cost of funds), estimable for small non-public firms; drawback: only quantity rationing, not price rationing. §20.5 Rauh (2006) uses the cash-flow discontinuity from mandatory DB pension contributions (Figure 20.8) to estimate the investment-cash-flow effect (20.17). §20.6 Zwick-Mahon (2017) uses the tax-based cash-flow shock from bonus depreciation \(z=\theta+(1-\theta)z^B\) in a DiD; longer-duration projects benefit more, both intensive and extensive margins positive. §20.7 Whited (1992) structural: compares Euler equations with/without a borrowing constraint (20.22)/(20.25), GMM tests, the constrained model is not rejected (firms are financially constrained, more so without bond ratings). §20.8 Matvos-Seru (2014) internal-capital-market trade-offs: the manager's utility (20.29) includes a "corporate socialism" disutility \(\lambda\sum(z_{tj}-z_t^*)k_{tj}\), \(\boldsymbol\theta\) estimated by a two-step method; conglomerates are always below stand-alones (dark side) but relatively higher in high-rate periods (bright side).

20.1 Benchmark Q Theory: Hayashi (1982)

1980 年代后,多数发达经济体投资/GDP 下降,即便利率(资本成本)下降、投资更便宜。无金融背景的替代解释:(1) 已达新技术上限;(2) 公司市场力更强、缺乏高效投资激励;(3) 无形资产更重要、未计入传统投资计算。下面从金融视角看投资。Hayashi (1982) 的边际 \(q\) 模型(简化版)。

20.1.1 设定. 公司用技术 \(A\) 与资本 \(K\) 生产价值 \(\pi(A,K)\) 的计价品(\(A\) 受随机冲击);折旧率 \(\delta\in(0,1)\)、贴现因子 \(\beta\in(0,1)\);调整成本 \(C(K',A,K)\);资本品本期价 \(p\)、下期 \(p'\)。投资 \(I=K'-(1-\delta)K\)。记 \(\pi_K\equiv\partial\pi/\partial K\)、\(C_{K'}\equiv\partial C/\partial K'\)、\(V_{K'}\equiv\partial V/\partial K'\)。

20.1.2 公司的最大化问题.

$$V(A,K,p)=\max_{K'}\pi(A,K)-C(K',A,K)-p(K'-(1-\delta)K)+\beta\mathbb{E}[V(A',K',p')\mid A] \tag{20.1}$$

对 \(K'\) 的一阶条件:

$$C_{K'}(K',A,K)+p=\underbrace{\beta\mathbb{E}[V_{K'}(A',K',p')\mid A]}_{\text{marginal }q} \tag{20.2}$$

右端为边际 \(q\)(多投一元的边际收益);(20.2) 即投资到边际成本=边际收益。

20.1.3 简化假设. 假设 1 \(\pi(A,K)=AK\) (20.3);假设 2 \(C(K',A,K)=\frac\gamma2\left(\frac{K'-(1-\delta)K}{K}\right)^2 K\) (20.4);假设 3 \(A'=\rho A+\varepsilon'\) (20.5)(\(|\rho|<1\),\(\varepsilon'\sim\mathcal N(0,1)\))。猜测 \(V(A,K,p)=Kq(A)\) (20.6)。由 (20.4)、(20.5) 改写 (20.2):

$$\frac IK=\underbrace{-\frac p\gamma+\frac\beta\gamma\mathbb{E}[q(A')\mid A]}_{\equiv z(A)} \tag{20.7}$$

代入 (20.1) 验证 \(V=Kq(A)\) 确解此动态规划(\(q(A)=A-\frac\gamma2 z(A)^2-pz(A)+\beta[z(A)+(1-\delta)]\mathbb E[q(A')\mid A]\))。关键结论

$$\frac IK=-\frac p\gamma+\frac\beta\gamma\mathbb{E}[q(A')\mid A] \tag{20.8}$$

可检验预测:据 (20.8) 设回归 \(\left(\frac IK\right)_{it}=\alpha_{it}+\frac\beta\gamma\mathbb E[q(A')\mid A]+\boldsymbol\psi_i\cdot\mathbf X_{it}+\varepsilon_{it}\) (20.9),预测

$$\boldsymbol\psi_i=\mathbf 0 \tag{20.10}$$

但此预测得自诸多简化假设、不宜直接检验;即便假设成立,边际 \(q\) 不可观测(只能见平均 \(q=V/K\),二者一般不等),小公司平均 \(q\) 也难观测;且推导无金融摩擦,有摩擦时边际 \(q\) 含摩擦、未必同形。

Post-1980s, most developed economies' investment/GDP fell, even as the interest rate (cost of capital) fell and investment became cheaper. Alternative explanations with no finance background: (1) reached an upper limit of new technology; (2) firms have stronger market power, less incentive to invest efficiently; (3) intangible assets are more important, not in traditional investment calculation. We view investment from a finance angle. Hayashi's (1982) marginal-\(q\) model (simplified).

20.1.1 Setup. A firm uses technology \(A\) and capital \(K\) to produce a numeraire good of value \(\pi(A,K)\) (\(A\) subject to random shocks); depreciation \(\delta\in(0,1)\), discount factor \(\beta\in(0,1)\); adjustment cost \(C(K',A,K)\); capital-good price \(p\) this period, \(p'\) next. Investment \(I=K'-(1-\delta)K\). Denote \(\pi_K\equiv\partial\pi/\partial K\), \(C_{K'}\equiv\partial C/\partial K'\), \(V_{K'}\equiv\partial V/\partial K'\).

20.1.2 Firm's maximization.

$$V(A,K,p)=\max_{K'}\pi(A,K)-C(K',A,K)-p(K'-(1-\delta)K)+\beta\mathbb{E}[V(A',K',p')\mid A] \tag{20.1}$$

The f.o.c. w.r.t. \(K'\):

$$C_{K'}(K',A,K)+p=\underbrace{\beta\mathbb{E}[V_{K'}(A',K',p')\mid A]}_{\text{marginal }q} \tag{20.2}$$

The RHS is marginal \(q\) (the marginal benefit of investing one more dollar); (20.2) says invest until MC = MB.

20.1.3 Simplifying assumptions. Assumption 1 \(\pi(A,K)=AK\) (20.3); Assumption 2 \(C(K',A,K)=\frac\gamma2\left(\frac{K'-(1-\delta)K}{K}\right)^2 K\) (20.4); Assumption 3 \(A'=\rho A+\varepsilon'\) (20.5) (\(|\rho|<1\), \(\varepsilon'\sim\mathcal N(0,1)\)). Guess \(V(A,K,p)=Kq(A)\) (20.6). Rewriting (20.2) with (20.4), (20.5):

$$\frac IK=\underbrace{-\frac p\gamma+\frac\beta\gamma\mathbb{E}[q(A')\mid A]}_{\equiv z(A)} \tag{20.7}$$

Plugging into (20.1) verifies \(V=Kq(A)\) solves this dynamic program (\(q(A)=A-\frac\gamma2 z(A)^2-pz(A)+\beta[z(A)+(1-\delta)]\mathbb E[q(A')\mid A]\)). The key conclusion:

$$\frac IK=-\frac p\gamma+\frac\beta\gamma\mathbb{E}[q(A')\mid A] \tag{20.8}$$

Testable prediction: from (20.8), specify \(\left(\frac IK\right)_{it}=\alpha_{it}+\frac\beta\gamma\mathbb E[q(A')\mid A]+\boldsymbol\psi_i\cdot\mathbf X_{it}+\varepsilon_{it}\) (20.9), predicting

$$\boldsymbol\psi_i=\mathbf 0 \tag{20.10}$$

But this comes from many simplifying assumptions and isn't ready to test directly; even if they hold, marginal \(q\) is unobservable (we see only average \(q=V/K\), generally unequal), small firms' average \(q\) is hard to observe, and the derivation has no financial friction (with friction marginal \(q\) includes frictions, not necessarily the same form).

20.2 Cash Flow and Investment: Fazzari et al. (1988) — and 20.3 Kaplan and Zingales (1997)

20.2 Fazzari et al. (1988). 现金流是决定投资的重要因素,因内部融资成本低于外部;有现金流的公司有更多内部融资、可更便宜地投资。实证检验:预测 1——现金流对投资支出有正效应;预测 2——内部融资更受约束(更缺现金)的公司更依赖现金流、其投资对现金流增长反应更强。数据:1970–1984 上市制造业(Value Line),按缺现金程度分四类(Class 1:股利/收入$<0.1$ 持续 \(\ge10\) 年,最缺现金;…;Class 4:其余,最不缺;图 20.1 摘要统计)。结果(图 20.2,\(Q_{it}\)=市值/账面值代理平均 \(q\)):现金流(及滞后)对投资有显著正效应(证预测 1);Class 1 现金流效应强于 Class 4(证预测 2);表面与 (20.10) 矛盾、但 (20.10) 源自过简模型、未被真正否定。评论:各类公司资本规模不可比;Class 1 公司更小、市价反映市值更不准 → \(Q_{it}\) 衰减偏误更重 → Class 1 的 \(Q\) 系数更偏向 0 → 其现金流系数反更向上偏,故预测 2 未必成立。

20.3 Kaplan-Zingales (1997) 反驳:高投资-现金敏感性意味更受融资约束的结论是错的。理论:K-Z 以内外资本成本之楔子定义融资约束(广义、几乎所有公司都可能受约束)。一期模型:公司选投资 \(I\) 解 \(\max_I F(I)-C(E,k)-I\) (20.11) s.t. \(I=W+E\) (20.12)(\(F'>0,F''<0\);外部资金死重成本 \(C(E,k)\) 凸于 \(E\),\(k\) 为楔子度量)。代入得 f.o.c.:

$$F_I(I)-C_E(E,k)=1 \tag{20.13}$$

全微分并令 \(dk=0\):

$$\frac{dI}{dW}=\frac{C_{EE}(E,k)}{F_{II}(I)-C_{EE}(E,k)} \tag{20.15}$$

市场不完美 \(C(E,k)>0\) 时 \(\frac{dI}{dW}>0\)、完美市场 \(C=0\) 时 $=0$——故投资-现金敏感性仅在市场不完美、公司受约束时为正。令 \(dW=0\) 得 \(\frac{dI}{dk}=\frac{C_{Ek}}{F_{II}-C_{EE}}\)(若 \(C_{Ek}>0\) 则为负)。但 Fazzari 等所需的「敏感性随内部资金递减」即 \(\frac{d^2I}{(dW)^2}<0\),经代数(并用三阶条件 \(F_{III}=C_{EEE}\))化为

$$\frac{F_{III}(C_{EE}^2-F_{II}^2)}{(F_{II}-C_{EE})^3}<0 \tag{20.16}$$

不必然成立(取决于 \(C\) 与 \(F\) 的相对曲率)。实证:用同样样本,从 SEC 文件收集 49 家 Fazzari 归为 Class 1 的低股利公司的细致信息,按定性年报+定量财报分 1–5 组(约束指标:流动性可得、股利限制、未限留存收益、现金/资本比;图 20.3 序数 Logit 显示这些指标显著、支持 5 组分类);图 20.4 显示现金流系数从「从不受约束」到「可能受约束」无递减模式——故基于高投资-现金敏感性的融资约束结论在更好度量约束时不成立。

20.2 Fazzari et al. (1988). Cash flow is an important factor determining investment, since internal financing is cheaper than external; a firm with cash flow has more internal financing and can invest more cheaply. Empirical test: Prediction 1 — cash flow has a positive effect on investment; Prediction 2 — firms more constrained on internal financing (more cash-short) rely more on cash flow, their investment reacting more strongly. Data: 1970–1984 public manufacturing firms (Value Line), four classes by cash shortage (Class 1: dividend/income $<0.1$ for \(\ge10\) years, most cash-short; …; Class 4: others, least short; Figure 20.1 summary stats). Results (Figure 20.2, \(Q_{it}\) = market/book value proxying average \(q\)): cash flow (and lagged) has a significant positive effect (confirms Prediction 1); Class 1 has stronger cash-flow effects than Class 4 (confirms Prediction 2); seemingly contradicting (20.10), but (20.10) comes from a too-simplified model and isn't really denied. Comments: classes are not comparable in capital size; Class 1 firms are smaller, with market price reflecting value less accurately → \(Q_{it}\) suffers more attenuation bias → Class 1's \(Q\) coefficient more biased toward 0 → its cash-flow coefficient more biased upward, so Prediction 2 need not hold.

20.3 Kaplan-Zingales (1997) rebut: the conclusion that high investment-cash sensitivity means more financially constrained is wrong. Theory: K-Z define financial constraint by the wedge between internal and external capital cost (broad; almost all firms could be constrained). A one-period model: the firm chooses \(I\) to solve \(\max_I F(I)-C(E,k)-I\) (20.11) s.t. \(I=W+E\) (20.12) (\(F'>0,F''<0\); external-fund deadweight cost \(C(E,k)\) convex in \(E\), \(k\) the wedge measure). Substituting gives the f.o.c.:

$$F_I(I)-C_E(E,k)=1 \tag{20.13}$$

Total-differentiating with \(dk=0\):

$$\frac{dI}{dW}=\frac{C_{EE}(E,k)}{F_{II}(I)-C_{EE}(E,k)} \tag{20.15}$$

With imperfect markets \(C(E,k)>0\), \(\frac{dI}{dW}>0\); with perfect markets \(C=0\), $=0$ — so investment-cash sensitivity is positive only when the market is imperfect and the firm constrained. With \(dW=0\), \(\frac{dI}{dk}=\frac{C_{Ek}}{F_{II}-C_{EE}}\) (negative if \(C_{Ek}>0\)). But the Fazzari requirement that sensitivity decrease in internal funds, \(\frac{d^2I}{(dW)^2}<0\), reduces (using a third-order condition \(F_{III}=C_{EEE}\)) to

$$\frac{F_{III}(C_{EE}^2-F_{II}^2)}{(F_{II}-C_{EE})^3}<0 \tag{20.16}$$

which need not hold (it depends on the relative curvature of \(C\) and \(F\)). Empirically: using the same sample, they collect detailed info from SEC filings on 49 low-dividend firms Fazzari classed as Class 1, classify each into groups 1–5 by qualitative reports + quantitative statements (constraint measures: liquidity access, dividend restriction, unrestricted retained earnings, cash/capital ratio; Figure 20.3 ordered logit shows these are significant, justifying the 5-group classification); Figure 20.4 shows the cash-flow coefficient has no decreasing pattern from never-constrained to probably-constrained — so the financial-constraint conclusion based on high investment-cash sensitivity doesn't hold with better-measured constraints.

20.4 Wedge in the First-Order Condition as a Measure of Financial Constraint: Lenzu and Manaresi (2018)

Lenzu-Manaresi (2019) 提出更根本的方法:直接计算公司利润最大化一阶条件中的楔子,而非估约束的代理/指标——直接显示公司是否有楔子(融资约束)。需要:边际借贷成本数据;资本边际收益产品 \(MRP^K\) 的估计(用生产估计文献的工具)。

理论:异质公司最大化对风险中性股东的未来现金流现值,随机性来自公司特定生产率 \(\omega_{i,t}\);每期观测 \(\omega_{i,t}\) 后经理决定还债或违约退出,\(\bar\omega\) 为内生违约阈值;不违约则选下期资本 \(K_{i,t+1}\)、劳动、中间投入与融资计划(银行债 \(B_{i,t+1}\)、内部现金或新股权);违约则被债权人接管、当期后以成本 \(X\ge0\) 比例资产清算。贷方对一组同质公司给单一利率 \(\tilde r_{t+1}\)。信贷限同 Kiyotaki-Moore (1997):\(B_{i,t+1}\le\lambda_{i,t}K_{i,t+1}\)(\(\lambda\) 越低可质押比例越小)。所得一阶条件:

$$\rho\int_{\bar\omega}^\infty[MRP_{i,t+1}^K-(r_{t+1}+\delta)]\,d\Phi(\omega_{i,t+1}\mid\omega_{i,t})=\psi_2^K(K_{i,t},K_{i,t+1})+\rho\int_{\bar\omega}^\infty\psi_1^K(K_{i,t+1},K_{i,t+2})\,d\Phi(\omega_{i,t+1}\mid\omega_{i,t})+\underbrace{\chi_{i,t}(1-\lambda_{i,t})}_{\equiv\tau_{i,t}\text{ Gap}}$$

其中 \(\rho\) 风险中性贴现、\(\delta\) 资本贴现率、\(\Phi\) 条件 cdf、\(\psi^K\) 调整成本(\(\psi_j^K\) 对第 \(j\) 参数求导);\(\tau_{i,t}\equiv\chi_{i,t}(1-\lambda_{i,t})\) = 资金影子成本、闭合多筹一元债的边际收益与成本之间的缺口。所关心的 \(\tau_{i,t}\) 取决于调整成本 \(\psi^K\) 与借贷约束 \(\lambda_{i,t}\)。

实证:优点——理论上有依据、可估非上市小公司。用意大利数据估每公司每期缺口:

$$\hat\tau_{i,t}=\rho\left(1-\hat{\mathbb{P}}(\text{Exit}_{i,t+1}\mid\mathcal F_i)\right)\cdot\left[\widehat{MRP}^K_{i,t+1}-(r_{t+1}+\delta_i)\right]$$

结果(图 20.5 偏离目标资本的百分比):许多公司未在最优水平投资;投资不足者多受约束(\(\tau\) 越高越严);边际 \(Q\) 与缺口正相关、缺口与融资可得(年龄/规模)负相关;关系借贷使缺口随关系延长而降(图 20.6);广延边际重要——首次获信贷的公司缺口 \(\tau_{i,t}\) 不连续下降(图 20.7)。评论:关键假设是融资约束经数量配给(无法按指定利率借)。但贷方或经把利率提到足够高来阻止借贷(价格配给);若接受价格配给也是约束,则 L-M 漏了它——被预测为不受约束的公司可能实际受价格配给约束。

Lenzu-Manaresi (2019) propose a more fundamental approach: directly compute the wedge in the firm's profit-maximization f.o.c., rather than estimating proxies/indicators — directly showing whether a firm has a wedge (financial constraint). Needs: marginal-borrowing-cost data; estimation of the marginal revenue product of capital \(MRP^K\) (using the production-estimation toolkit).

Theory: heterogeneous firms maximize the PV of future cash flows to risk-neutral shareholders, with randomness from firm-specific productivity \(\omega_{i,t}\); each period after observing \(\omega_{i,t}\) the manager decides to repay or default and exit, \(\bar\omega\) the endogenous default threshold; without default, the manager chooses next-period capital \(K_{i,t+1}\), labor, intermediate input, and financing (bank debt \(B_{i,t+1}\), internal cash or new equity); on default the creditor takes over and liquidates after the current period at cost \(X\ge0\) fraction of assets. Lenders offer a single rate \(\tilde r_{t+1}\) to a group of similar firms. The credit limit, as in Kiyotaki-Moore (1997): \(B_{i,t+1}\le\lambda_{i,t}K_{i,t+1}\) (lower \(\lambda\) = smaller pledgeable fraction). The resulting f.o.c.:

$$\rho\int_{\bar\omega}^\infty[MRP_{i,t+1}^K-(r_{t+1}+\delta)]\,d\Phi(\omega_{i,t+1}\mid\omega_{i,t})=\psi_2^K(K_{i,t},K_{i,t+1})+\rho\int_{\bar\omega}^\infty\psi_1^K(K_{i,t+1},K_{i,t+2})\,d\Phi(\omega_{i,t+1}\mid\omega_{i,t})+\underbrace{\chi_{i,t}(1-\lambda_{i,t})}_{\equiv\tau_{i,t}\text{ Gap}}$$

where \(\rho\) is the risk-neutral discount, \(\delta\) the capital discount rate, \(\Phi\) the conditional cdf, \(\psi^K\) the adjustment cost (\(\psi_j^K\) the derivative w.r.t. the \(j\)th argument); \(\tau_{i,t}\equiv\chi_{i,t}(1-\lambda_{i,t})\) = the shadow cost of funds, closing the gap between the marginal gain and cost of raising one more dollar of debt. The \(\tau_{i,t}\) of interest depends on the adjustment cost \(\psi^K\) and the borrowing constraint \(\lambda_{i,t}\).

Empirics: advantages — theoretically justified, estimable for small non-public firms. Using Italian data to estimate each firm's gap:

$$\hat\tau_{i,t}=\rho\left(1-\hat{\mathbb{P}}(\text{Exit}_{i,t+1}\mid\mathcal F_i)\right)\cdot\left[\widehat{MRP}^K_{i,t+1}-(r_{t+1}+\delta_i)\right]$$

Results (Figure 20.5, percentage deviation from target capital): many firms don't invest at the optimum; under-investors are more constrained (higher \(\tau\) more severe); marginal \(Q\) positively correlated with the gap, the gap negatively correlated with financing availability (age/size); relationship lending lowers the gap as the relationship lengthens (Figure 20.6); the extensive margin matters — firms gaining credit access for the first time have a discontinuous drop in \(\tau_{i,t}\) (Figure 20.7). Comment: the crucial assumption is that financial constraint works through quantity rationing (can't borrow at the specified rate). But lenders might raise the rate high enough to block borrowing (price rationing); if we accept price rationing as a constraint, L-M omits it — a firm predicted as unconstrained could actually be price-rationed.

20.5 Exogenous Cash Flow Discontinuity: Rauh (2006) — and 20.6 Tax Policy: Zwick and Mahon (2017)

20.5 Rauh (2006). 利用定额受益 (DB) 养老金强制缴款引致的内部资金不连续下降来研究现金流对投资的效应(控制内生性)。制度细节:养老基金市值$>$负债时不需缴款、市值$<$负债时须缴;缴款额按复杂非线性函数计算,在缴款阈值附近产生急剧不连续变化(图 20.8:强制缴款 = 最低注资缴款与赤字削减缴款之最大值)。设计:把阈值附近的不连续视为对现金流的外生负冲击,消除养老金注资状态与投资机会的内生关系。规范:

$$\frac{I_{i,t}}{A_{i,t-1}}=\alpha_t+\alpha_i+\beta_1 Q_{i,t-1}+\beta_2\frac{\text{Non-Pension Cash Flow}_{i,t}}{A_{i,t-1}}+\beta_3\frac{Z_{i,t}}{A_{i,t-1}}+\boldsymbol\gamma'\mathbf X_{i,t}+\varepsilon_{i,t} \tag{20.17}$$

\(Z_{i,t}\) 为强制养老缴款、\(\beta_3\) 为兴趣参数。结果(图 20.9):平均 \(Q\) 显著稳健;外生现金流变异 \(Z_{i,t}\) 即便加funding变量高阶项仍显著影响投资(系数负、因 \(Z\) 度量现金流下降、故对投资的现金流效应为正)。评论:核函数回归中资本支出与养老缴款的比例本应在阈值处不连续,但图 20.10 显示连续→有问题;Bakke-Whited (2012) 指出 Rauh 结果主要来自重度注资不足的小部分样本。Remark 20.1:此类简约式估计只描述效应、不论效率;如 Lenzu-Manaresi (2019) 则有清晰的效率观。

20.6 Zwick-Mahon (2017). 利用奖励折旧 (bonus depreciation) 形式的税基现金流冲击。奖励折旧是政府刺激投资的政策工具、加速折旧表;在 \(\theta\) 奖励表下(\(\theta\in(0,1)\)),公司在第 0 期可折旧

$$z=\theta+(1-\theta)z^B$$

比例的总值(\(z^B\) 为无奖励折旧时第 0 期折旧比例;奖励仅在购置时一次性适用)。更长久期的投资从奖励折旧获益更大(早折旧 = 现金流即时跳升、正 NPV,因税收利益不被贴现)。数据 12 万家公私公司、政策在行业-年层变异。DiD:\(\ln I_{i,t}=\alpha_i+\beta z_{N,t}+\boldsymbol\gamma'\mathbf X_{i,t}+\delta_t+\varepsilon_{i,t}\)(\(z_{N,t}\) = 合资格投资每平均一元扣除的现值、\(N\) 四位 NAICS 行业码);识别假设:平行趋势。结果(图 20.12,赔率比 \(\ln\frac{\mathbf P_N(I>0)}{1-\mathbf P_N(I>0)}\) = 广延边际):集约与广延边际都显著为正;小/中公司投资反应强得多(加权弹性比大公司高 27%,图 20.13)。

20.5 Rauh (2006). Exploits the discontinuous drop in internal funding from mandatory defined-benefit (DB) pension contributions to study cash flow's effect on investment (controlling endogeneity). Institutional detail: when the pension fund's market value $>$ liability no contribution is required; when $<$ liability it is; the amount follows a complex non-linear function, generating a sharp discontinuity around the threshold (Figure 20.8: mandatory contribution = max of the minimum funding contribution and the deficit-reduction contribution). Design: treat the discontinuity as an exogenous negative cash-flow shock, removing the endogenous relation between funding status and investment opportunities. Specification:

$$\frac{I_{i,t}}{A_{i,t-1}}=\alpha_t+\alpha_i+\beta_1 Q_{i,t-1}+\beta_2\frac{\text{Non-Pension Cash Flow}_{i,t}}{A_{i,t-1}}+\beta_3\frac{Z_{i,t}}{A_{i,t-1}}+\boldsymbol\gamma'\mathbf X_{i,t}+\varepsilon_{i,t} \tag{20.17}$$

\(Z_{i,t}\) the mandatory pension contribution, \(\beta_3\) the parameter of interest. Results (Figure 20.9): average \(Q\) significant and robust; the exogenous cash-flow variation \(Z_{i,t}\) significantly affects investment even with higher-order powers of funding variables (coefficient negative, since \(Z\) measures the cash-flow drop, so the cash-flow effect on investment is positive). Comment: the percentages of capex and pension contribution from a kernel regression should be discontinuous at the threshold, but Figure 20.10 shows them continuous → problematic; Bakke-Whited (2012) note Rauh's results come mainly from heavily underfunded firms (a small, sharply different fraction). Remark 20.1: such reduced-form estimation only describes an effect, with no stance on efficiency; papers like Lenzu-Manaresi (2019) have a clear notion of efficiency.

20.6 Zwick-Mahon (2017). Exploits the tax-based cash-flow shock from bonus depreciation. Bonus depreciation is a government tool to spur investment by accelerating the depreciation schedule; under a \(\theta\) bonus schedule (\(\theta\in(0,1)\)), the firm can depreciate

$$z=\theta+(1-\theta)z^B$$

of the total value in period 0 (\(z^B\) the period-0 depreciation fraction without the bonus; the bonus applies once upon purchase). Longer-duration investment benefits more (early depreciation = an instantaneous cash-flow spike, positive NPV since the tax benefit isn't discounted). Data: 120,000 public and private firms, with policy variation at the industry-year level. DiD: \(\ln I_{i,t}=\alpha_i+\beta z_{N,t}+\boldsymbol\gamma'\mathbf X_{i,t}+\delta_t+\varepsilon_{i,t}\) (\(z_{N,t}\) = the present value of one average dollar deduction for eligible investment, \(N\) a four-digit NAICS code); identifying assumption: parallel trend. Results (Figure 20.12, the odds ratio \(\ln\frac{\mathbf P_N(I>0)}{1-\mathbf P_N(I>0)}\) = extensive margin): both intensive and extensive margins are significantly positive; small/medium firms respond much more strongly (weighted elasticity 27% higher than large firms, Figure 20.13).

20.7 Structural Approach: Whited (1992)

难以厘清内外融资楔子对投资的效应,因选择偏误:内部融资状况很可能与投资机会相关(内部资金多的公司即便在反事实里也想投更多)。Whited (1992) 考虑两个欧拉方程:无借贷约束、有借贷约束,检验哪个更合数据。

模型:价值函数 \(V_{i0}=d_{i0}+\mathbb{E}_0\left[\sum_{t=1}^\infty\left(\prod_{\tau=0}^{t-1}\beta_{i\tau}\right)d_{it}\right]\) (20.18)(\(d_{it}\) 税后股利)。公司每期选投资 \(I_{it}\)(或资本 \(K_{it}\))最大化价值,约束:资本积累 \(K_{it}=I_{it}+(1-\delta)K_{i,t-1}\) (20.19);现金流约束给出股利

$$d_{it}=(1-\tau)[F(K_{i,t-1},\mathbf N_{it})-\mathbf w_t\cdot\mathbf N_t-\psi(I_{it},K_{i,t-1})-i_{t-1}B_{t,t-1}]+B_{it}-(1-\pi_t^e)B_{i,t-1}+p_{it}I_{it} \tag{20.20}$$

(\(\tau\) 公司税、\(F\) 生产函数、\(\psi\) 资本调整实成本、\(B_{it}\) 净债务、\(i_{t-1}-\pi_t^e\) 实利率、\(p_{it}\) 资本品实价);股利非负 \(d_{it}\ge0\) (20.21)(乘子 \(\lambda_{it}\));横截条件 (20.24) 防无限债。无约束时对 \(K_{i,t+1}\) 的欧拉方程:

$$\beta_{it}\mathbb{E}_t\left[\frac{1+\lambda_{i,t+1}}{1+\lambda_{it}}\Big[F_K-\psi_K+(1-\delta)\big(\psi_I+\tfrac{p_{i,t+1}}{1-\tau}\big)\Big]\right]=\psi_I(I_{it},K_{i,t-1})+\frac{p_{it}}{1-\tau} \tag{20.22}$$

(折现的预期未来投资边际收益 = 当前边际成本);对 \(B_{it}\) 的 f.o.c. (20.23)。加借贷约束 \(B_{it}\le B_{it}^*\)(乘子 \(\gamma_{it}\)),\(K_{it}\) 的欧拉方程变为含 \(\frac{\beta_{it}}{1+\gamma_{it}+\mathbb E_t[\lambda_{i,t+1}]}\) 因子的 (20.25)。假设 \(F_K=\frac{\eta Y_{it}-\mu C_{it}}{K_{i,t-1}}\) (20.26)、\(\psi=\frac\alpha2\left(\frac{I_{it}}{K_{i,t-1}}-\nu\right)^2 K_{i,t-1}\) (20.27),代入得期望误差方程 (20.28)(含 \(\Lambda_{it}=1-\frac{1+\lambda_{i,t+1}}{1+\lambda_{it}}\)、公司/时间 FE)。实证:以债务资产比 \(DAR_{it}\) 与利息覆盖率 \(COV_{it}\) 的二阶多项式估 \(\Lambda_{it}=c_0+c_1 DAR_{it}+c_2 DAR_{it}^2+c_3 COV_{it}+c_4 COV_{it}^2\);用 GMM 检验 \(\mathbb{E}_t[e_{i,t+1}X_t]=0\)(模型指 \(e_{i,t+1}\) 与 \(t\) 期信息正交、\(\ne0\) 则拒绝模型),分别在有/无 \(\Lambda_{it}=0\) 约束(等价 \(\lambda_{i,t+1}=0\)、不受约束)下检验。结果(图 20.14、20.15,有/无债券评级):\(\Lambda_{it}=0\) 时模型易被拒、不设此约束时不被拒 → 公司确受融资约束;无债券评级公司模型更易被拒 → 其融资约束更严重。Remark 20.2:结构法优点是经济直觉好、解释清晰;缺点是难以justify建模假设、难以实证测某些结构参数。

It is hard to tease out the internal-external wedge's effect on investment because of selection bias: internal financing is likely correlated with investment opportunities (firms with more internal funds want to invest more even counterfactually). Whited (1992) considers two Euler equations — without and with a borrowing constraint — testing which matches the data better.

Model: the value function \(V_{i0}=d_{i0}+\mathbb{E}_0\left[\sum_{t=1}^\infty\left(\prod_{\tau=0}^{t-1}\beta_{i\tau}\right)d_{it}\right]\) (20.18) (\(d_{it}\) after-tax dividend). The firm chooses \(I_{it}\) (or \(K_{it}\)) each period to maximize value, subject to: capital accumulation \(K_{it}=I_{it}+(1-\delta)K_{i,t-1}\) (20.19); a cash-flow constraint giving the dividend

$$d_{it}=(1-\tau)[F(K_{i,t-1},\mathbf N_{it})-\mathbf w_t\cdot\mathbf N_t-\psi(I_{it},K_{i,t-1})-i_{t-1}B_{t,t-1}]+B_{it}-(1-\pi_t^e)B_{i,t-1}+p_{it}I_{it} \tag{20.20}$$

(\(\tau\) corporate tax, \(F\) production function, \(\psi\) real adjustment cost, \(B_{it}\) net debt, \(i_{t-1}-\pi_t^e\) real rate, \(p_{it}\) real capital-good price); non-negative dividend \(d_{it}\ge0\) (20.21) (multiplier \(\lambda_{it}\)); a transversality condition (20.24) preventing infinite debt. Without the constraint, the Euler equation w.r.t. \(K_{i,t+1}\):

$$\beta_{it}\mathbb{E}_t\left[\frac{1+\lambda_{i,t+1}}{1+\lambda_{it}}\Big[F_K-\psi_K+(1-\delta)\big(\psi_I+\tfrac{p_{i,t+1}}{1-\tau}\big)\Big]\right]=\psi_I(I_{it},K_{i,t-1})+\frac{p_{it}}{1-\tau} \tag{20.22}$$

(discounted expected future marginal benefit = current marginal cost); the f.o.c. w.r.t. \(B_{it}\) is (20.23). Adding the borrowing constraint \(B_{it}\le B_{it}^*\) (multiplier \(\gamma_{it}\)), the \(K_{it}\) Euler equation becomes (20.25) with the factor \(\frac{\beta_{it}}{1+\gamma_{it}+\mathbb E_t[\lambda_{i,t+1}]}\). Assuming \(F_K=\frac{\eta Y_{it}-\mu C_{it}}{K_{i,t-1}}\) (20.26), \(\psi=\frac\alpha2\left(\frac{I_{it}}{K_{i,t-1}}-\nu\right)^2 K_{i,t-1}\) (20.27), substituting gives the expectation-error equation (20.28) (with \(\Lambda_{it}=1-\frac{1+\lambda_{i,t+1}}{1+\lambda_{it}}\), firm/time FE). Empirics: estimate \(\Lambda_{it}=c_0+c_1 DAR_{it}+c_2 DAR_{it}^2+c_3 COV_{it}+c_4 COV_{it}^2\) (a second-order polynomial of the debt-asset ratio \(DAR_{it}\) and interest coverage \(COV_{it}\)); use GMM testing \(\mathbb{E}_t[e_{i,t+1}X_t]=0\) (the model says \(e_{i,t+1}\) is orthogonal to time-\(t\) info, \(\ne0\) rejects the model), with and without the restriction \(\Lambda_{it}=0\) (equivalent to \(\lambda_{i,t+1}=0\), unconstrained). Results (Figures 20.14, 20.15, with/without bond ratings): with \(\Lambda_{it}=0\) the model is easily rejected, without it it isn't → firms are financially constrained; the model is rejected more clearly for firms without bond ratings → they face more serious constraints. Remark 20.2: the structural approach's advantage is better economic intuition and clearer interpretation; its disadvantage is the difficulty of justifying modeling assumptions and empirically measuring some structural parameters.

20.8 Internal Capital Markets Trade-Offs: Matvos and Seru (2014)

Matvos-Seru (2014) 实证量化内部资本市场的权衡:好处是避开时变的外部融资成本;成本是「公司社会主义」导致的效率损失。实证用外生油价冲击构造集团各部门生产率离散度的外生变异,识别参数。

模型:每公司经理的每期效用 \(u_t\)((20.29))含核心项

$$u_t=\sum_j z_{tj}k_{tj}-\underbrace{\lambda\sum_j(z_{tj}-z_t^*)k_{tj}}_{\text{corporate socialism}}-\sum_j I_{tj}-(\text{investment / financing / cash cost terms})$$

其中 \(z_{tj}\) 部门 \(j\) 在 \(t\) 的生产率、\(z_t^*=\frac1n\sum_{j=1}^n z_{tj}\) 为横截面均值;\(\lambda\sum(z_{tj}-z_t^*)k_{tj}\) 度量公司社会主义的负效用(经理厌恶某些部门跑赢均值)。还含投资成本(\(\phi_0,\phi_1,\phi_2\))、外部融资成本(\(c_0,\dots,c_8\)、依 TED 状态 \(\zeta_t\))、现金持有惩罚(\(j_0,j_1\)、\(p_t\) 为现金)。经理解 \(V(\mathbf s_t,\boldsymbol\sigma,\boldsymbol\theta)=\max_{\boldsymbol\sigma}\mathbb E_t[\sum_{\tau\ge t}\beta^{\tau-t}u_\tau]\) (20.30),s.t. 资本积累与现金转移。目标:用真实数据估参数向量 \(\boldsymbol\theta\)。

两步估计第 1 步(策略函数) 估 \(I_{tj}=\max\{0,Q_2(\mathbf s_t;\boldsymbol\beta_f)+e_{I_{tj}}\}\) (20.31)、\(f_t=Q_2(\cdot)+e_{f_t}\) (20.32)(\(Q_2\) 二阶多项式)及状态转移:资本 (20.33)、部门生产率 \(z_{t+1,j}=G_s(z_{tj})+\varepsilon\) (20.34)(\(G_s\) 9 节点线性样条、\(z\) 先由 \(Q\) 后由 ROA 代理)、TED \(\zeta_{t+1}=\alpha_\zeta+\beta_\zeta\zeta_t+\varepsilon\) (20.35)、现金 (20.36)。第 2 步(效用参数) 用策略函数(\(e=0\))正向模拟状态序列 \(\to\) 代入 (20.29)、(20.30) 算隐含 \(\hat V^{(j)}(\boldsymbol\theta)\);扰动策略函数(\(e\) 从经验分布抽)得 \(\hat V^{(j)(h)}(\boldsymbol\theta)\);定义偏离最优策略之利 \(g=\max\{0,\hat V^{(j)(h)}-\hat V^{(j)}\}\),理想期望为 0(价值已最大化);蒙特卡洛平均 \(g(\boldsymbol\theta)^2\),解 \(\boldsymbol\theta=\arg\min_{\boldsymbol\theta}g(\boldsymbol\theta)^2\)。Remark 20.3:估计依赖难justify的假设——(1) 经理真有 (20.29) 形式的效用;(2) 公司真按 (20.29) 最大化价值。

结果:估得 \(\boldsymbol\theta\)(图 20.16,\(Q\) 与 ROA 两种生产率度量);超样本正向模拟得图 20.17(多元化公司相对独立公司的超额价值演化):集团总低于独立公司(公司社会主义的效率损失 = 内部融资的暗面);高银行利率期(整体信用风险高)集团相对价值更高(虽仍为负)——外部融资更贵时内部融资之益 = 亮面。

Matvos-Seru (2014) empirically quantify the trade-offs of the internal capital market: the benefit is avoiding time-varying external-financing costs; the cost is efficiency loss from "corporate socialism." Empirically they use exogenous oil-price shocks to construct exogenous variation in productivity dispersion among conglomerate divisions, identifying the parameters.

Model: each firm's manager has per-period utility \(u_t\) ((20.29)) with the core term

$$u_t=\sum_j z_{tj}k_{tj}-\underbrace{\lambda\sum_j(z_{tj}-z_t^*)k_{tj}}_{\text{corporate socialism}}-\sum_j I_{tj}-(\text{investment/financing/cash cost terms})$$

where \(z_{tj}\) is division \(j\)'s productivity at \(t\), \(z_t^*=\frac1n\sum_{j=1}^n z_{tj}\) the cross-sectional mean; \(\lambda\sum(z_{tj}-z_t^*)k_{tj}\) measures the disutility of corporate socialism (the manager dislikes some divisions outperforming the average). It also includes investment costs (\(\phi_0,\phi_1,\phi_2\)), external-financing costs (\(c_0,\dots,c_8\), depending on TED state \(\zeta_t\)), and a cash-holding penalty (\(j_0,j_1\), \(p_t\) cash). The manager solves \(V(\mathbf s_t,\boldsymbol\sigma,\boldsymbol\theta)=\max_{\boldsymbol\sigma}\mathbb E_t[\sum_{\tau\ge t}\beta^{\tau-t}u_\tau]\) (20.30), s.t. capital accumulation and cash transition. The goal: estimate the parameter vector \(\boldsymbol\theta\) from real data.

Two-step estimation: Step 1 (policy function) estimate \(I_{tj}=\max\{0,Q_2(\mathbf s_t;\boldsymbol\beta_f)+e_{I_{tj}}\}\) (20.31), \(f_t=Q_2(\cdot)+e_{f_t}\) (20.32) (\(Q_2\) a second-degree polynomial), and state transitions: capital (20.33), division productivity \(z_{t+1,j}=G_s(z_{tj})+\varepsilon\) (20.34) (\(G_s\) a 9-knot linear spline, \(z\) proxied first by \(Q\) then ROA), TED \(\zeta_{t+1}=\alpha_\zeta+\beta_\zeta\zeta_t+\varepsilon\) (20.35), cash (20.36). Step 2 (utility parameters) simulate state sequences forward with the policy functions (\(e=0\)) → plug into (20.29), (20.30) for the implied \(\hat V^{(j)}(\boldsymbol\theta)\); perturb the policy functions (\(e\) drawn from the empirical distribution) for \(\hat V^{(j)(h)}(\boldsymbol\theta)\); define the profit from deviating, \(g=\max\{0,\hat V^{(j)(h)}-\hat V^{(j)}\}\), ideally with expectation 0 (value already maximized); Monte-Carlo average \(g(\boldsymbol\theta)^2\), solve \(\boldsymbol\theta=\arg\min_{\boldsymbol\theta}g(\boldsymbol\theta)^2\). Remark 20.3: the estimation depends on hard-to-justify assumptions — (1) managers really have the (20.29)-form utility; (2) firms actually maximize value per (20.29).

Results: the estimated \(\boldsymbol\theta\) (Figure 20.16, \(Q\) and ROA productivity measures); out-of-sample forward simulation gives Figure 20.17 (evolution of a diversified firm's excess value relative to stand-alones): conglomerates are always below stand-alones (the efficiency loss from corporate socialism = the dark side of internal financing); in high-bank-rate periods (high overall credit risk) conglomerates have higher relative value (though still negative) — the benefit of internal financing when external financing is more costly = the bright side.

References