26. Bayesian Learning with Subjective Beliefs
本章用贝叶斯学习为投资者的主观信念提供微观基础。设股利增长 \(\Delta d_t=\mu+\varepsilon_t\),代理人不知真值 \(\mu\),只能从数据中学习,得到时变的主观估计 \(\tilde\mu_t\)。三条主线:(1) 收益可预测性(§26.2/26.3.6)——Campbell-Shiller 分解给出 \(\mathbb E_t[r_{t+1}]-r_f\) 由"误判 \(\mu-\tilde\mu_t\)"与"信念动态 \(\mathbb E_t[\tilde\mu_{t+1}]-\tilde\mu_t\)"两部分驱动,即便代理人完全理性地贝叶斯学习,收益仍可预测;(2) 贝叶斯更新(§26.3)——共轭正态先验 \(\mathcal N(\tilde\mu_0,\nu_0\sigma^2)\) 经更新得后验 \(\mathcal N(\tilde\mu_t,\nu_t\sigma^2)\),增益 \(\nu_t\) 递减(递减增益),且主观信念是鞅 \(\tilde\mu_t=\tilde{\mathbb E}_t[\tilde\mu_{t+1}]\) (26.20);(3) 资产定价(§26.4)——在 Epstein-Zin 偏好(\(\psi=1\))下分两情形:情形 1 预期效用 (anticipated utility) 把当前估计 \(\tilde\mu_t\) 当作永久真值、忽略未来不确定性(主观风险溢价恒为 \(\gamma\sigma^2\),学习对价格无影响);情形 2 定价主观增长不确定性 把 \(\mu\) 的不确定性 \(\nu_t\) 也定价进 SDF。最后 (§26.5) 介绍 Nagel-Xu (2019) 渐忘记忆 (fading memory) 模型——常增益更新 (constant-gain) \(\tilde\mu_{t+1}=\tilde\mu_t+\nu(\Delta d_{t+1}-\tilde\mu_t)\),预测客观股权溢价高且强反周期、主观股权溢价恒定。
This chapter uses Bayesian learning to micro-found investors' subjective beliefs. With dividend growth \(\Delta d_t=\mu+\varepsilon_t\), the agent does not know the true \(\mu\) and learns it from data, obtaining a time-varying subjective estimate \(\tilde\mu_t\). Three threads: (1) return predictability (§26.2/26.3.6) — the Campbell-Shiller decomposition gives \(\mathbb E_t[r_{t+1}]-r_f\) driven by "misperception \(\mu-\tilde\mu_t\)" and "belief dynamics \(\mathbb E_t[\tilde\mu_{t+1}]-\tilde\mu_t\)", so returns are predictable even when the agent is fully rational and Bayesian-learning; (2) Bayesian updating (§26.3) — a conjugate normal prior \(\mathcal N(\tilde\mu_0,\nu_0\sigma^2)\) updates to the posterior \(\mathcal N(\tilde\mu_t,\nu_t\sigma^2)\), with decreasing gain \(\nu_t\) (decreasing-gain updating), and the subjective belief is a martingale \(\tilde\mu_t=\tilde{\mathbb E}_t[\tilde\mu_{t+1}]\) (26.20); (3) asset pricing (§26.4) — under Epstein-Zin preference (\(\psi=1\)), two cases: Case 1 anticipated utility treats the current estimate \(\tilde\mu_t\) as a permanent truth and ignores future uncertainty (subjective risk premium constant at \(\gamma\sigma^2\), learning has no price effect); Case 2 priced subjective growth uncertainty also prices the uncertainty \(\nu_t\) about \(\mu\) into the SDF. Finally (§26.5) the Nagel-Xu (2019) fading-memory model — constant-gain updating \(\tilde\mu_{t+1}=\tilde\mu_t+\nu(\Delta d_{t+1}-\tilde\mu_t)\) predicts a high, strongly counter-cyclical objective equity premium and a constant subjective equity premium.
26.1 Setup
经济中有一资产,\(t\) 时支付股利 \(D_t\)、价格 \(P_t\)(可解释为整体股市或个股;个股因学习涉及同业横截面更复杂,故假设整体市场)。记 \(d_t\equiv\ln D_t\)、\(\Delta d_t=d_t-d_{t-1}\),股利增长 (26.1):\(\Delta d_t=\mu+\varepsilon_t\),\(\varepsilon_t\overset{iid}\sim\mathcal N(0,\sigma^2)\),\(\mu\) 为客观(真)均值。\(p_t\equiv\ln P_t\)。Campbell-Shiller 分解 (7.5)(\(\kappa_0\to k\)、\(\kappa_1\to\rho\))(26.2):
The economy has an asset paying dividend \(D_t\) at \(t\), price \(P_t\) (interpretable as the whole equity market or an individual stock; an individual stock is more complex due to cross-sectional learning across the same industry, so assume the aggregate market). Denote \(d_t\equiv\ln D_t\), \(\Delta d_t=d_t-d_{t-1}\), dividend growth (26.1): \(\Delta d_t=\mu+\varepsilon_t\), \(\varepsilon_t\overset{iid}\sim\mathcal N(0,\sigma^2)\), \(\mu\) the objective (true) mean. \(p_t\equiv\ln P_t\). The Campbell-Shiller decomposition (7.5) (\(\kappa_0\to k\), \(\kappa_1\to\rho\)) (26.2):
$$p_t-d_t=\frac{k}{1-\rho}+\sum_{j=0}^{\infty}\rho^j\left[(\Delta d_{t+1+j})-r_{t+1+j}\right]\tag{26.2}$$
\(r_t\) 为对数毛收益,无风险利率 \(r_f\) 常数。\(\tilde{\mathbb E}_t[\cdot]\) 为 \(t\) 时主观条件期望(基于主观概率测度);另设计量经济学家用客观测度 \(\mathbb E_t[\cdot]\) 且知真值 \(\mu\)。
关键假设:代理人有常数主观风险溢价 \(\tilde\theta\) (26.3):\(\tilde{\mathbb E}_t[r_{t+j}]=\tilde\theta+r_f\);及对(常数)无条件股利增长率的时变主观估计(对所有未来期)(26.4):\(\tilde{\mathbb E}_t[\Delta d_{t+j}]=\tilde\mu_t\),\(\forall t\ge1\)。\(\Delta d\) 真无条件均值为 (26.1) 的 \(\mu\);时变下标 \(\tilde\mu_t\) 仅表示估计随信息变。(也可设 \(\tilde\theta\) 时变,此处聚焦单一主观信念来源。)
26.2 Return Predictability
\(t\) 时 \(p_t,d_t\) 被双方观测。由 (26.3)/(26.4) 对 (26.2) 取主观期望两端 → (26.5):
\(r_t\) is the log gross return, the risk-free rate \(r_f\) constant. \(\tilde{\mathbb E}_t[\cdot]\) is the time-\(t\) subjective conditional expectation (under the subjective measure); separately the econometrician uses the objective measure \(\mathbb E_t[\cdot]\) and knows the true \(\mu\).
Crucial assumption: the agent has constant subjective risk premium \(\tilde\theta\) (26.3): \(\tilde{\mathbb E}_t[r_{t+j}]=\tilde\theta+r_f\); and a time-varying subjective estimate of the (constant) unconditional dividend growth rate (for all future periods) (26.4): \(\tilde{\mathbb E}_t[\Delta d_{t+j}]=\tilde\mu_t\), \(\forall t\ge1\). The true unconditional mean of \(\Delta d\) is \(\mu\) in (26.1); the time subscript on \(\tilde\mu_t\) just indicates the estimate varies with information. (One could also let \(\tilde\theta\) vary; here we focus on a single source of subjective beliefs.)
26.2 Return Predictability
At \(t\), \(p_t,d_t\) are observed by both. Taking the subjective expectation of (26.2) using (26.3)/(26.4) gives (26.5):
$$p_t-d_t=\frac{k}{1-\rho}+\frac1{1-\rho}\left(\tilde\mu_t-\tilde\theta-r_f\right)\tag{26.5}$$
价格今天 \(p_t\) 随主观股利增长预期 \(\tilde\mu_t\) 变。计量经济学家视角:(26.2)≡(7.4) → \(r_{t+1}=d_t-p_t+k+\rho(p_{t+1}-d_{t+1})+\Delta d_{t+1}\) (26.6)。把 (26.5) 代入 (26.6)、再取客观期望两端,得收益可预测性 (26.8):
Today's price \(p_t\) varies with the subjective dividend-growth expectation \(\tilde\mu_t\). Econometrician's view: (26.2)≡(7.4) → \(r_{t+1}=d_t-p_t+k+\rho(p_{t+1}-d_{t+1})+\Delta d_{t+1}\) (26.6). Plugging (26.5) into (26.6), then taking the objective expectation of both sides, gives return predictability (26.8):
$$\mathbb E_t[r_{t+1}]-r_f=\underbrace{\tilde\theta+\frac{\rho}{1-\rho}\left(\mathbb E_t[\tilde\mu_{t+1}]-\tilde\mu_t\right)+(\mu-\tilde\mu_t)}_{\text{subjective premium }\tilde\theta\ \ne\ \text{objective }\theta}\tag{26.8}$$
证明 / Proof:(26.7) 与 (26.8)
把 (26.5)(及其 \(t+1\) 版)代入 (26.6) 整理得 (26.7): $$r_{t+1}=\tilde\theta+r_f-\frac1{1-\rho}\tilde\mu_t+\frac{\rho}{1-\rho}\tilde\mu_{t+1}+\Delta d_{t+1}\tag{26.7}$$ 两端取客观期望 \(\mathbb E_t\),\(\mathbb E_t[\Delta d_{t+1}]=\mu\),整理即得 (26.8)。\(\blacksquare\)
Substituting (26.5) (and its \(t+1\) version) into (26.6) and simplifying gives (26.7): $$r_{t+1}=\tilde\theta+r_f-\frac1{1-\rho}\tilde\mu_t+\frac{\rho}{1-\rho}\tilde\mu_{t+1}+\Delta d_{t+1}\tag{26.7'}$$ Taking the objective expectation \(\mathbb E_t\) of both sides, with \(\mathbb E_t[\Delta d_{t+1}]=\mu\), gives (26.8). \(\blacksquare\)
(26.8) 的可预测性来自两部分:Part 1 \(\mu-\tilde\mu_t\)(下期股利增长的可预测误判——若代理人高估 \(\mu-\tilde\mu_t<0\),计量经济学家知客观风险溢价下期更低);Part 2 \(\mathbb E_t[\tilde\mu_{t+1}]-\tilde\mu_t\)(代理人主观信念的可预测动态——若计量经济学家知代理人下期更乐观 \(\mathbb E_t[\tilde\mu_{t+1}]-\tilde\mu_t>0\),客观风险溢价下期更高)。
26.3 Bayesian Learning
26.3.1 Assumptions
代理人想知道的参数是 (26.1) 中的 \(\mu\),\(\varepsilon_t\overset{iid}\sim\mathcal N(0,\sigma^2)\),\(\Delta d_t\) 的 p.d.f. \(f(\Delta d_t\mid\mu)\) 属指数族 (26.9)。设代理人知 \(\sigma^2\) 但不知 \(\mu\)(合理:大样本/高频数据改善 \(\sigma^2\) 而非 \(\mu\),见第 39 章)。\(t\) 时累积信息 \(\mathcal H_t=\{\Delta d_t,\Delta d_{t-1},\dots,\Delta d_1\}\)。贝叶斯视角:代理人视 \(\mu\) 为随机变量(对它有主观不确定性)。
26.3.2 Conjugate Prior
设代理人在观测任何数据前有共轭先验 \(\pi_0(\mu)\)(He 2019a §7.3)(26.10)。改写分子 (26.11) 可见共轭先验为正态分布,均值 \(\tilde\mu_0\)、方差 \(\nu_0\sigma^2\),\(\tilde\mu_0,\nu_0>0\) 任意 (26.12):
The predictability in (26.8) comes from two parts: Part 1 \(\mu-\tilde\mu_t\) (predictable misperception of next period's dividend growth — if the agent over-predicts, \(\mu-\tilde\mu_t<0\), the econometrician knows the objective risk premium is lower next period); Part 2 \(\mathbb E_t[\tilde\mu_{t+1}]-\tilde\mu_t\) (predictable dynamics of the agent's subjective beliefs — if the econometrician knows the agent will be more optimistic next period, \(\mathbb E_t[\tilde\mu_{t+1}]-\tilde\mu_t>0\), the objective risk premium is higher next period).
26.3 Bayesian Learning
26.3.1 Assumptions
The parameter the agent wants to know is \(\mu\) in (26.1), \(\varepsilon_t\overset{iid}\sim\mathcal N(0,\sigma^2)\), and the p.d.f. \(f(\Delta d_t\mid\mu)\) is an exponential family (26.9). The agent knows \(\sigma^2\) but not \(\mu\) (reasonable: large-sample/high-frequency data improves \(\sigma^2\) but not \(\mu\), see Chapter 39). Accumulated information at \(t\): \(\mathcal H_t=\{\Delta d_t,\Delta d_{t-1},\dots,\Delta d_1\}\). Bayesian view: the agent treats \(\mu\) as a random variable (subjective uncertainty about it).
26.3.2 Conjugate Prior
The agent has a conjugate prior \(\pi_0(\mu)\) before observing any data (He 2019a §7.3) (26.10). Rewriting the numerator (26.11) shows the conjugate prior is a normal distribution with mean \(\tilde\mu_0\), variance \(\nu_0\sigma^2\), \(\tilde\mu_0,\nu_0>0\) arbitrary (26.12):
$$\text{Prior Distribution: }\mathcal N(\tilde\mu_0,\nu_0\sigma^2)\tag{26.12}$$
若 \(\nu_0=+\infty\),代理人无任何先验信息(平坦先验、对任意 \(\mu\) 赋等零权重)。
26.3.3 Bayesian Updating
Proposition 26.1(正态贝叶斯更新) 设 \(x\mid\theta\sim\mathcal N(\theta,\sigma^2)\),先验 \(\theta\sim\mathcal N(\mu,\tau^2)\),则后验 \(\theta\mid x\sim\mathcal N(\bar\mu,\bar\tau^2)\),其中 \(\bar\tau^{-2}=\tau^{-2}+\sigma^{-2}\),\(\bar\mu=\frac{\sigma^{-2}}{\sigma^{-2}+\tau^{-2}}x+\frac{\tau^{-2}}{\sigma^{-2}+\tau^{-2}}\mu\)。(证:He 2019a Prop 7.8。)
应用:观测 \(\mathcal H_1\) 后后验 \(\mu\sim\mathcal N(\tilde\mu_1,\sigma_1^2)\)。由 Prop 26.1 归纳,\(t\) 时后验 (26.13):
If \(\nu_0=+\infty\), the agent has no prior information (flat prior, equal zero weights on any \(\mu\)).
26.3.3 Bayesian Updating
Proposition 26.1 (Normal Bayesian updating) If \(x\mid\theta\sim\mathcal N(\theta,\sigma^2)\), prior \(\theta\sim\mathcal N(\mu,\tau^2)\), then the posterior \(\theta\mid x\sim\mathcal N(\bar\mu,\bar\tau^2)\), where \(\bar\tau^{-2}=\tau^{-2}+\sigma^{-2}\), \(\bar\mu=\frac{\sigma^{-2}}{\sigma^{-2}+\tau^{-2}}x+\frac{\tau^{-2}}{\sigma^{-2}+\tau^{-2}}\mu\). (Proof: He 2019a Prop 7.8.)
Application: after observing \(\mathcal H_1\), the posterior \(\mu\sim\mathcal N(\tilde\mu_1,\sigma_1^2)\). By Proposition 26.1 and induction, the posterior at \(t\) (26.13):
$$\text{Posterior after }\mathcal H_t:\ \mathcal N(\tilde\mu_t,\nu_t\sigma^2),\quad\tilde\mu_t=\tilde\mu_{t-1}+\nu_t(\Delta d_t-\tilde\mu_{t-1}),\quad\nu_t=\frac{\nu_{t-1}}{1+\nu_{t-1}}\tag{26.13–15}$$
- 若 \(\nu_0=+\infty\)(无先验信息):\(\nu_t=\frac1t\)(简单归纳),代理人实质取 \(\mathcal H_t\) 信息的样本平均估 \(\mu\)(直觉:无先验则等权对待每条新信息)。
- 对任意 \(\nu_0\),\(\nu_t\) 随时间递减(因 \(\frac1{1+\nu_{t-1}}\in(0,1)\)),故从观测 \(\Delta d_t\) 获得的信息增益随时间递减,称递减增益更新 (decreasing gain updating)。
Remark 26.1(MAP 估计) 有时不需刻画整个后验,只需后验的点估计。最大后验 (MAP) 估计 (26.16):\(\hat\mu_{\text{MAP},t}\equiv\arg\max_\mu\pi(\mu\mid\mathcal H_t)\)(给定先验与 \(\mathcal H_t\) 下"最可能"的 \(\mu\))。两特例:(1) \(\pi(\mu\mid\mathcal H_t)\) 正态(如 26.13)→ \(\hat\mu_{\text{MAP},t}=\tilde\mu_t\) 后验均值;(2) 先验方差为无穷 → MAP = MLE(先验无信息)。
26.3.4 Dynamic Subjective Distribution of Dividend Growth
\(\Delta d_{t+1}=\mu_t+\varepsilon_{t+1}\),\(\varepsilon\overset{iid}\sim\mathcal N(0,\sigma^2)\),均值 \(\mu_t\) 服从主观分布 \(\mu_t\sim\mathcal N(\tilde\mu_t,\nu_t\sigma^2)\)。\(\varepsilon_{t+1}\)(\(t+1\) 信息)⊥ \(\mu_t\)(\(t\) 信息)→ \(\tilde{\mathbb E}_t[\Delta d_{t+1}]=\tilde\mu_t\)、\(\widetilde{\text{Var}}_t(\Delta d_{t+1})=(1+\nu_t)\sigma^2\),即 (26.17):
- If \(\nu_0=+\infty\) (no prior information): \(\nu_t=\frac1t\) (simple induction), so the agent effectively takes the sample average of \(\mathcal H_t\) to estimate \(\mu\) (intuitive: with no prior, weight each new piece equally).
- For any \(\nu_0\), \(\nu_t\) is decreasing in time (since \(\frac1{1+\nu_{t-1}}\in(0,1)\)), so the information gain from observing \(\Delta d_t\) decreases over time — decreasing-gain updating.
Remark 26.1 (MAP estimator) Sometimes we needn't characterize the whole posterior, only its point estimate. The maximum-a-posteriori (MAP) estimator (26.16): \(\hat\mu_{\text{MAP},t}\equiv\arg\max_\mu\pi(\mu\mid\mathcal H_t)\) (the "most likely" \(\mu\) given the prior and \(\mathcal H_t\)). Two special cases: (1) \(\pi(\mu\mid\mathcal H_t)\) normal (e.g. 26.13) → \(\hat\mu_{\text{MAP},t}=\tilde\mu_t\) the posterior mean; (2) prior variance infinity → MAP = MLE (prior provides no information).
26.3.4 Dynamic Subjective Distribution of Dividend Growth
\(\Delta d_{t+1}=\mu_t+\varepsilon_{t+1}\), \(\varepsilon\overset{iid}\sim\mathcal N(0,\sigma^2)\), with mean \(\mu_t\) following the subjective distribution \(\mu_t\sim\mathcal N(\tilde\mu_t,\nu_t\sigma^2)\). \(\varepsilon_{t+1}\) (time \(t+1\) info) \(\perp\mu_t\) (time \(t\) info) → \(\tilde{\mathbb E}_t[\Delta d_{t+1}]=\tilde\mu_t\), \(\widetilde{\text{Var}}_t(\Delta d_{t+1})=(1+\nu_t)\sigma^2\), i.e. (26.17):
$$\Delta d_{t+1}\sim\mathcal N\!\left(\tilde\mu_t,(1+\nu_t)\sigma^2\right)\tag{26.17}$$
26.3.5 Martingale Posterior Beliefs
重写 (26.14) 为 \(t\to t+1\):\(\tilde\mu_{t+1}=\tilde\mu_t+\nu_{t+1}(\Delta d_{t+1}-\tilde\mu_t)\) (26.18),唯一随机部分是 \(\Delta d_{t+1}-\tilde\mu_t\)(\(t\) 时后验、\(t+1\) 时先验),\(\nu_{t+1}\) 由 (26.15) 确定。由 (26.17),\(\Delta d_{t+1}-\tilde\mu_t\sim\mathcal N(0,(1+\nu_t)\sigma^2)\),故 \(t\) 时 \(\tilde\mu_{t+1}\) 的分布 \(\mathcal N(\tilde\mu_t,\nu_{t+1}^2(1+\nu_t)\sigma^2)\) (26.19)。等价地 (26.20)/(26.21):
26.3.5 Martingale Posterior Beliefs
Rewrite (26.14) for \(t\to t+1\): \(\tilde\mu_{t+1}=\tilde\mu_t+\nu_{t+1}(\Delta d_{t+1}-\tilde\mu_t)\) (26.18), the only random part being \(\Delta d_{t+1}-\tilde\mu_t\) (posterior at \(t\), prior at \(t+1\)), \(\nu_{t+1}\) deterministic by (26.15). By (26.17), \(\Delta d_{t+1}-\tilde\mu_t\sim\mathcal N(0,(1+\nu_t)\sigma^2)\), so the time-\(t\) distribution of \(\tilde\mu_{t+1}\) is \(\mathcal N(\tilde\mu_t,\nu_{t+1}^2(1+\nu_t)\sigma^2)\) (26.19). Equivalently (26.20)/(26.21):
$$\tilde\mu_{t+1}=\tilde\mu_t+\nu_{t+1}\sqrt{(1+\nu_t)}\sigma\tilde\epsilon_{t+1},\qquad\tilde\epsilon_{t+1}\equiv\frac{\Delta d_{t+1}-\tilde\mu_t}{\sqrt{(1+\nu_t)}\sigma}\sim\mathcal N(0,1)\tag{26.20–21}$$
\(\tilde\epsilon_{t+1}\) 是代理人视角的 i.i.d. 冲击。(26.20) 表明代理人信念是鞅:\(\tilde\mu_t=\tilde{\mathbb E}_t[\tilde\mu_{t+1}]\)(直觉:代理人用尽 \(t\) 时全部信息,无残留的预期非零差,否则会立即调整 \(\tilde\mu_t\) 使鞅性成立)。
26.3.6 Revisit Return Predictability
(26.8) 在贝叶斯学习下:先算 \(\mathbb E_t[\tilde\mu_{t+1}]-\tilde\mu_t=\nu_{t+1}(\mathbb E_t[\Delta d_{t+1}]-\tilde\mu_t)=\nu_{t+1}(\mu-\tilde\mu_t)\) (26.23)(\(\nu_{t+1}\) 确定,\(\mathbb E_t[\Delta d_{t+1}]=\mu\) 客观)。代入 (26.22) 得 (26.24):
\(\tilde\epsilon_{t+1}\) is the i.i.d. shock in the agent's perspective. (26.20) shows the agent's belief is a martingale: \(\tilde\mu_t=\tilde{\mathbb E}_t[\tilde\mu_{t+1}]\) (intuitive: the agent uses up all time-\(t\) information, with no expected nonzero difference remaining, else they would immediately adjust \(\tilde\mu_t\) to make martingality hold).
26.3.6 Revisit Return Predictability
(26.8) under Bayesian learning: first compute \(\mathbb E_t[\tilde\mu_{t+1}]-\tilde\mu_t=\nu_{t+1}(\mathbb E_t[\Delta d_{t+1}]-\tilde\mu_t)=\nu_{t+1}(\mu-\tilde\mu_t)\) (26.23) (\(\nu_{t+1}\) deterministic, \(\mathbb E_t[\Delta d_{t+1}]=\mu\) objective). Substituting into (26.22) gives (26.24):
$$\mathbb E_t[r_{t+1}]-r_f=\tilde\theta+\left(\frac{\rho}{1-\rho}\nu_{t+1}+1\right)(\mu-\tilde\mu_t)\tag{26.24}$$
即便代理人完全理性、动态贝叶斯学习并用尽每条信息更新信念,收益(风险溢价)仍可预测,可预测性来自 \(\mu-\tilde\mu_t\)。关键假设是计量经济学家知客观真值 \(\mu\)。
26.4 Asset Pricing with Subjective Learning
26.4.1 Environment
禀赋经济,代表性代理人,消费 \(C_t\) 唯一来源是资产股利。\(c_t\equiv\ln C_t\)。市场出清 \(C_{t+1}=D_{t+1}\),故对数消费增长 = 对数股利增长 \(\Delta c_{t+1}=\Delta d_{t+1}\)。由 (26.1) (26.25):\(\Delta c_{t+1}=\mu+\varepsilon_{t+1}\),\(\varepsilon\overset{iid}\sim\mathcal N(0,\sigma^2)\)。
26.4.2 Epstein-Zin Preference
代表性代理人有 Epstein-Zin 偏好 (9.1) 即 (26.26):\(V_t=[(1-\delta)C_t^{1-1/\psi}+\delta(\tilde{\mathbb E}_t[V_{t+1}^{1-\gamma}])^{\frac{1-1/\psi}{1-\gamma}}]^{\frac1{1-1/\psi}}\)。\(V_t,\delta,\psi\) 对应 (9.1) 的 \(U_t,\beta,1/\rho\);用主观测度评估期望(代理人无客观测度信息);\(\gamma\) 相对风险厌恶、\(\psi\) 跨期替代弹性 (IES)。财富组合 \(W_{t+1}=R_{w,t+1}(W_t-C_t)\) (26.27),由 (9.15) \(W_t=\frac1{1-\delta}V_t^{1-1/\psi}C_t^{1/\psi}\) (26.28)。EZ 主观 SDF (9.12) (26.29):\(\tilde M_{t+1}=\delta(\frac{C_{t+1}}{C_t})^{-1/\psi}(\frac{V_{t+1}}{(\tilde{\mathbb E}_t[V_{t+1}^{1-\gamma}])^{1/(1-\gamma)}})^{1/\psi-\gamma}\)。
Remark 26.2 (26.29) 的推导(§9.3)基于关键假设:\(C_{t+1}\) 与 \(\tilde{\mathbb E}_t[V_{t+1}^{1-\gamma}]^{1/(1-\gamma)}\) 独立(或至少 \(\partial(\cdot)/\partial C_{t+1}=0\))。这对 §26.4.3 预期效用成立(未来股利冲击 i.i.d.,\(C_{t+1}\) 信息不改 \(\tilde{\mathbb E}_t[V_{t+1}^{1-\gamma}]\) 的评估)。对 §26.4.4 定价主观增长不确定性也成立:\(V_{t+1}\) 与 \(\tilde{\mathbb E}_{t+1}[V_{t+2}^{1-\gamma}]^{1/(1-\gamma)}\) 虽不完全独立,但 \(t+1\) 时主观测度的变动在 \(t\) 时不受代理人控制,那个额外边际效用渠道的 \(t\) 期期望为零,故仍退化为标准 SDF (26.29)。
下文聚焦 \(\psi=1\)、\(\gamma\neq1\)。(26.28) → \(\frac{W_t}{C_t}=\frac1{1-\delta}\) 常数 (26.30)。由 (9.3),(26.26) → \(V_t=(C_t)^{1-\delta}(R_t)^\delta\) (26.31),\(R_t=\tilde{\mathbb E}_t[V_{t+1}^{1-\gamma}]^{\frac1{1-\gamma}}\)。\(V_{t+1}\) 关于全部未来消费一阶齐次 → 关于 \(W_{t+1}\) 一阶齐次 \(V_{t+1}=\alpha W_{t+1}=\alpha C_{t+1}\frac1{1-\delta}\) (26.32)。代入 (26.29) 并取对数得对数主观 SDF (26.34):
Even when the agent is fully rational, dynamically Bayesian-learning, and uses every piece of information to update beliefs, returns (the risk premium) are still predictable, with predictability from \(\mu-\tilde\mu_t\). The crucial assumption is that the econometrician knows the objective true \(\mu\).
26.4 Asset Pricing with Subjective Learning
26.4.1 Environment
Endowment economy, representative agent, consumption \(C_t\) whose only source is the asset dividend. \(c_t\equiv\ln C_t\). Market clearing \(C_{t+1}=D_{t+1}\), so log consumption growth = log dividend growth \(\Delta c_{t+1}=\Delta d_{t+1}\). By (26.1) (26.25): \(\Delta c_{t+1}=\mu+\varepsilon_{t+1}\), \(\varepsilon\overset{iid}\sim\mathcal N(0,\sigma^2)\).
26.4.2 Epstein-Zin Preference
The representative agent has Epstein-Zin preference (9.1), i.e. (26.26): \(V_t=[(1-\delta)C_t^{1-1/\psi}+\delta(\tilde{\mathbb E}_t[V_{t+1}^{1-\gamma}])^{\frac{1-1/\psi}{1-\gamma}}]^{\frac1{1-1/\psi}}\). \(V_t,\delta,\psi\) map to \(U_t,\beta,1/\rho\) in (9.1); the subjective measure evaluates the expectation (agent has no objective-measure info); \(\gamma\) relative risk aversion, \(\psi\) IES. Wealth portfolio \(W_{t+1}=R_{w,t+1}(W_t-C_t)\) (26.27), by (9.15) \(W_t=\frac1{1-\delta}V_t^{1-1/\psi}C_t^{1/\psi}\) (26.28). EZ subjective SDF (9.12) (26.29): \(\tilde M_{t+1}=\delta(\frac{C_{t+1}}{C_t})^{-1/\psi}(\frac{V_{t+1}}{(\tilde{\mathbb E}_t[V_{t+1}^{1-\gamma}])^{1/(1-\gamma)}})^{1/\psi-\gamma}\).
Remark 26.2 The derivation (§9.3) of (26.29) relies on the crucial assumption that \(C_{t+1}\) and \(\tilde{\mathbb E}_t[V_{t+1}^{1-\gamma}]^{1/(1-\gamma)}\) are independent (or at least \(\partial(\cdot)/\partial C_{t+1}=0\)). This holds for §26.4.3 anticipated utility (future dividend shocks i.i.d., \(C_{t+1}\) info doesn't alter the evaluation of \(\tilde{\mathbb E}_t[V_{t+1}^{1-\gamma}]\)). It also applies to §26.4.4 priced subjective growth uncertainty: although \(V_{t+1}\) and \(\tilde{\mathbb E}_{t+1}[V_{t+2}^{1-\gamma}]^{1/(1-\gamma)}\) are not completely independent, the change in the time-\(t+1\) subjective measure is out of the agent's control at \(t\), so the time-\(t\) expectation of that additional marginal-utility channel is zero, and it degenerates to the standard SDF (26.29).
Below we focus on \(\psi=1\), \(\gamma\neq1\). (26.28) → \(\frac{W_t}{C_t}=\frac1{1-\delta}\) constant (26.30). By (9.3), (26.26) → \(V_t=(C_t)^{1-\delta}(R_t)^\delta\) (26.31), \(R_t=\tilde{\mathbb E}_t[V_{t+1}^{1-\gamma}]^{\frac1{1-\gamma}}\). \(V_{t+1}\) is h.o.d. 1 in all future consumption → h.o.d. 1 in \(W_{t+1}\), \(V_{t+1}=\alpha W_{t+1}=\alpha C_{t+1}\frac1{1-\delta}\) (26.32). Substituting into (26.29) and taking logs gives the log subjective SDF (26.34):
$$\tilde m_{t+1}=\ln\delta-\gamma\Delta c_{t+1}-\ln\left(\tilde{\mathbb E}_t\!\left[e^{(1-\gamma)\Delta c_{t+1}}\right]\right)\tag{26.34}$$
26.4.3 Scenario 1: Anticipated Utility
代理人把 \(t\) 时后验 \(\tilde\mu_t\) 代入 (26.25),完全忽略未来不确定性与 \(\mu\) 的更新:\(\Delta c_{t+1}=\tilde\mu_t+\varepsilon_{t+1}\),\(\varepsilon\overset{iid}\sim\mathcal N(0,\sigma^2)\) (26.35)(\(\Delta c\sim\mathcal N(\tilde\mu_t,\sigma^2)\),代理人认定 \(\tilde\mu_t\) 为永久真值)。称预期效用 (anticipated utility)。由对数正态 \(\tilde{\mathbb E}_t[e^{(1-\gamma)\Delta c_{t+1}}]=e^{(1-\gamma)\tilde\mu_t+\frac12(1-\gamma)^2\sigma^2}\) (26.36),代入 (26.34):
The agent plugs the time-\(t\) posterior \(\tilde\mu_t\) into (26.25), completely ignoring future uncertainty and updating of \(\mu\): \(\Delta c_{t+1}=\tilde\mu_t+\varepsilon_{t+1}\), \(\varepsilon\overset{iid}\sim\mathcal N(0,\sigma^2)\) (26.35) (\(\Delta c\sim\mathcal N(\tilde\mu_t,\sigma^2)\), the agent fixes \(\tilde\mu_t\) as a permanent truth). Call this anticipated utility. By log-normality \(\tilde{\mathbb E}_t[e^{(1-\gamma)\Delta c_{t+1}}]=e^{(1-\gamma)\tilde\mu_t+\frac12(1-\gamma)^2\sigma^2}\) (26.36), substituting into (26.34):
$$\tilde m_{t+1}=\tilde\mu_m-\gamma\tilde\mu_t-\gamma\varepsilon_{t+1},\qquad\tilde\mu_m=\ln\delta-\tfrac12(1-\gamma)^2\sigma^2\tag{26.37}$$
无风险利率 \(\frac1{R_{f,t}}=\tilde{\mathbb E}_t[\tilde M_{t+1}]\) (26.38) → \(r_{f,t}=-\tilde\mu_m+\tilde\mu_t-\frac12\gamma^2\sigma^2\) (26.39)。财富组合:禀赋经济中财富组合即产生总股利的证券价格,\(R_{w,t+1}=\frac1\delta\frac{C_{t+1}}{C_t}\) (26.40),\(r_{w,t+1}=-\ln\delta+\Delta c_{t+1}\) (26.41),主观方差 \(\widetilde{\text{Var}}(r_{w,t+1})=\sigma^2\)(忽略 \(\mu\) 不确定性)。主观对数风险溢价 \(\tilde{rp}_{t+1}\equiv\ln\tilde{\mathbb E}_t[R_{w,t+1}]-\ln R_{f,t}\) (26.42),及客观 \(rp_{t+1}\equiv\ln\mathbb E_t[R_{w,t+1}]-\ln R_{f,t}\) (26.44):
Risk-free rate \(\frac1{R_{f,t}}=\tilde{\mathbb E}_t[\tilde M_{t+1}]\) (26.38) → \(r_{f,t}=-\tilde\mu_m+\tilde\mu_t-\frac12\gamma^2\sigma^2\) (26.39). Wealth portfolio: in an endowment economy the wealth portfolio is the price of the security generating the aggregate dividends, \(R_{w,t+1}=\frac1\delta\frac{C_{t+1}}{C_t}\) (26.40), \(r_{w,t+1}=-\ln\delta+\Delta c_{t+1}\) (26.41), subjective variance \(\widetilde{\text{Var}}(r_{w,t+1})=\sigma^2\) (ignoring \(\mu\) uncertainty). Subjective log risk-premium \(\tilde{rp}_{t+1}\equiv\ln\tilde{\mathbb E}_t[R_{w,t+1}]-\ln R_{f,t}\) (26.42), and objective \(rp_{t+1}\equiv\ln\mathbb E_t[R_{w,t+1}]-\ln R_{f,t}\) (26.44):
$$\tilde{rp}_{t+1}=\gamma\sigma^2\tag{26.43}$$
$$rp_{t+1}=\gamma\sigma^2+(\mu-\tilde\mu_t)\tag{26.45}$$
差 \(rp-\tilde{rp}=\mu-\tilde\mu_t\):代理人过乐观 \(\mu-\tilde\mu_t<0\) 时客观风险溢价小于主观。
Remark 26.3 贝叶斯学习对资产价格无影响(主观对数风险溢价恒为常数 \(\gamma\sigma^2\))。因为代理人学习到 \(t\) 后突然认定所有未来期都用同一参数 \(\tilde\mu_t\),机械地剔除了所有学习性质(除参数被更新外),问题退化为无学习情形。
The difference \(rp-\tilde{rp}=\mu-\tilde\mu_t\): when the agent is over-optimistic (\(\mu-\tilde\mu_t<0\)), the objective risk premium is smaller than the subjective.
Remark 26.3 Bayesian learning has no effect on asset prices (the subjective log risk premium is always the constant \(\gamma\sigma^2\)). Because the agent learns up until \(t\) and then suddenly decides all future periods use the same parameter \(\tilde\mu_t\), mechanically removing all learning properties (except that the parameter is updated), so the problem degenerates to the non-learning case.
26.4.4 Scenario 2: Priced Subjective Growth Uncertainty
代理人把 \(\tilde\mu_t\) 代入 (26.25),但充分意识到 \(\mu\) 不确定、未来对 \(\mu\) 分布的看法会变。\(\Delta c_{t+1}=\tilde\mu_t+\varepsilon_{t+1}\) (26.46) 配合 (26.17) \(\Delta c_{t+1}\sim\mathcal N(\tilde\mu_t,(1+\nu_t)\sigma^2)\)。\(\mu\) 不确定性通过 \(\nu_t\) 动态被定价,称定价主观增长不确定性 (priced subjective growth uncertainty)。对数 SDF:\(\tilde{\mathbb E}_t[e^{m(\Delta c_{t+1})}]=e^{m\tilde\mu_t+\frac12m^2(1+\nu_t)\sigma^2}\) (26.47)。由 Remark 26.2,SDF (26.29) 仍成立,但 \(R_t\) 须计入 \(V_{t+1}\) 在 \(t+1\) 时所有可能主观分布下的 \(t\) 期期望。经价值函数求解(见折叠推导)得对数主观 SDF (26.58):
The agent plugs \(\tilde\mu_t\) into (26.25) but is fully aware that \(\mu\) is uncertain and views on its distribution will change in the future. \(\Delta c_{t+1}=\tilde\mu_t+\varepsilon_{t+1}\) (26.46) with (26.17) \(\Delta c_{t+1}\sim\mathcal N(\tilde\mu_t,(1+\nu_t)\sigma^2)\). The uncertainty about \(\mu\) is priced through the dynamics of \(\nu_t\) — priced subjective growth uncertainty. Log SDF: \(\tilde{\mathbb E}_t[e^{m(\Delta c_{t+1})}]=e^{m\tilde\mu_t+\frac12m^2(1+\nu_t)\sigma^2}\) (26.47). By Remark 26.2, the SDF (26.29) still holds, but \(R_t\) must take into account the time-\(t\) expectation of \(V_{t+1}\) under all possible subjective distributions at \(t+1\). Solving the value function (see the collapsible derivation) gives the log subjective SDF (26.58):
$$\tilde m_{t+1}=\tilde\mu_{m,t}-\tilde\mu_t-\xi_t\sigma\tilde\epsilon_{t+1}\tag{26.58}$$
$$\tilde\mu_{m,t}=\ln\delta-\tfrac12(1-\gamma)^2(1+U_v\nu_{t+1})^2(1+\nu_t)\sigma^2,\quad\xi_t=[1-(1-\gamma)(U_v\nu_{t+1}+1)]\sqrt{1+\nu_t},\quad U_v=\tfrac{\delta}{1-\delta}\tag{26.59}$$
证明 / Proof:价值函数 (26.49)–(26.53) 与 SDF (26.54)–(26.58)
价值函数:定义 \(v_t\equiv\ln\frac{V_t}{C_t}\)。由 (26.31) \(\psi=1\),\(v_t=\delta\ln\frac{R_t}{C_t}=\frac{\delta}{1-\gamma}\ln(\tilde{\mathbb E}_t[e^{(1-\gamma)(v_{t+1}+\Delta c_{t+1})}])\) (26.48–49)。猜测 \(v_t=\mu_{v,t}+U_v\tilde\mu_t\) (26.50)。代入 (26.49),用 (26.20)(26.21)(26.47),匹配系数:\(\delta(U_v+1)=U_v\Rightarrow U_v=\frac\delta{1-\delta}\) (26.52);\(\frac1\delta\mu_{v,t}=\mu_{v,t+1}+\frac12(1-\gamma)(1+U_v\nu_{t+1})^2(1+\nu_t)\sigma^2\) (26.53)。\(\mu_{v,t}\) 由确定性运动律 (26.53) 钉住(无需显式解)。(26.52)+(26.53) 验证 (26.50) 解 (26.49)。
SDF:重写 (26.29) \(\psi=1\):\(\tilde M_{t+1}=\delta(\frac{C_{t+1}}{C_t})^{-1}(\frac{V_{t+1}}{R_t})^{1-\gamma}\) (26.54)。由 (26.48) \(R_t=e^{\frac1\delta v_t+c_t}\) (26.55),\(V_{t+1}=e^{v_{t+1}+c_{t+1}}\) (26.56),(26.21) \(\Delta c_{t+1}=\sqrt{1+\nu_t}\sigma\tilde\epsilon_{t+1}+\tilde\mu_t\) (26.57)。把 (26.55)(26.56)(26.57) 与 \(v_{t+1}=\mu_{v,t+1}+U_v\tilde\mu_{t+1}\)、(26.20) 代入 \(\tilde m_{t+1}=\ln\tilde M_{t+1}\),整理(\(\tilde\mu_{t+1}\) 的系数合并、\(\mu_{v,t}\) 项用 (26.53) 抵消)得 (26.58)。\(\blacksquare\)
Value function: define \(v_t\equiv\ln\frac{V_t}{C_t}\). By (26.31) \(\psi=1\), \(v_t=\delta\ln\frac{R_t}{C_t}=\frac{\delta}{1-\gamma}\ln(\tilde{\mathbb E}_t[e^{(1-\gamma)(v_{t+1}+\Delta c_{t+1})}])\) (26.48–49). Guess \(v_t=\mu_{v,t}+U_v\tilde\mu_t\) (26.50). Substituting into (26.49), using (26.20)(26.21)(26.47), matching coefficients: \(\delta(U_v+1)=U_v\Rightarrow U_v=\frac\delta{1-\delta}\) (26.52); \(\frac1\delta\mu_{v,t}=\mu_{v,t+1}+\frac12(1-\gamma)(1+U_v\nu_{t+1})^2(1+\nu_t)\sigma^2\) (26.53). \(\mu_{v,t}\) is pinned by the deterministic law of motion (26.53) (no explicit solution needed). (26.52)+(26.53) verify (26.50) solves (26.49).
SDF: rewrite (26.29) \(\psi=1\): \(\tilde M_{t+1}=\delta(\frac{C_{t+1}}{C_t})^{-1}(\frac{V_{t+1}}{R_t})^{1-\gamma}\) (26.54). By (26.48) \(R_t=e^{\frac1\delta v_t+c_t}\) (26.55), \(V_{t+1}=e^{v_{t+1}+c_{t+1}}\) (26.56), (26.21) \(\Delta c_{t+1}=\sqrt{1+\nu_t}\sigma\tilde\epsilon_{t+1}+\tilde\mu_t\) (26.57). Substituting (26.55)(26.56)(26.57) with \(v_{t+1}=\mu_{v,t+1}+U_v\tilde\mu_{t+1}\) and (26.20) into \(\tilde m_{t+1}=\ln\tilde M_{t+1}\), simplifying (the \(\tilde\mu_{t+1}\) coefficients combine, the \(\mu_{v,t}\) terms cancel via (26.53)) gives (26.58). \(\blacksquare\)
\(\tilde\epsilon_{t+1}=\frac{\Delta c_{t+1}-\tilde\mu_t}{\sqrt{1+\nu_t}\sigma}\sim\mathcal N(0,1)\)。
Remark 26.4 对比 SDF (26.58)(随机来自 \(\tilde\epsilon_{t+1}\),定价主观增长不确定性)与 SDF (26.37)(随机来自 \(\varepsilon_{t+1}\),预期效用),可见 SDF 只允许在代理人视角下完全不受控的随机性。
无风险利率 \(r_{f,t}=-\tilde\mu_{m,t}+\tilde\mu_t-\frac12\xi_t^2\sigma^2\) (26.61)。财富-消费比仍常数((26.30) 成立,因 (9.15) 只依赖 SDF 形式 26.29),\(R_{w,t+1}=\frac1\delta\frac{C_{t+1}}{C_t}\) (26.62),\(r_{w,t+1}=-\ln\delta+\Delta c_{t+1}\) (26.63),主观方差 \(\widetilde{\text{Var}}(r_{w,t+1})=(1+\nu_t)\sigma^2\)(计入 \(\mu\) 不确定性)。主观与客观对数风险溢价 (26.65)/(26.67):
\(\tilde\epsilon_{t+1}=\frac{\Delta c_{t+1}-\tilde\mu_t}{\sqrt{1+\nu_t}\sigma}\sim\mathcal N(0,1)\).
Remark 26.4 Comparing SDF (26.58) (randomness from \(\tilde\epsilon_{t+1}\), priced subjective growth uncertainty) with SDF (26.37) (randomness from \(\varepsilon_{t+1}\), anticipated utility), the SDF only allows randomness that is completely out of control in the agent's perspective.
Risk-free rate \(r_{f,t}=-\tilde\mu_{m,t}+\tilde\mu_t-\frac12\xi_t^2\sigma^2\) (26.61). The wealth-consumption ratio is still constant ((26.30) holds, since (9.15) only depends on the SDF form (26.29)), \(R_{w,t+1}=\frac1\delta\frac{C_{t+1}}{C_t}\) (26.62), \(r_{w,t+1}=-\ln\delta+\Delta c_{t+1}\) (26.63), subjective variance \(\widetilde{\text{Var}}(r_{w,t+1})=(1+\nu_t)\sigma^2\) (taking \(\mu\) uncertainty into account). Subjective and objective log risk-premia (26.65)/(26.67):
$$\tilde{rp}_{t+1}=\xi_t\sqrt{1+\nu_t}\,\sigma^2\tag{26.65}$$
$$rp_{t+1}=-\frac{\nu_t}2\sigma^2+(\mu-\tilde\mu_t)+\xi_t\sqrt{1+\nu_t}\,\sigma^2\tag{26.67}$$
由 (26.59)(26.15),\(t\to\infty\) 时 \(\nu_t\to0\)、\(\xi_t\to\gamma\),故 (26.65) 趋近预期效用的 (26.43) \(\gamma\sigma^2\)。收益可预测性来自 \(\mu-\tilde\mu_t\)。主客观差 \(rp-\tilde{rp}=(\mu-\tilde\mu_t)-\frac{\nu_t}2\sigma^2\) (26.68)。
26.5 Learning From Fading Memory: Nagel and Xu (2019)
26.5.1 Key Points
代表性代理人以渐忘记忆 (fading memory) 学习禀赋增长的无条件均值。假设:跨期基本面 i.i.d.;代理人 Epstein-Zin 效用;风险厌恶常数;记忆随时间衰减。模型预测:客观股权溢价高且强反周期;主观股权溢价恒定。实证证据一致:经验支付增长率(主观增长预期 = 过去增长率的加权平均)与未来股市超额收益负相关;与调查中主观预期误差(主客观楔子)负相关;与分析师对长期盈利增长的预测(主观增长预期代理)正相关。
26.5.2 Constant-Gain Updating
对数股市支付 \(\Delta d_{t+1}=\mu+\varepsilon_{t+1}\),\(\varepsilon\) i.i.d.。同 §26.3.3 的贝叶斯学习,但记忆渐忘。\(t\) 时后验 (26.69):\(\pi_t(\mu)\propto\pi_0(\mu)\prod_{j=0}^\infty[e^{-\frac12((\Delta d_{t-j}-\mu)/\sigma)^2}]^{(1-\nu)^j}\),\(\nu>0\) 渐忘记忆参数。代理人始于无信息(\(\pi_t(0)\) 平坦),(26.69)→(26.70) 给后验均值 (26.71):\(\tilde\mu_t=\nu\sum_{j=0}^\infty\Delta d_{t-j}(1-\nu)^j\),后验方差 \(\nu\sigma^2\) 常数。由 (26.71) 递推得常增益更新 (constant-gain updating) (26.72):
By (26.59)(26.15), as \(t\to\infty\), \(\nu_t\to0\), \(\xi_t\to\gamma\), so (26.65) approaches the anticipated-utility (26.43) \(\gamma\sigma^2\). Return predictability comes from \(\mu-\tilde\mu_t\). Subjective-objective difference \(rp-\tilde{rp}=(\mu-\tilde\mu_t)-\frac{\nu_t}2\sigma^2\) (26.68).
26.5 Learning From Fading Memory: Nagel and Xu (2019)
26.5.1 Key Points
The representative agent learns the unconditional mean of endowment growth with fading memory. Assumptions: fundamentals i.i.d. across periods; agent has Epstein-Zin utility; risk aversion constant; memory decays over time. Model predicts: objective equity premium high and strongly counter-cyclical; subjective equity premium constant. Consistent evidence: the experienced payout growth rate (subjective growth expectations as a weighted average of past growth rates) is negatively related to future stock-market excess returns; negatively related to subjective expectation errors (the objective-subjective wedge) in surveys; positively related to analyst forecasts of long-run earnings growth (a proxy for subjective growth expectations).
26.5.2 Constant-Gain Updating
Log stock-market payout \(\Delta d_{t+1}=\mu+\varepsilon_{t+1}\), \(\varepsilon\) i.i.d. Same Bayesian learning as §26.3.3 but with fading memory. Time-\(t\) posterior (26.69): \(\pi_t(\mu)\propto\pi_0(\mu)\prod_{j=0}^\infty[e^{-\frac12((\Delta d_{t-j}-\mu)/\sigma)^2}]^{(1-\nu)^j}\), \(\nu>0\) the fading-memory parameter. The agent starts with no information (\(\pi_t(0)\) flat), (26.69)→(26.70) giving the posterior mean (26.71): \(\tilde\mu_t=\nu\sum_{j=0}^\infty\Delta d_{t-j}(1-\nu)^j\), posterior variance \(\nu\sigma^2\) constant. Recursing (26.71) gives constant-gain updating (26.72):
$$\tilde\mu_{t+1}=\tilde\mu_t+\nu(\Delta d_{t+1}-\tilde\mu_t)\tag{26.72}$$
更新 \(\tilde\mu\) 时从新信息 \(\Delta d_{t+1}\) 获得的增益是常数 \(\nu\),故称常增益更新;\(\frac1\nu\) 是估 \(\mu\) 的有效样本量(Nagel-Xu 用 \(\nu=0.018\))。两种动机:(1) 聚合论据(Malmendier-Nagel 2016,§27.2:尽管各年龄队列增益参数异质,聚合增益参数 = 等权平均为常数;聚合参数对资产定价重要 → 代表性代理人做常增益学习合理);(2) 个体组合选择的心理学论据(近因偏误 recency bias:对近期事件赋更多权重 → 对久远历史递减权重,是好假设)。
26.5.3 Empirical Analysis
数据:\(\Delta d\)(CRSP 1926 起季度;更早:Piketty et al. 2018 家庭股利税收数据 1913–1926、Wright 2004 公司非农非金融股利 1900–1913、Barro-Ursua 2008 人均实际 GDP 增长 1871–1900 作代理);市场指数收益(CRSP 价值加权 1926 起;Shiller 2005 S&P 综合 1871–1926,CPI 平减);调查(定量:UBS/Gallup 1998–2007、Vanguard/Ameriks 2019、Lease et al. 1974;定性:Michigan Survey 1987–2016、Roper 1974–1977);分析师预测(I/B/E/S 长期中位数 EPS 增长 1981 起;SPF 通胀预测)。
可检验假设:由 Campbell-Shiller (26.2) 导出 (26.73);代入常数主观风险溢价 (26.3) 化简得三组回归 (26.76)/(26.77)/(26.78):
The gain from new information \(\Delta d_{t+1}\) when updating \(\tilde\mu\) is the constant \(\nu\), hence constant-gain updating; \(\frac1\nu\) is the effective sample size for estimating \(\mu\) (Nagel-Xu use \(\nu=0.018\)). Two motivations: (1) aggregate argument (Malmendier-Nagel 2016, §27.2: despite heterogeneous gain parameters across age cohorts, the aggregate gain parameter = equally weighted average is constant; the aggregate parameter matters for asset pricing → reasonable for a representative agent to do constant-gain learning); (2) psychology argument for individual portfolio choice (recency bias: more emphasis on recent events → decreasing weights for further history is a good assumption).
26.5.3 Empirical Analysis
Data: \(\Delta d\) (CRSP quarterly since 1926; earlier: Piketty et al. 2018 household dividend from tax 1913–1926, Wright 2004 corporate non-farm non-financial dividends 1900–1913, Barro-Ursua 2008 per-capita real GDP growth 1871–1900 as proxy); market index returns (CRSP value-weighted since 1926; Shiller 2005 S&P Composite 1871–1926, CPI-deflated); surveys (quantitative: UBS/Gallup 1998–2007, Vanguard/Ameriks 2019, Lease et al. 1974; qualitative: Michigan Survey 1987–2016, Roper 1974–1977); analyst forecasts (I/B/E/S long-term median EPS growth since 1981; SPF inflation forecast).
Testable hypotheses: from Campbell-Shiller (26.2) derive (26.73); substituting the constant subjective risk premium (26.3) and simplifying gives three regressions (26.76)/(26.77)/(26.78):
$$\mathbb E_t[r_{t+1}]-r_f=\tilde\theta+\left(1+\tfrac{\rho}{1-\rho}\nu\right)(\mu-\tilde\mu_t)\tag{26.76}$$
$$\mathbb E_t[r_{t+1}]-\tilde{\mathbb E}_t[r_{t+1}]=\left(1+\tfrac{\rho}{1-\rho}\nu\right)(\mu-\tilde\mu_t)\tag{26.77}$$
$$\tilde{\mathbb E}_t[r_{t+1}]-r_f=\tilde\theta\quad(\text{constant})\tag{26.78}$$
(26.76)/(26.77) 中 \(\tilde\mu_t\) 对总股利支付(\(\tilde\mu_{d,t}\))与总收益(\(\tilde\mu_{r,t}\))两种构造都可跑;\(\tilde\mu_{r,t}\) 比 \(\tilde\mu_{d,t}\) 含更多噪声(可能含与市场基本面正交的投资者情绪)。实证结果(图 26.1–26.3 为回归系数表):
- 图 26.1(回归 26.76;Panel A 用 \(\tilde\mu_{d,t}\)、Panel B 用 \(\tilde\mu_{r,t}\),被解释变量为 CRSP 价值加权对数季度收益减 3 月期国库券):\(\tilde\mu_t\) 系数(两 Panel)显著为负,验证 (26.76)。
- 图 26.2(Panel A 回归 26.78、Panel B 回归 26.77,被解释变量为调查受访者季度 \(t\) 主观预期股票收益减上季末一年期国债收益):Panel A \(\tilde\mu_t\) 系数与零无显著差异(验证 26.78);Panel B \(\tilde\mu_{d,t}\) 系数显著为负(验证 26.77)。
- 图 26.3(回归 26.72/26.79 \(\tilde\mu_{d,t+1}=(1-\nu)\tilde\mu_{d,t}+\nu\Delta d_{t+1}\),被解释变量为 I/B/E/S 长期 EPS 增长分析师预测经 SPF 通胀平减):\(\tilde\mu_{d,t}\) 系数显著为正,验证 (26.72)。用分析师预测因无调查问股利支付预期。
26.5.4 Asset Pricing Model with Epstein-Zin Preference
代理人定价主观增长不确定性 → §26.4.4 的特例(\(\nu\) 此处为常数),直接用结果。对数主观 SDF (26.80):\(\tilde m_{t+1}=\tilde\mu_m-\tilde\mu_t-\xi\sigma\tilde\epsilon_{t+1}\),\(\tilde\mu_m=\ln\delta-\frac12(1-\gamma)^2(1+U_v\nu)^2(1+\nu)\sigma^2\),\(\xi=[1-(1-\gamma)(U_v\nu+1)]\sqrt{1+\nu}\),\(U_v=\frac\delta{1-\delta}\)。无风险利率 \(r_{f,t}=-\tilde\mu_m+\tilde\mu_t-\frac12\xi^2\sigma^2\);财富组合收益 \(r_{w,t+1}=-\ln\delta+\Delta c_{t+1}\) (26.63);主观对数风险溢价 \(\tilde{rp}_{t+1}=\xi\sqrt{1+\nu}\sigma^2\) (26.81);客观 \(rp_{t+1}=(\mu-\tilde\mu_t)-\frac\nu2\sigma^2+\xi\sqrt{1+\nu}\sigma^2\) (26.82);差 \(rp-\tilde{rp}=(\mu-\tilde\mu_t)-\frac\nu2\sigma^2\) (26.68)。
进一步设 \(\Delta d_{t+1}\) 为 \(\Delta c_{t+1}\) 的函数 (26.83):\(\Delta d_{t+1}=\lambda\Delta c_{t+1}-\alpha(d_t-c_t-\mu_{dc})+\sigma_d\eta_{t+1}\)(\(\alpha>0\)),导出无穷远单期股利条 (dividend strip) 的对数风险溢价:主观 (26.84) 恒定;客观 (26.85) 时变反周期(当期支付高 → \(\tilde\mu_t\) 高 → 风险溢价低)。
26.5.5 / 26.5.6 Contribution & Discussion
贡献:渐忘记忆学习框架简单又贴近现实;分析详尽(含模拟);数据全面(合并多源得最长样本)。讨论(动机):作者用 Malmendier-Nagel (2016) 论证渐忘记忆,但 M-N 由跨年龄队列聚合增益参数得常增益(只要各队列份额不变)——这对聚合层面资产定价(代表性代理人)无碍,但渐忘记忆框架更适合刻画特定投资者类型,故应以非聚合论据(近因偏误)为特定投资者类型动机常增益,并以经验证据支持。早期(pre-1926)与调查/分析师数据标准质量未必可比,模拟分析或应只用更干净的子样本。
In (26.76)/(26.77), \(\tilde\mu_t\) can be constructed from both aggregate dividend payout (\(\tilde\mu_{d,t}\)) and aggregate return (\(\tilde\mu_{r,t}\)); \(\tilde\mu_{r,t}\) contains more noise than \(\tilde\mu_{d,t}\) (may contain investor sentiment orthogonal to market fundamentals). Empirical results (Figures 26.1–26.3 are regression-coefficient tables):
- Figure 26.1 (regression 26.76; Panel A uses \(\tilde\mu_{d,t}\), Panel B \(\tilde\mu_{r,t}\); dependent variable the log quarterly CRSP value-weighted return minus 3-month T-bill): the coefficient of \(\tilde\mu_t\) (both panels) is significantly negative, confirming (26.76).
- Figure 26.2 (Panel A regression 26.78, Panel B regression 26.77; dependent variable the average subjective expected stock return of survey respondents in quarter \(t\) minus the one-year treasury yield at the end of quarter \(t-1\)): Panel A's \(\tilde\mu_t\) coefficient is non-significantly different from zero (confirming 26.78); Panel B's \(\tilde\mu_{d,t}\) coefficient is significantly negative (confirming 26.77).
- Figure 26.3 (regression 26.72/26.79 \(\tilde\mu_{d,t+1}=(1-\nu)\tilde\mu_{d,t}+\nu\Delta d_{t+1}\); dependent variable the aggregate long-term EPS growth analyst forecast from I/B/E/S deflated by SPF inflation forecast): the \(\tilde\mu_{d,t}\) coefficient is significantly positive, confirming (26.72). Analyst forecasts are used because no survey asks expectations of dividend payout.
26.5.4 Asset Pricing Model with Epstein-Zin Preference
The agent prices subjective growth uncertainty → a special case of §26.4.4 (\(\nu\) constant here), so use the results directly. Log subjective SDF (26.80): \(\tilde m_{t+1}=\tilde\mu_m-\tilde\mu_t-\xi\sigma\tilde\epsilon_{t+1}\), \(\tilde\mu_m=\ln\delta-\frac12(1-\gamma)^2(1+U_v\nu)^2(1+\nu)\sigma^2\), \(\xi=[1-(1-\gamma)(U_v\nu+1)]\sqrt{1+\nu}\), \(U_v=\frac\delta{1-\delta}\). Risk-free rate \(r_{f,t}=-\tilde\mu_m+\tilde\mu_t-\frac12\xi^2\sigma^2\); wealth return \(r_{w,t+1}=-\ln\delta+\Delta c_{t+1}\) (26.63); subjective log risk-premium \(\tilde{rp}_{t+1}=\xi\sqrt{1+\nu}\sigma^2\) (26.81); objective \(rp_{t+1}=(\mu-\tilde\mu_t)-\frac\nu2\sigma^2+\xi\sqrt{1+\nu}\sigma^2\) (26.82); difference \(rp-\tilde{rp}=(\mu-\tilde\mu_t)-\frac\nu2\sigma^2\) (26.68).
Further assume \(\Delta d_{t+1}\) is a function of \(\Delta c_{t+1}\) (26.83): \(\Delta d_{t+1}=\lambda\Delta c_{t+1}-\alpha(d_t-c_t-\mu_{dc})+\sigma_d\eta_{t+1}\) (\(\alpha>0\)), deriving the log risk-premium of a dividend strip in the infinite future: subjective (26.84) constant; objective (26.85) time-varying and counter-cyclical (high current payout → high \(\tilde\mu_t\) → low risk premium).
26.5.5 / 26.5.6 Contribution & Discussion
Contribution: the fading-memory learning framework is simple yet realistic; the analysis is thorough (including simulation); the data are comprehensive (merging multiple sources for the longest sample). Discussion (motivation): the authors motivate fading memory via Malmendier-Nagel (2016), but M-N reach constant-gain by aggregating gain parameters across age cohorts (as long as each cohort's share is constant) — this is fine for aggregate-level asset pricing (representative agent), but the fading-memory framework is better suited to modeling specific investor types, so it should motivate constant-gain with a non-aggregate argument (recency bias) for specific investor types, supported by empirical evidence. The earlier (pre-1926) and survey/analyst data may not be of comparable standard and quality, so the simulation analysis should perhaps use a cleaner subset rather than the full sample.
References
- Barro, R. J. and J. F. Ursúa (2008). Macroeconomic crises since 1870. Brookings Papers on Economic Activity 2008(1), 255–350.
- Hansen, L. P., J. C. Heaton, and N. Li (2008). Consumption strikes back? Measuring long-run risk. Journal of Political Economy 116(2), 260–302.
- He, X. (2019a). Econometrics Notes by Xindi He.
- Malmendier, U. and S. Nagel (2016). Learning from inflation experiences. The Quarterly Journal of Economics 131(1), 53–87.
- Nagel, S. and Z. Xu (2019). Asset pricing with fading memory. The Review of Financial Studies (forthcoming).
- Piketty, T., E. Saez, and G. Zucman (2018). Distributional national accounts: Methods and estimates for the United States. The Quarterly Journal of Economics 133(2), 553–609.
- Shiller, R. J. (2005). Irrational Exuberance (2nd ed.). Princeton University Press.
- Wright, S. (2004). Measures of stock market value and returns for the US nonfinancial corporate sector, 1900–2002. Review of Income and Wealth 50(4), 561–584.