18. Robustness to Misspecification

Jun He May 30, 2026

资产定价Asset Pricing 稳健性Robustness 模型误设Misspecification 模糊厌恶Ambiguity Aversion 递归效用Recursive Utility 学习笔记Study Note

Note

本章综述对模型误设的稳健性 (robustness to misspecification) 文献核心思想（Hansen-Sargent 2001）。代理人不 100% 确信基准模型对应真实概率测度，于是做审慎调整 (cautious adjustment)：在所有"与基准模型偏离不太远"的概率测度上最小化未来连续价值，得到连续价值的保守下界。偏离的惩罚由条件相对熵 (conditional relative entropy) 度量，惩罚强度参数 $\xi$ 代表对基准模型的信心。核心结果：求解此最小化问题得到的最优似然比是指数倾斜 (exponential tilting) 形式 (18.8)，而把它代回目标函数 (18.9) 恰好等于 Epstein-Zin 递归效用的连续价值 (18.10)，且当 $\xi=\frac{1}{\gamma-1}$ 时二者完全相同。于是稳健性/模糊厌恶在数学上等价于更高的风险厌恶（更小的 $\xi$ ⟺ 更大的 $\gamma$）。

Note

This chapter surveys the core idea of the robustness to misspecification literature (Hansen-Sargent 2001). The agent is not 100% sure the baseline model corresponds to the true probability measure, so makes a cautious adjustment: minimizing the future continuation value over all probability measures "not too far" from the baseline, yielding a conservative lower bound on the continuation value. The penalty for deviating is measured by the conditional relative entropy, with penalty strength $\xi$ representing confidence in the baseline. Key result: the optimal likelihood ratio solving this minimization is an exponential tilting (18.8), and plugging it back into the objective (18.9) exactly equals the continuation value of Epstein-Zin recursive utility (18.10), being identical when $\xi=\frac{1}{\gamma-1}$. So robustness / ambiguity aversion is mathematically equivalent to higher risk aversion (smaller $\xi$ ⟺ larger $\gamma$).

18.1 Setup

设代理人有递归效用（即 Epstein-Zin 偏好），等价于 (9.1) 改写的 (18.1)：

The agent has recursive utility (Epstein-Zin preference), equivalent to (9.1) rewritten as (18.1):

$$V_t=\left[(1-\beta)C_t^{1-\rho}+\beta(R_t^{1-\rho})\right]^{\frac1{1-\rho}}\tag{18.1}$$

$$R_t=\mathbb E\!\left[V_{t+1}^{1-\gamma}\mid\mathcal F_t\right]^{\frac1{1-\gamma}}\tag{18.2}$$

$R_t$ 为连续价值，$V_t$ 为当前价值函数，$C_t$ 为即期消费，$\mathcal F_t$ 为 $t$ 时信息。对数对象 $v_t\equiv\ln V_t$、$c_t\equiv\ln C_t$、$r_t\equiv\ln R_t$。条件似然比 (conditional likelihood ratio) 记为 $\frac{N_{t+1}}{N_t}$，满足 $\mathbb E[\frac{N_{t+1}}{N_t}\mid\mathcal F_t]=1$（保证新概率测度积分为 1）与 $\frac{N_{t+1}}{N_t}\geq0$（保证非负）。故 $\frac{N_{t+1}}{N_t}$ 是合法的改变测度密度。

18.2 The Minimization Problem

代理人求解最小化问题 (18.3)：

$R_t$ is the continuation value, $V_t$ the current value function, $C_t$ instant consumption, $\mathcal F_t$ time-$t$ information. Log objects $v_t\equiv\ln V_t$, $c_t\equiv\ln C_t$, $r_t\equiv\ln R_t$. The conditional likelihood ratio $\frac{N_{t+1}}{N_t}$ satisfies $\mathbb E[\frac{N_{t+1}}{N_t}\mid\mathcal F_t]=1$ (so the new measure integrates to 1) and $\frac{N_{t+1}}{N_t}\geq0$ (non-negativity). So $\frac{N_{t+1}}{N_t}$ is a valid change-of-measure density.

18.2 The Minimization Problem

The agent solves the minimization (18.3):

$$\min_{\frac{N_{t+1}}{N_t}}\ \mathbb E\!\left[\frac{N_{t+1}}{N_t}v_{t+1}\mid\mathcal F_t\right]+\xi\,\mathbb E\!\left[\frac{N_{t+1}}{N_t}(\ln N_{t+1}-\ln N_t)\mid\mathcal F_t\right]\tag{18.3}$$

$$\text{s.t.}\quad\mathbb E\!\left[\frac{N_{t+1}}{N_t}\mid\mathcal F_t\right]=1,\qquad\frac{N_{t+1}}{N_t}\geq0.\tag{18.4}$$

(18.5) $\frac{N_{t+1}}{N_t}\geq0$ 自动满足（可写 $\ln N_{t+1}$、$\ln N_t$）。

解读。

似然比 $\frac{N_{t+1}}{N_t}$ 把概率测度由基准模型改为某个其他测度。
$\mathbb E[\frac{N_{t+1}}{N_t}v_{t+1}\mid\mathcal F_t]$ 是新测度下的期望连续价值。
$\xi\,\mathbb E[\frac{N_{t+1}}{N_t}(\ln N_{t+1}-\ln N_t)\mid\mathcal F_t]$ 称条件相对熵 (conditional relative entropy)，度量从基准模型改到其他测度的惩罚值。其凸性与非负性保证它是合理的"距离"惩罚（定义 $\varepsilon(n)=n\ln n$，$\varepsilon'(n)=1+\ln n$、$\varepsilon''(n)=\frac1n>0$，故 $\frac{N_{t+1}}{N_t}(\ln N_{t+1}-\ln N_t)$ 凸；由 Jensen $\mathbb E[\frac{N_{t+1}}{N_t}(\ln N_{t+1}-\ln N_t)\mid\mathcal F_t]\geq\underbrace{\mathbb E[\frac{N_{t+1}}{N_t}\mid\mathcal F_t]}_{=1}\ln(\underbrace{\mathbb E[\frac{N_{t+1}}{N_t}\mid\mathcal F_t]}_{=1})=0$，惩罚恒非负）。
$\xi$ 是惩罚项的拉格朗日乘子/惩罚载荷，代表对基准模型的信心：$\xi$ 越大，偏离惩罚越大，代理人越不愿偏离基准（$\xi\to\infty$ 时 $\frac{N_{t+1}}{N_t}\to1$，直接用基准模型）。
该审慎调整把连续价值取为所有概率测度下的保守下界（最小化），体现模糊厌恶 (ambiguity aversion)：代理人对真实概率测度本身不确定。
$\xi$ 可时变，意味着在宏观经济好/坏时对基准模型的信心不同。

(18.5) $\frac{N_{t+1}}{N_t}\geq0$ is automatically satisfied (one can write $\ln N_{t+1}$, $\ln N_t$).

Interpretation.

The likelihood ratio $\frac{N_{t+1}}{N_t}$ changes the measure from the baseline to some other measure.
$\mathbb E[\frac{N_{t+1}}{N_t}v_{t+1}\mid\mathcal F_t]$ is the expected continuation value under the new measure.
$\xi\,\mathbb E[\frac{N_{t+1}}{N_t}(\ln N_{t+1}-\ln N_t)\mid\mathcal F_t]$ is the conditional relative entropy, the penalty for moving from the baseline to another measure. Its convexity and non-negativity make it a sensible "distance" penalty (define $\varepsilon(n)=n\ln n$, $\varepsilon'(n)=1+\ln n$, $\varepsilon''(n)=\frac1n>0$, so $\frac{N_{t+1}}{N_t}(\ln N_{t+1}-\ln N_t)$ is convex; by Jensen $\mathbb E[\frac{N_{t+1}}{N_t}(\ln N_{t+1}-\ln N_t)\mid\mathcal F_t]\geq\underbrace{\mathbb E[\frac{N_{t+1}}{N_t}\mid\mathcal F_t]}_{=1}\ln(\underbrace{\mathbb E[\frac{N_{t+1}}{N_t}\mid\mathcal F_t]}_{=1})=0$, always non-negative).
$\xi$ is the Lagrange multiplier / penalty loading, representing confidence in the baseline: larger $\xi$ means a larger deviation penalty, so the agent is less willing to deviate ($\xi\to\infty$ gives $\frac{N_{t+1}}{N_t}\to1$, using the baseline directly).
This cautious adjustment sets the continuation value to a conservative lower bound across all measures (minimization), reflecting ambiguity aversion: the agent is unsure of the true measure itself.
$\xi$ may be time-dependent, meaning confidence in the baseline differs in good/bad macroeconomic times.

18.3 Solve the Minimization Problem

对 (18.3)–(18.4) 作拉格朗日（乘子 $\mu$），对 $\frac{N_{t+1}}{N_t}$ 取一阶条件 (18.6)：

Form the Lagrangian for (18.3)–(18.4) (multiplier $\mu$), take the f.o.c. w.r.t. $\frac{N_{t+1}}{N_t}$ (18.6):

$$\frac{N_{t+1}}{N_t}=e^{-\frac1\xi v_{t+1}}e^{-\frac1\xi\mu-1}.\tag{18.6}$$

代入约束 (18.4) 钉住乘子 (18.7)：$e^{-\frac1\xi\mu-1}=\frac{1}{\mathbb E[e^{-\frac1\xi v_{t+1}}\mid\mathcal F_t]}$。结合 (18.6) 得最优似然比 (18.8)：

Substituting into the constraint (18.4) pins the multiplier (18.7): $e^{-\frac1\xi\mu-1}=\frac{1}{\mathbb E[e^{-\frac1\xi v_{t+1}}\mid\mathcal F_t]}$. Combined with (18.6), the optimal likelihood ratio (18.8):

$$\frac{N_{t+1}}{N_t}=\frac{e^{-\frac1\xi v_{t+1}}}{\mathbb E\!\left[e^{-\frac1\xi v_{t+1}}\mid\mathcal F_t\right]}.\tag{18.8}$$

Note

Remark 18.1. (18.8) 是指数倾斜 (exponential tilting)：给较小的 $v_{t+1}$（坏结果）赋更高权重，故求解出的 $\frac{N_{t+1}}{N_t}$ 把测度倾向坏结果——保守。$\xi\to\infty$ 时 $\frac{N_{t+1}}{N_t}\to1$（回到基准，无倾斜）。

理解目标函数。 把 (18.8) 代回目标函数 (18.3)，化简得 (18.9)：

Note

Remark 18.1. (18.8) is exponential tilting: it places higher weight on smaller $v_{t+1}$ (bad outcomes), so the solved $\frac{N_{t+1}}{N_t}$ tilts the measure toward bad outcomes — conservative. As $\xi\to\infty$, $\frac{N_{t+1}}{N_t}\to1$ (back to baseline, no tilt).

Understanding the objective. Substituting (18.8) back into the objective (18.3) and simplifying gives (18.9):

$$\text{Objective (18.3)}=-\xi\ln\!\left(\mathbb E\!\left[e^{-\frac1\xi v_{t+1}}\mid\mathcal F_t\right]\right).\tag{18.9}$$

而由 (18.2)，递归效用的连续价值对数 (18.10)：$r_t=\frac{1}{1-\gamma}\ln(\mathbb E[e^{(1-\gamma)v_{t+1}}\mid\mathcal F_t])$。比较 (18.9) 与 (18.10)：若设

By (18.2), the log continuation value of recursive utility (18.10): $r_t=\frac{1}{1-\gamma}\ln(\mathbb E[e^{(1-\gamma)v_{t+1}}\mid\mathcal F_t])$. Comparing (18.9) and (18.10): if we set

$$\xi=\frac{1}{\gamma-1},$$

则 (18.9) 与 (18.10) 完全相同。

解读。

最小化问题的目标函数 (18.3) 恰是递归（Epstein-Zin）效用下的期望连续价值——这为递归效用提供了一个辩护：对风险持保守态度的代理人本就会拥有递归效用函数。
递归效用因此可理解为内嵌模糊厌恶：代理人在所有可能概率测度上最小化、用最坏不确定性下的期望连续价值。
之所以是最小化而非 max-min 问题，是因为消费序列外生、无可选；在带生产与消费-投资权衡的经济中，递归效用的连续价值才会是 max-min 问题的解。
$\gamma$ 很大 ⟺ $\xi$ 很小：偏离基准的惩罚很小、会选更保守的测度。故"选更保守测度（更小 $\xi$）"等价于"更高风险厌恶（更大 $\gamma$）"——再次印证稳健性与递归效用的联系。

偏好冲击 (preference shock)。

一种理解：偏好冲击通过 $\xi$ 体现。$\xi$ 时变会改变所选似然比 $\frac{N_{t+1}}{N_t}$ 与对应测度，从而改变连续价值。
另一种理解：通过 $C_t$。仍用 $C_t$ 表递归效用，但 $C_t$ 为有效消费，显式定义 $C_t=\tilde C_t D_t$，$\tilde C_t$ 为实际消费、$D_t$ 为外生偏好移位项（如平稳一阶差分过程、AR 过程、随机波动率过程等）。
两种方式都可把价格的一部分归于偏好冲击；但需谨慎："偏好冲击"的故事也可能其实经由其他渠道（如改变生产能力）影响资产价格，未必真经由偏好。

then (18.9) and (18.10) are exactly the same.

Interpretation.

The objective (18.3) of the minimization is exactly the expected continuation value under recursive (Epstein-Zin) utility — a justification for recursive utility: agents with conservative attitudes toward risk would naturally have a recursive utility function.
Recursive utility can thus be understood as embedding ambiguity aversion: the agent minimizes over all possible measures, using the expected continuation value under the worst uncertainty.
It is a minimization rather than a max-min problem because the consumption sequence is exogenous with no choice; in an economy with production and a consumption-investment trade-off, the recursive-utility continuation value would be the solution to a max-min problem.
Large $\gamma$ ⟺ small $\xi$: the deviation penalty is small and a more conservative measure is chosen. So "choosing a more conservative measure (smaller $\xi$)" equals "higher risk aversion (larger $\gamma$)" — again confirming the link between robustness and recursive utility.

Preference shock.

One view: the preference shock acts through $\xi$. A time-varying $\xi$ changes the selected likelihood ratio $\frac{N_{t+1}}{N_t}$ and its measure, hence the continuation value.
Another view: through $C_t$. Still use $C_t$ for recursive utility, but $C_t$ is effective consumption, explicitly $C_t=\tilde C_t D_t$, with $\tilde C_t$ the actual consumption and $D_t$ an exogenous preference shifter (e.g. a stationary first-difference, AR, or stochastic-volatility process).
Either way one can attribute part of the price to a preference shock; but be cautious — the "preference shock" story may in fact operate through other channels (e.g. changing production capacity) rather than truly through preferences.

References

Hansen, L. P. and T. J. Sargent (2001). Robust Control and Model Uncertainty. American Economic Review 91(2), 60–66.