10. Deterministic Dynamic Programming

Note

本章主题:确定性动态规划。 §10.1 无穷期一般形式贝尔曼方程 \(V(x)=\max_{y\in\Gamma(x)}\{F(x,y)+\beta V(y)\}\);假设 1–3(\(\Gamma\) 紧值连续、\(F\) 有界连续、\(\beta\in(0,1)\));贝尔曼算子 \(T\)、由压缩映射定理(\(T\tilde v\) 有界 + 连续【最大值定理】+ 压缩【Blackwell 单调性 & 贴现】)证 \(V\) 存在唯一;\(V\)/\(g\) 的单调性、凹性、可微性、\(g\) 的性质。§10.2 无外部性经济:人力资本 \(h\)、\(c+h'\le Ah^\alpha\)、对数和 \(\sum\beta^t u(c_t)\);状态空间 \([0,A^{1/(1-\alpha)}]\);\(V\) 良定/唯一/连续/严增/严凹/一次可微;f.o.c. \(u'(Ah^\alpha-y^\star)=\beta V'(y^\star)\)、EC \(V'(h)=A\alpha h^{\alpha-1}u'(\cdot)\)、EE、稳态 \(h_{ss}=(\beta A\alpha)^{1/(1-\alpha)}\);二阶近似动态(特征方程,\(\lambda_1\lambda_2=1/\beta\))。§10.3 正外部性—社会计划者:\(F(h,H)=Ah^\theta H^r\),\(h_t=H_t\) ⟹ 退化为 \(\alpha=\theta+r\)。§10.4 正外部性—递归竞争均衡:两状态 \(h,H\)、猜想 \(\Phi(H)\)、\(W(h,H)\)、递归竞争均衡 \(\phi^\star(h,h,\Phi^e(h))=\Phi^e(h)\);稳态 \(\tilde h_{ss}=(\beta A\theta)^{1/(1-\hat\alpha)}<\bar h_{ss}\)(正外部性 ⟹ 第一/第二福利定理失败)。§10.5 有限期例子:倒向归纳,\(V(x,N)=\max F\)、\(V(x,n)=\max\{F+\beta V(y,n+1)\}\),性质逐期传递。

Note

Chapter theme: deterministic dynamic programming. §10.1 General infinite-horizon form: the Bellman equation \(V(x)=\max_{y\in\Gamma(x)}\{F(x,y)+\beta V(y)\}\); Assumptions 1–3 (\(\Gamma\) compact-valued continuous, \(F\) bounded continuous, \(\beta\in(0,1)\)); the Bellman operator \(T\), with the Contraction Mapping Theorem (\(T\tilde v\) bounded + continuous [Theorem of Maximum] + contraction [Blackwell monotonicity & discounting]) proving \(V\) exists & unique; monotonicity, concavity, differentiability of \(V\)/\(g\), properties of \(g\). §10.2 Economy with no externality: human capital \(h\), \(c+h'\le Ah^\alpha\), log-type \(\sum\beta^t u(c_t)\); state space \([0,A^{1/(1-\alpha)}]\); \(V\) well-defined/unique/continuous/strictly increasing/strictly concave/once-differentiable; f.o.c. \(u'(Ah^\alpha-y^\star)=\beta V'(y^\star)\), EC \(V'(h)=A\alpha h^{\alpha-1}u'(\cdot)\), EE, steady state \(h_{ss}=(\beta A\alpha)^{1/(1-\alpha)}\); second-order approximation dynamics (characteristic equation, \(\lambda_1\lambda_2=1/\beta\)). §10.3 Positive externality — social planner: \(F(h,H)=Ah^\theta H^r\), \(h_t=H_t\) ⟹ reduces to \(\alpha=\theta+r\). §10.4 Positive externality — recursive competitive equilibrium: two states \(h,H\), conjecture \(\Phi(H)\), \(W(h,H)\), recursive competitive equilibrium \(\phi^\star(h,h,\Phi^e(h))=\Phi^e(h)\); steady state \(\tilde h_{ss}=(\beta A\theta)^{1/(1-\hat\alpha)}<\bar h_{ss}\) (positive externality ⟹ First/Second Welfare Theorems fail). §10.5 Finite-horizon example: backward induction, \(V(x,N)=\max F\), \(V(x,n)=\max\{F+\beta V(y,n+1)\}\), properties propagate period by period.

10.1 General Form of Deterministic Dynamic Programming with Infinite Horizon

10.1.1 贝尔曼方程

$$V(x)\equiv\max_{y\in\Gamma(x)}\{F(x,y)+\beta V(y)\}$$ 其中 \(x\) 是当前期状态变量、\(y\) 是下一期状态变量、\(F(x,y)\) 是期间回报函数、\(\beta<1\) 是贴现因子(\(\beta<1\) 对压缩映射定理的适用至关重要)。注意贝尔曼方程是泛函方程,其解是满足贝尔曼方程的函数 \(V(x)\)。

10.1.2 \(V\) 的假设与性质

Important

假设 1 \(X\subseteq\mathbb R^n\) 是凸集。\(\Gamma:X\rightrightarrows X\) 非空、紧值、连续。(对应 \(y=\Gamma(x)\) 连续,若它既下半连续又上半连续。)

Important

定义 10.1(下半连续 lower hemi-continuous) 对应 \(\Gamma:X\to Y\) 在 \(x\) 处下半连续,若 \(\Gamma\) 非空、且对每个 \(y\in\Gamma(x)\) 与每个序列 \(x_n\to x\),存在序列 \(\{y_n\}_{n=0}^\infty\) 与 \(N\ge1\) 使得 \(y_n\in\Gamma(x_n)\) 对 \(\forall n\ge N\)。

Important

定义 10.2(上半连续 upper hemi-continuous) 紧值对应 \(\Gamma:X\to Y\) 在 \(x\) 处上半连续,若 \(\Gamma\) 非空、且对每个序列 \(x_n\to x\) 与每个满足 \(y_n\in\Gamma(x_n)\)(\(\forall n\))的序列 \(\{y_n\}\),存在 \(\{y_n\}\) 的收敛子序列、其极限点 \(y\) 在 \(\Gamma(x)\) 中。

Tip

注记 10.1 关于 \(x_n\to x\) 的直觉:若对应下半连续,\(\Gamma(x_n)\) 可比 \(\Gamma(x)\) 小、但只小无穷小量(即 \(\Gamma(x_n)\) 不能比 \(\Gamma(x)\) 小很多)。若上半连续,\(\Gamma(x_n)\) 可比 \(\Gamma(x)\) 大、但只大无穷小量(即 \(\Gamma(x_n)\) 不能比 \(\Gamma(x)\) 大很多)。

Important

假设 2 期间回报函数 \(F(x,y):X\times X\to\mathbb R\) 有界(即 \(\exists\bar F\in\mathbb R\) 使 \(|F(x,y)|\le\bar F\) 对 \(\forall x\in X,y\in\Gamma(x)\))且关于 \(x\) 与 \(y\) 连续。同时 \(\beta\in(0,1)\)。

设 \(C(X)\) 为有界连续函数空间(带上确界范数 \(\|f\|=\sup_{x\in X}|f(x)|\))。考虑度量空间 \((C(X),\rho)\),对 \(f,g\in C(X)\),\(\rho(f,g)=\|f-g\|=\sup_{x\in X}|f(x)-g(x)|\)。对 \(\tilde v\in C(X)\),在度量空间 \((C(X),\rho)\) 中定义贝尔曼算子 \(T\): $$(T\tilde v)(x)\equiv\max_{y\in\Gamma(x)}\{F(x,y)+\beta\tilde v(y)\}$$

Important

假设 3 假设函数 \(\tilde v(x)\) 连续且有界。

Tip

注记 10.2 假设 \(\tilde v(x)\) 连续且有界是合理的,因为已假设 \(F(x,y)\) 连续、且被 \(\bar F\) 有界。显然 \(\tilde v(x)\le\frac{\bar F}{1-\beta}\),连续性也由此不等式得到。

10.1.1 The Bellman equation

$$V(x)\equiv\max_{y\in\Gamma(x)}\{F(x,y)+\beta V(y)\}$$ where \(x\) is the current period state variable, \(y\) is the next period state variable, \(F(x,y)\) is the period return function and \(\beta<1\) is the discounting factor (\(\beta<1\) is crucial for the Contraction Mapping Theorem to apply). Note that the Bellman equation is a functional equation, and its solution is a function \(V(x)\) that satisfies the Bellman equation.

10.1.2 Assumptions and Properties of \(V\)

Important

Assumption 1 \(X\subseteq\mathbb R^n\) is a convex set. \(\Gamma:X\rightrightarrows X\) is non-empty, compact-valued, and continuous. (A correspondence \(y=\Gamma(x)\) is continuous if it is both lower hemi-continuous and upper hemi-continuous.)

Important

Definition 10.1 (lower hemi-continuous) A correspondence \(\Gamma:X\to Y\) is lower hemi-continuous at \(x\) if \(\Gamma\) is not empty, and if for every \(y\in\Gamma(x)\) and every sequence \(x_n\to x\), there exists a sequence \(\{y_n\}_{n=0}^\infty\) and \(N\ge1\) such that \(y_n\in\Gamma(x_n)\) for \(\forall n\ge N\).

Important

Definition 10.2 (upper hemi-continuous) A compact valued correspondence \(\Gamma:X\to Y\) is upper hemi-continuous at \(x\) if \(\Gamma\) is not empty, and if for every sequence \(x_n\to x\) and every sequence \(\{y_n\}\) such that \(y_n\in\Gamma(x_n)\) for \(\forall n\), there exists a convergent sub-sequence of \(\{y_n\}\) whose limit point \(y\) is in \(\Gamma(x)\).

Tip

Remark 10.1 The intuition for \(x_n\to x\): if a correspondence is lower hemi-continuous, then \(\Gamma(x_n)\) can be smaller than \(\Gamma(x)\) but only by an infinitely small amount (i.e. \(\Gamma(x_n)\) cannot be much smaller than \(\Gamma(x)\)). If a correspondence is upper hemi-continuous, then \(\Gamma(x_n)\) can be larger than \(\Gamma(x)\) but only by an infinitely small amount (i.e. \(\Gamma(x_n)\) cannot be much larger than \(\Gamma(x)\)).

Important

Assumption 2 The period return function \(F(x,y):X\times X\to\mathbb R\) is bounded (i.e. \(\exists\bar F\in\mathbb R\) s.t. \(|F(x,y)|\le\bar F\) for \(\forall x\in X,y\in\Gamma(x)\)) and continuous in \(x\) and \(y\). Also \(\beta\in(0,1)\).

Let \(C(X)\) be the space of bounded, continuous functions with the sup norm \(\|f\|=\sup_{x\in X}|f(x)|\). Consider the metric space \((C(X),\rho)\) such that for \(f,g\in C(X)\), \(\rho(f,g)=\|f-g\|=\sup_{x\in X}|f(x)-g(x)|\). For \(\tilde v\in C(X)\), define the Bellman operator \(T\) in the metric space \((C(X),\rho)\) as $$(T\tilde v)(x)\equiv\max_{y\in\Gamma(x)}\{F(x,y)+\beta\tilde v(y)\}$$

Important

Assumption 3 Assume the function \(\tilde v(x)\) is continuous and bounded.

Tip

Remark 10.2 It is reasonable to assume \(\tilde v(x)\) is continuous and bounded since we have already assumed \(F(x,y)\) is continuous and bounded by \(\bar F\). Obviously \(\tilde v(x)\le\frac{\bar F}{1-\beta}\), and continuity also follows from this inequality.

\(V\) 良定(存在性、唯一性)。 为使压缩映射定理(Contraction Mapping Theorem) 适用,需 (1) \(T\tilde v\) 有界;(2) \(T\tilde v\) 连续;(3) \(T\) 是压缩。 - \(T\tilde v\) 有界,因为 \(F\) 假设有界、\(\tilde v\) 假设有界。 - \(T\tilde v\) 连续,因为 \(F\) 与 \(\tilde v\) 连续、\(\Gamma\) 作为对应紧值且连续,则最大值定理(Theorem of Maximum) 蕴含 \(T\tilde v\) 连续。 - \(T\) 是压缩,因为它满足 Blackwell 充分条件(单调性与贴现): - 单调性:对 \(\forall x\),设 \(v_1(x)\le v_2(x)\),则 $$F(x,y)+\beta v_1(y)\le F(x,y)+\beta v_2(y)\Rightarrow(Tv_1)(x)=\max_{y\in\Gamma(x)}\{F(x,y)+\beta v_1(y)\}\le\max_{y\in\Gamma(x)}\{F(x,y)+\beta v_2(y)\}=(Tv_2)(x)$$ 故算子 \(T\) 的单调性满足。 - 贴现:由 \(\beta\in(0,1)\),对 \(a\ge0\), $$T(\tilde v+a)(x)=\max_{y\in\Gamma(x)}\{F(x,y)+\beta(\tilde v(y)+a)\}=\max_{y\in\Gamma(x)}\{F(x,y)+\beta\tilde v(y)\}+\beta a=(T\tilde v)(x)+\beta a$$ 满足 \(T(\tilde v+a)(x)\le(T\tilde v)(x)+\beta a\),即贴现条件。 - 故压缩映射定理适用:\(T\) 有唯一不动点 \(V\) 使 \(TV=V\)。最优性原理成立,即 \(V=V^\star\),\(V^\star\) 是序列问题的解。

\(V\) 与 \(g\) 的性质。 - (1) \(V\) 的单调性:要求 1:\(\Gamma(x)\) 单调(即 \(x'\ge x\Rightarrow\Gamma(x)\subseteq\Gamma(x')\))。要求 2:\(F(x,y)\) 关于 \(x\) 弱递增。若 \(F(x,y)\) 关于 \(x\) 严格递增,则 \(V\) 严格递增。 - (2) \(V\) 的凹性:要求 1:\(F(x,y)\) 关于 \((x,y)\) 弱凹。若 \(F\) 关于 \(x\) 严格凹,则 \(V\) 严格凹。要求 2:\(\Gamma(x)\) 作为对应凸。 - (3) \(V\) 在 \(\hat x\) 处的可微性:可微函数空间对算子 \(T\) 不是闭集,因为 \(T\) 可把可微函数序列无限接近一个带尖点(kink)的极限函数。故需条件保证极限中不出现尖点形状,即对 \(\hat x\) 周围的邻域 \(D\),存在凹且可微的函数 \(W:D\to\mathbb R\) 使 \(W(\hat x)=V(\hat x)\)、\(W(x)\le V(x)\)(\(\forall x\in D\))。该条件可在要求 5 满足时达到: - 要求 1:\(X\subseteq\mathbb R^n\) 凸集。要求 2:\(V\) 凹。要求 3:\(\hat x\in\text{int}(X)\)。要求 4:\(g(\hat x)\in\text{int}(\Gamma(\hat x))\)。要求 5:\(F\) 在 \(\hat x\) 可微。 - 当要求满足时,可构造 \(W(x)=F(x,g(\hat x))+\beta V(g(\hat x))\),它满足 \(W(\hat x)=V(\hat x)\)、\(W(x)\le V(x)\),且 \(W(x)\) 在 \(\hat x\) 可微、关于 \(x\) 凹。 - \(V\) 的可微性不给出 \(g\) 的可微性。\(g\) 的可微性要求 \(V\) 二次可微、且 \(F(x,y)\) 关于 \(x,y\) 二次可微: $$V(x)=F(x,g(x))+V(g(x))\Rightarrow V'(x)=F_1(x,g(x))\ [\because\text{envelope thm}]\Rightarrow V''(x)=F_{11}(x,g(x))+F_{12}(x,g(x))g'(x)\Rightarrow g'(x)=\frac{V''(x)-F_{11}(x,g(x))}{F_{12}(x,g(x))}$$ 但只要 \(V\) 一次可微,只需 \(F(x,y)\) 关于 \(x\) 一次可微。 - (4) \(g\) 的性质:若 \(V(x)\) 关于 \(x\) 严格凹,则最优策略函数 \(g(x)\) 是单值函数(非对应)且关于 \(x\) 连续。若 \(V(x)\) 与 \(F(x,y)\) 都(严格)凹,则 \(g(x)\) 关于 \(x\)(严格)递增。

\(V\) is well-defined (existence, uniqueness). For the Contraction Mapping Theorem to apply, we need that (1) \(T\tilde v\) is bounded; (2) \(T\tilde v\) is continuous; (3) \(T\) is a contraction. - \(T\tilde v\) is bounded because \(F\) is assumed to be bounded and \(\tilde v\) is assumed to be bounded. - \(T\tilde v\) is continuous because \(F\) and \(\tilde v\) are continuous and \(\Gamma\) is compact valued and continuous as a correspondence; then the Theorem of Maximum implies that \(T\tilde v\) is continuous. - \(T\) is a contraction because it satisfies Blackwell's sufficient conditions (monotonicity and discounting): - monotonicity: for \(\forall x\), suppose \(v_1(x)\le v_2(x)\), then $$F(x,y)+\beta v_1(y)\le F(x,y)+\beta v_2(y)\Rightarrow(Tv_1)(x)=\max_{y\in\Gamma(x)}\{F(x,y)+\beta v_1(y)\}\le\max_{y\in\Gamma(x)}\{F(x,y)+\beta v_2(y)\}=(Tv_2)(x)$$ Thus the monotonicity of operator \(T\) is satisfied. - discounting: since we assumed \(\beta\in(0,1)\), for \(a\ge0\), $$T(\tilde v+a)(x)=\max_{y\in\Gamma(x)}\{F(x,y)+\beta(\tilde v(y)+a)\}=\max_{y\in\Gamma(x)}\{F(x,y)+\beta\tilde v(y)\}+\beta a=(T\tilde v)(x)+\beta a$$ which satisfies \(T(\tilde v+a)(x)\le(T\tilde v)(x)+\beta a\), i.e. the discounting condition. - Then the Contraction Mapping Theorem applies: \(T\) has a unique fixed point \(V\) such that \(TV=V\). The principle of optimality holds, i.e. \(V=V^\star\), where \(V^\star\) is the solution to the sequence problem.

Properties of \(V\) and \(g\). - (1) Monotonicity of \(V\): Requirement 1: \(\Gamma(x)\) is monotone (i.e. \(x'\ge x\Rightarrow\Gamma(x)\subseteq\Gamma(x')\)). Requirement 2: \(F(x,y)\) weakly increasing in \(x\). If \(F(x,y)\) is strictly increasing in \(x\), then \(V\) is strictly increasing. - (2) Concavity of \(V\): Requirement 1: \(F(x,y)\) weakly concave in \((x,y)\). If \(F\) is strictly concave in \(x\), then \(V\) is strictly concave. Requirement 2: \(\Gamma(x)\) is convex as a correspondence. - (3) Differentiability of \(V\) at \(\hat x\): A space of differentiable functions is not a closed set to the operator \(T\) because the operator \(T\) can push a sequence of differentiable functions infinitesimally close to a function in the limit with a kink. So we need a condition to make sure the kinked shape does not appear in the limit, i.e. for a neighborhood \(D\) around \(\hat x\), there is a concave and differentiable function \(W:D\to\mathbb R\) with \(W(\hat x)=V(\hat x)\) and \(W(x)\le V(x)\) for \(\forall x\in D\). Such a condition can be attained if Requirement 5 is satisfied: - Requirement 1: \(X\subseteq\mathbb R^n\) is a convex set. Requirement 2: \(V\) is concave. Requirement 3: \(\hat x\in\text{int}(X)\). Requirement 4: \(g(\hat x)\in\text{int}(\Gamma(\hat x))\). Requirement 5: \(F\) is differentiable at \(\hat x\). - when this requirement is satisfied, we can construct \(W(x)=F(x,g(\hat x))+\beta V(g(\hat x))\), which satisfies \(W(\hat x)=V(\hat x)\) and \(W(x)\le V(x)\), and \(W(x)\) is differentiable at \(\hat x\) and concave in \(x\). - Differentiability of \(V\) does not give differentiability of \(g\). Differentiability of \(g\) requires twice differentiability of \(V\), and twice differentiability of \(F(x,y)\) in \(x\) and \(y\): $$V(x)=F(x,g(x))+V(g(x))\Rightarrow V'(x)=F_1(x,g(x))\ [\because\text{envelope theorem}]\Rightarrow V''(x)=F_{11}(x,g(x))+F_{12}(x,g(x))g'(x)\Rightarrow g'(x)=\frac{V''(x)-F_{11}(x,g(x))}{F_{12}(x,g(x))}$$ But once differentiability of \(V\) only requires once differentiability of \(F(x,y)\) in \(x\). - (4) Property of \(g\): If \(V(x)\) is strictly concave in \(x\), then the optimal policy function \(g(x)\) is a single valued function (not a correspondence) and continuous in \(x\). If both \(V(x)\) and \(F(x,y)\) are (strictly) concave, then \(g(x)\) is (strictly) increasing in \(x\).

10.2 Economy with No Externality

10.2.1 设定

期间约束:\(c+h'\le Ah^\alpha\),\(A>0\),\(\alpha\in(0,1)\),\(c\) 当前期消费、\(h'\) 下一期人力资本、\(h\) 当前期人力资本、\(A\) 技术。 偏好:\(U(\{c_t\}_{t=1}^\infty)=\sum_{t=0}^\infty\beta^t u(c_t)\),\(\beta\in(0,1)\),\(u\) 连续可微、严格递增、严格凹、\(u'(0)=+\infty\)。

10.2.2 关于该模型的重要问题

什么是有用的状态空间? 状态变量是人力资本 \(h\),状态空间为 \([0,h_{max}]\),其中 \(h_{max}\) 满足 \(0+h_{max}=Ah_{max}^\alpha\),即 \(h_{max}=A^{1/(1-\alpha)}\)。故状态空间可写为 \(\left[0,A^{1/(1-\alpha)}\right]\)。

贝尔曼方程是什么? $$V(h)=\max_{y\in[0,Ah^\alpha]}\{u(Ah^\alpha-y)+\beta V(y)\}$$ 其中 \(h\in\left[0,A^{1/(1-\alpha)}\right]\)。

解释 \(V\) 的如下性质(严格证明见 Recursive Methods in Economic Dynamics 第 4 章): - 良定(存在)、唯一、连续:状态空间紧(有界闭),\(c_t\) 对 \(\forall t\) 有界,故 \(u(c_t)\) 有界;又 \(\beta<1\),故 \(V\) 有界。用压缩映射定理证唯一性。用集合 \(\Gamma(h)=[0,Ah^\alpha]\) 随 \(h\) 连续变化证 \(V\) 连续。 - 严格递增:\(\Gamma(h)=[0,Ah^\alpha]\) 作为对应单调,\(h'>h\Rightarrow\Gamma(h')\supseteq\Gamma(h)\)。由 \(u\) 严格递增,更大的 \(h\) 对任意 \(y\) 给出更大的 \(u(Ah^\alpha-y)\);由 \(\Gamma(h)\) 单调,更大的 \(h\) 有更多 \(y\) 的选择。合起来,更大的期间回报与更多 \(y\) 的选择保证严格更大的 \(V(h)\)。 - 严格凹:定义 \(h_\theta=\theta h_1+(1-\theta)h_2\),\(\theta\in(0,1)\)。由 \(Ah^\alpha\) 严格凹,\(\theta Ah_1^\alpha+(1-\theta)Ah_2^\alpha一次可微:定义 \(w(h)=u\left(Ah^\alpha-g(\hat h)\right)+\beta V\left(g(\hat h)\right)\)。由 \(u\) 在 \(\hat h\) 一次可微,\(w(h)\) 在 \(\hat h\) 一次可微,蕴含 \(w(h)\) 的支撑超平面 \(l\) 唯一。注意 \(w(h)\le V(h)\)、且除 \(h=\hat h\) 外严格不等。故 \(V(h)\) 在 \(h=\hat h\) 也有唯一超平面,排除了 \(V(h)\) 的尖点形状。因此 \(V(h)\) 在解 \(y^\star=g(h)\) 为内点(即 \(y^\star\in(0,Ah^\alpha)\))处一次可微;且 \(y^\star=g(h)\) 不可能是角点解,因 \(y^\star\ne Ah^\alpha\)(\(u'(0)=+\infty\))、\(y^\star\ne0\)(\(y^\star=0\) 使未来生产不可能)。故 \(V(h)\) 一次可微。 - 图示(已转述): \(V(h)\) 曲线在 \(\hat h\) 处有一条支撑超平面 \(l\);构造的 \(w(h)\) 在 \(\hat h\) 与 \(V(h)\) 相切、其余处低于 \(V(h)\),从而 \(V(h)\) 在 \(\hat h\) 的支撑超平面唯一、无尖点。

10.2.1 Set-up

Period constraint: \(c+h'\le Ah^\alpha\) for \(A>0\), \(\alpha\in(0,1)\), where \(c\) is current period consumption, \(h'\) is next period human capital, \(h\) is current period human capital, and \(A\) is technology. Preference: \(U(\{c_t\}_{t=1}^\infty)=\sum_{t=0}^\infty\beta^t u(c_t)\) for \(\beta\in(0,1)\), where \(u\) is continuously differentiable, strictly increasing, strictly concave, and \(u'(0)=+\infty\).

10.2.2 Important questions regarding this model

What is a useful state space? The state variable is human capital \(h\), and the state space is \([0,h_{max}]\) where \(h_{max}\) satisfies \(0+h_{max}=Ah_{max}^\alpha\), i.e. \(h_{max}=A^{1/(1-\alpha)}\). So the state space can be rewritten as \(\left[0,A^{1/(1-\alpha)}\right]\).

What is the Bellman equation? $$V(h)=\max_{y\in[0,Ah^\alpha]}\{u(Ah^\alpha-y)+\beta V(y)\}$$ with \(h\in\left[0,A^{1/(1-\alpha)}\right]\).

Explain the following properties of \(V\) (see rigorous proofs in Chapter 4 of Recursive Methods in Economic Dynamics): - well-defined (exists), unique and continuous: Since the state space is compact (bounded and closed), \(c_t\) is bounded for \(\forall t\), so \(u(c_t)\) is bounded for \(\forall t\). We also know that \(\beta<1\), so \(V\) is bounded. We can use the Contraction Mapping Theorem to prove uniqueness. We can use the fact that the set \(\Gamma(h)=[0,Ah^\alpha]\) varies continuously with \(h\) to prove continuity of \(V\). - strictly increasing: \(\Gamma(h)=[0,Ah^\alpha]\) is monotone as a correspondence, \(h'>h\Rightarrow\Gamma(h')\supseteq\Gamma(h)\). Since \(u\) is strictly increasing, a larger \(h\) means a larger \(u(Ah^\alpha-y)\) for any \(y\). Since \(\Gamma(h)\) is monotone, a larger \(h\) means more choices of \(y\). Combined together, the larger period return function for any \(y\) and more choices of \(y\) guarantees a strictly larger \(V(h)\). - strictly concave: Define \(h_\theta=\theta h_1+(1-\theta)h_2\) for \(\theta\in(0,1)\). Since \(Ah^\alpha\) is strictly concave, \(\theta Ah_1^\alpha+(1-\theta)Ah_2^\alphaonce differentiable: Define \(w(h)=u\left(Ah^\alpha-g(\hat h)\right)+\beta V\left(g(\hat h)\right)\). Since \(u\) is once differentiable at point \(\hat h\), \(w(h)\) is once differentiable at \(\hat h\), which implies the uniqueness of the supporting hyper-plane \(l\) of \(w(h)\). Note that \(w(h)\le V(h)\) and the inequality is strict unless \(h=\hat h\). So \(V(h)\) also has a unique hyper-plane at \(h=\hat h\), which rules out the kink shape of \(V(h)\). Therefore \(V(h)\) is once differentiable where the solution \(y^\star=g(h)\) is interior, i.e. \(y^\star\in(0,Ah^\alpha)\). And we know that \(y^\star=g(h)\) cannot be a corner solution since \(y^\star\ne Ah^\alpha\) (since \(u'(0)=+\infty\)) and \(y^\star\ne0\) (since \(y^\star=0\) makes future production impossible). So \(V(h)\) is once differentiable. - Figure (paraphrased): the \(V(h)\) curve has a supporting hyper-plane \(l\) at \(\hat h\); the constructed \(w(h)\) is tangent to \(V(h)\) at \(\hat h\) and below \(V(h)\) elsewhere, so the supporting hyper-plane of \(V(h)\) at \(\hat h\) is unique and there is no kink.

写出一阶条件(f.o.c.)与包络条件(EC)。 - 关于 \(y\) 的 f.o.c.:\(u'(Ah^\alpha-y^\star)=\beta V'(y^\star)\)。 - 关于 \(h\) 的 EC:\(V'(h)=A\alpha h^{\alpha-1}u'(Ah^\alpha-y^\star)\)。

图示(f.o.c. 策略函数图,已转述): 横轴下一期人力资本 \(y^\star\)。画 \(\beta V'(y^\star)\)(递增曲线)、\(u'(Ah^\alpha-y^\star)\)(随 \(y^\star\) 递增的曲线),二者交点定出 \(y^\star\)。考虑 \(\hat h>h\):由 \(u'\) 随 \(h\) 递减,\(u'(A\hat h^\alpha-y^\star)\) 对任意 \(y^\star\) 都在 \(u'(Ah^\alpha-y^\star)\) 下方。由图,\(\hat y^\star>y^\star\),即 \(g(\hat h)>g(h)\),故策略函数随 \(h\) 递增;又 \(u'(A\hat h^\alpha-\hat y^\star)c\),即 \(c(h)=Ah^\alpha-g(h)\) 斜率为正,即 \(g'(h)\in(0,A\alpha h^{\alpha-1})\)。

写出欧拉方程。 $$u'(Ah^\alpha-g(h))=\beta V'(g(h))=\beta A\alpha g(h)^{\alpha-1}u'(Ag(h)^\alpha-g(g(h)))$$ 或 $$u'(Ah_t^\alpha-h_{t+1})=\beta A\alpha h_{t+1}^{\alpha-1}u'(Ah_{t+1}^\alpha-h_{t+2})$$

刻画稳态 \(h_{ss}\)。 $$u'(Ah_{ss}^\alpha-h_{ss})=\beta A\alpha h_{ss}^{\alpha-1}u'(Ah_{ss}^\alpha-h_{ss})\Rightarrow1=\beta A\alpha h_{ss}^{\alpha-1}\Rightarrow h_{ss}=(\beta A\alpha)^{\frac{1}{1-\alpha}}$$ 注意 \(h_{ss}\) 随 \(\beta\)、\(A\)、\(\alpha\) 递增。

Write the first order condition (f.o.c.) and the envelop condition (EC). - The f.o.c. w.r.t. \(y\): \(u'(Ah^\alpha-y^\star)=\beta V'(y^\star)\). - The EC w.r.t. \(h\): \(V'(h)=A\alpha h^{\alpha-1}u'(Ah^\alpha-y^\star)\).

Figure (the f.o.c. policy function graph, paraphrased): the horizontal axis is next-period human capital \(y^\star\). Draw \(\beta V'(y^\star)\) (an increasing curve) and \(u'(Ah^\alpha-y^\star)\) (a curve increasing in \(y^\star\)); their intersection pins down \(y^\star\). Consider \(\hat h>h\): since \(u'\) is decreasing in \(h\), \(u'(A\hat h^\alpha-y^\star)\) is below \(u'(Ah^\alpha-y^\star)\) for any \(y^\star\). From the graph, \(\hat y^\star>y^\star\), i.e. \(g(\hat h)>g(h)\), so the policy function is increasing in \(h\); also \(u'(A\hat h^\alpha-\hat y^\star)c\), which means \(c(h)=Ah^\alpha-g(h)\) has positive slope, i.e. \(g'(h)\in(0,A\alpha h^{\alpha-1})\).

Write the Euler Equation. $$u'(Ah^\alpha-g(h))=\beta V'(g(h))=\beta A\alpha g(h)^{\alpha-1}u'(Ag(h)^\alpha-g(g(h)))$$ or $$u'(Ah_t^\alpha-h_{t+1})=\beta A\alpha h_{t+1}^{\alpha-1}u'(Ah_{t+1}^\alpha-h_{t+2})$$

Characterize the steady state \(h_{ss}\). $$u'(Ah_{ss}^\alpha-h_{ss})=\beta A\alpha h_{ss}^{\alpha-1}u'(Ah_{ss}^\alpha-h_{ss})\Rightarrow1=\beta A\alpha h_{ss}^{\alpha-1}\Rightarrow h_{ss}=(\beta A\alpha)^{\frac{1}{1-\alpha}}$$ Note that \(h_{ss}\) is increasing in \(\beta\), \(A\), and \(\alpha\).

10.2.3 重要事实小结

  1. 贝尔曼方程(BE):\(V(h)=\max_{y\in[0,Ah^\alpha]}\{u(f(h)-y)+\beta V(y)\}\),\(h\in\left[0,A^{1/(1-\alpha)}\right]\)、\(f(h)=Ah^\alpha\)。
  2. 存在唯一、有界、连续的函数 \(V\) 满足上述 BE,且 \(V=V^\star\)(\(V^\star\) 由序列问题生成)。
  3. \(V\) 严格递增、严格凹。
  4. 最优策略函数 \(y^\star=g(h)\) 单值、连续、严格递增,\(g'(h)\in(0,A\alpha h^{\alpha-1})\)。
  5. 刻画 \(y^\star\) 的 f.o.c.:\(u'(f(h)-y^\star)=\beta V'(y^\star)\)。
  6. \(u\) 的 Inada 条件(\(u'(0)=+\infty\))保证 \(y^\star\) 内点解。
  7. 包络条件(EC):\(V'(h)=f'(h)u'(f(h)-y^\star)\)。
  8. 欧拉方程(EE):\(u'(f(h_t)-h_{t+1})=\beta f'(h_{t+1})u'(f(h_{t+1})-h_{t+2})\)。
  9. 稳态 \(h_{ss}=(\beta A\alpha)^{1/(1-\alpha)}\),随 \(\beta\)、\(A\)、\(\alpha\) 递增。

10.2.4 动态

分析系统在受扰后落入稳态小邻域时的行为,线性近似很有用。对 EE 在稳态附近做二阶近似(记 \(c_t=f(h_t)-h_{t+1}\)): $$\begin{aligned}\text{LHS}=u'(c_t)&\approx u'(c_{ss})+u''(c_{ss})[f'(h_{ss})(h_t-h_{ss})-(h_{t+1}-h_{ss})]\\&=u'+u''f'(h_t-h_{ss})-u''(h_{t+1}-h_{ss})\\\text{RHS}=\beta f'(h_{t+1})u'(c_{t+1})&\approx\beta f'u'+\beta\left\{f''u'(h_{t+1}-h_{ss})+(f')^2 u''(h_{t+1}-h_{ss})-f'u''(h_{t+2}-h_{ss})\right\}\end{aligned}$$ 定义 \(z_t=h_t-h_{ss}\),用 \(\beta f'=1\) 重写: $$0=-\beta f'u''z_{t+2}+\beta\left(f''u'+(f')^2 u''+u''\right)z_{t+1}-u''f'z_t\tag{10.1}$$ 这是二阶线性差分方程,需两个边界条件:(1) 选定的初始条件 \(h_0\);(2) 收敛到 \(h_{ss}\)。(10.1) 的特征方程为 $$-\beta f'u''\lambda^2+\beta\left(f''u'+(f')^2 u''+u''\right)\lambda-u''f'=0$$ 有两根 \(\lambda_1\)、\(\lambda_2\),\(z_t\) 的解为 \(z_t=C_1\lambda_1^t+C_2\lambda_2^t\)。设 \(|\lambda_1|<1\),则由 Vieta 定理 \(\lambda_1\cdot\lambda_2=1/\beta>1\),故 \(|\lambda_2|>1\)。于是第一个边界条件钉住 \(C_1\) 以匹配 \(h_0\)、第二个边界条件要求 \(C_2=0\)。

10.2.3 Summary for the important facts

  1. The Bellman Equation (BE): \(V(h)=\max_{y\in[0,Ah^\alpha]}\{u(f(h)-y)+\beta V(y)\}\) with \(h\in\left[0,A^{1/(1-\alpha)}\right]\) and \(f(h)=Ah^\alpha\).
  2. There exists a unique, bounded, continuous function \(V\) satisfying the above BE, and \(V=V^\star\) where \(V^\star\) is generated by the sequence problem.
  3. \(V\) is strictly increasing and strictly concave.
  4. The optimal policy function \(y^\star=g(h)\) is single-valued, continuous, strictly increasing with \(g'(h)\in(0,A\alpha h^{\alpha-1})\).
  5. The f.o.c. that characterizes \(y^\star\): \(u'(f(h)-y^\star)=\beta V'(y^\star)\).
  6. The Inada condition on \(u\) (i.e. \(u'(0)=+\infty\)) guarantees an interior solution of \(y^\star\).
  7. The Envelop Condition (EC): \(V'(h)=f'(h)u'(f(h)-y^\star)\).
  8. The Euler Equation (EE): \(u'(f(h_t)-h_{t+1})=\beta f'(h_{t+1})u'(f(h_{t+1})-h_{t+2})\).
  9. The steady state \(h_{ss}=(\beta A\alpha)^{1/(1-\alpha)}\), increasing in \(\beta\), \(A\), and \(\alpha\).

10.2.4 Dynamics

We analyze the system's behavior after a disturbance makes it fall into the small neighborhood around the steady state; linear approximation is very useful. Take a second order approximation of the EE around the steady state (with \(c_t=f(h_t)-h_{t+1}\)): $$\begin{aligned}\text{LHS}=u'(c_t)&\approx u'(c_{ss})+u''(c_{ss})[f'(h_{ss})(h_t-h_{ss})-(h_{t+1}-h_{ss})]\\&=u'+u''f'(h_t-h_{ss})-u''(h_{t+1}-h_{ss})\\\text{RHS}=\beta f'(h_{t+1})u'(c_{t+1})&\approx\beta f'u'+\beta\left\{f''u'(h_{t+1}-h_{ss})+(f')^2 u''(h_{t+1}-h_{ss})-f'u''(h_{t+2}-h_{ss})\right\}\end{aligned}$$ Define \(z_t=h_t-h_{ss}\), and using the fact \(\beta f'=1\), rewrite: $$0=-\beta f'u''z_{t+2}+\beta\left(f''u'+(f')^2 u''+u''\right)z_{t+1}-u''f'z_t\tag{10.1}$$ This is a second-order linear difference equation. We need two boundary conditions: (1) the chosen initial condition \(h_0\); (2) convergence to \(h_{ss}\). The characteristic equation of (10.1) is $$-\beta f'u''\lambda^2+\beta\left(f''u'+(f')^2 u''+u''\right)\lambda-u''f'=0$$ which has two roots \(\lambda_1\), \(\lambda_2\), and \(z_t\) has the solution \(z_t=C_1\lambda_1^t+C_2\lambda_2^t\). Suppose \(|\lambda_1|<1\), then by Vieta's theorem \(\lambda_1\cdot\lambda_2=1/\beta>1\), so \(|\lambda_2|>1\). So the first boundary condition pins down \(C_1\) to match \(h_0\) and the second boundary condition requires that \(C_2=0\).

10.3 Positive Externality: The Social Planner's Problem

现在设社会平均人力资本 \(H\) 作为正外部性进入技术,技术为 $$F(h,H)=Ah^\theta H^r$$ 其中 \(\theta>0\)、\(r>0\)、\(\theta+r<1\)。

设经济中每个 agent 起始时有相同的初始人力资本 \(h_0\)。社会计划者给每个 agent 相同权重、要求他们在每期做完全相同的事。故每个 agent 的人力资本在每期彼此相同,即 \(h_t=H_t\) 对 \(\forall t\)。

于是社会计划者问题与之前(无外部性)完全相同——只要令 \(\alpha=\theta+r\),即 $$F(h,H)=Ah^\theta H^r=Ah^\theta h^r=Ah^{\theta+r}=Ah^\alpha=f(h)$$

10.4 Positive Externality: The Recursive Competitive Equilibrium

下面讨论 agent 各自为己做决策的经济。

10.4.1 设定

  • 与社会计划者问题相同,社会平均人力资本 \(H\) 作为正外部性进入技术:\(F(h,H)=Ah^\theta H^r\),\(\theta>0\)、\(r>0\)、\(\hat\alpha\equiv\theta+r<1\)。
  • agent 的问题有两个状态变量:\(h\) 与 \(H\)。
  • agent 对下一期平均人力资本的猜想记为 \(\Phi(H)\),基于当前期平均人力资本 \(H\)。
  • agent 的最优选择记为 \(y^\star=\phi(h,H,\Phi)\),依赖于 \(\Phi\)。
  • 值函数现写为 \(W(h,H;\Phi)\)。

10.4.2 贝尔曼方程 $$W(h,H)=\max_{y\in[0,F(h,H)]}\{u(F(h,H)-y)+\beta W(y,\Phi(H))\}$$ 这里 \(\Phi(H)\) 是对其他每个人选择的猜想。

10.4.3 递归竞争均衡

递归竞争均衡由函数 \(\Phi^e(H=h)\) 定义、具有如下性质: $$\phi^\star(h,H=h,\Phi^e(H=h))=\Phi^e(H=h)$$ 即 \(\phi^\star(h,h,\Phi^e(h))=\Phi^e(h)\)。

Tip

注记 10.3 \(\phi^\star(h,H=h,\Phi^e(H=h))=\Phi^e(H=h)\) 中的参数 \(H\) 取值 \(h\),是因为假设所有 agent 相同、且在当前期起始于相同的 \(h\),故当前期社会平均 \(H\) 恰为 \(h\)。递归竞争均衡的性质 \(\phi^\star(h,h,\Phi^e(h))=\Phi^e(h)\) 保证 agent 在下一期仍彼此相同。在递归竞争均衡中,猜想实现自身。

Now suppose the social average human capital \(H\) enters the technology as a positive externality, so the technology is $$F(h,H)=Ah^\theta H^r$$ where \(\theta>0\), \(r>0\), \(\theta+r<1\).

Suppose every agent in the economy starts with the same initial human capital \(h_0\). The social planner gives every agent equal weight and asks them to do exactly the same thing in every period. So every agent's human capital stays the same with each other in every period, i.e. \(h_t=H_t\) for \(\forall t\).

So the social planner's problem is exactly the same as before (without externality) if we set \(\alpha=\theta+r\), i.e. $$F(h,H)=Ah^\theta H^r=Ah^\theta h^r=Ah^{\theta+r}=Ah^\alpha=f(h)$$

10.4 Positive Externality: The Recursive Competitive Equilibrium

In the following discussion, we will consider the economy in which agents make individual decisions for their own.

10.4.1 Set-up

  • Same as in the social planner's problem, the social average human capital \(H\) enters the technology as a positive externality: \(F(h,H)=Ah^\theta H^r\), where \(\theta>0\), \(r>0\) and \(\hat\alpha\equiv\theta+r<1\).
  • There are two state variables for the agent's problem: \(h\) and \(H\).
  • The agent's conjecture of average human capital in the next period is denoted as \(\Phi(H)\), which is based on \(H\), the average human capital of the current period.
  • The agent's optimal choice is defined as \(y^\star=\phi(h,H,\Phi)\), which depends on \(\Phi\).
  • The value function is now written as \(W(h,H;\Phi)\).

10.4.2 The Bellman equation $$W(h,H)=\max_{y\in[0,F(h,H)]}\{u(F(h,H)-y)+\beta W(y,\Phi(H))\}$$ Here \(\Phi(H)\) is the conjecture about the choice of everyone else.

10.4.3 Recursive competitive equilibrium

A recursive competitive equilibrium is defined by a function \(\Phi^e(H=h)\) with the property $$\phi^\star(h,H=h,\Phi^e(H=h))=\Phi^e(H=h)$$ i.e. \(\phi^\star(h,h,\Phi^e(h))=\Phi^e(h)\).

Tip

Remark 10.3 The argument \(H\) in \(\phi^\star(h,H=h,\Phi^e(H=h))=\Phi^e(H=h)\) takes the value \(h\) because we assume all agents are identical and they start with the same \(h\) in the current period, so in the current period the social average \(H\) is exactly \(h\). The property \(\phi^\star(h,h,\Phi^e(h))=\Phi^e(h)\) of the recursive competitive equilibrium insures that agents will still be exactly the same in the next period. In the recursive competitive equilibrium, the conjecture realizes itself.

10.4.4 模型结果

  • 刻画 \(y^\star\) 的 f.o.c.:\(u'(F(h,H)-y^\star)=\beta W_1(y^\star,\Phi^e(H))\),即 \(u'(F(h,h)-y^\star)=\beta W_1(y^\star,y^\star)\)(\(W_1\) 表示关于 \(W\) 第一个参数的偏导)。
  • 包络条件(EC):\(W_1(h,H)=F_1(h,H)u'(F(h,H)-y^\star)\),即 \(W_1(h,h)=F_1(h,h)u'(F(h,h)-y^\star)\)(\(F_1\) 表示关于 \(F\) 第一个参数的偏导)。
  • 欧拉方程(EE):合并 f.o.c. 与包络条件: $$u'(F(h_t,h_t)-h_{t+1})=\beta F_1(h_{t+1},h_{t+1})u'(F(h_{t+1},h_{t+1})-h_{t+2})$$
  • 稳态满足: $$\begin{aligned}u'(F(h_{ss},h_{ss})-h_{ss})&=\beta F_1(h_{ss},h_{ss})u'(F(h_{ss},h_{ss})-h_{ss})\\\Rightarrow1&=\beta F_1(h_{ss},h_{ss})=\beta\theta Ah_{ss}^{\theta-1}h_{ss}^r=\beta\theta Ah_{ss}^{\theta+r-1}\\\Rightarrow\tilde h_{ss}&=(\beta A\theta)^{\frac{1}{1-(r+\theta)}}\end{aligned}$$ 若定义 \(\theta=\alpha\)、\(r+\theta=\hat\alpha\),则 \(\tilde h_{ss}=(\beta A\alpha)^{1/(1-\hat\alpha)}\)。
Tip

注记 10.4 回忆社会计划者带外部性的问题中 \(\bar h_{ss}=(\beta A\hat\alpha)^{1/(1-\hat\alpha)}\)。由 \(\hat\alpha>\alpha\),有 \(\bar h_{ss}>\tilde h_{ss}\),即社会计划者的稳态大于递归竞争均衡的稳态。换言之,递归竞争均衡非帕累托最优。第一福利定理在此不成立,因为人力资本积累中存在正外部性。为刻画一个有外部性的竞争均衡经济,我们必须使用易处理的递归竞争均衡、而非社会计划者问题,因为第二福利定理也失败——故社会计划者的结果无法由该真实经济的竞争均衡实现。

10.4.4 Results of the model

  • The f.o.c. that characterizes \(y^\star\): \(u'(F(h,H)-y^\star)=\beta W_1(y^\star,\Phi^e(H))\), i.e. \(u'(F(h,h)-y^\star)=\beta W_1(y^\star,y^\star)\) (\(W_1\) means the partial derivative w.r.t. \(W\)'s first argument).
  • The Envelop Condition (EC): \(W_1(h,H)=F_1(h,H)u'(F(h,H)-y^\star)\), i.e. \(W_1(h,h)=F_1(h,h)u'(F(h,h)-y^\star)\) (\(F_1\) means the partial derivative w.r.t. \(F\)'s first argument).
  • The Euler Equation (EE): combine the f.o.c. and the Envelop condition: $$u'(F(h_t,h_t)-h_{t+1})=\beta F_1(h_{t+1},h_{t+1})u'(F(h_{t+1},h_{t+1})-h_{t+2})$$
  • The steady state satisfies: $$\begin{aligned}u'(F(h_{ss},h_{ss})-h_{ss})&=\beta F_1(h_{ss},h_{ss})u'(F(h_{ss},h_{ss})-h_{ss})\\\Rightarrow1&=\beta F_1(h_{ss},h_{ss})=\beta\theta Ah_{ss}^{\theta-1}h_{ss}^r=\beta\theta Ah_{ss}^{\theta+r-1}\\\Rightarrow\tilde h_{ss}&=(\beta A\theta)^{\frac{1}{1-(r+\theta)}}\end{aligned}$$ If we define \(\theta=\alpha\) and \(r+\theta=\hat\alpha\), then \(\tilde h_{ss}=(\beta A\alpha)^{1/(1-\hat\alpha)}\).
Tip

Remark 10.4 Recall that in the social planner's problem with externality \(\bar h_{ss}=(\beta A\hat\alpha)^{1/(1-\hat\alpha)}\). Since \(\hat\alpha>\alpha\), we have that \(\bar h_{ss}>\tilde h_{ss}\), which means that the steady state for the social planner is greater than the steady state for the recursive competitive equilibrium. In other words, the recursive competitive equilibrium is not Pareto optimal. The First Welfare Theorem doesn't hold here because there is positive externality in the accumulation of human capital. In order to characterize a competitive equilibrium economy with externality, we have to use the tractable recursive competitive equilibrium instead of the social planner's problem because the Second Welfare Theorem also fails so the social planner's outcome cannot be achieved by that real economy's competitive equilibrium.

10.5 An Example of Deterministic Dynamic Programming with Finite Horizons

10.5.1 设定

与无穷期情形相同: - 状态变量 \(x\in X\),状态空间 \(X\in\mathbb R^n\) 凸且紧。 - 对应 \(\Gamma:X\rightrightarrows X\) 非空、紧值、连续。 - 期间回报函数 \(F(x,y):X\times X\to\mathbb R\) 有界、关于 \(x\) 与 \(y\) 连续。 - 贴现因子 \(\beta\in(0,1)\)。 - 另外,把期数固定为 \(N\)(有限)。

10.5.2 贝尔曼方程

可用倒向归纳(backward induction) 法:先导出最后一期的值方程,再每次往回一步导出之前各期的值函数。 - 末期 \(N\) 的贝尔曼方程:\(V(x,N)=\max_{y\in\Gamma(x)}F(x,y)\)。 - 期 \(n=1,2,\dots,N-1\) 的贝尔曼方程:\(V(x,n)=\max_{y\in\Gamma(x)}\{F(x,y)+\beta V(y,n+1)\}\)。

10.5.3 值函数的性质

现有值函数序列 \(\{V(x,n)\}_{n=1}^N\),考虑其性质。 - 对末期 \(N\),\(V(x,N)=\max_{y\in\Gamma(x)}F(x,y)\): - 存在:因 \(F(x,y)\) 在紧集上连续,故在 \(X\) 上取得最大。 - 唯一:若假设 \(F(x,y)\) 严格拟凹。 - 关于 \(x\) 连续:因 \(F(x,y)\) 关于 \(x,y\) 连续、\(\Gamma:X\rightrightarrows X\) 作为对应连续,用最大值定理。 - 关于 \(x\) 严格凹:若假设 \(F(x,y)\) 关于 \(x,y\) 严格凹、\(\Gamma:X\rightrightarrows X\) 作为对应凸。 - 关于 \(x\) 严格递增:若假设 \(F(x,y)\) 关于 \(x\) 严格递增、\(\Gamma:X\rightrightarrows X\) 作为对应单调。 - 关于 \(x\) 可微:若假设 \(F(x,y)\) 在 \(x\in\text{int}(X)\) 可微、\(g(x)\in\text{int}(\Gamma(x))\)。 - 对期 \(n=1,2,\dots,N-1\),\(V(x,n)=\max_{y\in\Gamma(x)}\{F(x,y)+\beta V(y,n+1)\}\):\(V(x,n)\) 的所有所需性质(存在、唯一、关于 \(x\) 连续、关于 \(x\) 严格凹、关于 \(x\) 严格单调、可微)成立,若这些性质对 \(V(x,n+1)\) 成立。

10.5.1 Set-up

Same as in the infinite horizon case, we have: - state variable \(x\in X\), with state space \(X\in\mathbb R^n\) convex and compact. - correspondence \(\Gamma:X\rightrightarrows X\) non-empty, compact valued and continuous. - period return function \(F(x,y):X\times X\to\mathbb R\) bounded and continuous in \(x\) and \(y\). - discounting factor \(\beta\in(0,1)\). - In addition, fix the number of periods as \(N\), which is finite.

10.5.2 Bellman equations

We can use the backward induction methodology to first derive the value equation for the last period and then go one step back each time to derive value functions for all the previous periods. - The Bellman equation for the ending period \(N\): \(V(x,N)=\max_{y\in\Gamma(x)}F(x,y)\). - The Bellman equation for period \(n=1,2,\dots,N-1\): \(V(x,n)=\max_{y\in\Gamma(x)}\{F(x,y)+\beta V(y,n+1)\}\).

10.5.3 Properties of the value functions

Now we have a sequence of value functions \(\{V(x,n)\}_{n=1}^N\). Let's consider the properties of them. - For the ending period \(N\), \(V(x,N)=\max_{y\in\Gamma(x)}F(x,y)\): - exists: because \(F(x,y)\) is continuous on a compact set, so it attains maximum in \(X\). - is unique: if we assume that \(F(x,y)\) is strictly quasi-concave. - is continuous in \(x\): because \(F(x,y)\) is continuous in \(x\) and \(y\) and \(\Gamma:X\rightrightarrows X\) is continuous as a correspondence, we can use the Theorem of Maximum. - is strictly concave in \(x\): if we assume \(F(x,y)\) is strictly concave in \(x\) and \(y\), and \(\Gamma:X\rightrightarrows X\) is convex as a correspondence. - is strictly increasing in \(x\): if we assume \(F(x,y)\) is strictly increasing in \(x\) and \(\Gamma:X\rightrightarrows X\) is monotone as a correspondence. - is differentiable in \(x\): if we assume \(F(x,y)\) is differentiable in \(x\in\text{int}(X)\) and \(g(x)\in\text{int}(\Gamma(x))\). - For period \(n=1,2,\dots,N-1\), \(V(x,n)=\max_{y\in\Gamma(x)}\{F(x,y)+\beta V(y,n+1)\}\): all the desired properties of \(V(x,n)\) (i.e. existence, uniqueness, continuity in \(x\), strict concavity in \(x\), strict monotonicity in \(x\), and differentiability) hold if these properties hold for \(V(x,n+1)\).