12. Stochastic Dynamic Programming: Commodity Model

Jun He June 01, 2026

宏观经济学Macroeconomics 随机动态规划Stochastic Dynamic Programming 商品模型Commodity Model 存储Storage 社会剩余Social Surplus 贝尔曼方程Bellman Equation 学习笔记Study Note

Note

本章主题：随机动态规划的商品模型。 §12.1 设定：需求平稳 $P=D(q)$（向下倾斜、严正、连续）；供给——代表性卖家每期获随机禀赋 $\omega_t\in(m,M)$（i.i.d.、密度 $\mu(\omega)$、无生产成本）；社会剩余 $u(q)=\int_0^q D(x)dx$（严增、严凹、连续可微）；存储成本 $\phi(y)$（严增、严凸、$\phi(0)=0$）。§12.2 贝尔曼方程：两状态 $V(x,\omega)=\max_{y\in[0,x+\omega]}\{u(x+\omega-y)-\phi(y)+\beta\int_m^M V(y,\omega')\mu(\omega')d\omega'\}$；单状态 $s=x+\omega$（$y$=结转存货）$V(s)=\max_{y\in[0,s]}\{u(s-y)-\phi(y)+\beta\int_m^M V(y+\omega')\mu(\omega')d\omega'\}$；f.o.c. $u'(s-y)+\phi'(y)\ge\beta\int_m^M V'(y+\omega')\mu(\omega')d\omega'$（内点取等，LHS 增 RHS 减 ⟹ 唯一 $y^\star=g(s)$ 且随 $s$ 递增）。

Note

Chapter theme: the commodity model of stochastic dynamic programming. §12.1 Set-up: stationary demand $P=D(q)$ (downward sloping, strictly positive, continuous); supply — the representative seller gets a random endowment $\omega_t\in(m,M)$ each period (i.i.d., density $\mu(\omega)$, no production cost); social surplus $u(q)=\int_0^q D(x)dx$ (strictly increasing, strictly concave, continuously differentiable); cost of storage $\phi(y)$ (strictly increasing, strictly convex, $\phi(0)=0$). §12.2 Bellman equation: two states $V(x,\omega)=\max_{y\in[0,x+\omega]}\{u(x+\omega-y)-\phi(y)+\beta\int_m^M V(y,\omega')\mu(\omega')d\omega'\}$; one state $s=x+\omega$ ($y$ = inventory carried over) $V(s)=\max_{y\in[0,s]}\{u(s-y)-\phi(y)+\beta\int_m^M V(y+\omega')\mu(\omega')d\omega'\}$; f.o.c. $u'(s-y)+\phi'(y)\ge\beta\int_m^M V'(y+\omega')\mu(\omega')d\omega'$ (equality at interior; LHS increasing, RHS decreasing ⟹ unique $y^\star=g(s)$ increasing in $s$).

12.1 Set-up

12.1.1 需求

设需求平稳（即不随 $t$ 变化）：$P=D(q)$，其中 $q$ 是需求且被消费的量。假设函数 $D$ 向下倾斜、严格为正、连续。

12.1.2 供给

每期，代表性卖家获得禀赋 $\omega_t\in(m,M)$，\(0

12.1.3 社会剩余

定义社会剩余为 $$u(q)=\int_0^q D(x)dx$$ 由于 $D$ 向下倾斜、严正、连续，$u$ 关于 $q$ 严格递增、严格凹、连续可微。

12.1.4 存储成本

定义把 $y$ 单位商品存储一期的成本为 $\phi(y)$，其中 $\phi(y)$ 连续、关于 $y$ 严格递增、严格凸、连续可微、且 $\phi(0)=0$。

12.1.1 Demand

Suppose the demand is stationary (i.e. doesn't change along with $t$) as $P=D(q)$ where $q$ is the amount demanded and consumed. Assume that the function $D$ is downward sloping, strictly positive, and continuous.

12.1.2 Supply

In each period, the representative seller gets an endowment $\omega_t\in(m,M)$ with \(0

12.1.3 Social surplus

Define the social surplus as $$u(q)=\int_0^q D(x)dx$$ Since we supposed that $D$ is downward sloping, strictly positive and continuous, $u$ is strictly increasing in $q$, strictly concave in $q$, and continuously differentiable.

12.1.4 Cost of storage

Define the cost of storing $y$ units of commodity for one period as $\phi(y)$ where $\phi(y)$ is continuous, strictly increasing in $y$, strictly convex in $y$, continuously differentiable, and $\phi(0)=0$.

12.2 The Bellman Equation

12.2.1 两状态变量的贝尔曼方程

考虑状态变量 $S=(x,\omega)$，其中内生状态 $x$ 是当前期从上期结转的初始存货、$\omega$ 是外生随机冲击。社会计划者的贝尔曼方程为 $$V(x,\omega)=\max_{y\in\Gamma(x,\omega)}\left\{u(x+\omega-y)-\phi(y)+\beta\int_m^M V(y,\omega')\mu(\omega')d\omega'\right\}$$ 其中 $\Gamma(x,\omega)=[0,x+\omega]$。

12.2.2 仅一个状态变量的贝尔曼方程

考虑状态变量 $s=x+\omega$，$x$、$\omega$ 同前。$s$ 表示当前期的总供给。注意 $y$ 不是下一期的总供给，而是结转到下一期的存货。社会计划者的贝尔曼方程为 $$V(s)=\max_{y\in\Gamma(s)}\left\{u(s-y)-\phi(y)+\beta\int_m^M V(y+\omega')\mu(\omega')d\omega'\right\}$$ 其中 $\Gamma(s)=[0,s]$。

12.2.3 一阶条件

仅一个状态变量的贝尔曼方程 $$V(s)=\max_{y\in\Gamma(s)}\left\{u(s-y)-\phi(y)+\beta\int_m^M V(y+\omega')\mu(\omega')d\omega'\right\}$$ 有一阶条件 $$u'(s-y)+\phi'(y)\ge\beta\int_m^M V'(y+\omega')\mu(\omega')d\omega'$$ 在 $y^\star>0$（内点解）时取等号。注意可对 $u$ 假设 Inada 条件以排除角点解 $y^\star=s$。由于 LHS 关于 $y$ 递增、RHS 关于 $y$ 递减，故有唯一解 $y^\star=g(s)$。此外，更大的 $s$ 把 LHS 下移、使最优策略更大，即 $\hat s>s\Rightarrow g(\hat s)>g(s)$，故 $g$ 关于 $s$ 递增。

12.2.1 Bellman equation with two state variables

Consider the state variable $S=(x,\omega)$ where the endogenous state $x$ is the current period's initial inventory carried over from last period and $\omega$ is the exogenous stochastic shock. The Bellman equation for the social planner is as follows. $$V(x,\omega)=\max_{y\in\Gamma(x,\omega)}\left\{u(x+\omega-y)-\phi(y)+\beta\int_m^M V(y,\omega')\mu(\omega')d\omega'\right\}$$ with $\Gamma(x,\omega)=[0,x+\omega]$.

12.2.2 Bellman equation with only one state variable

Consider the state variable $s=x+\omega$ where $x$ and $\omega$ are defined as before. Here $s$ means the total supply of the current period. And $y$ doesn't mean the total supply of next period; instead, $y$ means the inventory carried over to next period. The Bellman equation for the social planner is as follows. $$V(s)=\max_{y\in\Gamma(s)}\left\{u(s-y)-\phi(y)+\beta\int_m^M V(y+\omega')\mu(\omega')d\omega'\right\}$$ with $\Gamma(s)=[0,s]$.

12.2.3 The first-order condition

The Bellman equation with only one state variable $$V(s)=\max_{y\in\Gamma(s)}\left\{u(s-y)-\phi(y)+\beta\int_m^M V(y+\omega')\mu(\omega')d\omega'\right\}$$ has the f.o.c. $$u'(s-y)+\phi'(y)\ge\beta\int_m^M V'(y+\omega')\mu(\omega')d\omega'$$ with equality if $y^\star>0$ (interior solution). Note that we can assume an Inada condition on $u$ to rule out the corner solution that $y^\star=s$. Since the LHS is increasing in $y$ and the RHS is decreasing in $y$, there is a unique solution for $y^\star=g(s)$. Moreover, a larger $s$ will shift down the LHS so that the optimal policy is larger, i.e. for $\hat s>s$, $g(\hat s)>g(s)$. Therefore, $g$ is increasing in $s$.