7. Implications of Existence and Equivalence Theorems

Jun He June 02, 2026

中文English 存在性定理Existence Theorems Roll批判Roll Critique 样本内拟合In-Sample Fishing 联合假设Joint Hypothesis 因子个数Number of Factors

7. Implications of Existence and Equivalence Theorems

Note

本章导读本章（Cochrane 第 7 章）几乎不含公式，却是方法论上最重要的一章。贴现因子存在定理与三视角等价定理，对如何做与评价实证有深刻含义：① \(p=\mathbb E(mx)\) 本身"无害"（innocuous），全部内容都在 \(m=f(\text{data})\) 的设定里；② 定理对样本矩同样成立——任何能拟合样本的事后有效组合都"按构造"定价，故"在样本里捞模型 (fishing)"必然成功而毫无意义；③ 因此需要纪律（经济理论 + 样本外检验）；④ 因子模仿组合的含义；⑤ 理性/有效性的联合假设问题；⑥ "因子个数"是个伪问题；⑦ 为何贴现因子语言正取代均值方差/贝塔语言。

7. Implications of Existence and Equivalence Theorems

Note

Overview This chapter (Cochrane Ch 7) has almost no formulas but is the most important methodologically. The existence of a discount factor and the equivalence of the three views have deep implications for how we do and evaluate empirical work: ① \(p=\mathbb E(mx)\) is itself innocuous — all content lives in the specification \(m=f(\text{data})\); ② the theorems hold for sample moments too, so any ex-post efficient portfolio that fits a sample prices it "by construction," making "fishing" for a model guaranteed to succeed yet meaningless; ③ hence the need for discipline (economic theory + out-of-sample checks); ④ the meaning of factor-mimicking portfolios; ⑤ the joint-hypothesis problem for rationality/efficiency; ⑥ why "the number of factors" is a non-question; ⑦ why discount-factor language is taking over from mean-variance/beta.

\(p=\mathbb E(mx)\) 是"无害"的 / \(p=\mathbb E(mx)\) is Innocuous

Roll (1977) 之前，期望收益-贝塔表示总是在特定显式经济模型（尤其 CAPM）下导出的，任何贝塔模型的实证成功似乎都是整套结构的胜利。Roll 证明均值方差有效性蕴含单贝塔表示后，一切都变了：由于总存在某个均值方差有效收益，某个单贝塔表示总是存在。资产定价模型真正断言的，只是某个特定收益（比如"市场收益"）是否均值方差有效。于是"检验 CAPM"的关键，变成必须对参照组合极其挑剔，以防撞上某个恰好均值方差有效、因而"按构造"给资产定价的组合。

\(p=\mathbb E(mx)\) is Innocuous

Before Roll (1977), expected return-beta representations were derived inside specific explicit economic models (especially the CAPM), and any beta model's empirical success seemed to vindicate the whole structure. After Roll showed that mean-variance efficiency implies a single-beta representation, that changed: since some mean-variance efficient return always exists, some single-beta representation always exists. The asset pricing model only really claims that one particular return (say "the market") is mean-variance efficient. So "testing the CAPM" becomes a matter of being very choosy about the reference portfolio, to guard against stumbling on something that happens to be efficient and hence prices assets by construction.

Important

好消息与坏消息 / The good news and the bad news 由一价定律存在某个 \(m\) 使 \(p=\mathbb E(mx)\)，这只是 Roll 定理的现代重述。好消息：你总可以从 \(p=\mathbb E(mx)\) 出发，几乎没施加任何结构，无需担心隐藏的或有索取权或代表性消费者假设。坏消息：你也因此没走多远——全部经济、统计与预测内容都在挑选 \(m=f(\text{data})\) 这一步。That the law of one price gives some \(m\) with \(p=\mathbb E(mx)\) is just a modern restatement of Roll's theorem. Good news: you can always start from \(p=\mathbb E(mx)\) having imposed almost no structure, free of criticism about hidden contingent-claim or representative-consumer assumptions. Bad news: you have not gotten very far — all the economic, statistical, and predictive content comes in picking the discount factor model \(m=f(\text{data})\).

Roll 的洞见自然导向用更宽的财富指数（Stambaugh 1982）作参照组合，但这条路没流行起来：股票用股票因子、债券用债券因子定价；按规模、账面市值比、过往表现排序的股票，由按这些特征排序的组合定价（Fama and French 1993, 1996）。部分原因是各资产类别相关性不高、贝塔很小，故一类风险溢价对另一类平均收益影响甚微；含人力资本、房地产的更全面财富度量又缺乏高频价格数据。这留下一丝令研究者既不安又兴奋的怀疑：市场或许（尤其在高频上）是分割的。

事前与事后 / Ex Ante and Ex Post

定理对任何概率都成立——主观概率、客观总体概率、或某给定样本里实现的矩，皆可。因此事前、事后同样适用。这意味着：若一价定律在样本里成立，就能用样本矩构造一个 \(x^*\)，使 \(p(x)=E(x^*x)\) 在该样本里精确成立（\(p\) 是观测价格、\(E\) 是样本均值）；等价地，若收益的样本协方差阵非奇异，就存在一个事后均值方差有效组合，其样本平均收益与样本回归贝塔完全对齐。

Roll's insight led naturally to broader wealth indices (Stambaugh 1982) as the reference portfolio, but that route never caught on: stocks are priced by stock factors, bonds by bond factors; stocks sorted on size, book/market, and past performance are priced by portfolios sorted on those characteristics (Fama and French 1993, 1996). Part of the reason is that asset classes are not highly correlated and betas are small, so risk premia from one source barely affect another set of average returns; more comprehensive wealth measures (human capital, real estate) lack high-frequency price data. This leaves a nagging (and, to a researcher, exciting) suspicion that markets may be segmented, especially at high frequency.

Ex Ante and Ex Post

The theorems hold for any set of probabilities — agents' subjective probabilities, objective population probabilities, or the moments realized in a given sample. So they work ex ante and ex post alike. This means: if the law of one price holds in a sample, one can form an \(x^*\) from sample moments such that \(p(x)=E(x^*x)\) holds exactly in that sample (\(p\) is observed prices, \(E\) is the sample average); equivalently, if the sample covariance matrix of returns is nonsingular, there exists an ex-post mean-variance efficient portfolio whose sample average returns line up exactly with sample regression betas.

Warning

捞模型的危险 / The danger of fishing 这指出了"搜寻并统计评价临时 (ad hoc) 资产定价模型"这一普遍做法的巨大危险：只要对 \(m\) 或参照组合里放什么施加足够少的结构，模型在样本里注定成功。模型之所以不完美，唯一原因是研究者对因子的个数/身份、或因子到 \(m\) 的函数参数所施加的限制——而这些限制正是模型的全部内容，因此必须有趣、清晰刻画、动机充分！可惜大量实证研究恰恰相反：从一池 ad hoc 因子里捞一阵，报告"成功"（在定价一组组合时未被统计拒绝）。This exposes a great danger in the widespread exercise of searching for and statistically evaluating ad hoc asset pricing models: with little enough structure on what goes into \(m\) or the reference portfolios, a model is guaranteed success in-sample. The only reason it is not perfect is the restrictions the researcher imposes on the number/identity of factors, or the parameters of the function relating factors to \(m\) — and those restrictions are the model's entire content, so they had better be interesting, carefully described, and well motivated! Sadly, most empirical work does the opposite: fish around a pond of ad hoc factors and report "success" (not statistically rejected in pricing a set of portfolios).

纪律 / Discipline

那么，用一个"按构造"定价的事后有效组合或 \(x^*\) 有什么不对？错在：在这个样本里事后有效、因而给所有资产定价的组合，在下一个样本里几乎不可能仍均值方差有效，因而对未来的定价很差。同理，用样本二阶矩构造的 \(x^*=p'E(xx')^{-1}x\)，其所需权重 \(p'E(xx')^{-1}\) 会逐样本剧烈变化。即便 CAPM 为真、市场组合事前有效，它在任一给定样本里也几乎不会事后有效——事后有效组合是"星期一早上的马后炮"，让你重仓那些在样本里恰好走运的资产。"要是 1982 年买了微软就好了"，对今天构造有效组合毫无指导意义（事实上，均值回复与账面市值比效应表明，过去异常好的资产未来往往表现差！）。

Discipline

So what is wrong with using an ex-post efficient portfolio or \(x^*\) that prices by construction? The mistake: a portfolio ex-post efficient in this sample, and so pricing all assets in it, is unlikely to be efficient in the next sample, and so prices the future poorly. Likewise the \(x^*=p'E(xx')^{-1}x\) built from the sample second-moment matrix has weights \(p'E(xx')^{-1}\) that swing wildly from sample to sample. Even if the CAPM is true and the market is ex-ante efficient, it is unlikely to be ex-post efficient in any given sample — an ex-post efficient portfolio is a "Monday-morning quarterback," loading up on whatever was lucky in-sample. "If only I'd bought Microsoft in 1982…" is no guide to forming an efficient portfolio today (indeed, mean-reversion and book/market effects suggest past winners tend to do poorly going forward!).

唯一的出路是施加某种纪律，以免捞出虚假的样本内拟合。这与传统回归 \(y=x'\beta+\varepsilon\) 同理：盲目堆右侧变量可得任意好的拟合，但样本外不稳定、无用。计量经济学家纠结此事约 50 年，最好的答案是：(1) 用经济理论审慎设定右侧变量；(2) 用一整套跨样本与样本外稳健性检验。可惜两条都难行：经济理论对该放什么变量往往沉默，或允许极大范围——Fama (1991) 把 APT 与 ICAPM 称作"捕鱼执照 (fishing license)"，可为几乎任何想要的因子背书。样本外检验则常揭示模型不稳、需要修改；一改，就再没有样本外可查了。而且即便某研究者纯粹到肯等 50 年新样本，其竞争者和期刊编辑也不会这么有耐心。

依本书作者之见，寻找样本外、跨市场稳健的定价因子，最好的希望是理解风险的宏观经济根源——像那个命运多舛的消费模型经 \(m_{t+1}=\beta u'(c_{t+1})/u'(c_t)\) 把资产价格系于宏观事件那样。消费模型近年失宠，但经验定出的风险因子其个数与身份也不稳定（自 Merton 1973a、Ross 1976a 开创多因子模型以来，标准风险因子集约每两年换一茬）。Lettau and Ludvigson (2001a) 那种为经验因子寻找宏观解释的努力，或是有用的折中。

The only way out is to impose some discipline against dredging up spuriously good in-sample pricing. It is the same as in regression \(y=x'\beta+\varepsilon\): blindly adding right-hand variables yields arbitrarily good fit but is unstable and useless out of sample. Econometricians have wrestled with this for ~50 years, and the best answers are: (1) use economic theory to specify the right-hand side carefully; (2) use a battery of cross-sample and out-of-sample stability checks. Alas, both are hard: economic theory is usually silent on which variables to include, or allows a huge range — Fama (1991) calls APT and ICAPM a "fishing license" for almost any desired factor. Out-of-sample tests often reveal instability and force changes; once changed, there is no out-of-sample left. And even a researcher pure enough to wait 50 years for a fresh sample has impatient competitors and editors.

In the author's view, the best hope for pricing factors robust out-of-sample and across markets is to understand the macroeconomic sources of risk — tying asset prices to macro events the way the ill-fated consumption model does via \(m_{t+1}=\beta u'(c_{t+1})/u'(c_t)\). The consumption model has lost favor, but empirically determined factors are unstable too (since Merton 1973a and Ross 1976a launched multifactor models, the standard set has changed about every two years). Efforts like Lettau and Ludvigson (2001a) to find macro explanations for empirical factors may be a useful compromise.

我们当前评价模型与论文的统计方法论，对这种纪律有天然抵触：当上一位作者捞出一个产生 1% 平均定价误差的 ad hoc 模型，你很难说服读者、审稿人、编辑与客户，相信你那个有经济动机、却有 2% 平均误差的模型更有趣。你的模型也许真的更好、会在那个被捞出的模型随金融时尚凋零后继续样本外表现良好，但很难越过样本内拟合的统计度量。人们渴望一个对"捞鱼"次数的正式度量（如 \(R^2\) 的自由度校正）；没有数值校正，就只能用判断力，按产生这些表面统计成功所做的经济与统计捞鱼量，把它们打个折扣。

因子模仿组合 / Mimicking Portfolios

\(x^*=\operatorname{proj}(m|X)\) 也有实证含义：任何模型的定价含义都可由其因子模仿组合等价表示。若驱动 \(m\) 的经济变量含测量误差，则真 \(m\) 的因子模仿组合，会比用实测宏观变量估计的 \(m\) 定价更准。因此，拿经济上有意义的模型去和"用组合收益作因子"的模型做统计赛马，多半不明智：即使经济模型为真且测量完美，在大样本里它也只能等于自己的因子模仿组合；一旦有测量误差，它就会输给自己的模仿组合；而二者在样本内又总会输给那些找到近似事后有效组合的 ad hoc 因子模型。

Our current statistical methodology for evaluating models and papers naturally resists such discipline: when the last author fished up an ad hoc model with 1% average pricing errors, it is hard to persuade readers, referees, editors, and clients that your economically motivated model with 2% errors is more interesting. Your model may truly be better and keep performing out-of-sample after the fished one falls by the wayside of financial fashion — but it is hard to get past in-sample statistical fit. One hungers for a formal count of the hurdles imposed on a fishing expedition, like the degrees-of-freedom correction in \(R^2\). Absent a numerical correction, we must use judgment to discount apparent statistical successes by the amount of economic and statistical fishing that produced them.

Mimicking Portfolios

\(x^*=\operatorname{proj}(m|X)\) also has empirical bite: any model's pricing implications are equivalently captured by its factor-mimicking portfolio. If the economic variables driving \(m\) have measurement error, the mimicking portfolio for the true \(m\) will price assets better than an estimate of \(m\) using the measured macro variables. So it is usually unwise to run a statistical horse race of an economically interesting model against models that use portfolio returns as factors: even if the economic model is true and perfectly measured, in large samples it just equals its own factor-mimicking portfolio; with any measurement error it underperforms it; and both always lose in-sample to ad hoc factor models that find nearly ex-post efficient portfolios.

这并非说要绕过"理解真正宏观因子"的过程、径直去捞模仿组合。实务经验印证了这一忠告：靠大量统计分析（即捞鱼）得到的大型商业因子模型样本外表现糟糕——其因子与载荷 \(\beta\) 总在变。但确有一席之地留给"用收益作因子"的模型：找到底层宏观因子后，从业者宜每日（乃至每分钟）盯着因子模仿组合，因为模仿组合有高频好数据，许多用途并不需要理解模型的经济内涵。

非理性与联合假设 / Irrationality and the Joint Hypothesis

金融学长期争论市场的"理性/非理性"、"有效/无效"。许多实证论文被包装成"市场无效"或"投资者非理性"的证据（1987 年 10 月股灾、小公司/账面市值比/季节效应、长期可预测性等）。但这些谜题无一记录了可利用的套利机会。因此我们知道：必定存在一个"理性模型"——一个随机贴现因子、一个可用于单贝塔表示的有效组合——能把它们统统合理化。而且可以自信地预言这一状况会持续：真套利机会活不长！

This does not say to bypass understanding the true macro factors and just fish for mimicking portfolios. Practitioner experience bears this out: large commercial factor models from extensive statistical analysis (a.k.a. fishing) perform poorly out-of-sample, as shown by factors and loadings \(\beta\) that change all the time. But there is an important place for return-factor models: once the underlying macro factors are found, practitioners are well advised to watch the mimicking portfolio day-by-day (even minute-by-minute), since good high-frequency data on it are available and many purposes do not require understanding the economics.

Irrationality and the Joint Hypothesis

Finance has a long history of fighting over "rationality vs. irrationality" and "efficiency vs. inefficiency." Many empirical papers are sold as evidence of "inefficiency" or "irrationality" (the October 1987 crash; small-firm, book/market, seasonal effects; long-term predictability). But none of these puzzles documents an exploitable arbitrage opportunity. So we know there is a "rational model" — a stochastic discount factor, an efficient portfolio for a single-beta representation — that rationalizes them all. And we can confidently predict this will continue: real arbitrage opportunities do not last long!

Important

Fama 的联合假设 / Fama's joint hypothesis Fama (1970)：任何"有效性"检验都是有效性与某"市场均衡模型"（即一个资产定价模型、一个 \(m\) 的模型）的联合检验。仅凭资产市场数据，无法决定性地证明市场"理性"与否——难怪 30 年、数千篇论文未把争论推近分毫。市场当然可以非理性/无效而不含套利机会，当且仅当生成价格的贴现因子与实体经济中的边际替代率/转换率脱节；但这又把我们拉回到设定并检验 \(m\) 的经济模型。最多某谜题严重到所需 \(m\) 是"不合理"的边际替代率度量——可我们仍得先说清什么样的边际替代率才算合理。Fama (1970): any test of "efficiency" is a joint test of efficiency and a "model of market equilibrium" (i.e. an asset pricing model, a model of \(m\)). No test based only on asset-market data can conclusively show markets are "rational" or not — small wonder 30 years and thousands of papers have not moved the debate an inch. Markets can be irrational/inefficient without arbitrage, if and only if the discount factors generating prices are disconnected from marginal rates of substitution/transformation in the real economy; but that puts us right back to specifying and testing economic models of \(m\). At best a puzzle is so severe that the required \(m\) is an "unreasonable" measure of real marginal rates — but we still must say what a reasonable marginal rate looks like.

因子个数 / The Number of Factors

许多检验纠结于"给一组资产定价需要几个因子"。等价定理表明这是个伪问题。线性因子模型 \(m=b'f\)（及其等价贝塔模型）的表示不唯一：给定任意多因子表示，总能化为单贝塔表示——\(m=b'f\)、\(x^*=\operatorname{proj}(b'f|X)\) 或对应的 \(R^*\) 都给出与原多因子模型定价能力完全相同的单贝塔模型。也能轻易改写成不同（大于一）的因子数，例如把三因子 \(m=a+b_1f_1+b_2f_2+b_3f_3\) 合并为两因子 \(a+b_1f_1+b_2\hat f_2\)。有时人们确实关心多因子表示（因子有线性组合后丢失的经济解释），但因子的纯粹个数不是有意义的问题。

贴现因子 vs. 均值、方差、贝塔 / Discount Factors vs. Mean, Variance, Beta

为何均值方差与贝塔语言先发展，而贴现因子语言似乎正在接管？资产定价起初把收益的均值与方差放在坐标轴上，而非如今的"状态 1 的支付、状态 2 的支付"。Markowitz (1952) 的问题问得极好：想用微观经济学"苹果与橙子、无差异曲线与预算集"的框架处理资产，难点是坐标轴贴什么标签。"IBM 股票""GM 股票"显然不行——投资者重视的不是证券本身，而是其随机现金流的某些方面。其天才在于把组合收益的均值与方差放上坐标轴，作为投资者评价组合的"享乐维度 (hedonics)"：均值越多越好、方差越少越好，并赋予定义在均值-方差上的"效用函数"，均值方差前沿就是"预算集"。随后自然认识到：每只证券的均值收益度量其对组合均值的贡献，对总组合的回归贝塔度量其对组合方差的贡献——逐证券的均值-贝塔描述便顺势而生（Sharpe 1964）。

The Number of Factors

Many tests obsess over "how many factors are needed to price a cross-section." The equivalence theorems show this is a non-question. A linear factor model \(m=b'f\) (and its equivalent beta model) is not a unique representation: given any multifactor representation, one can always reduce it to a single-beta representation — \(m=b'f\), \(x^*=\operatorname{proj}(b'f|X)\), or the corresponding \(R^*\) all give single-beta models with exactly the multifactor model's pricing ability. One can equally rewrite it with a different number (greater than one) of factors, e.g. collapsing a three-factor \(m=a+b_1f_1+b_2f_2+b_3f_3\) into two factors \(a+b_1f_1+b_2\hat f_2\). Sometimes a multifactor representation is of interest (factors carry economic meaning lost on combination), but the pure number of factors is not meaningful.

Discount Factors vs. Mean, Variance, and Beta

Why did mean-variance and beta language develop first, while discount-factor language seems to be taking over? Asset pricing began by putting the mean and variance of returns on the axes, rather than payoff-in-state-1, payoff-in-state-2 as we do now. Markowitz (1952) posed it just right: treat assets in the apples-and-oranges, indifference-curve-and-budget-set framework of microeconomics — the trouble was what to label the axes. "IBM stock" and "GM stock" won't do; investors value not securities per se but aspects of their random cash flows. The brilliant move was to put portfolio mean and variance on the axes as the "hedonics" by which investors value portfolios (more mean, less variance), with "utility functions" over mean and variance and the mean-variance frontier as the "budget set." Then naturally: each security's mean return measures its contribution to portfolio mean, and its regression beta on the portfolio measures its contribution to portfolio variance — the mean-return-versus-beta description for each security followed (Sharpe 1964).

Tip

为何偏爱贴现因子语言 / Why prefer discount-factor language 把"状态 1 的消费、状态 2 的消费"放上坐标轴（对状态依存消费设定偏好与预算约束），比把均值方差放上坐标轴，是标准微观经济学到金融学更自然的映射——至少或有索取权的预算约束是线性的，而均值方差前沿不是。早期对均值方差的聚焦是历史的偶然。贴现因子语言胜在简洁、普适、数学便利、优雅：一切定价都可从 \(p=\mathbb E(mx)\) 一式起步，统摄股票、债券、期权、实物投资机会（而贝塔表述在后几类上很笨拙）。无套利（正支付负价格）从未进入等价讨论：贝塔模型只能事后硬贴"确保正支付组合有正价格"，而贴现因子模型只需加 \(m>0\)。它还便于处理不同期限与现值表述：\(p_t=\mathbb E_t\sum_j m_{t,t+j}x_{t+j}\)。语言之选无关正态性或收益分布——前述全部讨论未作任何分布假设；二阶矩（贝塔、前沿方差）之所以出现，仅因 \(p=\mathbb E(mx)\) 本身含二阶矩。Putting consumption-in-state-1, consumption-in-state-2 on the axes (preferences and budget constraints over state-contingent consumption) is a more natural mapping of standard microeconomics into finance than mean and variance — if only because contingent-claim budget constraints are linear while the mean-variance frontier is not. The early focus on mean-variance was an accident of history. Discount-factor language wins on simplicity, generality, mathematical convenience, and elegance: every calculation starts from the one equation \(p=\mathbb E(mx)\), covering stocks, bonds, options, and real investment opportunities (where the beta formulation is cumbersome). Arbitrage (positive payoffs, negative prices) never entered the equivalence discussion: a beta model can only tack it on after the fact, while a discount-factor model just adds \(m>0\). It also handles horizons and present values easily: \(p_t=\mathbb E_t\sum_j m_{t,t+j}x_{t+j}\). The choice is not about normality or return distributions — no distributional assumption appears anywhere above; second moments (betas, frontier variance) show up only because \(p=\mathbb E(mx)\) itself involves a second moment.

小结 / Summary

存在性定理意味着没有捷径可证"理性"或"非理性"：解释资产价格的唯一正道，是思考贴现因子的经济模型。\(p=\mathbb E(mx)\) 无害、贝塔表示无害——全部内容在 \(m=f(\text{data})\) 的设定与约束里。实证的真正挑战不是拟合样本（那"按构造"总能做到），而是施加有经济动机、样本外稳健的纪律。"因子个数"是伪问题；贴现因子语言因其线性预算约束、普适性与对无套利/多期的自然处理而胜出。

Summary

The existence theorems mean there are no quick proofs of "rationality" or "irrationality": the only game in town for explaining asset prices is thinking about economic models of the discount factor. \(p=\mathbb E(mx)\) is innocuous, the beta representation is innocuous — all content is in the specification and restrictions of \(m=f(\text{data})\). The real empirical challenge is not fitting a sample (that succeeds "by construction") but imposing economically motivated, out-of-sample-robust discipline. "The number of factors" is a non-question; discount-factor language wins for its linear budget constraints, generality, and natural handling of no-arbitrage and multiple horizons.