Benchmarks, Risk Appraisal, and Manager Skill

How Level III tests benchmark quality, liability-based and asset-based benchmarks, benchmark misspecification, and appraisal of manager skill.

At Level III, a benchmark is not just a comparison convenience. It is part of the logic of evaluation. If the benchmark is wrong, attribution and appraisal can both look precise while still being misleading.

Why This Lesson Matters

Candidates often know a few appraisal metrics but miss the real question:

  • is the benchmark appropriate
  • is the benchmark liability-aware or only asset-based
  • are the measured results telling the truth about skill
  • are alternative investments or illiquid assets making benchmarking harder

Level III often rewards benchmark judgment before metric calculation.

Benchmarks Come In Different Forms For Different Purposes

Benchmark typeBest use
Liability-based benchmarkWhen the investor’s obligations are central to success
Asset-based benchmarkWhen the mandate is framed around investable market opportunity sets
Peer or custom composite benchmarkWhen the portfolio structure is too specific for a broad public index alone

The stronger answer starts by asking what success actually means for the investor.

Benchmark Quality Should Be Tested, Not Assumed

Benchmark-quality testWhy it matters
UnambiguousThe benchmark should be defined clearly enough that evaluation is consistent
InvestableThe manager should be able to implement something meaningfully comparable
MeasurableData should exist to evaluate it reliably
AppropriateIt should match the mandate, objective, and opportunity set
Specified in advanceIt should not be changed after performance is known

This quality logic often matters more than the exact benchmark label.

Benchmark Misspecification Can Corrupt Attribution And Appraisal

If the benchmark is wrongWhat goes wrong next
Too easyThe manager may look skillful without true value add
Too hard or structurally mismatchedThe manager may look weak for reasons unrelated to process quality
Misaligned with liability or investor objective“Outperformance” may still fail the real purpose of the portfolio

Level III often tests whether the candidate can spot that the benchmark problem came first.

Some Appraisal Metrics Are Useful, But None Are Self-Sufficient

The Sortino ratio is often written as:

$$ \text{Sortino ratio} = \frac{R_P - R_T}{\sigma_D} $$

where (R_T) is the target or required return and (\sigma_D) is downside deviation.

The appraisal ratio is commonly written as:

$$ \text{Appraisal ratio} = \frac{\alpha_P}{\sigma_{\varepsilon}} $$

where active return is scaled by residual or specific risk.

MetricWhat it emphasizesLimitation
Sortino ratioDownside-risk-adjusted performanceDepends on the chosen target return
Appraisal ratioActive value added per unit of residual riskOnly useful if the benchmark model is sensible
Upside or downside capture ratiosHow the manager behaves in up and down marketsCan hide path dependency and regime effects
Maximum drawdown and drawdown durationCapital-loss depth and recovery painSensitive to sample period and path

The exam may ask you to compute or interpret one, but it still expects judgment about fit and limitation.

Manager Skill Requires Separation Of Process From Luck

Apparent good resultStronger Level III question
High active returnWas the benchmark appropriate and was the risk intentional?
Strong downside capture profileWas it caused by process quality or by one favorable regime?
Low drawdownWas the portfolio simply running lower market exposure than the benchmark?

This is why skill evaluation is more than comparing one ratio.

Benchmarking Alternatives Is Harder

Alternative investments complicate evaluation because of:

  • stale or smoothed valuations
  • irregular cash flows
  • private-market comparability problems
  • limited public benchmarks

Level III often uses this to test whether the candidate realizes that neat public-market benchmarking tools may not translate cleanly.

How CFA-Style Questions Usually Test This

  • by asking whether a liability-based or asset-based benchmark fits better
  • by testing whether the benchmark passes quality checks
  • by asking what benchmark misspecification does to attribution and appraisal
  • by interpreting Sortino, appraisal ratio, capture ratios, or drawdown metrics
  • by asking whether apparent manager skill survives better benchmark scrutiny

Mini-Case

A private DB plan outperformed its asset-based policy benchmark, but the liability benchmark still deteriorated because discount-rate sensitivity was managed poorly. The manager’s appraisal ratio versus the policy benchmark looks attractive.

A weak answer praises the manager’s strong risk-adjusted performance.

A stronger answer asks whether the benchmark used for appraisal ignored the plan’s actual liability problem, making the skill conclusion incomplete.

Common Traps

  • treating any broad market index as a valid benchmark
  • using appraisal metrics without checking benchmark quality
  • assuming good capture ratios prove broad skill
  • ignoring the benchmarking difficulty of alternatives and liabilities

Sample CFA-Style Question

Why can a strong appraisal ratio still be misleading?

Best answer: Because the ratio may look strong even when the benchmark is misspecified or misaligned with the portfolio’s true objective.

Why: Level III rewards evaluation logic, not blind metric worship.

Continue In This Chapter

Revised on Wednesday, April 15, 2026