Multiple Regression, Assumptions, and Coefficient Logic

How Level II tests multiple-regression setup, coefficient interpretation, underlying assumptions, and residual-based judgment.

Level II regression questions are not mainly about pushing buttons. They are about deciding what the model is trying to explain, what each coefficient really means, and whether the assumptions needed for interpretation are even close to plausible.

Why This Lesson Matters

Candidates often miss regression item sets for one of four reasons:

  • they cannot identify the dependent variable that actually matters for the investment problem
  • they read coefficients without holding other variables constant
  • they memorize assumptions as a list but do not connect them to residual evidence
  • they treat statistical output as economically meaningful before asking whether the model specification makes sense

The stronger reader starts with the model purpose.

The Multiple-Regression Frame

When the curriculum writes a multiple-regression model, the basic structure is:

$$ Y_i = b_0 + b_1 X_{1,i} + b_2 X_{2,i} + \cdots + b_k X_{k,i} + \varepsilon_i $$

The point is not the notation itself. The point is that each estimated coefficient is interpreted while holding the other included variables fixed.

Start With The Investment Question

Question typeTypical regression use
What explains cross-sectional stock returns?Multiple independent variables may capture risk, valuation, or style characteristics
What drives credit spread changes?Regression can separate benchmark-rate, liquidity, and issuer effects
What influences firm value or profitability?Explanatory variables can test operating, leverage, or market drivers

If the analyst does not know what the model is trying to explain, the output quickly becomes decorative.

Coefficient Interpretation Is Conditional

Output elementWhat it meansCommon Level II trap
InterceptPredicted value of the dependent variable when all included independent variables equal zeroTreating it as economically meaningful when zero is not a realistic state
Slope coefficientEstimated change in the dependent variable for a one-unit change in that variable, holding others constantForgetting the ceteris paribus condition
Sign of coefficientDirection of estimated relationAssuming sign alone proves causal logic
MagnitudeEstimated sensitivityIgnoring units and scale

Level II often tests whether the candidate reads the coefficient in the right economic units rather than simply calling it positive or negative.

The Assumptions Matter Because They Support Interpretation

Assumption areaWhy it matters
Linearity in parametersSupports the way the model is specified and estimated
Independent-variable variationThe model cannot learn much from variables that barely move
Residual propertiesHelp determine whether inference is reliable
Stable relation between variablesMakes the coefficients worth interpreting

The exam does not want a legalistic recital. It wants you to see why assumption failure weakens the result.

Residual Plots Are Diagnostic Evidence

Residual evidence is often the first clue that a pleasant-looking regression table should not be trusted.

Residual patternWhat it may suggest
Funnel-shaped spreadHeteroskedasticity
Clear curvatureFunctional-form misspecification
Clustering through timeSerial correlation or regime behavior
Extreme isolated observationsInfluential points or outliers

That is why the vignette may include a plot instead of another line of numeric output.

Statistical Significance And Economic Usefulness Are Different

A coefficient can be statistically significant and still be economically weak. It can also be economically important but estimated imprecisely because the sample is small or noisy.

Level II often uses that distinction to separate careful readers from table skimmers.

How CFA-Style Questions Usually Test This

  • by asking which dependent variable or set of independent variables matches the investment problem
  • by asking for the correct interpretation of a coefficient in words
  • by showing a residual plot that indicates a violated assumption
  • by tempting the candidate to overstate what a statistically significant coefficient proves

Mini-Case

An analyst regresses excess stock returns on size, leverage, and book-to-market. The size coefficient is negative and statistically significant.

A weak answer says small firms therefore cause lower returns.

A stronger answer says the model estimates a negative association between the size variable and returns, conditional on the other included variables, and then asks whether the specification and economic rationale support using that estimate.

Common Traps

  • confusing association with causation
  • interpreting a coefficient without holding other variables constant
  • treating the intercept as economically central when it is not
  • ignoring residual evidence because the regression table looks clean

Sample CFA-Style Question

What is the best interpretation of a slope coefficient in a multiple regression?

Best answer: It estimates the change in the dependent variable associated with a one-unit change in that independent variable, holding the other included variables constant.

Why: Level II often tests the conditional nature of coefficient interpretation more than the algebra itself.

Continue In This Chapter

Revised at Thursday, April 9, 2026