Multiple Regression, Assumptions, and Coefficient Logic

April 9, 2026

How Level II tests multiple-regression setup, coefficient interpretation, underlying assumptions, and residual-based judgment.

On this page

Level II regression questions are not mainly about pushing buttons. They are about deciding what the model is trying to explain, what each coefficient really means, and whether the assumptions needed for interpretation are even close to plausible.

Why This Lesson Matters

Candidates often miss regression item sets for one of four reasons:

they cannot identify the dependent variable that actually matters for the investment problem
they read coefficients without holding other variables constant
they memorize assumptions as a list but do not connect them to residual evidence
they treat statistical output as economically meaningful before asking whether the model specification makes sense

The stronger reader starts with the model purpose.

The Multiple-Regression Frame

When the curriculum writes a multiple-regression model, the basic structure is:

$$ Y_i = b_0 + b_1 X_{1,i} + b_2 X_{2,i} + \cdots + b_k X_{k,i} + \varepsilon_i $$

The point is not the notation itself. The point is that each estimated coefficient is interpreted while holding the other included variables fixed.

Start With The Investment Question

Question type	Typical regression use
What explains cross-sectional stock returns?	Multiple independent variables may capture risk, valuation, or style characteristics
What drives credit spread changes?	Regression can separate benchmark-rate, liquidity, and issuer effects
What influences firm value or profitability?	Explanatory variables can test operating, leverage, or market drivers

If the analyst does not know what the model is trying to explain, the output quickly becomes decorative.

Coefficient Interpretation Is Conditional

Output element	What it means	Common Level II trap
Intercept	Predicted value of the dependent variable when all included independent variables equal zero	Treating it as economically meaningful when zero is not a realistic state
Slope coefficient	Estimated change in the dependent variable for a one-unit change in that variable, holding others constant	Forgetting the ceteris paribus condition
Sign of coefficient	Direction of estimated relation	Assuming sign alone proves causal logic
Magnitude	Estimated sensitivity	Ignoring units and scale

Level II often tests whether the candidate reads the coefficient in the right economic units rather than simply calling it positive or negative.

The Assumptions Matter Because They Support Interpretation

Assumption area	Why it matters
Linearity in parameters	Supports the way the model is specified and estimated
Independent-variable variation	The model cannot learn much from variables that barely move
Residual properties	Help determine whether inference is reliable
Stable relation between variables	Makes the coefficients worth interpreting

The exam does not want a legalistic recital. It wants you to see why assumption failure weakens the result.

Residual Plots Are Diagnostic Evidence

Residual evidence is often the first clue that a pleasant-looking regression table should not be trusted.

Residual pattern	What it may suggest
Funnel-shaped spread	Heteroskedasticity
Clear curvature	Functional-form misspecification
Clustering through time	Serial correlation or regime behavior
Extreme isolated observations	Influential points or outliers

That is why the vignette may include a plot instead of another line of numeric output.

Statistical Significance And Economic Usefulness Are Different

A coefficient can be statistically significant and still be economically weak. It can also be economically important but estimated imprecisely because the sample is small or noisy.

Level II often uses that distinction to separate careful readers from table skimmers.

How CFA-Style Questions Usually Test This

by asking which dependent variable or set of independent variables matches the investment problem
by asking for the correct interpretation of a coefficient in words
by showing a residual plot that indicates a violated assumption
by tempting the candidate to overstate what a statistically significant coefficient proves

Mini-Case

An analyst regresses excess stock returns on size, leverage, and book-to-market. The size coefficient is negative and statistically significant.

A weak answer says small firms therefore cause lower returns.

A stronger answer says the model estimates a negative association between the size variable and returns, conditional on the other included variables, and then asks whether the specification and economic rationale support using that estimate.

Common Traps

confusing association with causation
interpreting a coefficient without holding other variables constant
treating the intercept as economically central when it is not
ignoring residual evidence because the regression table looks clean

Sample CFA-Style Question

What is the best interpretation of a slope coefficient in a multiple regression?

Best answer: It estimates the change in the dependent variable associated with a one-unit change in that independent variable, holding the other included variables constant.

Why: Level II often tests the conditional nature of coefficient interpretation more than the algebra itself.

Continue In This Chapter

Next lesson: Model Fit, Misspecification, and Regression Extensions
Related topic: Valuation Process, Required Return, and Model Choice
Back to chapter root: Quantitative Methods

Revised at Thursday, April 9, 2026

2.2 Fit, Misspecification & Extensions

Browse CFA Level II Study Guide