Module 5 of 16

Intermediate Models

Build reusable transformation steps without exposing half-finished business tables.

95 minutes1 exercisesFree

Watch as Slides Course overviewInline lab below

Start here

Learning objectives

Know when to create an intermediate model
Separate reusable logic from final reporting shape
Reduce duplication across marts

The Mental Model

Intermediate models hold reusable transformation logic that is too complex for staging but not final enough for business users.

Intermediate models are the prep bowls in a kitchen. They are useful while cooking, but you do not serve them as the final dish.

Tiny Example

We will use a small ecommerce dataset throughout the course. Think of these as the only tables in your first warehouse:

Table	Grain	Example columns
`raw_orders`	one row per order event	`order_id`, `customer_id`, `amount`, `status`, `created_at`
`raw_order_items`	one row per item inside an order	`order_id`, `product_id`, `quantity`, `item_price`
`raw_customers`	one row per customer	`customer_id`, `email`, `country`, `created_at`

Interactive Check

Question: Two marts need the same order refund calculation. Should both copy the SQL?

Reveal the answer

No. Put the shared refund logic in an intermediate model, then let both marts ref it.

Inline Practice Lab

This lab is intentionally small. You can solve it by reading the table, writing the SQL/YAML mentally, or pasting the snippet into any SQL scratchpad later.

-- Example starter table
select
  order_id,
  customer_id,
  amount,
  status,
  created_at
from raw_orders;

The goal is not tooling setup. The goal is learning the production habit: state the grain, clean one thing, test one assumption, and explain the downstream impact.

Self-Check Quiz

What is the grain of the table you are building?
Which downstream metric or dashboard would be wrong if this model broke?
What test would catch the most likely beginner mistake here?

Real world

Where this shows up

Reliable executive dashboards that do not disagree across teams
AI analytics agents that query governed metrics instead of guessing SQL
Auditable metric changes where owners can see downstream impact before merge

Production notes

Keep these close

Intermediate models are useful, but too many create a maze. Each one should remove real duplication or clarify complex logic.

Common mistakes

What usually breaks

Creating intermediate models for every tiny SELECT
Letting BI tools query intermediate models directly
Hiding important business definitions without documentation

Think like an engineer

Questions to answer before shipping

Can you explain the grain of this model in one sentence?
What breaks downstream if this field becomes null tomorrow?
Where should this logic live so it is reused instead of copied?

Key terms

Vocabulary used in this module