Module 1 of 13

Understanding Zero Trust Security

Why perimeter security fails and how identity-based security changes everything

3 hours2 labsFree

Watch as Slides Course overview Lab code

Start here

Learning objectives

Understand why traditional perimeter security fails in cloud-native systems
Learn the core principles of Zero Trust architecture
Differentiate between human identity and workload identity
Understand service-to-service authentication challenges

Before

Static secrets in environment variables
Shared certificates across services
Manual certificate rotation (or none at all)
Network location used as proxy for identity
IP-based firewall rules that break with auto-scaling

After

Automatic cryptographic workload identity
Unique short-lived certificate per service
Automatic rotation before expiry
Cryptographic verification on every request
Identity follows the workload regardless of location

For decades, enterprise security followed a simple model: build a strong perimeter, trust everything inside it. Firewalls guarded the boundary. VPNs extended the trusted zone. If a request came from inside the network, it was assumed to be legitimate.

This model is fundamentally broken in cloud-native systems. Containers spin up and down in seconds. Pods get random IP addresses. Services communicate across clusters, clouds, and regions. The “perimeter” does not exist anymore.

The Evolution of Enterprise Security

Enterprise security evolved through three eras:

Castle-and-Moat (1990s-2000s): Strong perimeter with firewalls. Trust everything inside the network. Works when servers are static and physical.
Defense in Depth (2000s-2010s): Multiple security layers. Network segmentation with VLANs. Still fundamentally trusts network location as a proxy for identity.
Zero Trust (2010s-present): Trust nothing. Verify every request. Identity is cryptographic, not network-based. Every service must prove who it is on every request.

Why Perimeter Security Fails

In a Kubernetes cluster, a pod’s IP address changes every time it restarts. Auto-scaling creates new replicas with new IPs. Service mesh proxies forward traffic, masking the original source. Network policies can restrict which namespaces can communicate, but they cannot verify which specific service is making the request.

The fundamental problem: network location is not identity. An attacker who compromises one pod in a namespace can impersonate any other pod in that namespace because network policies only check source/destination labels, not cryptographic identity.

Core Principles of Zero Trust

Never trust, always verify: Every request must be authenticated and authorized, regardless of source location.
Least privilege access: Services get only the permissions they need, nothing more.
Assume breach: Design systems so that a compromised component cannot access everything.
Verify explicitly: Use cryptographic identity, not network properties, to authenticate services.
Continuous verification: Identity is verified on every request, not just at connection establishment.

Human Identity vs Workload Identity

We solved human identity years ago with SSO, OAuth, and multi-factor authentication. A developer logs in with their credentials and gets a token that proves who they are. But what about the services themselves?

Workload identity gives every service, container, and process its own cryptographic identity - just like a human has a username and password, a workload gets a cryptographic certificate. This identity is automatically issued, short-lived, automatically rotated, and cryptographically verifiable without a central authority.

The Identity Crisis in Distributed Systems

Without workload identity, teams resort to shared secrets in environment variables (leaked in logs, source control, and crash dumps), long-lived API keys (never rotated, shared across environments), service account tokens (broad permissions, no mutual verification), and IP-based allow lists (break with auto-scaling and container orchestration).

Each of these is a security vulnerability waiting to happen. SPIFFE and SPIRE replace all of them with a single, standards-based identity system.

Zero Trust Maturity Model

Level	Description	Technologies
Level 0	No identity verification between services	Flat network, shared secrets
Level 1	Network segmentation and basic auth	NetworkPolicies, API keys
Level 2	Service-level mTLS with long-lived certs	cert-manager, Vault PKI
Level 3	Automatic workload identity with short-lived certs	SPIFFE/SPIRE
Level 4	Identity-aware authorization with policy engines	SPIRE + OPA + Envoy

This course takes you from Level 0 to Level 4.

What SPIFFE/SPIRE Does NOT Solve

Honest understanding of limitations builds real expertise:

Does not stop compromised workloads: If an attacker has code execution inside a legitimate workload, SPIFFE identity does not prevent the attacker from using that identity. Runtime security (Falco, seccomp) handles that layer.
Does not automatically provide authorization: SPIFFE proves identity. It does not decide what that identity is allowed to do. You need a policy engine (OPA) for authorization.
Does not replace runtime security: Container escape, privilege escalation, and lateral movement are addressed by runtime tools, not identity systems.
Does not eliminate policy design complexity: You still need to design who can talk to whom. SPIFFE makes enforcement possible - it does not design the rules for you.
Does not prevent all secret sprawl: Application-level secrets (database passwords, API keys for external services) still need management via Vault or similar tools.

Authentication vs Authorization: The Critical Distinction

Responsibility	Tool	Question Answered
Identity	SPIFFE (spec)	What is this workload called?
Identity Issuance	SPIRE (runtime)	How does it get its identity?
Authentication	mTLS / SVID verification	Is this really who they claim to be?
Authorization	OPA / Policy Engine	Is this workload allowed to do this action?

Many learners wrongly assume SPIFFE equals authorization. It does not. SPIFFE handles identity and authentication. Authorization is a separate concern addressed in Module 7.

Real world

Where this shows up

Kubernetes workload authentication - proving pod identity without shared secrets
Secure service mesh identity - mTLS between all services in a cluster
Multi-cluster trust - workloads across clusters verifying each other
Secure CI/CD pipelines - ephemeral build agents with cryptographic identity
AI agent identity - autonomous AI systems authenticating to services

Production notes

Keep these close

Network policies alone are NOT zero trust. They restrict traffic by label but cannot verify cryptographic identity. An attacker who compromises one pod with the right labels can impersonate any service.
Start your zero trust journey with one critical service path (e.g., API → database), not the entire mesh. Incremental rollout reduces risk and builds team confidence.

Common mistakes

What usually breaks

Treating network location as identity - IPs and namespaces are not cryptographic proof
Using long-lived shared secrets in environment variables - they leak in logs, crash dumps, and source control
Assuming Kubernetes NetworkPolicies provide zero trust - they only check labels, not cryptographic identity
Trying to implement zero trust everywhere at once instead of incrementally
Confusing authentication (who are you?) with authorization (what can you do?)

Security risks

Threats to watch

Shared secrets in environment variables can be read by any process in the container
Long-lived API keys are never rotated and accumulate over time
IP-based allow lists break silently when containers restart with new IPs
Service account tokens in Kubernetes pre-1.24 never expire

Think like an engineer

Questions to answer before shipping

If an attacker compromises one pod in your namespace, what can they access?
How would you prove the identity of a service that was just auto-scaled 30 seconds ago?
What happens to your security model when you move workloads between clouds?
How is machine-to-machine authentication different from human-to-machine authentication?

Key terms

Vocabulary used in this module

Zero Trust

Security model that requires verification of every request regardless of source location

Workload Identity

Cryptographic identity assigned to a service, container, or process

mTLS

Mutual TLS - both client and server verify each other's certificates

Perimeter Security

Traditional model that trusts everything inside the network boundary

SPIFFE

Secure Production Identity Framework For Everyone - the open standard for workload identity

SVID

SPIFFE Verifiable Identity Document - a certificate or JWT proving workload identity

Labs

Hands-on labs

Exploring Traditional vs Zero Trust Architectures

Compare network-based security with identity-based security by observing service communication patterns.

Deploy two services in a Kubernetes namespace without any security
Observe that any pod can call any other pod without authentication
Add NetworkPolicies and observe the limitations
Discuss what is missing: identity verification and encryption

View lab on GitHub

Inspecting Service Communication Flows

Capture and inspect unencrypted service-to-service traffic to understand the attack surface.

Deploy a simple microservice application (frontend + API + database)
Use tcpdump to capture traffic between pods
Observe unencrypted HTTP traffic with visible headers and payloads
Document the security gaps: no encryption, no identity, no authorization