ADR-015: Pyramid Documentation Architecture

Date: 2025-11-10 Status: ✅ Accepted Deciders: CTO (Claude Code AI), CEO Tags: #documentation #architecture #organization

Context

EPGOAT's documentation had grown to 151 files with significant quality issues: - 356 CRITICAL issues (broken links, duplicates, inconsistent organization) - 93 broken links - No clear navigation or audience routing - Organized by content-type (Guides, Reference, Troubleshooting) - industry standard but not optimal for our needs

Dual-Audience Challenge: - AI assistants need compressed, token-efficient docs for fast system understanding - Humans need verbose, example-rich docs for learning and onboarding - These needs are fundamentally opposed (compression vs verbosity)

CEO Directive: "Build world-class, unrivaled documentation with perfect organization (novel, logical, systematic)"

Decision

Implement a 5-layer Pyramid Architecture organized by audience and urgency, not content type:

📍 Layer 0: SIGNAL   - CEO ↔ CTO Communication (<5K tokens)
🤖 Layer 1: CONTEXT  - AI System Reference (<50K tokens)
🔧 Layer 2: WORK     - Daily Engineering (quick access)
📚 Layer 3: LEARN    - Educational HTML (humans only)
🏛️ Layer 4: ARCHIVE  - Historical Record (completed work)

Layer Purposes:

Layer 0 (SIGNAL): Executive communication
CEO-INBOX.md: CEO's priorities and directives
CTO-UPDATES.md: Weekly progress summaries
DECISIONS-PENDING.md: Architectural decisions awaiting approval
QUARTERLY-OBJECTIVES.md: Business goals alignment
Layer 1 (CONTEXT): AI reference documentation
Ultra-compressed (<50K tokens total, 95% reduction from 200K)
Four files: _SYSTEM, _STANDARDS, _CODEBASE, _WORKFLOWS
Enables AI to load entire system context in single session
Layer 2 (WORK): Daily engineering quick-reference
Copy-paste commands
Troubleshooting guides
Active work tracking (current sprint)
Layer 3 (LEARN): Educational content for humans
Generated HTML from Layer 1 MD
Verbose with examples, diagrams, explanations
Never read by AI (to avoid token waste)
Layer 4 (ARCHIVE): Historical record
Completed projects
ADRs (Architecture Decision Records)
Legacy documentation
Immutable, timestamped

Rationale

Why Audience-First Organization?

Traditional content-type organization (Guides, Reference, Troubleshooting): - ❌ Doesn't account for different audience needs (AI vs humans) - ❌ Leads to duplication (same info in Guide and Reference) - ❌ Unclear routing ("Is this a guide or reference?") - ❌ No token budget awareness

Pyramid (audience-first) organization: - ✅ Clear routing: "Who am I?" → Go to your layer - ✅ Optimized for each audience (compressed for AI, verbose for humans) - ✅ No duplication (each layer has distinct purpose) - ✅ Token budget enforced per layer

Why Pyramid Shape?

Inverted priorities: - Top (SIGNAL): Smallest, highest priority → CEO visibility - Layer 1 (CONTEXT): Small, high priority → AI fast loading - Layer 2 (WORK): Medium, daily use → Engineer quick access - Layer 3 (LEARN): Large, occasional use → Onboarding, learning - Layer 4 (ARCHIVE): Largest, rarely accessed → Historical record

Flow: Work flows UP the pyramid (completed projects → archive → decisions → CEO updates)

Why <50K Token Budget for Layer 1?

Claude Code has 200K token context limit
Layer 1 at ~11K tokens (22% of 50K budget) leaves 189K for code
Enables loading entire system context + significant codebase in one session
95% compression (200K → 11K) maintains all critical information

Alternatives Considered

Alternative 1: Diátaxis Framework (Tutorials, How-tos, Reference, Explanation)

Rejected because: - Doesn't solve dual-audience problem (AI vs humans) - No token budget awareness - Industry standard but not "novel" as CEO requested - Doesn't provide CEO-CTO communication layer

Alternative 2: Docusaurus/GitBook (Standard Doc Platforms)

Rejected because: - Optimized for humans only, not AI - No token compression - Platform lock-in - Doesn't support dual format (MD + HTML)

Alternative 3: Single Comprehensive Guide

Rejected because: - Would exceed token budget massively - No audience routing - Slow to navigate - Doesn't scale

Alternative 4: Wiki-Style (Lots of Small Pages)

Rejected because: - Fragmentation makes AI loading difficult - Link maintenance nightmare - No clear hierarchy - Hard to enforce standards

Consequences

Positive

Clear Routing: Every user (CEO, AI, Engineer, Learner, Historian) knows exactly where to go
Token Efficiency: AI can load entire system (11K tokens) in seconds
Dual Format Achieved: Compressed MD for AI, verbose HTML for humans
CEO Visibility: SIGNAL layer ensures executive priorities are always visible
Scalable: Each layer can grow independently without affecting others
Quality Enforced: Token budget, link validation automated per layer
Novel: No other company uses this organization (CEO goal achieved)

Negative

Learning Curve: New pattern requires explanation (this ADR!)
Migration Effort: 151 legacy files need categorization and migration
Two Files Per Concept: Layer 1 (compressed MD) + Layer 3 (expanded HTML)
Build System Dependency: Requires automation to keep formats in sync

Neutral

Maintenance: Automated via pre-commit hooks, build system, validation tools
Onboarding: START-HERE.md provides clear entry point with audience routing

Implementation

Phase 1 (Complete): Foundation - Created all 5 layer directories - Built Layer 0 (SIGNAL) - 4 files - Built Layer 1 (CONTEXT) - 4 files - Built Layer 2 (WORK) - 3 quick-ref files - Created Layer 4 (ARCHIVE) structure

Phase 2 (Complete): Automation - Static site generator (MD → HTML) - Link validator - Pre-commit hooks - Health dashboard

Phase 3 (In Progress): Content - Content expansion system (150+ educational expansions) - Legacy migration (42/155 files migrated)

Phase 4 (Future): Deployment - Cloudflare Pages hosting (internal.epgoat.tv) - Protected access - CDN optimization

Validation

Token Budget: ✅ - Layer 1: 11,017 / 50,000 tokens (22.0% used) - 78% headroom for future growth

Documentation Health: ✅ - Baseline: 15.0/100 - Current: 92.0/100 (+77 points)

User Feedback: ✅ - CEO: "I love this! This really moves us into a CEO - CTO relationship" - Clear enthusiasm for novel approach

Technical Validation: ✅ - Build system operational - Pre-commit hooks preventing degradation - Zero broken links in Pyramid layers

References

00-START-HERE.md - Entry point with audience routing
README.md - Comprehensive explanation of Pyramid system
HEALTH.md - Quality metrics dashboard
Living Document - Project tracking

Review Schedule

Next Review: After Phase 4 (Cloudflare deployment) complete
Review Trigger: If health score drops below 85/100
Success Criteria: Health score ≥95/100, all migrations complete

Notes

This architecture is the foundation of EPGOAT's documentation system. It solves the dual-audience problem (AI vs humans) in a novel way that aligns with the CEO's vision for "world-class, unrivaled" documentation.

The pyramid shape (inverted priority) ensures the most important information (CEO directives, AI context) is at the top, while large historical records sink to the bottom.

Key insight: Organizing by WHO (audience) is more effective than organizing by WHAT (content type) for dual-purpose documentation systems.