Files

T

Rodin 76f4bcc33e docs: architectural analysis of top repos (CockroachDB, Prometheus, Ecto, Oban)

Four documents examining codebases at module and ecosystem levels:
- architectural-analysis.md — internal structure, dependency flow
- ecosystem-analysis.md — consumer extension points, deliberate absences
- crosscutting-analysis.md — logging, config, retry, lifecycle
- testing-evolution-analysis.md — proof models, API evolution strategies

2026-04-30 10:50:54 -07:00

10 KiB

Raw Permalink Blame History

Architectural Patterns from Top Repos

CockroachDB: How to Organize 20,000 Files

The 116-Package Principle

CockroachDB has 116 packages under pkg/util/ averaging 4 files each. This is deliberate:

Force: A 2M-line codebase where developers work on different subsystems simultaneously. If pkg/util were 5 big packages, every PR would conflict.

Pattern: One concept = one package. circuit/ is 3 files (breaker, options, signal). quotapool/ is 5 files. stop/ is 2 files. The package boundary IS the API boundary — no internal debates about what is exported.

Naming: Single-concept nouns. No helpers, no common, no shared. Every package name tells you what it does: cancelchecker, ctxgroup, syncutil.

Dependency Layering

sql → kv → storage → util
 ↓     ↓       ↓
 ↓     ↓    roachpb (protobuf types)
 ↓     ↓       ↓
 ↓     keys ← util
 ↓
 settings, config

Critical insight: kv imports from sql AND sql imports from kv. They solved circular deps via interfaces + callback registration — not by eliminating the cycle. The internal/ package provides the bridge.

storage imports kv (for transaction types) but kv also imports storage. Again, interface boundaries break the cycle at compile time.

Lesson: Perfect layering is impossible in distributed databases. The real skill is knowing where to put the interface that breaks the cycle.

Error Handling at Scale

They use github.com/cockroachdb/errors — their own library that extends stdlib errors with:

Error marks: Tag errors with metadata without changing the error chain
Wrapping with causes: errors.Wrap(err, "context")
Safe printing: redact.Sprint for log-safe errors
Network encoding: Errors serialize across RPC boundaries

Pattern: Errors are first-class data that flows through the entire system, surviving serialization across nodes. Not just strings — structured, typed, matchable.

Circuit Breaker (not stdlib)

type Breaker struct {
    mu struct {
        syncutil.RWMutex
        errAndCh *errAndCh  // stable Signal() results
        probing  bool
    }
}

Key design: Signal() returns a channel + error getter (like context.Done() + context.Err()). The channel is stable — closing it doesn't affect callers who already have a reference. New callers get a new channel after reset.

Force: In a distributed DB, a broken replica should fail-fast all pending requests, then probe for recovery. Context cancellation isn't enough because you need to distinguish "gave up waiting" from "system is broken."

QuotaPool: Abstract Resource Allocation

type Resource interface{}
type Request interface {
    Acquire(ctx context.Context, r Resource) (
        fulfilled bool, tryAgainAfter time.Duration)
    ShouldWait() bool
}

Pattern: The pool is generic over any resource type. Concrete implementations include:

IntPool — weighted semaphore with FIFO ordering
Rate limiters (via tryAgainAfter)
Token buckets

Force: Different subsystems need different quota types but the same queueing/fairness semantics. Abstract once, instantiate many.

Prometheus: Interface-Driven Storage Architecture

The Contract Layer

storage/interface.go defines 15+ interfaces that form the entire query/storage contract:

Storage (top level)
├── Appendable → Appender (write path)
├── Queryable → Querier (read path)
├── ChunkQueryable → ChunkQuerier (bulk read)
├── ExemplarStorage (exemplars)
└── Searcher (experimental)

Force: Prometheus must support:

Local TSDB (the main implementation)
Remote read/write (federation)
Recording rules (virtual series)
Testing (mock implementations)

All through the same interface. The contract layer is the single point of truth for "what does storage mean."

Compile-Time Interface Verification

var _ storage.GetRef = &headAppender{}
var _ storage.Searcher = &blockBaseQuerier{}

Prometheus uses this pattern 8 times in tsdb/ alone. Every concrete type that claims to satisfy a storage interface proves it at compile time.

Why this matters at scale: Storage interfaces evolve. When Searcher was added, every type that should implement it needed updating. The var _ pattern makes the compiler tell you what you missed.

Plugin Discovery via Channel

type Discoverer interface {
    Run(ctx context.Context, up chan<- []*targetgroup.Group)
}

Brilliance: The entire service discovery system is one interface with one method. Consul, DNS, Kubernetes, AWS — all implement Run. They push target groups through a channel. The manager multiplexes.

Force: Prometheus supports 20+ discovery mechanisms. Adding one should require zero changes to the core. The channel-based push model means the manager never polls.

Atomic File Operations

Block lifecycle uses filesystem conventions:

.tmp-for-creation — incomplete write
.tmp-for-deletion — incomplete delete

On startup, scan and clean up. No WAL needed for block-level operations because rename is atomic on POSIX.

Force: TSDB blocks are large (hours of data). A WAL for block operations would be overkill. The suffix convention gives crash consistency with zero overhead.

Ecto: Composability Through Data

Query as Accumulating Struct

defstruct prefix: nil, sources: nil, from: nil,
          joins: [], wheres: [], select: nil,
          order_bys: [], limit: nil, offset: nil,
          group_bys: [], updates: [], havings: [],
          preloads: [], distinct: nil, lock: nil,
          windows: [], with_ctes: nil

Every query operation appends to a list or sets a field. Nothing is executed. The struct accumulates intent until Repo.all/Repo.one triggers planning + execution.

Force: Queries must be composable (build in one module, filter in another, paginate in a third). If operations executed immediately, composition would require the entire DB context at every step.

Macro → Builder → Planner Pipeline

User writes: from(u in User, where: u.age > 18)
                     ↓
Macro expands: Builder.Filter.build(query, expr, env)
                     ↓
Builder produces: %Ecto.Query.BooleanExpr{...}
                     ↓
Planner resolves: types, bindings, params
                     ↓
Adapter generates: SQL string

Each builder module handles one clause type. There are 15 builder modules (from, join, filter, select, etc.). The planner doesn't know about SQL — it resolves the query struct into a normalized form that any adapter can consume.

Force: Support multiple databases (Postgres, MySQL, SQLite) with the same query language. The adapter is the only part that knows SQL dialect.

Protocol for Extensibility

Ecto.Queryable protocol lets you pass:

A module atom (User) → resolved to schema query
A string ("users") → raw table
A tuple ({"filtered_users", User}) → view + schema
An Ecto.Query struct → identity

Force: Repo.all(X) should work with any "queryable thing." New queryable types can be added without touching Repo code.

Oban: Architecture for Testability

Engine Swap by Config

def get_engine(%{engine: engine, testing: :disabled}), do: engine
def get_engine(%{testing: :inline}), do: Oban.Engines.Inline
def get_engine(%{testing: :manual}), do: engine

Three modes:

disabled (production) — real engine
inline (unit test) — execute in caller process
manual (integration) — enqueue but don't execute

Force: Background jobs are inherently untestable without process control. Rather than making tests async (flaky), make the engine deterministic.

Flat Supervision with Named Registry

children = [
  {Notifier, conf: conf, name: Registry.via(name, Notifier)},
  {Nursery, conf: conf, name: Registry.via(name, Nursery)},
  {Peer, conf: conf, name: Registry.via(name, Peer)},
  {Sonar, conf: conf, name: Registry.via(name, Sonar)},
  {Harbor, conf: conf, name: Registry.via(name, Harbor)}
]

Every child gets its config via conf: and its identity via Registry.via. This means:

Multiple Oban instances can run in the same VM
Tests can start isolated Oban supervisors
No global state — everything is namespaced

Force: Libraries can't own global names. Enterprise apps run multiple Oban instances (different repos, different queues). The Registry pattern makes this possible without process naming conflicts.

Behaviour as Plugin Contract

# Plugin must be a GenServer AND implement these:
@callback start_link([option()]) :: GenServer.on_start()
@callback validate([option()]) :: :ok | {:error, String.t()}

Force: Plugins need lifecycle management (start, stop, crash recovery) AND configuration validation. By requiring both a behaviour AND OTP compliance, Oban gets:

Fault isolation (supervisor restarts crashed plugins)
Config validation at startup (fail fast)
No coupling (any GenServer works)

Cross-Cutting Insights

1. Interfaces at Boundaries, Structs Internally

All four codebases define interfaces at system boundaries (storage, engine, discovery) but use concrete types internally. The interface is the published contract; the struct is the implementation detail.

2. Config as Validated Struct, Not Map

Every system validates config at startup and stores it as a typed struct. Never a raw map floating around.

3. Testing is an Architecture Decision

Oban's engine swap, CockroachDB's stopper tracking, Prometheus's mock interfaces — testability isn't bolted on, it's designed in from day one.

4. Composition via Data, Not Inheritance

Ecto queries accumulate as data. Prometheus discoverers push through channels. CockroachDB quota requests are data objects. Nobody uses class hierarchies.

5. The Cycle Problem is Solved with Interfaces

CockroachDB has circular dependencies between sql↔kv↔ storage. They break cycles with interface packages that both sides depend on. This is the only way at scale.

6. Small Packages > Large Packages

CockroachDB: 4 files average per package. Oban: focused modules (engine, worker, plugin). Ecto: one builder per clause type. The package boundary forces you to define the API.

10 KiB Raw Permalink Blame History