Files
alknet/docs/architecture/decisions/019-vault-assembly-layer-only.md
glm-5.2 7dda6eec68 docs(architecture): add ADR-025 — vault local-only dispatch, drop irpc
Drops irpc from alknet-vault entirely. The vault's dispatch is now direct
method calls on VaultServiceHandle — no VaultProtocol enum, no
VaultMessage, no VaultServiceActor, no mpsc channel, no Service trait, no
RemoteService trait, no postcard serialization. The vault is local-only by
construction.

The core security argument: irpc made the vault remote-capable by default
(RemoteService generated unless no_rpc is passed). The IrohProtocol handler
forwards all messages without auth. The docs framed 'register an ALPN' as a
server-setup change. This is the default-insecure anti-pattern — security
should be opt-in, not opt-out. ADR-025 inverts the default: local-only is
the only mode, and remote access requires building a separate vault-server
crate (a visible architectural act, not a flag flip).

The actor path was already dead code — service.md said 'prefer
VaultServiceHandle directly — no channel, no serialization.' The actor
existed only to make irpc's Service trait work, which existed only to make
RemoteService work, which was the footgun. VaultServiceHandle's
Arc<RwLock> provides concurrent reads and exclusive writes — better
throughput than the actor's sequential processing.

DerivedKey serialization simplifies: always redact on serialize (for
logging safety), reject '[REDACTED]' on deserialize with an error. No
'postcard preserves bytes' path. This resolves review #002 W8 (silent
corruption on JSON-deserialized DerivedKey).

Resolves:
- OQ-21: remote vault access — resolved (not deferred). Not a vault crate
  feature; if needed, a separate vault-server crate with its own ADR.
- C7: vault-server-crate question decided — not created now, not precluded.
- C8: operation access policy table dissolved — all operations local-only
  by default; if a vault-server crate exposes some remotely, that crate
  defines the policy.
- W8: DerivedKey JSON deserialization — resolved (reject redacted payloads).

Amends ADR-005 (irpc remains for alknet-call, not for alknet-vault),
ADR-018 (vault is even more standalone — zero RPC framework deps),
ADR-019 (vault is the only layer, not just the only direct-caller layer),
ADR-008 (vault integration point unchanged, but now local-only by
construction).
2026-06-22 14:53:52 +00:00

7.5 KiB

ADR-019: Vault Assembly-Layer-Only Access

Status

Accepted

Context

ADR-008 established that the vault is a capability source — the CLI binary unlocks it at startup, derives and decrypts the credentials each handler needs, and injects the results into handler capabilities. ADR-014 specified the injection mechanism (Capabilities on OperationContext) and locked the constraint that no vault operations are registered in the call protocol.

These ADRs answer how the vault integrates with the rest of alknet. This ADR answers a narrower question that the vault's own spec needs to be explicit about: what is the vault's access model from its own perspective?

The vault provides a VaultServiceHandle with unlock, lock, derive_ed25519, derive_encryption_key, derive_ethereum_key, derive_password, encrypt, and decrypt methods. Who is allowed to call these, and through what path?

The candidates:

  1. Handlers call the vault directly — each handler holds a VaultServiceHandle and derives keys at call time. This was the pre-ADR-008 model and is rejected: it exposes the vault to every handler, requires the vault to enforce per-handler path restrictions itself, and means the master seed is reachable from every call path.

  2. The call protocol exposes vault operationsvault/derive, vault/decrypt, vault/unlock registered as operations. This was the contradiction ADR-014 resolved: the master seed and mnemonics would cross the wire.

  3. The assembly layer is the sole caller — the CLI binary (or an embedded assembly layer) holds the VaultServiceHandle, calls vault methods at startup and (rarely) at call time through scoped capabilities, and injects results into handlers. Handlers never hold a vault reference.

Decision

The assembly layer is the sole direct caller of the vault. This restates ADR-008/ADR-014 from the vault's perspective and makes the access model explicit in the vault's own spec.

What the assembly layer does

At startup:

  1. Constructs VaultServiceHandle::new()
  2. Unlocks with a mnemonic (from a secure prompt, a file, or a hardware token) and optional passphrase
  3. Derives the keys each handler needs (identity, SSH host, TLS identity, signing keys)
  4. Decrypts the credentials each handler needs (LLM provider API keys, OAuth tokens)
  5. Constructs handlers with the derived/decrypted material injected into their Capabilities
  6. Registers the handlers in the OperationRegistry
  7. Starts the endpoint

After startup, the vault is typically not called again. The common case is construction-time injection — a handler holds a static decrypted API key for its lifetime.

What handlers do NOT do

Handlers never:

  • Hold a VaultServiceHandle reference
  • Call derive_*, encrypt, or decrypt directly
  • Receive the master seed or mnemonic
  • Import alknet_vault as a dependency

Handlers receive secret material through OperationContext.capabilities (ADR-014). The Capabilities type holds non-serializable, zeroized secret material that the assembly layer populated at construction time.

The scoped-capability exception

The narrow exception is a handler that needs a child key at an unpredictable path determined by call input (e.g., signing for a specific GitHub repo). This handler receives a scoped capability — a restricted handle that performs a specific derivation at a restricted path set and returns the result in-process. The handler never sees the master seed and never holds a full VaultServiceHandle.

The scoped capability is still a capability (it lives on OperationContext.capabilities), not a vault reference. Whether it is a distinct type or a pre-derived key injected at construction is a two-way door for the alknet-call and alknet-agent crate specs (ADR-014).

No vault operations on the wire

The vault has no ALPN (ADR-003, ADR-008). No vault operation is registered in the call protocol's OperationRegistry (ADR-014). The master seed, mnemonics, and derived private keys never appear in call.requested payloads, call.responded payloads, or OperationContext.metadata (ADR-014).

If a future use case requires exposing a vault operation over the call protocol (e.g., a restricted vault/public-key operation that returns only public key material for identity verification), it requires its own ADR with an explicit threat model justification. This decision does not close that door; it simply does not open it.

Consequences

Positive:

  • The master seed is reachable from exactly one place: the assembly layer. The attack surface for the root of trust is a single process boundary, not a distributed set of handlers.
  • Handlers don't need to enforce path restrictions — they don't have the vault. The scoped-capability mechanism enforces restrictions by construction.
  • The vault's API is consumed by one caller. This simplifies the vault's threat model: it doesn't need per-caller authentication, rate limiting, or path-based access control. The assembly layer is trusted.
  • The vault can be tested in isolation — VaultServiceHandle::new()unlock_new(24)derive_* is the test pattern, with no networking or handler mockery.

Negative:

  • The assembly layer has more construction-time responsibility: it must know which handlers need which credentials and wire them. This is expected — the CLI assembles everything (ADR-008).
  • Adding a new handler that needs a new credential requires updating the assembly layer, not just registering an operation. This is a feature: it forces an explicit decision about what secret material a handler needs.
  • Remote vault administration (unlock a running node's vault over the network) is not supported. The vault is local-only by construction (ADR-025) — no remote dispatch capability exists in the vault crate. If remote vault access is needed in the future, it requires a separate vault-server crate that depends on both alknet-core (for auth) and alknet-vault (for the handle), with a heavily restricted mechanism (admin scope, mTLS-only, never expose the mnemonic over an unauthenticated channel) and its own ADR.

Assumptions

  1. The assembly layer is trusted. The CLI binary holds the vault handle and is the trust boundary. If the assembly layer is compromised, all handlers' capabilities are compromised. This is the same trust boundary as ADR-008 and ADR-014.

  2. Handlers need credentials at construction time or at call time, not dynamically discovered at call time. If a handler needs to derive a key at an unpredictable path determined by call input, the scoped-capability model covers it (the handler holds a scoped vault access), but the surface area is larger. The assumption is that this case is rare.

  3. No legitimate use case requires returning a private key over the wire. Public key sharing (identity verification, encryption to a recipient) is the only cross-node key material flow. If a use case for returning a private key emerges (e.g., a key-escrow service), it needs its own ADR and a very different threat model.

References

  • ADR-003: Crate decomposition (alknet-vault is standalone)
  • ADR-008: Vault integration point (CLI-embedded, capability source)
  • ADR-014: Secret material flow and capability injection (the injection mechanism this ADR relies on)
  • ADR-018: Vault as standalone crate (the independence this ADR preserves)
  • crates/vault/service.md
  • crates/vault/README.md