docs: correct ecosystem dependency direction and add integration context

Architecture docs previously referenced the hub as the authoritative source for call/identity specs. In reality, call protocol, identity, and access control come from @alkdev/operations; call graph schemas from @alkdev/flowgraph; task graph schemas from @alkdev/taskgraph; event transport from @alkdev/pubsub. The hub is a consumer of @alkdev/storage, not the other way around. Key changes: - overview.md: add Ecosystem Integration section with dependency direction diagram, What Comes From Where table, repo layer bridging pattern, and circular dependency avoidance guidance - overview.md: promote repo-layer vs operations-bridging from open question to explicit decision (CRUD in storage, bridging in consumer) - overview.md: add zero-ecosystem-dependency statement; fix taskgraph type names (TaskGraphNodeAttributes, DependencyEdge) - overview.md: fix terminology (hub is consumer, not authority) - metagraph.md: add Ecosystem Context section; replace hub references with correct ecosystem sources; fix GraphStatus/GraphBaseType enum mischaracterization (C1); unify empty-array semantics with sqlite-host (C2); clarify repo layer does NOT import operations (C3); add flowgraph canonical schema note; add versioning cross-reference to graph_types table - encrypted-data.md: reframe hub as provenance not authority; update What Lives Where table; fix standalone table advice; update references - sqlite-host.md: fix actors table description; unify empty-array semantics; contextualize hub as reference consumer; add operations identity reference
2026-05-28 14:25:16 +00:00
parent bb544469fd
commit 33a5b0816d
4 changed files with 229 additions and 58 deletions
--- a/docs/architecture/overview.md
+++ b/docs/architecture/overview.md
@@ -57,14 +57,14 @@ the main `mod.ts` re-exports it. Importing from either `@alkdev/storage` or
 | Term                    | Definition                                                                                                                                                                                                                   |
 | ----------------------- | ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
 | **Metagraph**           | A type system where graph types define schemas, node types define data shapes within those graphs, and edge types define typed relationships. Graph instances are concrete data conforming to these type definitions.        |
-| **Hub**                 | The central service in the hub-spoke architecture. Runs PostgreSQL, hosts API endpoints, coordinates spokes, and is the authoritative data store. `@alkdev/storage`'s PostgreSQL host (not yet implemented) targets the hub. |
-| **Spoke**               | A local/embedded instance that runs per-project or per-session. Uses SQLite for local storage. `@alkdev/storage`'s SQLite host targets spokes.                                                                               |
+| **Hub**                 | The central service in the hub-spoke architecture. A consumer of `@alkdev/storage` — uses the PostgreSQL host for persistent graph storage. The hub also depends on `@alkdev/operations`, `@alkdev/pubsub`, `@alkdev/flowgraph`. |
+| **Spoke**               | A local/embedded instance that runs per-project or per-session. A consumer of `@alkdev/storage` — uses the SQLite host for local graph storage.                                                                               |
 | **Graph type**          | A class of graphs (e.g., "call-graph", "acl"). Defines structural constraints (directed/undirected/mixed, multi-edges, self-loops) and the valid node/edge type vocabularies. Stored in the `graph_types` table.             |
 | **Node type**           | A category of node within a graph type. Defines the attribute schema for nodes of that type. Stored in the `node_types` table.                                                                                               |
 | **Edge type**           | A category of edge within a graph type. Defines the attribute schema and optionally restricts which node types can be source/target. Stored in the `edge_types` table.                                                       |
 | **Graph instance**      | A concrete graph belonging to a graph type. Contains nodes and edges conforming to its type definitions. Stored in the `graphs` table.                                                                                       |
-| **Consumer**            | Code that imports `@alkdev/storage` (or a subpath) to define graph types and persist graph data. The hub and spokes are consumers.                                                                                           |
-| **Repository layer**    | ⚠️ Not yet implemented. The typed CRUD functions (insert, find, update, delete) that sit between consumer code and raw Drizzle queries. Performs schema validation before writes.                                            |
+| **Consumer**            | Code that imports `@alkdev/storage` (or a subpath) to define graph types and persist graph data. The hub, spokes, and other @alkdev packages are consumers.                                                                 |
+| **Repository layer**    | ⚠️ Not yet implemented. The typed CRUD functions (insert, find, update, delete) that sit between consumer code and raw Drizzle queries. Performs schema validation before writes. No dependency on `@alkdev/operations` — the consumer wires CRUD into the registry. |
 | **Validation boundary** | The line where schema validation is enforced. In this package, validation happens in the SchemaBuilder (at type definition time) and the repository layer (at mutation time), NOT in the database.                           |

 ## Design Decisions
@@ -137,6 +137,12 @@ in PG). This ensures every row has auditability and extensibility.
 `@alkdev/typebox` and `@alkdev/drizzlebox` are npm packages (not yet on JSR).
 JSR handles npm dependencies natively.

+**Ecosystem packages are not runtime dependencies of `@alkdev/storage`.** All
+ecosystem references in this document describe consumer-side data shapes and
+integration patterns, not import dependencies. The `@alkdev/operations`,
+`@alkdev/pubsub`, `@alkdev/flowgraph`, and `@alkdev/taskgraph` packages are
+consumed by the hub and spokes, not by storage itself.
+
 ## What Exists vs. What's Needed

 ### Implemented
@@ -151,21 +157,108 @@ JSR handles npm dependencies natively.
 | Gap                                       | Priority     | Notes                                                                                               |
 | ----------------------------------------- | ------------ | --------------------------------------------------------------------------------------------------- |
 | Encrypted data node type + crypto utility | **Critical** | ⚠️ Not yet implemented. API keys and secrets at rest. See [encrypted-data.md](./encrypted-data.md). |
-| Repository/CRUD layer                     | High         | ⚠️ Not yet implemented. Typed insert, find, update, delete functions for graphs, nodes, edges       |
+| Repository/CRUD layer                     | High         | ⚠️ Not yet implemented. Typed insert, find, update, delete functions for graphs, nodes, edges. No dependency on `@alkdev/operations` — consumer wires CRUD into registry. |
 | Tests                                     | High         | Zero tests exist. Needed before any real use.                                                       |
 | PostgreSQL host                           | Medium       | Same table shapes, `pgTable` + `jsonb` + `timestamp` + `pgEnum`. Stub only.                         |
-| ACL graph type                            | Medium       | Access control as a graph. Depends on encrypted data and CRUD layer.                                |
-| Call graph type                           | Low          | Hub-specific, uses metagraph. Deferred until hub consumes this package.                             |
-| Session/message models                    | Low          | Hub-specific, may remain domain tables.                                                             |
+| Call graph type                           | Medium       | Informed by `@alkdev/flowgraph`'s `CallNodeAttrs`/`CallEdgeAttrs` schemas and `@alkdev/operations`' call protocol events. Not hub-specific — any consumer that tracks call invocations needs this. |
+| ACL graph type                            | Medium       | Access control as a graph. Informed by `@alkdev/operations`' `Identity` and `AccessControl`. Depends on encrypted data and CRUD layer. |
+| Task graph type                           | Low          | Informed by `@alkdev/taskgraph`'s `TaskGraphNodeAttributes` and `DependencyEdge` schemas.                  |
+
+## Ecosystem Integration
+
+`@alkdev/storage` is a **data layer package** consumed by other packages in the
+@alkdev ecosystem. It does not depend on the hub — the dependency flows the
+other way. The hub consumes storage (along with operations, pubsub, flowgraph,
+and taskgraph) as part of its architecture.
+
+### Dependency Direction
+
+```
+@alkdev/pubsub          ← transport only (no storage dependency)
+    ↑
+@alkdev/operations      ← call protocol, registry, identity, access control
+    ↑                   (depends on: @alkdev/pubsub, @alkdev/typebox)
+@alkdev/flowgraph       ← call graph schema, operation graph, workflow templates
+    ↑                   (depends on: @alkdev/operations [peer], @alkdev/typebox)
+@alkdev/taskgraph       ← task dependency graph schema, cost-benefit analysis
+                        (depends on: @alkdev/typebox)
+
+@alkdev/storage         ← YOU ARE HERE — typed graph persistence
+                        (depends on: @alkdev/typebox, @alkdev/drizzlebox)
+
+    ↑                       ↑
+    |                       |
+Hub / Spoke          Any consumer that needs
+(consumes all)       persistent graph storage
+```
+
+The key insight: `@alkdev/storage` provides the **persistence primitives**
+(schemas, tables, repository layer). The **domain semantics** (what a call graph
+means, what identity looks like, how access control works) are defined by the
+packages above. Storage stores the shapes those packages define; it does not
+define the semantics itself.
+
+### What Comes from Where
+
+| Concept | Source package | Storage's role |
+|---------|---------------|----------------|
+| Call protocol events (`call.requested`, `call.responded`, etc.) | `@alkdev/operations` | Storage persists the outcomes — graphs with `CallNodeAttrs` nodes |
+| Identity (`id`, `scopes`, `resources`) | `@alkdev/operations` | Storage stores identity as node attributes; `Identity` is a data shape, not a storage concept |
+| Access control (`AccessControl`, `requiredScopes`) | `@alkdev/operations` | Storage's ACL graph type mirrors the operations `AccessControl` schema as graph structure |
+| Call graph schema (`CallNodeAttrs`, `CallEdgeAttrs`, `CallStatus`) | `@alkdev/flowgraph` | Storage persists these in-memory shapes to the database |
+| Task graph schema (`TaskGraphNodeAttributes`, `DependencyEdge`) | `@alkdev/taskgraph` | Storage persists task dependency shapes |
+| Event transport (`TypedEventTarget`, `EventEnvelope`) | `@alkdev/pubsub` | Storage is not involved in event routing; it stores the events' outcomes |
+
+### Repository Layer Bridging Pattern (Consumer-Side Concern)
+
+The repository layer in `@alkdev/storage` provides typed CRUD — no `@alkdev/operations`
+dependency. A **consumer-side** bridging module can then wire these CRUD functions
+into the `@alkdev/operations` registry, analogous to how `drizzle-graphql`
+auto-generates a GraphQL schema from Drizzle tables — but using operations
+(queries, mutations, subscriptions) instead of GraphQL resolvers. This works
+because:
+
+1. `@alkdev/operations` already maps closely to GraphQL's
+   queries/mutations/subscriptions (it was modeled after that pattern)
+2. `@alkdev/pubsub` provides the subscription transport (forked from
+   graphql-yoga's pubsub with additions)
+3. `@alkdev/storage`'s metagraph tables are the data source, analogous to
+   Drizzle tables for drizzle-graphql
+
+The bridging module would live in a consumer package (e.g., the hub or a
+dedicated `@alkdev/storage-operations` adapter), not in `@alkdev/storage` itself,
+to avoid circular dependencies:
+
+```
+@alkdev/storage → defines types + tables (no operations dependency)
+@alkdev/operations → defines call protocol + registry (no storage dependency)
+Consumer (hub / adapter) → imports both, generates operations from schemas
+```
+
+### Avoiding Circular Dependencies
+
+Neither `@alkdev/storage` nor `@alkdev/operations` should depend on each
+other directly. Storage defines the schema types and database tables; operations
+defines the call protocol and execution model. The consumer (hub, spoke, or
+adapter package) imports both and bridges them. This preserves the
+single-responsibility principle and allows each package to evolve independently.
+
+If shared type definitions are needed (e.g., `Identity` referenced in both
+storage node attributes and operations call events), they should either:
+1. Be duplicated in each package with a documented correspondence (acceptable
+   for small, stable types)
+2. Be extracted to a minimal shared types package if the duplication becomes
+   burdensome

 ## Open Questions

 1. **Should `actors` be a node type or a standalone table?** Currently `actors`
   is a standalone table in the SQLite host that isn't referenced by any
-   relation. If identity/authentication is a graph (ACL nodes), actors become
-   node types. If identity is a domain concept that needs special query patterns
-   (auth lookups, session joins), standalone tables may be better. Decision:
-   defer until ACL design.
+   relation. If identity/authentication is a graph (ACL nodes based on
+   `@alkdev/operations`'s `Identity` interface), actors become node types. If
+   identity is a domain concept that needs special query patterns (auth lookups,
+   session joins), standalone tables may be better. Decision: defer until ACL
+   design, informed by `@alkdev/operations`'s `AccessControl` model.

 2. **Should the repository layer be host-specific or host-agnostic?** A
   host-agnostic repository (insert graph, find nodes by type) requires an
@@ -190,9 +283,24 @@ JSR handles npm dependencies natively.
   application-level. See [metagraph.md](./metagraph.md) for the versioning
   approach.

+6. **~~Should the repository layer live in `@alkdev/storage` or in a consumer
+   package?~~** Decision: the repository CRUD layer (host-specific typed
+   queries, schema validation before writes) belongs in `@alkdev/storage`. The
+   operations bridging layer (generating `OperationSpec`s from metagraph schemas)
+   belongs in a consumer or adapter package. These are separate concerns — CRUD
+   is a storage concern; call protocol integration is an application concern.
+   The repository layer in `@alkdev/storage` has **no dependency on
+   `@alkdev/operations`**. It performs typed inserts, finds, updates, and
+   deletes with schema validation. The consumer then wires these CRUD functions
+   into the operations registry if desired.
+
 ## References

- Hub storage spec: `/workspace/@alkdev/hub/docs/architecture/storage/`
+- Operations architecture: `/workspace/@alkdev/operations/docs/architecture/README.md`
+- Pubsub architecture: `/workspace/@alkdev/pubsub/docs/architecture/README.md`
+- Flowgraph architecture: `/workspace/@alkdev/flowgraph/docs/architecture/README.md`
+- Taskgraph architecture: `/workspace/@alkdev/taskgraph_ts/docs/architecture/README.md`
+- drizzle-graphql (reference for repo bridging pattern): `/workspace/drizzle-graphql/`
 - Source heritage: `@ade/ade-v0/packages/core/graphs` and
  `@ade/ade-v0/packages/storage_sqlite`
 - Drizzle ORM: https://orm.drizzle.team/