Architecture updates based on gaps discovered during live deployment testing: - ADR-023: HTTP/2 client-facing support via ALPN-based protocol detection. The spec previously said HTTP/2 was out of scope, but the deployment revealed that modern browsers negotiate HTTP/2 via ALPN. The proxy now correctly detects the negotiated ALPN protocol and uses the appropriate HTTP server builder (http2::Builder for h2, auto::Builder for http/1.1). Upstream connections remain HTTP/1.1. Host resolution now falls back to URI host for HTTP/2 :authority pseudo-headers. - ADR-024: ANSI-disabled logging. All tracing-subscriber layers now use with_ansi(false) to prevent ANSI escape codes in log output, which broke fail2ban regex matching in Docker deployments. Also documents the fail2ban regex anchor fix (^RATE_LIMIT → RATE_LIMIT). Bug fixes found by architecture review: - Fix missing ALPN protocols in manual TLS mode. build_manual_server_config and build_multi_domain_server_config did not set alpn_protocols, meaning manual TLS mode could not support HTTP/2. Added h2 and http/1.1 ALPN entries to both functions (acme-tls/1 only in ACME mode). - Fix missing with_ansi(false) in JSON log format. The init_json function with file output did not disable ANSI on stdout or file layers, which would break fail2ban in production JSON logging mode. Other spec updates: - All document statuses updated from draft to reviewed - proxy.md: documented Server header removal, upstream HTTPS client, two-phase timeout enforcement, HTTP/2 host resolution, connect timeout - tls.md: documented ALPN configuration differing by mode (ACME vs manual) - overview.md: added HTTP/2 client-facing support to scope, updated crate deps (hyper-rustls, rustls-native-certs, hyper-util), clarified out-of-scope - config.md: fixed http_port type (u16→u32) to match implementation, added ANSI-disabled note for LoggingConfig - operations.md: documented ANSI-disabled logging, fail2ban regex anchor - open-questions.md: updated OQ-09 resolution (connect timeout fully implemented), OQ-10 (C2 bug is fixed)
7.5 KiB
7.5 KiB
status, last_updated
| status | last_updated |
|---|---|
| reviewed | 2026-06-12 |
Open Questions
TLS
OQ-01: Should cipher suites be restricted beyond rustls defaults?
- Origin: tls.md
- Status: resolved
- Priority: medium
- Resolution: Restrict cipher suites to match the nginx scope: four ECDHE-AES-GCM suites for TLS 1.2 plus all TLS 1.3 suites. This provides behavioral parity during migration. See ADR-012.
- Cross-references: ADR-005, ADR-012
OQ-02: What log format should fail2ban consume?
- Origin: operations.md, proxy.md
- Status: resolved
- Priority: high
- Resolution: Custom structured log format with
key=valuepairs andRATE_LIMITprefix. A corresponding custom fail2ban filter will be provided. See ADR-007. - Cross-references: ADR-007
OQ-07: Should per-site TLS overrides be supported for mixed ACME/manual domains?
- Origin: tls.md, config.md
- Status: resolved
- Priority: low
- Resolution: Resolved by introducing
[[listeners]]configuration. Each listener is an independent TLS endpoint with its own bind address, TLS config, and site routing. This supports both deployment models: (1) shared-IP multi-domain (one listener, SAN certificate, SNI routing) and (2) dedicated-IP single-domain (multiple listeners, each with its own IP/cert/domain). Mixed ACME/manual configurations are naturally supported since each listener has its own TLS mode. See ADR-019. - Cross-references: ADR-011, ADR-019
Logging and Monitoring
OQ-03: Should the health check endpoint be on a separate port?
- Origin: operations.md
- Status: resolved
- Priority: low
- Resolution: Add a configurable local health check port (default: 9900)
bound to
127.0.0.1only. Health checks work even when TLS is misconfigured. There is no/healthroute on the main HTTPS listener — health checking is handled exclusively by the local port and admin socket. See ADR-013 and ADR-022. - Cross-references: ADR-013, ADR-022
Configuration
OQ-04: Should config reload support a Unix domain socket API in addition to SIGHUP?
- Origin: config.md
- Status: resolved
- Priority: low
- Resolution: Yes. Add a Unix domain socket admin API alongside SIGHUP.
The socket accepts a
reloadcommand and returns structured success/failure responses. SIGHUP is retained as a fallback. See ADR-014. - Cross-references: ADR-014
Deployment
OQ-05: Should the proxy bind to multiple addresses or just one?
- Origin: overview.md
- Status: resolved
- Priority: low
- Resolution: A single
bind_addrper listener entry is sufficient. ADR-019 introduced[[listeners]], where each listener has its ownbind_addr. This supports multiple bind addresses in a single process — one per listener — without needing an array of addresses on a single listener. See ADR-016 and ADR-019. - Cross-references: ADR-016, ADR-019
Proxy
OQ-06: Should upstream timeouts be configurable per-site?
- Origin: proxy.md
- Status: resolved
- Priority: low
- Resolution: Resolved by ADR-015. Per-site upstream timeout overrides with sensible defaults (5s connect, 60s request). Optional fields in SiteConfig that override global defaults when specified.
- Cross-references: ADR-015, ADR-017
OQ-08: Should the /health path use a less common endpoint to avoid upstream collision?
/health path use a less common endpoint to avoid upstream collision?- Origin: Implementation review finding W5, proxy.md
- Status: resolved
- Priority: medium
- Resolution: The
/healthroute does not belong on the main listener at all. Health checking is an operational concern served by the dedicated local port (9900) and the admin socket'sstatuscommand — not by intercepting traffic on the public-facing proxy. Serving/healthon the main listener creates collision with upstream applications, requires special-case routing logic before host-based matching, and is architecturally wrong: the main listener's job is to proxy requests, not to serve operational endpoints. The local health check port (bound to127.0.0.1:9900) and the admin socket are the sole health/status mechanisms. See ADR-022. - Cross-references: ADR-013, ADR-022
OQ-09: How should upstream_connect_timeout_secs be enforced?
upstream_connect_timeout_secs be enforced?- Origin: Implementation review finding W4, ADR-015, ADR-017
- Status: resolved
- Priority: medium
- Resolution: Implemented using a two-phase
tokio::time::timeoutapproach. The inner timeout uses the per-siteupstream_connect_timeout_secs(default 5s) for the connect + first-byte phase, and the outer timeout usesupstream_request_timeout_secs(default 60s) for the full request/response cycle. Additionally,HttpConnector::set_connect_timeout()enforces the TCP-level connect timeout on both HTTP and HTTPS clients. The implementation is inhandler.rsandcreate_http_client()/create_https_client(). No new ADR needed; the decision was already made in ADR-015. - Cross-references: ADR-015, ADR-017
OQ-10: Should ACME contact email be a required config field?
- Origin: Implementation review finding C2, tls.md, config.md
- Status: resolved
- Priority: high
- Resolution: This is not an open question — the architecture already
specifies
acme_contactas a required field in ACME mode (config.md validation rule 19). The field is defined in theListenerConfigtable and shown in TOML examples. Let's Encrypt requires a contact email for production certificate requests. The implementation bug (C2:contact: vec![]) has been fixed —acme_contactis now correctly wired from config to the ACME state machine. No new ADR needed — the decision is already documented in config.md and tls.md. - Cross-references: ADR-004
OQ-11: How should X-Forwarded-Proto be derived per-listener?
X-Forwarded-Proto be derived per-listener?- Origin: Implementation review finding W14, proxy.md
- Status: resolved
- Priority: medium
- Resolution: The hardcoded
is_https: truebehavior is correct for a TLS-terminating reverse proxy. The proxy only proxies requests on the HTTPS listener, which always setsX-Forwarded-Proto: https. The HTTP redirect listener sends a 301 redirect and does NOT proxy requests, soX-Forwarded-Protois not set there. This behavior is correct and matches the architecture spec (proxy.md). The implementation should add a comment documenting this rationale to prevent future "fixes" that would change the behavior. No ADR or spec change needed — just a code comment. - Cross-references: ADR-021
Operations
OQ-12: Should request access logging be mandatory or optional?
- Origin: Implementation review finding W13, operations.md
- Status: resolved
- Priority: high
- Resolution: Access logging is mandatory and always-on at
infolevel. The architecture spec (operations.md) already states: "Access logging is always-on — it is the primary observability mechanism for the proxy and is required for fail2ban integration. There is no configuration option to disable access logging." Thelog_request!macro exists in the codebase but is not called — this is an implementation gap (W13), not an architectural question. No ADR needed; ADR-007 already covers the log format. - Cross-references: ADR-007