Roadmap

Statewave is purpose-built for support-agent workflows — the first use case where structured memory clearly outperforms naive history stuffing and simple RAG. The roadmap reflects this: trust and reliability first, then support-agent superiority, then operator experience.

v0.1 — Local MVP ✅

[x] Core domain model (Episode, Memory, ContextBundle)
[x] FastAPI server with all 6 endpoints
[x] Heuristic memory compiler
[x] Context assembly with token estimation
[x] PostgreSQL + pgvector schema
[x] Docker Compose local deployment
[x] Python SDK v0.1.0, TypeScript SDK v0.1.0

v0.2 — Production Hardening ✅

[x] Idempotent compilation, pluggable compilers, token-bounded context
[x] Ranked retrieval (kind × recency × relevance)
[x] Structured errors, request-ID, CORS, health endpoints, structured logging
[x] LLM compilation via LiteLLM (100+ providers)
[x] Semantic search via pgvector
[x] Authentication (API keys), rate limiting (in-memory)
[x] Python SDK v0.2.0, TypeScript SDK v0.2.0

v0.3 — Advanced Features ✅

[x] Temporal reasoning, memory conflict resolution
[x] Webhooks, multi-tenant (experimental)
[x] Middleware ordering, validation, LLM thread-pool fix

v0.4 — Adoption Readiness ✅

[x] Batch episode ingestion (up to 100)
[x] OpenTelemetry tracing (optional)
[x] Deployment guide (Docker, Fly.io, Railway)
[x] SDK publish readiness, getting started guide
[x] Support-agent benchmark & "Why Statewave" comparison doc

v0.5 — Reliability & Trust ✅

[x] Reliable webhook delivery — persistent queue, exponential backoff, dead-letter
[x] SDK retry with backoff — automatic retry on 429/5xx with jitter
[x] Durable async compilation — Postgres-backed job queue
[x] True multi-tenant isolation — app-layer query scoping
[x] Distributed rate limiting — Postgres-backed
[x] Backup/restore tooling — subject-level export/import
[x] Admin introspection — jobs + webhooks
[x] Compilation status API

v0.6 — Support-Agent Superiority ✅

[x] Session-aware context assembly
[x] Resolution tracking (open/resolved/unresolved)
[x] Handoff context packs (structured escalation briefs)
[x] Repeat-issue detection (prior resolution surfacing)
[x] Support-specific ranked retrieval
[x] Customer health scoring (0–100, explainable factors)
[x] Health-aware handoff (risk level + factors in briefs)
[x] Proactive health alerts (webhooks on state transitions)
[x] SLA tracking (response time, resolution time, breach flags)
[x] SLA integration into health + handoff
[x] Product website (statewave.ai)
[x] Proof layer: 3 eval suites (54 assertions), 2 benchmarks (9/9 vs 2/9)

v0.7 — Operator & Cloud Experience ← CURRENT

Goal: Make Statewave trustworthy to operate at scale. An operator should be able to deploy, monitor, upgrade, and scale Statewave without surprises.

[ ] Deep health checks — /readyz verifies DB connectivity, queue health, LLM reachability
[x] Migration safety — preflight script, startup schema guard, /ops/migrations endpoint, runbook
[x] Admin dashboard (read-only) — system health, jobs, webhooks, counts, health distribution
[x] Usage metering — episodes/month, compiles/month, per-tenant usage
[ ] Memory TTL / expiry policies — automatic cleanup of stale memories
[ ] Horizontal scaling guide — read replicas, connection pooling, tested patterns
[ ] Helm chart + Kubernetes deployment guide

v0.8 — Adoption & Ecosystem (planned)

Goal: Make it trivial for teams to adopt Statewave and integrate it into existing stacks.

[ ] SDK convenience methods for support endpoints (health, SLA, handoff, resolutions)
[ ] Framework integrations (LangChain, CrewAI, AutoGen)
[ ] Webhook event filters (subscribe to specific event types)
[ ] Memory templates for common patterns
[ ] Design partner onboarding package
[ ] Head-to-head benchmark against Mem0 / Zep

Design principles

Raw truth first — episodes are immutable, memories are derived
Self-hosted, operator-friendly — you own your data and infra
Support-agent wedge — optimize here, prove it, then expand
Multi-provider — LiteLLM means no vendor lock-in
Trust over features — reliability beats feature count
Honest about limitations — document what doesn't work yet

Roadmap ​

v0.1 — Local MVP ✅ ​

v0.2 — Production Hardening ✅ ​

v0.3 — Advanced Features ✅ ​

v0.4 — Adoption Readiness ✅ ​

v0.5 — Reliability & Trust ✅ ​

v0.6 — Support-Agent Superiority ✅ ​

v0.7 — Operator & Cloud Experience ← CURRENT ​

v0.8 — Adoption & Ecosystem (planned) ​

Design principles ​