Brain — Telemetry & Usage Analytics for Deployed AI Agents | M8TRX

Component	Responsibility	Tech
Brain SDK	Buffer + flush events from inside the agent process; never raise.	Python package `m8trx_brain` (pip-installable from a private index or git URL).
Ingestion API	`POST /v1/events`, bearer auth, schema validation, redaction, DB write. Stateless.	FastAPI + uvicorn in a Docker container on EC2 (t3.small) behind an ALB.
Redaction pipeline	In-process module. Routes raw summaries through Claude Haiku to strip PII based on the customer's privacy tier.	`anthropic` SDK; Haiku (`claude-haiku-4-5`).
Postgres (RDS)	Primary store. JSONB payload column for schema flexibility during iteration.	RDS Postgres 16, db.t4g.medium, gp3 storage, 7-day automated snapshots.
Metabase	Read-only analytics UI for the team.	Open-source Metabase on a small EC2 / ECS task; uses a read-only DB role.

`event_type`	Emitted when	Payload fields
`session.start`	Agent picks up a unit of work.	`category` (free string, agent-tagged, e.g. `"email_reply"`); `source` (optional, e.g. `"inbox"`).
`session.end`	Unit of work completes, fails, or escalates.	`status: "success"\|"failed"\|"escalated"`; `duration_ms`; `tool_calls: [{name, ms, ok}]`; `llm_usage: {model, input_tokens, output_tokens, cost_cents}`; `summary_raw?` (redaction input).
`error`	Uncaught error in the agent.	`message`, `kind`, `stack_hash` (sha256 of stack trace, no body) — groupable without leaking code paths.
`heartbeat`	SDK background thread, every 5 min.	`agent_version`, `pid_uptime_s`, `dropped_events_since_last`.

¶ 1. Goal & non-goals