Ingestion, retrieval, generation, and evaluation, a complete reference architecture.
Notes on AWS, cost & reliable systems
Practical, hard-won notes on AWS, cost optimization, machine learning, core services, and running things reliably in the cloud. A new post every couple of weeks.
AWS re:Invent 2025: my recap and the launches that matter
Field notes from Las Vegas: the keynotes, the standout announcements, and what I am taking back to production.
Read the recapRecent posts
Canaries, gradual rollouts, and instant rollback, decoupling deploy from release.
A serverless, distributed SQL database, what it is, and where it fits today.
A back-of-envelope model for what idle headroom really costs you per year.
Queue-backed inference for large payloads and bursty traffic, scaling to zero between bursts.
Wiring CloudWatch, X-Ray, and OpenTelemetry into one coherent view.
Service-to-service auth and access policies without managing a mesh.








