About

Who I am

I'm Max. I work on AI evals and reliability for agentic systems — measuring what actually works versus what just demos well.

Skeptical of tooling hype, mostly because I keep measuring it. The findings end up here and on YouTube.