Benedikt Stroebl

I'm the cto and cofounder of a company part of YC S25.

Before that, I was a PhD student at Princeton University, advised by Arvind Narayanan.

My research focuses on AI agents, with a focus on enhancing their real-world usefulness and reliability. Part of that is developing rigorous evaluation frameworks and studying the limitations of inference scaling techniques. I have also done work on making models speak multiple languages and be faithful to the underlying culture.

[Google Scholar] [GitHub] [X]

Selected Publications

(* indicates equal contribution)

Blog Posts

Workshops

Talks & Press