Curunir Evals

Agentic eval results — local vs cloud models on tool use, planning, memory, and more.

Agentic Eval Series

This page has moved. Redirecting to the series homepage.