IS  //  Input Systems

Notes

Blog

Experiments, model evaluations, and things worth writing down from building at Input Systems.

June 13, 2026 · Model evaluation

Do open-weight coding models live up to their benchmarks?

Six open-weight models, ten fresh and uncontaminated coding tasks, one level playing field. The leaderboard rank barely transferred — and the benchmark headline turned out to be the least useful number on the page. What actually separated them was cost and reasoning efficiency, and the cheapest model quietly won.

Read →

More to come.  ·  Back to inputsystems.ai