April 18, 2026
Building evals for agents inside SDLC
Replayable episodes, executable judges, and an autonomy scorecard
Alexander PolomodovTechnical Director & Fellow · T-Technologies
/ AI Evals for SDLC Agents 20261
Replayable episodes, executable judges, and an autonomy scorecard