00 · Experiment, score, pick winnersAI EvalsRun experiments on live traffic, keep cohorts stable, and compare variants against real outcome signals. The substrate for evaluating models, prompts, and rewrites in production.EVENTsend.emailupdate.crmscore.leadnotify.slackeventuser.signed_up · 4 listenersPatterns01Run experiments in productionUse group.experiment() to split traffic, keep cohorts stable, compare variants, and roll changes forward safely.Next primitive →Durable Workflows