Better Harness: A Recipe for Harness Hill

The article presents a compelling case for using evaluations ("evals") as a form of training data to refine AI agent behavior, drawing a clear parallel to classical machine learning. The strongest version of this narrative is that evals provide a structured way to encode desired behaviors, enabling iterative improvement through a feedback loop that includes human oversight and holdout sets to prevent overfitting. This approach is grounded in practical engineering principles, such as data quality...

Better Harness: A Recipe for Harness Hill

Facts Only

Executive Summary

Full Take

Sentinel — Human