SWE-CI: Evaluating Agent Capabilities in Maintaining Codebases via CI

(arxiv.org)

38 points | by mpweiher  2 hours ago

3 comments