In the SWE-bench test, Devin was able to correctly resolve 13.86% of GitHub issues without any assistance, performing far better than GPT-4.
Posted by:VentureBeat
Posted on: 3/12/2024
BACK TO ISSUE