The SWE-agent project uses LLMs to try to automatically fix issues in
GitHub repositories. I found their paper interesting, mostly because
they make extensive use of SymPy as a test repository.
https://swe-agent.com/

Apparently there are quite a few SymPy issues in the SWE-bench
dataset, which is a dataset of issues and corresponding pull requests
in open source projects. https://www.swebench.com/

Their model was able to fix 10% of SymPy issues in the dataset. That's
obviously not going to replace human developers any time soon, but
it's still interesting. From what I could tell, the issues seem to be
biased towards more easy/straightforward ones (i.e., ones that are
easy to verify if a fix is correct or not). But still, if LLMs are
reaching a point where they can fix bugs completely automatically that
could be very useful.

Aaron Meurer

-- 
You received this message because you are subscribed to the Google Groups 
"sympy" group.
To unsubscribe from this group and stop receiving emails from it, send an email 
to sympy+unsubscr...@googlegroups.com.
To view this discussion on the web visit 
https://groups.google.com/d/msgid/sympy/CAKgW%3D6JvWJBCnfbQjsQOo5VCOFw2jG-jXr36UYbkaQZ07TtRtA%40mail.gmail.com.

Reply via email to