Hacking Redefined: How LLM Agents Took on University Hacking Competition
For the first time, Team Atlanta deployed their hybrid system, powered by LLM agents—Atlantis —to compete in Georgia Tech’s flagship CTF event, TKCTF 2024 . During the competition, Atlantis concentrated on two pivotal areas: vulnerability analysis and automatic vulnerability remediation. Remarkably, the system uncovered 10 vulnerabilities and produced 7 robust patches1, showcasing the practicality and promise of the team's approach in a real-world hacking competition.