The previous thread glossed over how our LLM Agents actually work.
The truth is, it took us a long time to figure out how to get reliable and impressive results from agents.
By the end, we learned general strategies to build effective LLM agents, which we're now sharing. 🧵
This high confidence allows us to run our exploiter and patcher agents on every vulnerability, often resulting in both a PoC and a Patch. We run multiple copies of each agent and cross-check the results against one-another.