Agents are significantly more powerful than standalone LLM calls. But, debugging them is a nightmare.
You can trace their reasoning and tool use, but traces get huge and are impossible to parse.
To solve this, we spent the lasts several months building Gentrace for Agents, which puts our own agents to work on yours.
In Gentrace for Agents, you can:
• Chat with AI to debug agent traces
• Create smart monitoring columns
• Build out tailored evaluations
It’s like a giant AI powered spreadsheet over your trace data, with a Cursor-style chat sidecar. If it sounds a little meta, it is, but it is very powerful in practice.
We recorded this video to show you how it works. Take a look, and let me know what you think: