Nov 25, 2025 at 5:15 AM EST
Ask HN: What do u use for agent/agentic evals?
Mood
informative
Sentiment
neutral
Category
ask_hn
Key topics
Evaluation
Agent Testing
Machine Learning
Experimentation
Right now looking at MLFlow/Braintrust but find it hard to compare acrosss versions of agents, and a/b testing of agents, and mcp tools. Also obvious things like runaway agents (stuck in a loop), or token/spend optimalisation.
What do you all use?
Discussion Activity
No activity data yet
We're still syncing comments from Hacker News.
Generating AI Summary...
Analyzing up to 500 comments to identify key contributors and discussion patterns
Discussion (0 comments)
Discussion hasn't started yet.
ID: 46044370Type: storyLast synced: 11/25/2025, 10:16:07 AM
Want the full context?
Jump to the original sources
Read the primary article or dive into the live Hacker News thread when you're ready.