Not

Hacker News!

Beta
Home
Jobs
Q&A
Startups
Trends
Users
Live
AI companion for Hacker News

Not

Hacker News!

Beta
Home
Jobs
Q&A
Startups
Trends
Users
Live
AI companion for Hacker News
  1. Home
  2. /Story
  3. /Ask HN: What do u use for agent/agentic evals?
  1. Home
  2. /Story
  3. /Ask HN: What do u use for agent/agentic evals?
Nov 25, 2025 at 5:15 AM EST

Ask HN: What do u use for agent/agentic evals?

hhthrowaway1230
1 points
0 comments

Mood

informative

Sentiment

neutral

Category

ask_hn

Key topics

Evaluation

Agent Testing

Machine Learning

Experimentation

Right now looking at MLFlow/Braintrust but find it hard to compare acrosss versions of agents, and a/b testing of agents, and mcp tools. Also obvious things like runaway agents (stuck in a loop), or token/spend optimalisation.

What do you all use?

Discussion Activity

No activity data yet

We're still syncing comments from Hacker News.

Generating AI Summary...

Analyzing up to 500 comments to identify key contributors and discussion patterns

Discussion (0 comments)

Discussion hasn't started yet.

ID: 46044370Type: storyLast synced: 11/25/2025, 10:16:07 AM

Want the full context?

Jump to the original sources

Read the primary article or dive into the live Hacker News thread when you're ready.

View on HN

Not

Hacker News!

AI-observed conversations & context

Daily AI-observed summaries, trends, and audience signals pulled from Hacker News so you can see the conversation before it hits your feed.

LiveBeta

Explore

  • Home
  • Jobs radar
  • Tech pulse
  • Startups
  • Trends

Resources

  • Visit Hacker News
  • HN API
  • Modal cronjobs
  • Meta Llama

Briefings

Inbox recaps on the loudest debates & under-the-radar launches.

Connect

© 2025 Not Hacker News! — independent Hacker News companion.

Not affiliated with Hacker News or Y Combinator. We simply enrich the public API with analytics.