
Braintrust.dev
Langfuse
Helicone AI
LangSmith
Future AGI
Galileo AI
LangChain
Humanloop
Autoblocks
Openlayer
Giskard.ai
Langfuse
LangChain
Postman
LangWatch
Sentry.io
Braintrust.dev
AutoblocksBased on our record, Braintrust.dev seems to be more popular. It has been mentiond 3 times since March 2021. We are tracking product recommendations and mentions on various public social media platforms and blogs. They can help you identify which product is more popular and what people think of it.
Braintrust focuses on evaluation-driven development: the idea that monitoring LLM applications means continuously scoring outputs against quality criteria, not just tracking latency and error rates. It's an eval platform first, with observability features built on top of the evaluation infrastructure. - Source: dev.to / 19 days ago
You're monitoring production traffic. You need Langfuse / Phoenix / Helicone / Braintrust for that. Online eval is a different problem class: implicit feedback, drift detection, hallucination rates on your data, not on HellaSwag. - Source: dev.to / 29 days ago
Same approach works with Langfuse, Phoenix, Braintrust, or your existing OTel pipeline โ the metadata.userId pattern is the universal part. - Source: dev.to / about 1 month ago
Langfuse - Langfuse is an open-source LLM engineering platform that helps teams collaboratively debug, analyze, and iterate on their LLM applications.
Openlayer - Test, fix, and improve your ML models
Helicone AI - Open-source LLM Observability for Developers
Giskard.ai - Open-source & Collaborative Quality Testing for AI models
LangSmith - Build and deploy LLM applications with confidence
Future AGI - Open-source engineering stack for self-improving AI Agents