Groundedness Score

Stays within observable conversation scope

RAG-Compatible0-100
What we measure

Does the response stay within information that's observable from the conversation? Does it introduce unverifiable policies, features, or specifics not discussed?

Why it matters

This metric works with RAG-based AI systems where knowledge bases may be incomplete. Making ungrounded claims (even if true elsewhere) creates confusion when users can't verify the information.

Research Foundation
  • Reference-free evaluation frameworks (Evidently AI RAG Guide)
  • Assesses quality without requiring external ground truth
  • No documentation access needed – evaluates based on conversation alone