#evaluation
2 posts found.
llm
5 min read
LLM quality is stabilized when managed through datasets, evaluation criteria, online feedback, and regression detection loops, not sentence tuning.

2 min read
Standard for managing search quality by interpreting Recall@K, MRR, and NDCG according to the service context