#evaluation

2 posts found.

llm
5 min read
LLM quality is stabilized when managed through datasets, evaluation criteria, online feedback, and regression detection loops, not sentence tuning.
vector search evaluation indicator design cover image
2 min read
Standard for managing search quality by interpreting Recall@K, MRR, and NDCG according to the service context