#reliability
5 posts found.
llm
4 min read
From a system perspective, we summarize that the essence of the LLM function is not the prompt statement itself but the boundary, contract, state, and failure handling.
llm
4 min read
We summarize the reasons and operating patterns for retries, timeouts, fallbacks, and circuit breakers in LLM systems that should be designed differently from regular APIs.

2 min read
Outbox pipeline design that simultaneously ensures transaction consistency and event delivery reliability

2 min read
Organizing key scope, TTL, and response reuse policies to safely handle duplicate requests

2 min read
Designing limits, buffers, and retries to protect the system from producer-consumer rate imbalances.