#sre

The phenomenon of connection draining not ending due to WebSocket persistent connection is analyzed from the perspective of TCP, Nginx proxy, and application, and five solutions are presented.

devops sre network load-balancer nginx websocket

Read article

devops

2026년 3월 4일4 min read

Part 5. Non-disruptive deployment strategy in VM + Nginx environment

The Zero Downtime Deployment procedure that can actually be operated in the L4 -> Nginx -> App -> WebSocket architecture is presented in a 7-step runbook.

devops sre network load-balancer nginx websocket

Read article

devops

2026년 3월 4일3 min read

Part 6. Analysis of actual failure cases (SRE perspective)

Actual failure patterns caused by server removal without drain, WebSocket drain failure, and keepalive setting imbalance are analyzed through symptoms, logs, causes, and solutions.

devops sre network load-balancer nginx websocket

Read article

devops

2026년 3월 4일4 min read

Part 7. Operation checklist and verification method

We present a practical checklist to verify Connection Draining and Graceful Shutdown readiness within 10 minutes immediately before deployment in the L4/Nginx/App/WebSocket structure.

devops sre network load-balancer nginx websocket

Read article

llm

2026년 3월 3일4 min read

Part 3. Reliability Design: Retry, Timeout, Fallback, Circuit Breaker

We summarize the reasons and operating patterns for retries, timeouts, fallbacks, and circuit breakers in LLM systems that should be designed differently from regular APIs.

llm agent system-design reliability sre resilience

Read article

llm

2026년 3월 3일4 min read

Part 11. Reference Architecture: End-to-End Operational Design

We present an LLM/Agent reference architecture that combines prompting, evaluation, reliability, cost, security, and observability into one operating system.

llm agent system-design reference-architecture platform sre

Read article