Operational Considerations for Managing Stateful Workloads
https://dev.to/pampatzoglou/operational-considerations-for-managing-stateful-workloads-20c3
When managing stateful workloads, whether in Kubernetes or traditional infrastructure, operational concerns like isolation, lifecycle management, security, disaster recovery, scalability, and observability take center stage. While the examples focus on AWS, PostgreSQL, and Kubernetes, the principles and best practices discussed here are broadly applicable to any environment. This article approaches these topics from an operations perspective, prioritizing reliability, maintainability, and resilience. The goal is not just to run a database, but to ensure it operates efficiently, scales properly, and remains secure in real-world conditions. We’ll explore key aspects of running stateful workloads, from managing failure domains to ensuring observability, and how these impact both operations teams and developers. Whether you’re running a database in a cloud-native setup or on bare metal, these strategies will help you build a robust, well-managed system.
https://dev.to/pampatzoglou/operational-considerations-for-managing-stateful-workloads-20c3
Can Configuration Languages (config DSLs) solve configuration complexity?
https://itnext.io/can-configuration-languages-dsls-solve-configuration-complexity-eee8f124e13a
https://itnext.io/can-configuration-languages-dsls-solve-configuration-complexity-eee8f124e13a
GKE Cost Cutting — Three Key Lookout Points to view your Potential Savings
https://medium.com/google-cloud/gke-cost-cutting-three-key-lookout-points-to-view-your-potential-savings-10f271dc4fa9
https://medium.com/google-cloud/gke-cost-cutting-three-key-lookout-points-to-view-your-potential-savings-10f271dc4fa9
How Kubernetes HPA Decides Which Pod to Terminate When Scaling Down
https://medium.com/@AlexanderObregon/how-kubernetes-hpa-decides-which-pod-to-terminate-when-scaling-down-6675ebbdf56f
https://medium.com/@AlexanderObregon/how-kubernetes-hpa-decides-which-pod-to-terminate-when-scaling-down-6675ebbdf56f
Load Balancing gRPC traffic with Istio
https://dev.to/visepol/load-balancing-grpc-traffic-with-istio-1k49
https://dev.to/visepol/load-balancing-grpc-traffic-with-istio-1k49
Why Every Platform Engineer Should Care About Kubernetes Operators
https://www.pulumi.com/blog/why-every-platform-engineer-should-care-about-kubernetes-operators
https://www.pulumi.com/blog/why-every-platform-engineer-should-care-about-kubernetes-operators
Demystifying Swap in Kubernetes: A Handbook for DevOps Engineers
https://medium.com/@robertbotez/demystifying-swap-in-kubernetes-a-handbook-for-devops-engineers-e5ef934593e3
https://medium.com/@robertbotez/demystifying-swap-in-kubernetes-a-handbook-for-devops-engineers-e5ef934593e3
Argo Rollouts — Canary Deployment with Istio
https://medium.chuklee.com/argo-rollouts-canary-deployment-with-istio-b432bc141ba9
https://medium.chuklee.com/argo-rollouts-canary-deployment-with-istio-b432bc141ba9
kpatch
https://github.com/dynup/kpatch
kpatch is a Linux dynamic kernel patching infrastructure which allows you to patch a running kernel without rebooting or restarting any processes. It enables sysadmins to apply critical security patches to the kernel immediately, without having to wait for long-running tasks to complete, for users to log off, or for scheduled reboot windows. It gives more control over uptime without sacrificing security or stability.
https://github.com/dynup/kpatch
Understanding the Circuit Breaker: A Key Design Pattern for Resilient Systems
https://dzone.com/articles/circuit-breaker-pattern-resilient-systems
The Circuit Breaker Pattern is a key design pattern for building resilient systems by preventing cascading failures and ensuring graceful degradation.
https://dzone.com/articles/circuit-breaker-pattern-resilient-systems
Load Testing with Impulse at Airbnb
https://medium.com/airbnb-engineering/load-testing-with-impulse-at-airbnb-f466874d03d2
Comprehensive Load Testing with Load Generator, Dependency Mocker, Traffic Collector, and More
https://medium.com/airbnb-engineering/load-testing-with-impulse-at-airbnb-f466874d03d2
🤔 Как не дать развалиться системе из 1500 микросервисов под пиковой нагрузкой? И что делать при DDoS-атаке на 1 млн RPS?
Команда Яндекс Маркета выкатила детальный разбор своей инженерии надёжности. Внутри — честно о том, как на практике работает Graceful Degradation, зачем нужны war rooms и как они проводят нагрузочные тесты прямо на проде.
✅Философия Graceful Degradation.
✅Must-have архитектурные паттерны.
✅Распределение процессов во время инцидентов.
✅Нагрузочное тестирование на проде.
Статья будет полезна тем, кто строит и поддерживает высоконагруженные и распределённые системы. Отличная возможность заглянуть под капот гиганта e-commerce и сравнить их подходы со своими.
Реклама. Рекламодатель ООО «Яндекс.Такси». ИНН 7704340310
Команда Яндекс Маркета выкатила детальный разбор своей инженерии надёжности. Внутри — честно о том, как на практике работает Graceful Degradation, зачем нужны war rooms и как они проводят нагрузочные тесты прямо на проде.
✅Философия Graceful Degradation.
✅Must-have архитектурные паттерны.
✅Распределение процессов во время инцидентов.
✅Нагрузочное тестирование на проде.
Статья будет полезна тем, кто строит и поддерживает высоконагруженные и распределённые системы. Отличная возможность заглянуть под капот гиганта e-commerce и сравнить их подходы со своими.
Реклама. Рекламодатель ООО «Яндекс.Такси». ИНН 7704340310
unregistry
https://github.com/psviderski/unregistry
Unregistry is a lightweight container image registry that stores and serves images directly from your Docker daemon's storage.
https://github.com/psviderski/unregistry
Terraform: Working with the State File in Memory
https://medium.com/@pilitsyn/terraform-working-with-the-state-file-in-memory-930a262dd154
https://medium.com/@pilitsyn/terraform-working-with-the-state-file-in-memory-930a262dd154
terrafetch
https://github.com/RoseSecurity/terrafetch
Terrafetch is the Neofetch of Terraform—because your infrastructure deserves a little flair. It scans your Terraform repository and displays key statistics like the number of variables, resources, modules, outputs, and more—all in a stylish, terminal-friendly format. Perfect for CLI screenshots, repo intros, or just flexing your infra hygiene.
https://github.com/RoseSecurity/terrafetch