Helm Error: UPGRADE FAILED - Another Operation in Progress

Helm Error: UPGRADE FAILED - Another Operation in Progress When working with Helm in Kubernetes, you might encounter the following error: Error: UPGRADE FAILED: another operation (install/upgrade/rollback) is in progress This usually happens when a Helm release is already in the middle of an action, but another upgrade or rollback is triggered. Common Pending States Helm releases can get stuck in several states: pending-install — Helm started installing, but something went wrong before it finished. pending-upgrade — Helm tried to upgrade, but the process didn’t complete. pending-rollback — A rollback started but got stuck in the middle. These states prevent you from running another helm upgrade or helm rollback. ...

September 26, 2025 · 2 min · 258 words · John Cena

What is Packer? Features and Examples

What is Packer? Packer is an open-source tool from HashiCorp that automates the creation of machine images. Think of it as a “factory” that takes a base image (Ubuntu, CentOS, Windows) and produces consistent, pre-configured VM or cloud images ready for deployment. Instead of manually preparing servers or VMs, you define everything in a template, and Packer does the work. Key Features Multi-platform builds — Create images for AWS AMI, Azure, Google Cloud, VMware, VirtualBox, Docker, and more, all at once. Immutable infrastructure — Instead of configuring servers after they start, you ship pre-baked images. Automation-friendly — Works well with CI/CD pipelines. Extensible — Supports plugins for provisioners (e.g., Ansible, Chef, Puppet, shell scripts). Example: Simple Packer Template { "builders": [{ "type": "docker", "image": "ubuntu:20.04", "commit": true }], "provisioners": [{ "type": "shell", "inline": ["apt-get update", "apt-get install -y nginx"] }] } What happens here: Start with ubuntu:20.04 Docker image. Run provisioning (install Nginx). Save the result as a new Docker image. When to Use Packer Building goldeggn images for production (with pre-installed dependencies). Standardizing environments across multiple clouds. Speeding up autoscaling by using pre-baked images. Conclusion Packer helps developers and DevOps engineers avoid “snowflake servers” and instead work with predictable, automated images. If your infrastructure spans multiple platforms, Packer is a great way to keep things consistent. ...

September 26, 2025 · 2 min · 217 words · John Cena

How to Automatically Restart Deployment on ConfigMap Change

By default, Kubernetes does not automatically restart a Deployment when its ConfigMap changes. This can lead to situations where your pods keep running with outdated configuration until you trigger a rollout manually. Fortunately, there are common patterns to solve this. Why It Happens Kubernetes mounts ConfigMaps into pods as files or environment variables, but the Deployment controller does not track changes in ConfigMap content. That means no automatic restart. Solution Checksum annotations: Add a hash of the ConfigMap into the Deployment’s pod template annotations. Example in Helm: ...

September 25, 2025 · 1 min · 197 words · John Cena

What are SLO, SLA, and SLI? Simple Explanation with Examples

What are SLO, SLA, and SLI? If you’ve ever dived into SRE (Site Reliability Engineering) or service monitoring, you’ve likely seen three mysterious abbreviations: SLO, SLA, and SLI. They might look similar, but they play different roles in how we measure and guarantee reliability. SLI — Service Level Indicator This is a metric that tells you how your service is doing. Examples: Latency (e.g., “response time of API requests”) Availability (e.g., “percentage of successful requests”) Error rate (e.g., “number of 5xx responses”) 👉 Think of SLI as a thermometer — it measures the state of your system. ...

September 25, 2025 · 2 min · 297 words · John Cena

What is a Percentile in Observability? Simple Explanation with Examples

What is a Percentile in Observability? When we talk about observability, especially metrics like latency, we often hear terms such as p50, p95, or p99. These are percentiles. They give us a way to understand not just the average behavior of a system, but how it performs for the majority (or the unlucky few) of requests. Simple Definition A percentile tells you the value below which a given percentage of measurements fall. ...

September 25, 2025 · 2 min · 301 words · John Cena