Job description:
**About GovEagle**
We build an AI‑powered platform that helps government contractors draft proposals faster and more securely. Our stack: Python micro‑services, Celery workers, Redis, Kubernetes, and LLM integrations in AWS GovCloud & Azure GovCloud.
\*\*The Gig (1 Month, ≈ 20 hrs/week)\*\*
Help us hit a strict uptime & latency SLA by auditing our infrastructure and knocking out the highest‑impact fixes within four weeks.
**What You’ll Tackle**
* Reliability audit of K8s workloads, Celery queues, Redis caching, and cloud networking
* Prioritized action plan
- rapid “quick win” implementations (HPA tuning, alerts, rollout strategies)
* Guidance (or prototype) on adopting Temporal where Celery falls short
* Clear documentation so our small team can maintain the improvements
**Tech You’ll Touch**
Python · Redis · Kubernetes (Helm/Kustomize) · Celery · Docker
_Bonus_: Temporal, Terraform, Prometheus/Grafana, GitHub Actions
**What We Need**
* 5+ yrs running high‑availability production systems
* Deep experience scaling Python services on Kubernetes
* Track record with queue‑based architectures and observability
* FedRAMP / GovCloud familiarity is a plus
**How to Apply**
Message me here with a short intro, relevant project highlights, and your hourly rate. We’ll move fast.