Job Title: DevOps - Platform Engineer Company Name: August Health Job Details: RemoteFull,Time Job Url: https://hiring.cafe/viewjob/uq2x1h6gw7he6r91 Job Description: Posted 1d agoDevOps - Platform Engineer@ August HealthView All JobsWebsiteUnited StatesRemoteFull TimeResponsibilities:Infrastructure as code, Kubernetes platform, CI/CD pipelinesRequirements Summary:Hands-on AWS, Kubernetes in production, IaC (Pulumi/Terraform), GitHub Actions, security/compliance (SOC 2/HIPAA), observability (Prometheus), data pipeline infra (Snowflake/NiFi), strong documentation and self-direction.Technical Tools Mentioned:Pulumi, Kubernetes, GitHub Actions, Prometheus, Snowflake, Apache NiFi, Tailscale, Langfuse, Phoenix, Helicone About August HealthAt August Health, our mission is to empower the essential work of caring for our elders.We achieve this by providing a modern operating platform and electronic health record (EHR) that enables senior living operators to deliver high-quality care with confidence.Caregivers are the heart of senior living communities, embodying care, compassion, and well-being. Yet, they face increasing challenges—higher resident acuity, complex workflows, and staffing shortages. At August Health, we build tools that simplify their tasks, eliminate inefficiencies, and provide the insights they need to focus on what truly matters—caring for residents.At August, we strive to live our values each day, in every interaction with our customers and with each other.Be responsible – leave things better than you found themTake ownership – be decisive and take actionBe ambitious – build something greatKeep an open mindset – communicate candidly and welcome new ideasBe humble – celebrate each others’ successes and learn from our mistakesStay positive – assume best intentAbout the RoleWe're looking for a Platform Engineer who thrives at the intersection of reliability, security, and developer productivity. You'll be a core contributor to our infrastructure — owning the systems that keep August Health fast, secure, and resilient as we scale.This is a high-autonomy, high-impact role. You'll work closely with our engineering team to shape how we build, deploy, and operate software — with real influence over architecture decisions and engineering culture.Infrastructure as code — managing and evolving our AWS infrastructure using Pulumi, with a focus on reliability, cost efficiency, and maintainabilityKubernetes platform — operating and improving our K8s clusters: workload scheduling, resource management, networking, and observabilityCI/CD pipelines — owning and optimizing our GitHub Actions workflows to keep builds fast, feedback tight, and deployments safeSecurity & compliance — hardening our infrastructure posture, supporting audit readiness, and implementing controls that meet the requirements of operating in healthcareData pipeline infrastructure — supporting the reliable operation of our data engineering workflowsLLM tooling — deploying and maintaining prompt tracing, evaluation, and observability tools as we integrate AI capabilities into our productNetwork & access — managing secure, zero-trust connectivity via Tailscale across our distributed infrastructureDisaster recovery & incident response — designing, documenting, and regularly testing DR/IR processes so we're always readyAbout YouStrong hands-on experience with AWS — particularly EKS, Cognito, Aurora, RDS, Lambda, and VPC; you can make smart tradeoff decisions across services and know when to reach for eachProficiency with Kubernetes in production — you've operated clusters at scale and know how to debug when things go wrongExperience with infrastructure as code, ideally Pulumi or a similar tool (Terraform, CDK)Comfort with GitHub Actions or similar CI/CD systems — you've built and optimized pipelines, not just used themA security-minded approach — you think about least privilege, secrets management, and compliance by default; experience working toward or maintaining SOC 2 and/or HIPAA compliance is important, not just a nice-to-haveSolid observability experience — you're comfortable with Prometheus, have instrumented backend services before, and can look at an existing metrics setup and form a point of view on what's missing or misleadingFamiliarity with data pipeline infrastructure, including tools like Snowflake and Apache NiFiStrong communication skills — you can explain infrastructure decisions to non-infrastructure engineers, and you write good documentationSelf-direction — you can identify what needs doing, prioritize well, and drive projects to completion without heavy oversightNice to HaveExperience with Tailscale or other zero-trust networking toolsAbility to read and write backend code in Scala or Java — the ideal candidate is comfortable enough to review or contribute to application code, not just the infrastructure around itCKA (Certified Kubernetes Administrator) certification from the Linux FoundationPrior work in a healthcare or other regulated industry, with hands-on experience maintaining SOC 2 or HIPAA complianceExperience deploying LLM observability or evaluation tooling (e.g., Langfuse, Phoenix, Helicone, or similar)About our teamOur team brings together deep expertise in technology, healthcare, and company-building. We’ve led teams at Apple, Google, Landmark Health, and Adobe, co-founded and exited multiple companies, shipped products used by hundreds of millions of users, and managed clinical teams caring for thousands of patients.Backed by top-tier Silicon Valley investors, we are partnering with some of the largest senior care organizations in the U.S. to transform the future of senior living.We offer market-competitive compensation based on experience and ability, including significant equity option grants. Our benefits prioritize your well-being with 100% company-paid premiums for health, dental, and vision coverage, along with company contributions to your HSA. We help you plan for the future with a 2% 401(k) match, and support your physical and mental health through services like Rightway Health Advocacy and Spring Health Mental Wellness. Beyond traditional benefits, we offer a flexible time off policy, 100% paid family leave, and all-expenses-paid, in-person company offsites twice a year.