Job Url: https://boards.greenhouse.io/embed/job_app?token=4016468009&utm_source=jobright Job Description: Dev Ops / Infrastructure Engineer - 3 Month Contract at 10a Labs (View all jobs) New York City About 10a Labs: 10a Labs is the safety and threat-intelligence layer trusted by frontier AI labs, AI unicorns, Fortune 10 companies, and leading global technology platforms. Our adversarial red teaming, model evaluations, and intelligence collection enable engineering, safety, and security teams to stay ahead of evolving threats and deploy AI systems safely. 3 Month Contract | Remote | High-Impact About the Role: We’re looking for an infrastructure-focused devOps / MLOps engineer who thrives at the intersection of machine learning, systems, and product delivery. This is a hands-on 3-month contract role responsible for deploying, monitoring, and scaling the testing and deployment infrastructure for a real-time ML-powered content moderation system used to detect and triage abuse, threats, and edge-case language.  In This Role, You Will: Design, build, and document a maintainable GCP cloud infrastructure CI/CD pipeline for real-time model serving and data workflows Deploy and optimize APIs for low-latency ML systems Automate model deployment, retraining, and evaluation (CI/CD for ML) Build observability tooling to monitor rollouts, errors, integration testing, and drift in ML pipelines Ensure infrastructure meets security, compliance, and uptime requirements We’re Looking for Someone Who: Has 3–8 years of  DevOps/Platform engineering experience deploying machine learning systems or high-availability backend systems. Ability to build CI/CD pipelines from scratch; familiarity with GitHub Actions or similar. Expert-level proficiency with Git and GitHub workflows and strong scripting abilities in Python, Bash, and/or Go. Experience with Google Cloud Run and Docker. Experience with Google Cloud Platforms, Docker, Kubernetes, Terraform . Familiarity with SOC 2 compliance requirements and security best practices (IAM, secrets, etc). Experience implementing monitoring, logging, and alerting systems (e.g., Prometheius, Grafana, ELK/EFK, OpenTelemetry). Can work cross-functionally with ML, security, and engineering teams to deploy safely and iterate fast. Brings a builder's mindset and bias for ownership in ambiguous environments. What Success Looks Like in the  3 Months: You’ve deployed and monitored a real-time ML inference system with well-defined observability. You’ve implemented an API with latency under 1000ms for classifier-based inference. You’ve partnered with ML engineers to streamline deployment and retraining workflows. You’ve built logging and monitoring that gives insight into system performance and classifier behavior. Apply for this Job* Required First Name * Last Name * Email * Phone Resume/CV * Attach Dropbox or enter manually (File types: pdf, doc, docx, txt, rtf) Cover Letter Attach Dropbox or enter manually (File types: pdf, doc, docx, txt, rtf) School * School * Degree * Degree * Discipline * Discipline * Start Date (Year*) End Date (Year*) + Add another education LinkedIn Profile * Website Will you now or in the future require employer sponsorship for work authorization (e.g., H-1B or other employment-based sponsorship) to work in the U.S.? *