Job Url: https://www.remoterocketship.com/company/bench/jobs/senior-software-engineer-site-reliability-united-states-remote Job Description: Benchmark Website LinkedIn All Job Openings Benchmark is a global product realization services company that specializes in providing comprehensive solutions in advanced computing, commercial aerospace, defense, medical technologies, and semiconductor capital equipment. The company offers a range of services from design engineering and precision machining to full-system electronic assembly and lifecycle management, ensuring reliable support for innovative products in demanding markets. With a collaborative approach that leverages cross-functional teams, Benchmark aims to be a trusted partner in delivering customized solutions tailored to complex challenges. Advanced Technology β€’ Design Engineering β€’ Manufacturing β€’ Order Fulfillment β€’ Design 10,000+ employees Founded 1979 πŸš€ Aerospace βš•οΈ Healthcare Insurance Senior Software Engineer, Site Reliability 42 minutes ago πŸ‡ΊπŸ‡Έ United States – Remote ⏰ Full Time 🟠 Senior πŸ¦… H1B Visa Sponsor AWS Cloud Docker Grafana Java Kubernetes PHP Prometheus Python Terraform Apply Now Receive Emails with Similar Jobs Report problem πŸ“‹ Description β€’ Contribute to the design, development, and delivery of features that enhance system reliability and scalability. β€’ Define, measure, and improve SLIs, SLOs, and error budgets in collaboration with engineering teams. β€’ Participate in building a culture of reliability through knowledge sharing, documentation, and process improvements. β€’ Implement and improve observability tooling and practices to monitor the health and performance of production systems. β€’ Participate in incident management, including on-call rotations, root cause analysis, and postmortem reviews. β€’ Lead smaller initiatives or components of larger projects, ensuring technical quality and operational readiness. β€’ Collaborate with software engineering, security, and product teams to ensure resilient and secure system design. β€’ Mentor junior engineers, sharing expertise in SRE principles and AWS best practices. β€’ Contribute to automation efforts to reduce toil and improve efficiency of operational processes. 🎯 Requirements β€’ 5+ years of experience in Site Reliability Engineering, DevOps, or Software Engineering with a focus on production operations. β€’ Strong knowledge of AWS cloud services and cloud-native architectures. β€’ Proficiency in scripting or programming languages (e.g., Python, Bash). β€’ Experience with observability tools (e.g., CloudWatch, Datadog, Prometheus, Grafana). β€’ Familiarity with infrastructure-as-code tools (e.g., Terraform, CloudFormation) and CI/CD pipelines. β€’ Strong problem-solving skills and ability to work cross-functionally. β€’ Some experience mentoring or coaching junior engineers. β€’ Preferred Qualifications: AWS certifications (e.g., AWS Certified Solutions Architect – Associate or AWS Certified DevOps Engineer – Associate). β€’ Experience leading on-call rotations, capacity planning, and chaos engineering initiatives. β€’ Experience with containerization (Docker, ECS, Kubernetes/EKS). β€’ Familiarity with incident response best practices and operational readiness processes. β€’ Knowledge of PHP or Java is a plus.