Company Name: Articul8 AI Job Details: Be an Early Applicant 2 Locations Remote Senior level Job Url: https://builtin.com/job/senior-software-development-engineer-test-sdet-chaos-engineering-specialist-brazil/6815557 Job Description: Company OverviewAt Articul8 AI, we're building the next generation of resilient, scalable software systems that help organizations transform their operations. Our commitment to quality and reliability drives our engineering culture, where we continuously test and improve our systems under real-world conditions.Why Join Articul8 AI?Make an Impact: Shape the resilience and reliability of AI-driven systems at scale.Build with Modern Tech: Leverage cutting-edge tools and platforms (Multi-cloud, AI-first tooling).Ownership & Growth: Take ownership of chaos engineering initiatives and influence engineering culture across teams.Continuous Learning: Collaborate with top engineers, participate in mentoring, and stay ahead in chaos engineering and SRE practices.Position SummaryWe are seeking a Senior SDET specializing in chaos engineering and monitoring to join our Quality Engineering team. You will design and implement sophisticated test automation frameworks, create and run chaos experiments to validate our systems' resilience against real-world failures, while ensuring comprehensive monitoring capabilities that provide actionable insights during both testing and production scenarios.Key ResponsibilitiesDesign, develop, and maintain advanced test automation frameworks that incorporate chaos engineering principlesCreate and execute chaos experiments that simulate various failure modes and edge cases in our distributed systemsImplement monitoring solutions that effectively track system performance, resilience, and failure recoveryEstablish observability practices that provide deep insights into system behavior during chaos experimentsCollaborate with development teams to build resilience into our applications from the ground upDevelop metrics and dashboards to visualize system reliability and the impact of chaos experimentsLead post-mortem analyses to identify system weaknesses discovered through chaos testingIntegrate chaos testing into CI/CD pipelines to validate system resilience continuouslyMentor engineers through code reviews, technical sessions, and hands-on guidance in test automation, chaos engineering, and monitoring best practices.Contribute to the company's overall testing strategy and quality assurance practicesQualificationsRequiredBachelor's degree in Computer Science, Engineering, or related field5+ years of experience in software testing and quality assurance, with at least 2 years focused on chaos engineeringStrong programming skills in languages such as Python, Go, and/or RustExperience with chaos engineering tools such as Chaos Monkey, Gremlin, or similar frameworksIn-depth knowledge of monitoring systems like Prometheus, Grafana, ELK Stack, or similar toolsExperience implementing observability practices (metrics, logging, tracing) in distributed systemsFamiliarity with container orchestration platforms like Kubernetes and related chaos toolsExperience with SRE practices and principlesStrong understanding of CI/CD pipelines and how to integrate testing workflowsExperience with cloud platforms (AWS, GCP, Azure) and their monitoring capabilitiesExcellent communication skills with the ability to present technical findings to various stakeholdersPreferredMaster’s degree in Computer Science, Engineering, or related fieldKnowledge of statistical analysis for evaluating test results and system performanceExperience with distributed systems and microservice architecturesContributions to open-source testing or chaos engineering projectsFamiliarity with AI/ML systems and their unique testing challengesRelevant certifications in cloud platforms, testing methodologies, or chaos engineeringReady to shape the future of resilient software systems? Apply now and help drive the reliability of tomorrow’s AI at Articul8 AI!