Job Url: https://www.linkedin.com/jobs/search/?currentJobId=4361608219&distance=25.0&f_AL=true&f_TPR=r86400&f_WT=2&geoId=103644278&keywords=software%20engineer&origin=JOB_SEARCH_PAGE_JOB_FILTER Job Description: ScitiX Share Show more options SRE/Devops Engineer California, United States · 8 hours ago · Over 100 applicants Promoted by hirer · No response insights available yet Remote Matches your job preferences, workplace type is Remote. Full-time Matches your job preferences, job type is Full-time. Easy Apply Save Save SRE/Devops Engineer at ScitiX SRE/Devops Engineer ScitiX · California, United States (Remote) Easy Apply Save Save SRE/Devops Engineer at ScitiX Show more options Your profile is missing required qualifications Show match details Help me update my profile BETA Is this information helpful? Get personalized tips to stand out to hirers Find jobs where you’re a top applicant and tailor your resume with the help of AI. Try Premium for PKR0 Meet the hiring team May Wu 3rd 招聘经理 Job poster Message About the job About the Company ScitiX is a next-generation provider of intelligent computing infrastructure, dedicated to empowering artificial intelligence innovation through high-performance, scalable solutions. Built on deep technical expertise and large-scale cluster operation experience, ScitiX offers a full-stack platform spanning IaaS, PaaS, and MaaS, delivering end-to-end support for AI development workflows. About the Role Responsible for kubernetes deployment, daily operation and maintenance, and troubleshooting of each training cluster. Responsibilities Responsible for the design and development of monitoring and automation functions of the cluster management platform, and continuously improving the cluster management and control capabilities. Assisting in the analysis and troubleshooting of issues related to cluster containers, operating systems, networks, storage, etc. Managing the quota of each business in the cluster, analyzing utilization rates, and subsequent capacity planning. Participate in operation and maintenance duty, promptly handle faults, and respond to user issues and requirements. Qualifications Bachelor or above degree in computer science or related majors. 3+ years of industrial experience, including solid Linux platform operation, maintenance, and debugging capabilities, with proficiency in troubleshooting, configuration optimization, and performance analysis. Proficient in programming in one of the following programming languages such as: Python, Go, Shell, etc. Familiar with the Kubernetes architecture, understand the functional characteristics of each component, and have rich practical experience in deployment and optimization of Kubernetes CNI, CSI, LB, etc. Experience in large-scale training cluster construction and optimization is preferred. Preferred Skills Good communication and coordination skills. Demonstrated independent thinking capabilities and troubleshooting skills. Job search faster with Premium