Job Title: Senior DevOps Engineer (ex-SysOps/SRE track) Company Name: Social Links Job Details: RemoteFull,Time Job Url: https://hiring.cafe/viewjob/xzfz0ftwm25tpf0r Job Description: Posted 5d agoSenior DevOps Engineer (ex-SysOps/SRE track)@ Social LinksView All JobsWebsiteUnited StatesRemoteFull TimeResponsibilities:design infra, build infra, maintain clustersRequirements Summary:5+ years Linux/DevOps/SRE; 3+ years GCP; strong Kubernetes; Terraform; Ansible; Bash/Python; networking; observability; GitOps; CI/CD; English level B2+Technical Tools Mentioned:GCP, Kubernetes, Terraform, Ansible, Python, Bash, GitLab CI, GitHub Actions, Cloud Build, Prometheus, Grafana, Loki, Tempo, OpenTelemetry, Cloud Monitoring, Cloud Logging, Helm, Kustomize, Cloud SQL, AlloyDB, Memorystore, Kafka, RabbitMQ, Elasticsearch/OpenSearch, GKE, GitOps, ArgoCD, Flux We are a global OSINT company headquartered in the US, empowering investigators and security professionals with cutting-edge AI-powered products. Our technology collects and analyzes massive volumes of data from open sources, including social media, messengers, and the dark web, to create a comprehensive picture for data-driven investigations and decision-making. Our customers include S&P 500 companies and law enforcement agencies in 80+ countries worldwide. Social Links is scaling rapidly, growing 2x annually, with the ambition of becoming a unicorn valued at $1B+. We are actively migrating our infrastructure from on-prem to a modern hybrid/cloud stack with a primary focus on Cloud and Kubernetes. We’re looking for a strong Senior DevOps Engineer to lead this transformation. We need someone with deep Linux expertise, proven experience building reliable infrastructure, and a strong focus on Cloud. Your Tasks Will Be: Design, build, and maintain cloud infrastructure in GCP (VPC, GKE Autopilot/Standard, Cloud SQL, AlloyDB, Memorystore, Cloud Storage, Cloud Run, Artifact Registry, Cloud NAT, Cloud Armor, etc.) Manage hybrid environments (remaining on-prem + GCP) Deploy and operate production-grade Kubernetes clusters (GKE + on-prem k8s) Manage all infrastructure as code using Terraform (mandatory) + Helm + Kustomize Configuration management with Ansible (existing playbooks) while evolving to more modern practices Ensure high availability and disaster recovery for databases and queues (Cloud SQL, AlloyDB, Memorystore Redis, managed Kafka/RabbitMQ, Elasticsearch/OpenSearch on GKE) Build a modern observability stack: Prometheus + Grafana + Loki/Tempo + OpenTelemetry, integrated with Cloud Monitoring and Cloud Logging Design and implement CI/CD pipelines (GitLab CI, GitHub Actions, Cloud Build) Participate in security & compliance processes (IAM, KMS, Secret Manager, VPC Service Controls, Security Command Center, hardening) Join the on-call rotation (we are building an SRE culture) Mentor mid/junior engineers and participate in architecture reviews Our Ideal Candidate Has: 5+ years of Linux system administration / DevOps / SRE experience 3+ years of hands-on production experience with Google Cloud Platform (GCP) Deep expertise with Kubernetes (GKE is mandatory; CKA/CKAD/CKS certification is a big plus) Strong proficiency in Terraform (complex modules, state management, remote backends, workspaces) Solid experience writing and maintaining Ansible roles/collections Strong scripting skills: Bash + Python 3 (mandatory) In-depth networking knowledge (VPC, subnets, firewall rules, Cloud NAT, Cloud Armor, Private Service Connect, Hybrid Connectivity) Hands-on experience with observability stacks (Prometheus/Grafana/Alertmanager + Cloud Operations Suite) Understanding of GitOps practices (ArgoCD / Flux is a plus) Proven experience building and supporting CI/CD pipelines in GitLab CI or Cloud Build English — B2 or higher Nice To Haves: GCP Professional Cloud DevOps Engineer or Professional Cloud Architect certification Experience leading large-scale on-prem → GCP migrations Istio / Anthos Service Mesh OPA/Gatekeeper or other policy-as-code tools Production experience with HashiCorp Vault or Google Secret Manager Chaos Engineering practices Real-world SELinux policy writing Experience with XCP-ng / Proxmox (small on-prem footprint remains) Russian – advanced level or higher What We Offer: Fully remote position (excluding Russia and Belarus) Stock options and long-term incentives Clear and transparent career progression to Lead DevOps / SRE / Cloud Architect Learning & certification budget (including GCP Professional certifications) Modern toolchain and real influence on product architecture Genuinely complex and interesting challenges in OSINT and AI We are an equal-opportunity employer and are committed to fostering a diverse and inclusive environment for all candidates.