Job Title: Senior DevOps Engineer (ex-SysOps/SRE track)

Company Name: Social Links

Job Details: RemoteFull,Time

Job Url: https://hiring.cafe/viewjob/xzfz0ftwm25tpf0r

Job Description: Posted 5d agoSenior DevOps Engineer (ex-SysOps/SRE track)@ Social LinksView All JobsWebsiteUnited StatesRemoteFull TimeResponsibilities:design infra, build infra, maintain clustersRequirements Summary:5+ years Linux/DevOps/SRE; 3+ years GCP; strong Kubernetes; Terraform; Ansible; Bash/Python; networking; observability; GitOps; CI/CD; English level B2+Technical Tools Mentioned:GCP, Kubernetes, Terraform, Ansible, Python, Bash, GitLab CI, GitHub Actions, Cloud Build, Prometheus, Grafana, Loki, Tempo, OpenTelemetry, Cloud Monitoring, Cloud Logging, Helm, Kustomize, Cloud SQL, AlloyDB, Memorystore, Kafka, RabbitMQ, Elasticsearch/OpenSearch, GKE, GitOps, ArgoCD, Flux
We are a global OSINT company headquartered in the US, empowering investigators and security professionals with cutting-edge AI-powered products. Our technology collects and analyzes massive volumes of data from open sources, including social media, messengers, and the dark web, to create a comprehensive picture for data-driven investigations and decision-making.
Our customers include S&P 500 companies and law enforcement agencies in 80+ countries worldwide. Social Links is scaling rapidly, growing 2x annually, with the ambition of becoming a unicorn valued at $1B+.
We are actively migrating our infrastructure from on-prem to a modern hybrid/cloud stack with a primary focus on Cloud and Kubernetes. We’re looking for a strong Senior DevOps Engineer to lead this transformation. We need someone with deep Linux expertise, proven experience building reliable infrastructure, and a strong focus on Cloud.

Your Tasks Will Be:

Design, build, and maintain cloud infrastructure in GCP (VPC, GKE Autopilot/Standard, Cloud SQL, AlloyDB, Memorystore, Cloud Storage, Cloud Run, Artifact Registry, Cloud NAT, Cloud Armor, etc.)
Manage hybrid environments (remaining on-prem + GCP)
Deploy and operate production-grade Kubernetes clusters (GKE + on-prem k8s)
Manage all infrastructure as code using Terraform (mandatory) + Helm + Kustomize
Configuration management with Ansible (existing playbooks) while evolving to more modern practices
Ensure high availability and disaster recovery for databases and queues (Cloud SQL, AlloyDB, Memorystore Redis, managed Kafka/RabbitMQ, Elasticsearch/OpenSearch on GKE)
Build a modern observability stack: Prometheus + Grafana + Loki/Tempo + OpenTelemetry, integrated with Cloud Monitoring and Cloud Logging
Design and implement CI/CD pipelines (GitLab CI, GitHub Actions, Cloud Build)
Participate in security & compliance processes (IAM, KMS, Secret Manager, VPC Service Controls, Security Command Center, hardening)
Join the on-call rotation (we are building an SRE culture)
Mentor mid/junior engineers and participate in architecture reviews


Our Ideal Candidate Has:

5+ years of Linux system administration / DevOps / SRE experience
3+ years of hands-on production experience with Google Cloud Platform (GCP)
Deep expertise with Kubernetes (GKE is mandatory; CKA/CKAD/CKS certification is a big plus)
Strong proficiency in Terraform (complex modules, state management, remote backends, workspaces)
Solid experience writing and maintaining Ansible roles/collections
Strong scripting skills: Bash + Python 3 (mandatory)
In-depth networking knowledge (VPC, subnets, firewall rules, Cloud NAT, Cloud Armor, Private Service Connect, Hybrid Connectivity)
Hands-on experience with observability stacks (Prometheus/Grafana/Alertmanager + Cloud Operations Suite)
Understanding of GitOps practices (ArgoCD / Flux is a plus)
Proven experience building and supporting CI/CD pipelines in GitLab CI or Cloud Build
English — B2 or higher


Nice To Haves:

GCP Professional Cloud DevOps Engineer or Professional Cloud Architect certification
Experience leading large-scale on-prem → GCP migrations
Istio / Anthos Service Mesh
OPA/Gatekeeper or other policy-as-code tools
Production experience with HashiCorp Vault or Google Secret Manager
Chaos Engineering practices
Real-world SELinux policy writing
Experience with XCP-ng / Proxmox (small on-prem footprint remains)
Russian – advanced level or higher


What We Offer:

Fully remote position (excluding Russia and Belarus)
Stock options and long-term incentives
Clear and transparent career progression to Lead DevOps / SRE / Cloud Architect
Learning & certification budget (including GCP Professional certifications)
Modern toolchain and real influence on product architecture
Genuinely complex and interesting challenges in OSINT and AI

We are an equal-opportunity employer and are committed to fostering a diverse and inclusive environment for all candidates.