Company Name: Twelve Labs

Job Details: $145-182kPlus,equityKubernetesAWSGCPGoAzureREST,APIGolangDockerPrometheusGrafanaSpringBootSenior,and,Expert,levelSan,Francisco,Bay,AreaRemote,from,USMore,information,about,location

Job Url: https://app.welcometothejungle.com/jobs/cNK-7h2m?theme=take-another-look

Job Description: RoleWho you areExperience: 5+ years of backend engineering experience, with a proven track record of designing and delivering scalable web services and APIsAPI Expertise: Advanced proficiency in designing and implementing RESTful APIs, adhering to OpenAPI/Swagger specifications, with experience in modern frameworks (e.g., Go’s Gin or Echo, Spring Boot, or similar)Technical Expertise: Deep expertise in service-oriented architecture (SOA), microservices, and distributed systems, with strong knowledge of scalable database design (e.g., relational, NoSQL) and effective use of event-driven architectureCloud Proficiency: Extensive experience with cloud-native development and deployment on platforms like AWS, GCP, or Azure, leveraging tools such as Docker, Kubernetes, or serverless frameworks to ensure scalability and resilienceAnalytical & Collaborative Skills: Strong first-principles thinking to address complex technical challenges, combined with effective communication skills and a collaborative approach to working with cross-functional teamsEven if there are a few checkboxes that aren’t ticked through your prior experience, we still encourage you to apply! If you are a 0-1 achiever, a ferocious learner, and a kind and fun team player who motivates others, you will find a home at TwelveLabsDesirableAI/ML Familiarity: Strong understanding of AI/ML concepts, particularly related to video analysis (e.g., object detection, motion tracking, or video summarization), and experience integrating backend systems with AI models or data pipelinesVideo Technology Experience: Hands-on knowledge of video-specific tools and frameworks (e.g., FFmpeg, AWS Media Services) to support video processing workflowsStartup Agility: Experience thriving in fast-paced startup environments, with a demonstrated ability to adapt quickly and deliver results with agilityGo Proficiency: Proficiency with Go (Golang) and its ecosystem, aligning with team preferencesDevOps Practices: Exposure to CI/CD pipelines and observability tools (e.g., Prometheus, Grafana) for building and monitoring scalable systemsWhat the job involvesAs a Senior Product Backend Engineer at Twelve Labs, you’ll architect scalable APIs and systems to power our AI video platformYou’ll collaborate with cross-functional teams to integrate video foundation models, optimizing for performance and adaptability in a dynamic startup environmentDesign and implement scalable RESTful APIs adhering to OpenAPI specifications, powering features like video search, generation, and embedding, integrated with model inference pipelinesArchitect high-throughput, service-oriented backend systems to support enterprise-grade SaaS solutions for diverse customers, leveraging cloud-native tools (e.g., AWS, GCP, Azure)Optimize performance and reliability of distributed systems, processing large-scale video data with low latency and high availabilityCollaborate with cross-functional teams (product managers, frontend engineers, AI/ML teams) to deliver end-to-end video solutionsApply video-specific technologies (e.g., encoding, transcoding, streaming, metadata extraction) to enhance product capabilities and meet strategic goalsShare this jobReport a problem with this jobHide companyView 17 more jobs at Twelve LabsCompanyCompany benefitsFull health, dental, and vision benefitsExtremely flexible PTO and parental leave policy. Office closed the week of Christmas and New YearsRemote-flexible, offices in San Francisco and Seoul and coworking stipendVISA support (such as H1B and OPT transfer for US employees)Funding (last 2 of 7 rounds)Oct 2025$3mEARLY VCDec 2024$30mEARLY VCTotal funding: $110.2mOur takeDeveloping an algorithm that can understand text or images is (relatively) straightforward. However, the challenge escalates when it comes to understanding video, where these modes merge with audio, and context becomes much harder to grasp.Twelve Labs has developed a machine learning solution that tackles this challenge by making the inner content of videos both indexable for developers and highly searchable for users. This technology could prove immensely valuable, with use cases extending far beyond simple searchability for end users. It could be employed for more accurate monitoring of community guidelines on social media, enterprise knowledge searches, and a deeper understanding of the value of video content.With recent funding under its belt, Twelve Labs is set for rapid growth. The investment will drive R&D, nearly double its workforce, and advance its video understanding technology. As it expands, Twelve Labs is well-positioned to lead the future of multimodal AI and revolutionize how organizations extract values from video content.StephCompany Specialist at Welcome to the Jungle