Job Title: Founding Engineer (Full-Stack - Platform Control Plane & Cloud Infrastructure) Company Name: Buildfunctions Job Url: https://www.simplyhired.com/job/4VZKX86ZPteAfzCsh2Vws1Qq_XolsZxI_q-1erwqduh4kPW5sdC_Yw Job Description: Founding Engineer (Full-Stack - Platform Control Plane & Cloud Infrastructure) Buildfunctions Remote Job Details Full-time Qualifications GPU programming Cloud identity and access management (IAM) Performance tuning Node.js Continuous Delivery (CD) implementation Load balancers Amazon EC2 Startup experience Firmware System design Scalable systems OS Kernels AWS Incident response Docker Virtualization SDKs Managing IT infrastructure Scalability Developing and maintaining backend systems Web applications S3 Serverless cloud services Linux Distributed computing Senior level AI Leadership TypeScript Linux administration Shell Scripting Virtual Private Clouds Debugging Route 53 AWS Lambda Full Job Description Company: Buildfunctions provides a simple solution to serverless computing, enabling developers to build, deploy, and scale serverless functions designed for AI at a rapid pace — all without worrying about infrastructure or servers. Role: We are seeking visionary Founding Engineers to join our core team. This is an equity-based role with the potential for a salary upon successful fundraising. We’re looking for individuals with deep technical expertise and a proven track record of building serverless compute products — engineers who excel in fast-paced startup environments and are driven to shape the future of AI compute. This role owns the control plane and application-level infrastructure of the platform. It focuses on orchestration, correctness, scalability, and reliability of CPU/GPU workloads across a distributed cloud system, sitting between customer-facing surfaces (SDK, Web App, APIs) and the low-level execution layer (MicroVMs, bare metal, performance systems). Responsibilities - Own the platform control plane, including backend services responsible for job orchestration, lifecycle management, request routing, and coordination between customer requests and CPU/GPU execution. - Design and maintain backend APIs (REST and streaming) that power the SDK, compute infrastructure, web app, and internal services, ensuring correctness, observability, and backward compatibility. - Build and operate Node.js/TypeScript services that manage job submission, scheduling signals, capacity-aware admission, scaling decisions, retries, and failure handling across a distributed system. - Own cloud infrastructure glue, including AWS resources such as EC2, EFS, autoscaling groups, AMIs, networking, Route 53, IAM, load balancers, and service-to-service connectivity. - Implement and maintain autoscaling logic, capacity planning signals, and coordination with low-level infrastructure systems to ensure reliability under variable load. - Ensure platform reliability and correctness, including error handling, retries, idempotency, consistency guarantees, and graceful degradation during partial failures. - Build observability into the platform, including structured logging, metrics, tracing hooks, and operational visibility for debugging and incident response. - Collaborate closely with the MicroVM Infrastructure team, consuming low-level capabilities (VM lifecycle, startup readiness, execution signals) without owning kernel, hypervisor, or image internals. - Participate in on-call and incident response, diagnosing platform-level issues across services, infrastructure, and cloud environments. Qualifications: - Strong backend engineering experience with Node.js and TypeScript, including building production systems that handle concurrency, streaming data, and long-running processes. - Experience designing and operating distributed backend systems, with a strong understanding of failure modes, retries, backpressure, idempotency, and state management. - Proven experience working with AWS infrastructure and infrastructure-as-code (Pulumi or equivalent). - Experience building and operating control plane systems that coordinate work across multiple services or compute nodes. - Strong understanding of cloud-native operational concerns, including deployment pipelines, CI/CD, environment promotion, rollbacks, and observability. - Comfortable debugging complex production issues across application code, cloud infrastructure, and distributed services. - Solid Linux fundamentals and systems intuition, without needing to be a kernel or performance specialist. Industry Insight: AI and machine learning are advancing at an unprecedented speed. We are looking for someone ready to move quickly, as there are excellent industry opportunities happening in the near future. Compensation: Equity stake based on experience, contributions, and other factors. Details will be shared during interviews. Salaried positions are expected once funding is secured. Apply Now: If you're an experienced builder with a strong vision for the future of serverless computing and AI, then we invite you to connect with us! Job Type: Full-time People with a criminal record are encouraged to apply Application Question(s): Are you comfortable working in a remote setting? Have you worked in a fast-paced, early-stage startup environment? Have you built or worked on applications that stream responses? Rate your expertise (1–10, with the 10 highest) working with cloud and AWS services, including SDK, EC2, EFS, Lambda, ALB, VPC, Route 53, and IAM. Rate your expertise (1–10, with the 10 highest) with shell scripting. Will you be able to commit your full focus to Buildfunctions, without other jobs, once funding is achieved and a stage-appropriate salary is provided? Will you be able to commit your full focus to Buildfunctions, without other jobs, in the near future, before funding is achieved? Will you be able to jump in and start right away if selected for the position? If you can’t start immediately, how many weeks would you need after the interview process is complete and you’re selected? Do you use AI as a tool while coding in your day-to-day work? Do you have experience leading a technical team? Do you currently hold a visa or work authorization that requires you to maintain employment (e.g., STEM OPT, student, or temporary visas)? Rate your expertise (1–10, with the 10 highest) with Node.js. Rate your expertise (1–10, with the 10 highest) with virtual machine monitors like Firecracker or alternatives. Work Location: Remote