Job Url: https://hiring.cafe/?searchState=%7B%22dateFetchedPastNDays%22%3A14%2C%22searchQuery%22%3A%22ai%22%2C%22locations%22%3A%5B%7B%22id%22%3A%22FxY1yZQBoEtHp_8UEq7V%22%2C%22types%22%3A%5B%22country%22%5D%2C%22address_components%22%3A%5B%7B%22long_name%22%3A%22United+States%22%2C%22short_name%22%3A%22US%22%2C%22types%22%3A%5B%22country%22%5D%7D%5D%2C%22formatted_address%22%3A%22United+States%22%2C%22population%22%3A327167434%2C%22workplace_types%22%3A%5B%22Remote%22%5D%2C%22options%22%3A%7B%22flexible_regions%22%3A%5B%22anywhere_in_continent%22%2C%22anywhere_in_world%22%5D%7D%7D%5D%2C%22securityClearances%22%3A%5B%22None%22%5D%7D Job Description: Senior AI Engineer @ PubNub View All Jobs Website United States $149k-$200k/yr Remote Full Time Responsibilities: architecting services, building pipelines, delivering tooling Requirements Summary: 5+ years in production systems; 1+ year AI/LLM features; scaling services; ML tooling expertise; containerization; strong English communication. Technical Tools Mentioned: Python, TypeScript, Rust, CUDA, TensorRT, vLLM, Docker, Kubernetes, PyTorch, TensorFlow, Transformers, vector search, CI/CD, CLI Save Mark Applied Hide Job Report & Hide Job Description Copy Job Description About PubNub PubNub powers the world’s most engaging real-time experiences—chat, live updates, and interactive applications—for over 2,000 companies including Verizon, Autodesk, Zillow, and Dropbox. Our global data network processes trillions of messages each month with sub-100 ms latency across 15+ data centers. Backed by $130M in funding, we’re shaping the future of how the world connects. We’re now building something new: an intelligence layer that lets developers weave large language models (LLMs) and deep-learning pipelines directly into high-speed streams. We believe AI should be as real-time as the data it reasons about, and we’re hiring founding engineers to make that vision real. The Role As a Senior AI Engineer, you’ll architect and build cloud-native services that combine PubNub’s real-time streams with state-of-the-art AI. From retrieval-augmented generation and low-latency inference to developer tooling, you’ll create the foundation of PubNub’s intelligence platform. This is a greenfield opportunity to define architecture, drive scale, and deliver AI capabilities that power products across industries. What You’ll Do Architect and build services that fuse real-time data streams with NLP, moderation, recommendation, and custom models Own the full ML lifecycle: pipelines, fine-tuning, evaluation, packaging, inference, and observability Develop internal tooling (SDKs, CLI, CI/CD hooks) so teams can add AI with a single API call Optimize for sub-100 ms inference at global scale using CUDA, TensorRT, vLLM, Rust, and caching strategies Partner with product and solution architects to deliver reusable AI capabilities: content safety, sentiment, personalization, anomaly detection, and more Champion responsible AI practices: robust evaluation, guardrails, governance, and transparent feedback loops What We’re Looking For Required 5+ years building production systems in Python, TypeScript, or Rust (ideally more than one) 1+ year delivering AI/LLM features to external users Experience scaling services beyond 100k req/s or equivalent event volumes Deep knowledge of ML tooling (PyTorch/TensorFlow, transformers, vector search, distributed training, experiment tracking) Strong containerization/orchestration skills (Docker, Kubernetes) Comfortable using AI coding copilots as part of your workflow Excellent written and verbal English communication Preferred Experience with streaming platforms (Kafka, Kinesis, Redpanda) Hands-on work with cloud AI services (AWS Bedrock, GCP Vertex, Azure OpenAI) Knowledge of low-level performance tuning (CUDA kernels, SIMD, memory profiling) Why Join PubNub Build a greenfield intelligence platform at internet scale Ship features that land directly in customer-facing products across healthcare, fintech, gaming, and streaming Competitive Compensation in the range of USD149000-200000 Remote-friendly culture with Open PTO Equity in a profitable, fast-growing infrastructure company A team that values craftsmanship over ego If you’re excited to make AI real-time, we’d love to hear from you.