Job Url: https://www2.jobdiva.com/portal/?a=rljdnwckmemsdoqhlhgx3mm4i19wpt03f36ca3yntxn3p4i6ji7k0ik9ye6vmpxj#/jobs/25746755?jobtitle=Senior+Generative+AI+Engineer Job Description: Senior Generative AI Engineer#25-00114 Nj, NJ Fully Remote Full Time/Contract Apply Now Share on Job Description Senior Generative AI Engineer Remote Job Summary: We are looking for a Senior Generative AI Engineer to lead the development of Proofs of Concept (POCs) and transition them into robust, scalable production-grade solutions. The ideal candidate has strong expertise in LLMs, prompt engineering, RAG, and deploying GenAI-powered applications. You'll collaborate across product, data, and engineering teams to rapidly prototype ideas and deliver AI-first features that create business impact. Key Responsibilities: • Drive end-to-end development of POCs using Generative AI models (OpenAI, Claude, Gemini, Mistral, open-source LLMs). • Translate business problems into AI-powered use cases and prototypes with clear outcomes. • Architect and build production-ready systems from validated POCs. • Implement Retrieval-Augmented Generation (RAG) pipelines, vector databases (e.g., Pinecone, FAISS, Weaviate), and embedding-based search. • Optimize prompts, model selection, fine-tuning, and response pipelines for reliability and cost-efficiency. • Build API services, microservices, or SDKs for GenAI functionalities and expose them to frontend or enterprise systems. • Evaluate open-source and proprietary models and recommend fit-for-purpose solutions. • Ensure secure, ethical, and responsible AI use in compliance with organizational and regulatory guidelines. • Collaborate closely with product managers and software engineers to integrate GenAI into real-world applications. Required Skills & Qualifications: • Strong experience with LLMs, transformer-based architectures, and NLP pipelines. • Proven track record building and deploying GenAI-powered POCs or applications. • Hands-on experience with OpenAI, Anthropic, Google Gemini, Hugging Face, Llama, etc. • Experience in Python, LangChain, LlamaIndex, or similar orchestration frameworks. • Working knowledge of vector databases, embedding models, and RAG architecture. • Cloud experience (AWS/GCP/Azure) including AI/ML services, serverless architecture, and containerization. • Familiarity with API design, backend development, and microservice architecture. • Strong understanding of model safety, cost optimization, prompt chaining, token limits, and response streaming. Preferred Skills: • Experience with fine-tuning open-source models (e.g., LLaMA, Mistral, Falcon). • Familiarity with agentic workflows (e.g., AutoGPT, CrewAI, LangGraph). • Exposure to MLOps tools (MLflow, Kubeflow, SageMaker Pipelines). • Ability to handle unstructured data (PDFs, audio, images, structured logs) and convert into usable GenAI formats.