Company Name: FuriosaAI

Job Details: 9,Locations,In-Office,or,Remote,Junior

Job Url: https://builtin.com/job/ai-software-engineer-platform-software/7771026

Job Description: About the JobFuriosaAI is looking for passionate AI Software Engineers to join our Platform Team. You will participate in the research and development of models optimized for our NPU accelerator.Our team builds the production-grade, streamlined AI software that makes up our SDK. This includes the runtime, LLM serving framework, and PyTorch models/extensions.Your work on these critical parts of the SDK will directly enable AI developers to efficiently deploy optimized AI models on FuriosaAI NPUs.ResponsibilitiesDevelop and optimize DNN model implementations in PyTorch for FuriosaAI's Tensor Contraction Processor (TCP) architectureAnalyze the features, implementations, CUDA and Triton kernels of existing AI model inference frameworks such as vLLM, TensorRT-LLM, and DeepSpeed-MIIResearch and implement generative AI models, parallelism strategies, and inference techniques to improve performance and efficiencyCollaborate closely with the compiler team to optimize and enable models.Minimum QualificationsBS degree in Computer Science, Engineering, or a related field, or equivalent industry experienceProficiency in Python programming skillExperience in developing AI models in DNN frameworks (e.g., PyTorch)Solid understanding of machine learning, deep learning, natural language processing (NLP), and/or generative AI modelsStrong communication skills with the ability to collaborate effectively across cross-functional teamsPreferred QualificationsHands-on experience with PyTorch 2.0 technologies (e.g., TorchDynamo) or DNN compiler technologies, such as Triton and MLIRProficiency in C++/CUDA or Rust programming skillsHands-on experience deploying and optimizing large-scale ML models in productionHands-on experience in model training and fine-turning of pre-trained modelsExperience in LLM inference frameworks: vLLM, TensorRT-LLM, and DeepSpeed-MIIStrong background in model quantizations and model evaluationsStrong background in machine learning, generative AI, and model evaluation techniquesProven track record of contributing to open-source projectsContactywkim@furiosa.ai