Job Title: Staff Machine Learning Engineer Company Name: Terra AI Job Details: RemoteFull,Time Job Url: https://hiring.cafe/viewjob/fbc1wnw8w91w02rp Job Description: Posted 2d agoStaff Machine Learning Engineer@ Terra AIView All JobsWebsiteRedwood City, California, United StatesRemoteFull TimeResponsibilities:design diffusion, train models, adapt diffusionRequirements Summary:Lead development of generative diffusion models for 3D geoscience data; expert in PyTorch and large-scale DL; strong data curation and software engineering skills.Technical Tools Mentioned:PyTorch, Diffusion models, Transformers, Cloud computing, GPU clustering About Terra AIWe are building the state-of-the-art AI platform for the discovery and development of clean energy and mineral resources. We bring the most advanced techniques in generative AI, foundation modeling, and autonomous decision optimization to tackle the most important problems in the geosciences. These systems can help more reliably identify critical resource deposits, more rapidly measure and characterize them, and design more efficient and sustainable production plans.We are backed by Khosla Ventures and other leading venture investors. We are now looking to grow our team from ~15 to ~30 by the end of the year to continue to mature our technology and support deployment with our world-class mineral and clean energy partners.Role descriptionIn the same way image generators have shown the remarkable ability to produce a diverse set of realistic pictures conditioned on a text prompt (and other inputs), we are developing a generative model that produces 3D geological models conditioned on geophysical surveys, bore hole measurements, and other forms of physical observation. The outputs of the generative model capture what we know and don’t know about the state of the subsurface, allowing explorers to make maximally informed decisions about how and where to explore for critical resources. We are looking for a talented deep learning engineer or scientist to lead the development of this model that will revolutionize decision making in the earth subsurface for a wide range of clean energy applications.Role ResponsibilitiesDesign, train, test, and iterate on diffusion models for 3D geological modelsDesign, train, test, and iterate on an approach to for conditioning generation on geophysical data and other observationsInform the generation of synthetic data to improve model performanceAdapt diffusion modeling approach to specific real-world projects in collaboration with project teams. QualificationsRequired Qualifications:Extensive PyTorch ExperienceDeep understanding of PyTorch, including writing custom modules, optimizing training, and debugging issues in large-scale models.Expertise in Developing Large Deep Learning Models from ScratchProven ability to design, implement, and train complex deep learning architectures from the ground up.Data Curation SkillsHands-on experience in creating, cleaning, and maintaining high-quality datasets tailored for machine learning applications.Strong Software Engineering and Design ExperienceProficient in software development best practices, including version control, testing, and code optimization.Familiarity with designing scalable and maintainable systems.Bonus points if you:Experience with Generative ModelsFamiliarity with generative architectures, particularly diffusion models, and an emphasis on posterior sampling methods.Knowledge of Transformer ArchitecturesExperience building and training transformers, especially in applications involving 3D data.Scaling Models Across Large GPU ClustersExpertise in parallelizing models across multiple GPUs and optimizing distributed training pipelines.Cloud Infrastructure ExpertiseExperience setting up, managing, and optimizing cloud environments for machine learning workloads, including provisioning resources and managing costs.