Job Title: Machine Learning Engineer

Company Name: Stack

Job Url: https://boards.greenhouse.io/embed/job_app?token=5033361007&utm_source=jobright&jr_id=69743ce137785856350e36b3

Job Description: About the Role:

In the ML Data Understanding team, our mission is to provide trusted and useful data to efficiently power all of Stack's ML applications end-to-end from mining to training to safety evaluation. We work hand in hand with AV autonomy teams to provide cutting edge solutions to all their data needs, working across data engineering, mining, modeling and infrastructure. In particular, we provide services to find (data mining), curate (datasets), annotate (data labeling), search and serve (high throughput data access) data for all ML needs.

Data Mining: We are building a framework and infrastructure to find interesting events quickly and flexibly. As part of this mission, you would be setting the direction for and helping us build an inference service using LLMs, open-world models and vector databases.
Semantic Search for Data Mining: We are building the infrastructure of a highly scalable semantic search service for multimodal data to find interesting events quickly and flexibly. As part of this mission, you would be setting the direction for and helping us build an inference service using the latest AI models & approaches. 
Dataset management for training: We are building state of the art infrastructure to support machine learning training and inference workloads using OSS components such as Ray, Spark, Lance and Iceberg.
Responsibilities:

Build state-of-art multimodal data mining and semantic search solutions to power AV product development.
Develop data understanding platform infrastructure for real-time querying/vector databases and batch/stream processing using technologies like Ray, Spark, Lance, or similar.
Deliver end-to-end data mining solutions that span onboard (C++) and offboard (ML & Data Infra) infrastructure to accelerate AV product development. 
Develop e2e solution for real-time semantic search services (text/images/videos) and vector DBs.
Discover and identify key issues in existing ML infra and proactively improve system performance.
Build low latency/high throughput batch or stream processing pipelines.
Drive technical discussions across multiple orgs and deliver solutions on a timely basis.
Architect and tune ETL pipelines to maximize GPU/CPU/Ram utilization.
Write readable and high-performance Python/C++ code.
Qualifications: 

Experience with both ML platforms and building ML-based applications (modeling experience is a bonus).
Proven track record of building scalable, reliable infrastructure in a fast-paced environment.
Ability to collaborate effectively across teams.
Experience building or using ML infrastructure for a large number of customer teams.
Deep understanding of design trade-offs with the ability to articulate those trade-offs and achieve alignment with others.
Experience in building ML models or infrastructure in domains such as autonomous