Job Url: https://www.linkedin.com/jobs/search/?currentJobId=4327876307&distance=25&f_TPR=r86400&f_WT=2&geoId=103644278&keywords=software%20engineer&origin=JOB_SEARCH_PAGE_JOB_FILTER&refresh=true&sortBy=R&start=925 Job Description: Miraei AI Share Show more options Founding Data Engineer United States · 14 hours ago · 97 applicants Promoted by hirer · No response insights available yet $150K/yr - $180K/yr Remote Matches your job preferences, workplace type is Remote. Full-time Easy Apply Save Save Founding Data Engineer at Miraei AI Founding Data Engineer Miraei AI · United States (Remote) Easy Apply Save Save Founding Data Engineer at Miraei AI Show more options Your profile is missing required qualifications Show match details Help me update my profile BETA Is this information helpful? Get personalized tips to stand out to hirers Find jobs where you’re a top applicant and tailor your resume with the help of AI. Try Premium for PKR0 Meet the hiring team Davin Cho 3rd Founder at Miraei AI | Helping Clinical Trial Vendors Win Biopharma Partnerships Job poster Message About the job Founding Data Engineer, Clinical Trials and Oncology Data Location: Hybrid, San Francisco and/or Los Angeles Experience: 3 to 7 years Type: Full-time Stage: Early, founding engineering hire About Miraei Miraei is building the deal engine for life sciences. Business development in life sciences is still driven by fragmented data, manual research, and slow, relationship-heavy workflows. Miraei changes that by structuring and continuously tracking clinical trials and scientific data, then transforming it into actionable intelligence that powers how deals are identified, evaluated, and executed. We start by helping vendors and diagnostics companies identify and engage the right biopharma partners around active and emerging clinical trials. Over time, Miraei becomes the platform where life sciences deals occur end to end, from vendors to biopharma, biopharma to biotechs, and cross-border partnerships such as biopharma seeking assets and collaborators internationally. We are venture-backed and are generating revenue from enterprise customers. The role We are hiring a Founding Data Engineer to design and own the core data architecture, pipelines, and processes that powers Miraei. This role is responsible for building the canonical data models for clinical trial intelligence and ensuring our data pipelines are scalable and reliable as we ingest more sources, trials, and send out real-time updates. This is a hands-on individual contributor role. You will write production code, make architectural decisions, and shape the long-term data foundation of the company. What you will do Design and implement core data schemas for clinical trial data and data sources related to clinical assets, including Trials, arms, cohorts, endpoints, biomarkers, sponsors, and timelines Longitudinal versioning across abstracts, amendments, and readouts Press releases, news, and publications Build hierarchical taxonomies and ontologies for oncology and clinical research Indications, modalities, mechanisms of action, biomarkers, endpoints Architect and maintain data ingestion pipelines from Conference abstracts Clinical trial registries Publications and structured internal outputs Enable longitudinal tracking and alerting as trials evolve over time Partner closely with product and ML to ensure the data model supports downstream reasoning and user workflows Make pragmatic early-stage tradeoffs and evolve the system as the company scales What we’re looking for 3 to 7 years of experience as a data engineer or analytics engineer Prior experience working with clinical trial or life sciences data strongly preferred Pharma, biotech, diagnostics, CRO, real-world data, or clinical informatics Startup experience required You have built systems in ambiguous, fast-moving environments Strong fundamentals in: Database design (OLTP/OLAP), data modeling, metadata management, and schema design Skills in building reliable ETL/ELT pipelines, data integration, transformation, validation, and orchestration SQL/Python/Bash scripting Cloud-based data infrastructure (AWS/GCP) Experience with modern software development tools, such as version control (git), automations/CI/CD (GitHub actions, Jenkins, etc), Docker containerization, etc Comfortable owning systems end to end as a senior IC Clear communicator who can explain tradeoffs and push back when needed Must be authorized to work in the United States. Visa sponsorship is not available for this position. Nice to have Oncology domain expertise or familiarity Experience with ontology, RAG/knowledge graph, vector databases or other information retrieval experience Exposure to ML feature pipelines, context engineering, prompt engineering, and other AI-adjacent systems