Job Url: https://www.remoterocketship.com/company/sidebycare/jobs/senior-data-engineer-healthcare-data-ai-systems-united-states-remote Job Description: Wix Website LinkedIn All Job Openings Create your own professional web presence—exactly the way you want. Free Website Builder • Web Design • Mobile Websites • HTML5 Websites • Web Apps 1001 - 5000 employees Senior Data Engineer - Healthcare Data & AI Systems June 25 ⛰️ Colorado – Remote 💵 $80k - $110k / year ⏰ Full Time 🟠 Senior 🚰 Data Engineer Airflow Amazon Redshift AWS BigQuery Cloud EC2 ETL Kafka Pulsar Python PyTorch Tensorflow Apply Now Receive Emails with Similar Jobs Report problem 📋 Description • Architect and implement robust data pipelines between EMRs, internal systems, and Snowflake, ensuring scalability, reliability, and data provenance • Lead the design of warehouse schemas for multiple use cases: transactional processing, reporting (BI), and statistical/ML analysis • Define and enforce standards for data semantics, integrity, quality, lineage, and access control • Collaborate with data scientists and ML engineers to enable production-grade ML workflows (e.g., TensorFlow pipelines, model monitoring, A/B testing infrastructure) • Experiment with and support the deployment of LLMs to enable reasoning, summarization, and classification on structured and unstructured data (e.g., clinical notes) • Build monitoring and alerting around pipeline health and data trustworthiness • Integrate and normalize complex healthcare data sources (FHIR/HL7, custom APIs, third-party vendors) into a unified analytics model • Partner with engineering and product teams to deliver data-driven features, dashboards, and insights 🎯 Requirements • 5+ years of experience in data engineering or backend systems, with senior or staff-level contributions • Deep Python proficiency, with production experience in ETL, data validation, and orchestration frameworks (e.g., Airflow, Dagster, dbt) • Strong experience with data warehouse design, including star/snowflake schemas, denormalization strategies, and performance optimization • Strong understanding of data privacy and security practices, especially in healthcare (HIPAA, de-identification, audit logging, etc.) • Proven experience managing complex integrations with EMRs or clinical systems • Familiarity with LLM and ML development tools (e.g., TensorFlow, PyTorch, LangChain, transformers, vector DBs) • Experience deploying or supporting predictive models in production environments • Expertise in Snowflake or similar cloud data platforms (e.g., BigQuery, Redshift) • Strong grasp of data modeling, provenance, and semantics for analytical and AI purposes • Experience working with AWS services such as S3, Lambda, Batch, Event Bridge, Cloud Front, EC2, etc 🏖️ Benefits • Help make a new form of AI-driven virtual care available for millions of people with gut-brain conditions (like Irritable Bowel Syndrome and more than 30 other conditions) • Be a foundational contributor to a modern healthcare data stack and AI platform • Shape how LLMs and ML are responsibly deployed in real-world clinical settings • Work with a small, fast-moving, mission-driven team of engineers and clinicians • Competitive pay • Flexible remote work culture