Job Title: Lead Data Engineer Company Name: Banner Health Job Url: https://www.simplyhired.com/job/vfavWby80lowYthhEiUt3oiJdF9_Mof6KVdKQs7NX5x3t-TT5mprPg Job Description: Lead Data Engineer Banner Health New York, NY Job Details Full-time $53.63 - $89.38 an hour 5 hours ago Benefits Health insurance Qualifications Data model design Performance tuning Databricks Azure 6 years Bachelor's degree in information technology Computer Science Data lake Continuous Delivery (CD) implementation Data modeling projects Infrastructure as Code (IaC) DevOps Spark Data reporting Scalable systems HL7 Java SQL Data quality management EMR/EHR Data Architecture Design (Architecture design skills) AWS Solution architecture design Bachelor's degree IT experience within healthcare Machine learning Mentoring Cloud Native Design Scalability Metadata Distributed computing Business requirements Real-time data processing implementation Senior level AI Batch data processing Cross-functional collaboration Bachelor's degree in computer science Implementing cost-saving initiatives Communication skills Project stakeholder communication Python Generative AI Cross-functional communication Bachelor's degree in data science Metadata management Data Science Analytics Information Technology Database software proficiency Full Job Description Department Name: Digital Transform-Foundation Work Shift: Day Job Category: Information Technology Estimated Pay Range: $53.63 - $89.38 / hour Banner Health is committed to pay equity and transparency. The posted compensation range is a reasonable estimate that extends from the lowest to the highest pay Banner Health in good faith believes it might pay for this particular job, based on the circumstances at the time of posting. This range is based on possible base salaries and does not include the value of our total rewards package. Actual pay determined at offer will be based on years of relevant work experience, education, certifications, skills, and geographic location, along with a review of current employees in similar roles to ensure pay equity is achieved and maintained. Banner Health was named to Fortune’s Most Innovative Companies in America 2025 list for the third consecutive year and named to Newsweek's list of Most Trustworthy Companies in America for the second year in a row. We’re proud to be recognized for our commitment to the latest health care advancements and excellent patient care. At Banner Health, data is central to how we improve patient experiences, clinical outcomes, operational performance, and decision-making across the enterprise. The Lead Data Engineer help design, build, and scale our next-generation data platform on the cloud. This role will lead the engineering of reliable, secure, and scalable data products and pipelines that power analytics, reporting, AI, and operational insights across clinical and business domains. You will work closely with architects, analysts, data scientists, product teams, and business leaders to modernize how data is ingested, governed, transformed, and consumed across the organization. Your pay and benefits (Total Rewards) are important components of your Journey at Banner Health. Banner Health offers a variety of benefit plans to help you and your family. We provide health and financial security options, so you can focus on being the best at what you do and enjoying your life. Within Banner Health Corporate, you will have the opportunity to apply your unique experience and expertise in support of a nationally-recognized healthcare leader. We offer stimulating and rewarding careers in a wide array of disciplines. Whether your background is in Human Resources, Finance, Information Technology, Legal, Managed Care Programs or Public Relations, you'll find many options for contributing to our award-winning patient care. POSITION SUMMARY This position helps to design, build, and scale Banner Health’s next-generation data platform on the cloud. This role leads the engineering of reliable, secure, and scalable data products and pipelines that power analytics, reporting, AI, and operational insights across clinical and business domains. Works closely with architects, analysts, data scientists, product teams, and business leaders to modernize how data is ingested, governed, transformed, and consumed across the organization. CORE FUNCTIONS 1. Designs, builds, and optimizes scalable batch and streaming data pipelines for enterprise analytics and operational use cases. Contributes to the evolution of the enterprise data platform to support advanced analytics, self-service consumption, and AI/ML use cases. 2. Leads the development of curated, high-quality data products across lakehouse, warehouse, and domain-oriented data architectures. 3. Builds and enhances cloud-native data solutions using modern platforms such as Databricks, Spark, Delta Lake, and AWS services. 4. Establishes and enforces engineering standards for code quality, testing, CI/CD, observability, lineage, and documentation. 5. Drives best practices for data quality, schema evolution, performance tuning, reliability, and cost optimization. 6. Partners with architecture, governance, security, and analytics teams to implement trusted and compliant data solutions. 7. Supports ingestion and transformation of complex healthcare and enterprise data sources, including structured, semi-structured, and high-volume event data. 8. Mentors engineers, provides technical leadership, and contributes to solution design, estimation, and delivery planning. 9. Translates business and operational requirements into scalable technical designs and production-ready data pipelines. MINIMUM QUALIFICATIONS Must possess strong knowledge of data engineering and analytics as normally obtained through the completion of a Bachelor's degree in Data Science, Computer Science, Information Technology or a related field. Must have 6+ years of experience in data engineering, big data, analytics engineering, or data platform development. Must have 3+ years in a senior or lead engineering role driving architecture, standards, and delivery across large-scale data environments. Must have strong hands-on experience with Databricks, Apache Spark, Delta Lake, and cloud-based data platforms on AWS, Azure, or GCP. Deep expertise in SQL and strong programming skills in Python and/or Java0 Experience building and operating large-scale distributed data systems, including data lakes, lakehouses, warehouses, or mesh-oriented platforms. Must have a strong understanding of data modeling, partitioning, storage design, metadata management, and performance optimization. Experience implementing data quality, lineage, observability, and operational monitoring in production environments. Familiarity with orchestration, DevOps, and CI/CD practices for data platforms. Must have strong communication skills and the ability to work effectively with technical and non-technical stakeholders. Proven ability to balance multiple priorities in a fast-paced environment while maintaining high engineering standards. PREFERRED QUALIFICATIONS Experience in healthcare, payer-provider, clinical, or regulated data environments. Familiarity with EHR, claims, FHIR, HL7, interoperability, or other healthcare data standards. Experience with governance and secure access models using tools such as Unity Catalog, Lake Formation, or equivalent. Experience supporting AI/ML, feature engineering, vector or unstructured data pipelines, or data products for GenAI use cases. Exposure to infrastructure-as-code, automated testing, and platform engineering practices. Experience mentoring engineers and influencing cross-functional technical direction. Additional related education and/or experience preferred. Anticipated Closing Window (actual close date may be sooner): 2026-07-23 EEO Statement: EEO/Disabled/Veterans Our organization supports a drug-free work environment. Privacy Policy: Privacy Policy