Job Url: https://cloudera.wd5.myworkdayjobs.com/External_Career/job/US-Texas-Remote/Principal-Engineer----Apache-Spark_250521-1 Job Description: Principal Software Engineer - Apache Spark Apply remote type Remote locations US-Texas-Remote US-Florida-Remote US-Delaware-Remote US-Georgia-Atlanta US-Georgia-Remote View All 9 Locations time type Full time posted on Posted Today job requisition id 250521 Business Area: Engineering Seniority Level: Director Job Description:  At Cloudera, we empower people to transform complex data into clear and actionable insights. With as much data under management as the hyperscalers, we're the preferred data partner for the top companies in almost every industry.  Powered by the relentless innovation of the open source community, Cloudera advances digital transformation for the world’s largest enterprises. Cloudera is seeking an experienced Principal Engineer with strong distributed systems expertise to work on the Cloudera distribution of Apache Spark.   We are looking for senior engineers with experience in large-scale, distributed systems and data processing to help build our enterprise-grade system, designed for customers running Spark on thousands of nodes and processing petabytes of data. We are looking for a passionate individual that is ready to be a team lead for a team that is already supporting production systems at many of the biggest companies – and is looking to expand and take on even more projects to drive the next gen Data Engineering experience.  You will be working with a distributed team, spread across the United States and Hungary, including multiple committers on Apache Spark.  As a Principal Software Engineer, you will: Architect, design, and implement resilient and scalable solutions for distributed data processing at massive scale, with a focus on fault tolerance, performance optimization, query planning, and resource management Take ownership of critical distributed systems components, solving complex challenges related to network communication, concurrency, data consistency, and system reliability across clusters of thousands of nodes Develop advanced monitoring, debugging, and performance analysis tools for large-scale distributed systems. Act as a tech lead for Cloudera’s Spark team Work with and contribute to the latest open source technologies, including Apache Spark, Iceberg, and Parquet Develop new features in Scala/Java a modern platforms Gain a solid understanding and deep technical knowledge of components across the Cloudera Data Engineering Experience stack, but focusing on Iceberg and Spark Debug system level deployment issues, root cause analysis, perform system test analysis and resolve failures Work on improving internal infrastructure Collaborate with other team members and stakeholders We are excited about you if you have: Bachelor’s degree in Computer Science or equivalent, and 10+ years of experience; OR Master’s degree and 6+ years of experience; OR PhD and 4+ years of experience Experience with systems design and development specifically for large-scale distributed environments Experience leading and delivering complex product enhancements. We use Java and Scala in projects, you should have a strong understanding of these two languages. Passionate about programming, clean coding habits, attention to detail, and focus on quality Strong oral and written communication skills. Strong ability to research and solve problems independently without constant supervision (Most importantly) Open-minded, desire to learn new things and build great products. You may also have: Experience with large-scale, distributed systems design and development with an understanding of scaling, performance, and scheduling In-depth understanding of distributed query processing and planning Experience with using/developing Apache Spark or other related technologies. In-depth understanding of distributed systems concepts like consensus algorithms, distributed transactions, and fault tolerance Experience working with query automated query optimization Solid experience with at least one cloud service (AWS, Azure, GCP, OpenShift) Contributors to open-source projects. This role is not eligible for immigration sponsorship What you can expect from us: Generous PTO Policy  Support work life balance with Unplugged Days Flexible WFH Policy  Mental & Physical Wellness programs  Phone and Internet Reimbursement program  Access to Continued Career Development  Comprehensive Benefits and Competitive Packages  Paid Volunteer Time Employee Resource Groups EEO/VEVRAA # LI-SZ1 #LI-Remote Similar Jobs (1) Principal Software Engineer, Trino / PrestoDB remote type Remote locations 2 Locations time type Full time posted on Posted 30+ Days Ago Recruitment Fraud Alert It has come to our attention that job seekers have been contacted about fake job opportunities with Cloudera from individuals fraudulently posing as Cloudera employees. These recruiting fraud schemes often include requests for personal information and payments. Be aware that Cloudera will never request a payment as part of its recruitment process. Additionally, Cloudera will never make a job offer without conducting an interview process. Any information submitted to Cloudera in relation to a job application should only be through our official career portal https://www.cloudera.com/careers.html Email communications from Cloudera will come from an email address ending in @cloudera.com. If you are the target of a recruiting scam, consider filing a report with law enforcement authorities. Cloudera is not responsible for fraudulent job offers and/or any claims, damages, expenses, or other inconvenience connected to recruiting scams. Read More For information on Cloudera's Candidate Privacy Notice, click here. If you have any questions about our privacy practices, please contact us at privacy@cloudera.com. EEO/VEVRAA If you need assistance with applying for a position, please email our office at talentacquisition@cloudera.com. Read More Follow Us Cloudera's Candidate Privacy Notice © 2025 Wor