Data Processing Principal Engineer
DataPelago
Lead architecture, design, and implementation of next-generation data processing engine built to exploit accelerated computing.
The developer will have the following specific responsibilities in achieving these objectives.
Responsibilities
• Work with cross-functional teams to advance the architecture at the core of Datapelago’s parallel and distributed execution engine based on accelerated computing
• Lead the execution engine team in the design, implementation, and rollout of enterprise grade, highly reliable acceleration engine for data processing
• Design, implement, test, and maintain major components of the execution engine • Analyze and identify areas for differentiation and improvements in the execution engine • Collaborate with other teams and team members to drive code reviews, design reviews,
performance and reliability reviews, and development process and to drive continuous improvement in all of these
Qualifications
• B.S. EE/CS or equivalent with 15+ years experience or MS with 10+ years experience • 10+ years of experience developing core components of an enterprise-grade database or analytics execution engine serving large-scale data processing workloads. Experience developing for platforms such as Apache Spark, Gluten, Velox, DataFusion preferred. • Experienced developing high-performance parallel implementations of data processing operators and functions, such as joins, aggregations, sorts
• Experience leading 10+ teams in designing, developing, and releasing high-performance data processing engines for large production deployments
• Strong programming ability in C, C++, and Rust
• Strong development experience on Linux platforms