Lead Data Engineer - Pyspark

Relanto logo

Relanto

View Salaries, Reviews, and more  

Job Summary


Job Type
-

Seniority

Years of Experience
Information not provided

Tech Stacks
Python SQL Airflow Azure Git Apache pySpark CI Analytics Data Extraction AWS Snowflake ETL

Job Description

Job Summary

We are looking for an experienced Data Engineer with 7+ years of expertise in building scalable data pipelines and modern data platforms. The ideal candidate will have strong hands-on experience in PySpark, SQL, Python, Airflow, dbt, and Snowflake, with the ability to design robust data architectures that support analytics and business intelligence needs.

Responsibilities

  • Design, develop, and maintain scalable and efficient data pipelines using PySpark and Python
  • Build and optimize data workflows and orchestration pipelines using Apache Airflow
  • Develop and manage data transformation models using dbt (Data Build Tool)
  • Implement and maintain data warehouse solutions on Snowflake
  • Write complex and optimized SQL queries for data extraction, transformation, and analysis
  • Ensure data quality, integrity, and reliability across pipelines and systems
  • Collaborate with Data Scientists, Analysts, and Business teams to understand data requirements and deliver solutions
  • Optimize data processing performance and cost efficiency in cloud environments
  • Implement best practices for data governance, security, and compliance
  • Troubleshoot and resolve data-related issues in production environments
  • Mentor junior engineers and contribute to improving engineering standards and practices

Required Skills

  • 7+ years of experience in Data Engineering or related roles
  • Must have strong experience in DBT
  • Strong proficiency in Python and PySpark for large-scale data processing
  • Advanced knowledge of SQL for data manipulation and performance tuning
  • Hands-on experience with Apache Airflow for workflow orchestration
  • Strong experience with dbt for data transformation and modeling
  • Expertise in Snowflake including data modeling, performance tuning, and cost optimization
  • Experience working with cloud platforms (AWS, Azure, or GCP)
  • Strong understanding of data warehousing concepts and ETL/ELT frameworks
  • Familiarity with version control systems (e.g., Git) and CI/CD pipelines
  • Excellent problem-solving and analytical skills

Interview Questions of Lead Data Engineer - Pyspark at Relanto

Currently, there aren't any interview questions for this role at Relanto shared by other job seekers.
View more interview questions of similar roles from other companies โ†’
banner icon
Prepare For Your Interview in 1 Week?
Equip yourself with possible questions that interviewers might ask you, based on your work experience and job description.
Get Started!

Salary Insights of Lead Data Engineer - Pyspark at Relanto

Currently, there aren't any salaries for this role at Relanto shared by other job seekers.

View more salaries from Relanto โ†’

Achieve your dream job with our top-notch tools!

Resume Checker Illustration

Resume Checker

Our free resume checker analyzes the job description and identifies important keywords and skills missing from your resume in just a minute!

Check Now
Interview Preparation Illustration

AI InterviewPrep

Utilizing advanced AI, our tool generates tailored interview questions based on your industry, role, and experience. Practice and receive feedback on your answers in real time!

Check Now
Resume Builder Illustration

Resume Builder

Let us show you the differences between a bad, good, and great resume, and guide you in building a resume that helps you stand out to employers, ensuring you land your next position faster!

Check Now