Project Role : Data Engineer
Project Role Description : Design, develop and maintain data solutions for data generation, collection, and processing. Create data pipelines, ensure data quality, and implement ETL (extract, transform and load) processes to migrate and deploy data across systems.
Must have skills : Microsoft Azure Databricks
Good to have skills : NA
Minimum 7.5 Year(s) Of Experience Is Required
Educational Qualification : 15 years full time education
Summary:
As a Data Engineer, a typical day involves designing, developing, and maintaining comprehensive data solutions that support the generation, collection, and processing of data. The role includes creating efficient data pipelines and ensuring the integrity and quality of data throughout its lifecycle. Additionally, the position requires implementing processes to extract, transform, and load data, facilitating seamless migration and deployment across various systems. This dynamic environment encourages continuous collaboration and innovation to meet evolving data needs and support organizational objectives.
Roles & Responsibilities:
- Expected to be an SME, collaborate and manage the team to perform.
- Responsible for team decisions.
- Engage with multiple teams and contribute on key decisions.
- Provide solutions to problems for their immediate team and across multiple teams.
- Lead efforts to optimize data workflows and improve system performance.
- Mentor junior team members to foster skill development and knowledge sharing.
- Coordinate cross-functional initiatives to align data strategies with business goals.
- Build and operate scalable Lakehouse pipelines on Databricks/Azure. Own ELT/streaming, Delta Lake optimization, Unity Catalog governance, and CI/CD. Integrate ADLS/ADF/Synapse, and deliver high-quality data sets for BI/ML/GenAI.
- Must-haves: PySpark, SQL, Databricks (Delta, DLT/Workflows), Azure data services, Unity Catalog, CI/CD.
MLflow/Feature Store, Power BI, streaming/CDC, vector search/RAG, Terraform.
Professional & Technical Skills:
- Must To Have Skills: Proficiency in Microsoft Azure Databricks.
- Experience in designing and implementing scalable data pipelines and ETL processes.
- Strong knowledge of cloud-based data storage and processing solutions.
- Ability to troubleshoot and resolve complex data integration issues.
- Familiarity with data governance and data quality best practices.
- Skilled in performance tuning and optimization of data workflows.
Additional Information:
- The candidate should have minimum 7.5 years of experience in Microsoft Azure Databricks.
- This position is based at our Pune office.
- A 15 years full time education is required.