Data Engineer

Xerox logo

Xerox

View Salaries, Reviews, and more  

Job Description


About Xerox Holdings Corporation

For more than 100 years, Xerox has continually redefined the workplace experience. Harnessing our leadership position in office and production print technology, we’ve expanded into software and services to sustainably power the hybrid workplace of today and tomorrow. Today, Xerox is continuing its legacy of innovation to deliver client-centric and digitally-driven technology solutions and meet the needs of today’s global, distributed workforce. From the office to industrial environments, our differentiated business and technology offerings and financial services are essential workplace technology solutions that drive success for our clients. At Xerox, we make work, work. Learn more about us at www.xerox.com.

Job Role: Data Engineer

Job Description: A Data Engineer with AI/ML focus combines traditional data engineering responsibilities with the technical requirements for supporting Machine Learning (ML) systems and artificial intelligence (AI) applications. This role involves not only designing and maintaining scalable data pipelines but also integrating advanced AI/ML models into the data infrastructure. The role is critical for enabling data scientists and ML engineers to efficiently train, test, and deploy models in production. This role is also responsible for designing, building, and maintaining scalable data infrastructure and systems to support advanced analytics and business intelligence. This role often involves leading mentoring junior team members, and collaborating with cross-functional teams.

Core Competencies

  • Data Engineering: Hands on experience on Databricks, Azure data factory, ETL/ELT and Data Warehouse knowledge, Python, PySpark, Strong SQL skills
  • Cloud Platforms: Azure (must), Amazon (Good to have)
  • Languages: Python, SQL, Scala, Bash
  • DevOps & Infra: CI/CD pipelines(must) , Docker, Kubernetes, Terraform (Good to have)
  • ML/AI Integration: MLflow, Feature Store, TensorFlow, PyTorch, Hugging Face (Good to have)
  • GenAI: OpenAI API, Vector DBs (Good to have)

Key Responsibilities:

Data Infrastructure for AI/ML:

  • Design and implement robust data pipelines that support data preprocessing, model training, and deployment.
  • Ensure that the data pipeline is optimized for high-volume and high-velocity data required by ML models.
  • Build and manage feature stores that can efficiently store, retrieve, and serve features for ML models.

AI/ML Model Integration:

  • Collaborate with ML engineers and data scientists to integrate machine learning models into production environments.
  • Implement tools for model versioning, experimentation, and deployment (e.g., MLflow, Kubeflow, TensorFlow Extended).
  • Support automated retraining and model monitoring pipelines to ensure models remain performant over time.

Data Architecture & Design

  • Design and maintain scalable, efficient, and secure data pipelines and architectures.
  • Develop data models (both OLTP and OLAP).
  • Create and maintain ETL/ELT processes.

Data Pipeline Development

  • Build automated pipelines to collect, transform, and load data from various sources (internal and external).
  • Optimize data flow and collection for cross-functional teams.

MLOps Support:

  • Develop CI/CD pipelines to deploy models into production environments.
  • Implement model monitoring, alerting, and logging for real-time model predictions.

Data Quality & Governance

  • Ensure high data quality, integrity, and availability.
  • Implement data validation, monitoring, and alerting mechanisms.
  • Support data governance initiatives and ensure compliance with data privacy laws (e.g., GDPR, HIPAA).

Tooling & Infrastructure

  • Work with cloud platforms (AWS, Azure, GCP) and data engineering tools like Apache Spark, Kafka, Airflow, etc.
  • Use containerization (Docker, Kubernetes) and CI/CD pipelines for data engineering deployments.

Team Collaboration & Mentorship

  • Collaborate with data scientists, analysts, product managers, and other engineers.
  • Provide technical leadership and mentor junior data engineers.

Soft Skills:

  • Strong problem-solving and critical-thinking skills.
  • Excellent communication and collaboration abilities.
  • Leadership experience and the ability to guide technical decisions.

Educational Qualifications:

  • Bachelor's or Master's degree in Computer Science, Engineering, or related field.
  • 4+ years of experience in data engineering.
  • Strong understanding of data modeling, ETL/ELT concepts, and distributed systems.
  • Experience with big data tools and cloud platforms.

Interview Questions of Data Engineer at Xerox

Currently, there aren't any interview questions for this role at Xerox shared by other job seekers.
View more interview questions of similar roles from other companies →
banner icon
Prepare For Your Interview in 1 Week?
Equip yourself with possible questions that interviewers might ask you, based on your work experience and job description.
Get Started!

Salary Insights of Data Engineer at Xerox

Currently, there aren't any salaries for this role at Xerox shared by other job seekers.

View more salaries from Xerox →

Achieve your dream job with our top-notch tools!

Resume Checker Illustration

Resume Checker

Our free resume checker analyzes the job description and identifies important keywords and skills missing from your resume in just a minute!

Check Now
Interview Preparation Illustration

AI InterviewPrep

Utilizing advanced AI, our tool generates tailored interview questions based on your industry, role, and experience. Practice and receive feedback on your answers in real time!

Check Now
Resume Builder Illustration

Resume Builder

Let us show you the differences between a bad, good, and great resume, and guide you in building a resume that helps you stand out to employers, ensuring you land your next position faster!

Check Now